1
|
Panda A, Tuller T. Determinants of associations between codon and amino acid usage patterns of microbial communities and the environment inferred based on a cross-biome metagenomic analysis. NPJ Biofilms Microbiomes 2023; 9:5. [PMID: 36693851 PMCID: PMC9873608 DOI: 10.1038/s41522-023-00372-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2022] [Accepted: 01/11/2023] [Indexed: 01/25/2023] Open
Abstract
Codon and amino acid usage were associated with almost every aspect of microbial life. However, how the environment may impact the codon and amino acid choice of microbial communities at the habitat level is not clearly understood. Therefore, in this study, we analyzed codon and amino acid usage patterns of a large number of environmental samples collected from diverse ecological niches. Our results suggested that samples derived from similar environmental niches, in general, show overall similar codon and amino acid distribution as compared to samples from other habitats. To substantiate the relative impact of the environment, we considered several factors, such as their similarity in GC content, or in functional or taxonomic abundance. Our analysis demonstrated that none of these factors can fully explain the trends that we observed at the codon or amino acid level implying a direct environmental influence on them. Further, our analysis demonstrated different levels of selection on codon bias in different microbial communities with the highest bias in host-associated environments such as the digestive system or oral samples and the lowest level of selection in soil and water samples. Considering a large number of metagenomic samples here we showed that microorganisms collected from similar environmental backgrounds exhibit similar patterns of codon and amino acid usage irrespective of the location or time from where the samples were collected. Thus our study suggested a direct impact of the environment on codon and amino usage of microorganisms that cannot be explained considering the influence of other factors.
Collapse
Affiliation(s)
- Arup Panda
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, 69978, Israel
| | - Tamir Tuller
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, 69978, Israel.
| |
Collapse
|
2
|
Wang Z, Lu W, Rajapaksha P, Wilkop T, Cai Y, Wei Y. Comparison of in vitro and in vivo oligomeric states of a wild type and mutant trimeric inner membrane multidrug transporter. Biochem Biophys Rep 2018; 16:122-129. [PMID: 30417130 PMCID: PMC6216019 DOI: 10.1016/j.bbrep.2018.10.006] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Many membrane proteins exist and function as oligomers or protein complexes. Routine analytical methods involve extraction and solubilization of the proteins with detergents, which could disturb their actual oligomeric state. AcrB is a trimeric inner membrane multidrug transporter in E. coli. In previous studies, we created a mutant AcrBP223G, which behaves like a monomer when extracted from the cell membrane. However, the actual oligomeric state of AcrBP223G in cell membranes remained unclear, which complicated the interpretation of the mechanism by which the mutation affects function. Here we used several complementary methods to determine the oligomeric state of AcrBP223G in E. coli cell membranes. Two sets of quantitative fluorescent techniques were exploited. For these, we created fluorescent tagged AcrB, AcrB-CFP and AcrB-YPet. Fluorescence resonance energy transfer (FRET) and fluorescence recovery after photobleaching (FRAP) were employed to characterize independently the efficiency of energy transfer between co-expressed AcrB-CFP and AcrB-YPet, and the diffusion coefficient of AcrB-YPet and AcrBP223G-YPet in live E. coli cells. Second, we introduced Cys pairs at the inter-subunit interface and used controlled oxidation to probe inter-subunit distances. The results from all studies converge on the conclusion that AcrBP223G exists as a trimer in cell membranes, which dissociates during the purification steps. The small change in trimer affinity and structure leads to a significant loss of AcrB activity. In addition, throughout this study we developed protocols and established benchmark values, useful for further studies on membrane protein associations in cell membranes.
Collapse
Affiliation(s)
- Zhaoshuai Wang
- Department of Chemistry, University of Kentucky, Lexington, KY 40506, United States
| | - Wei Lu
- Department of Chemistry, University of Kentucky, Lexington, KY 40506, United States
| | - Prasangi Rajapaksha
- Department of Chemistry, University of Kentucky, Lexington, KY 40506, United States
| | - Thomas Wilkop
- Light Microscopy Core, University of Kentucky, Lexington, KY 40536, United States
| | - Yuguang Cai
- Department of Chemistry, University of Kentucky, Lexington, KY 40506, United States
| | - Yinan Wei
- Department of Chemistry, University of Kentucky, Lexington, KY 40506, United States
| |
Collapse
|
3
|
Danchin A, Ouzounis C, Tokuyasu T, Zucker JD. No wisdom in the crowd: genome annotation in the era of big data - current status and future prospects. Microb Biotechnol 2018; 11:588-605. [PMID: 29806194 PMCID: PMC6011933 DOI: 10.1111/1751-7915.13284] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Science and engineering rely on the accumulation and dissemination of knowledge to make discoveries and create new designs. Discovery-driven genome research rests on knowledge passed on via gene annotations. In response to the deluge of sequencing big data, standard annotation practice employs automated procedures that rely on majority rules. We argue this hinders progress through the generation and propagation of errors, leading investigators into blind alleys. More subtly, this inductive process discourages the discovery of novelty, which remains essential in biological research and reflects the nature of biology itself. Annotation systems, rather than being repositories of facts, should be tools that support multiple modes of inference. By combining deduction, induction and abduction, investigators can generate hypotheses when accurate knowledge is extracted from model databases. A key stance is to depart from 'the sequence tells the structure tells the function' fallacy, placing function first. We illustrate our approach with examples of critical or unexpected pathways, using MicroScope to demonstrate how tools can be implemented following the principles we advocate. We end with a challenge to the reader.
Collapse
Affiliation(s)
- Antoine Danchin
- Integromics, Institute of Cardiometabolism and Nutrition, Hôpital de la Pitié-Salpêtrière, 47 Boulevard de l'Hôpital, 75013, Paris, France
- School of Biomedical Sciences, Li KaShing Faculty of Medicine, Hong Kong University, 21 Sassoon Road, Pokfulam, Hong Kong
| | - Christos Ouzounis
- Biological Computation and Process Laboratory, Centre for Research and Technology Hellas, Chemical Process and Energy Resources Institute, Thessalonica, 57001, Greece
| | - Taku Tokuyasu
- Shenzhen Institutes of Advanced Technology, Institute of Synthetic Biology, Shenzhen University Town, 1068 Xueyuan Avenue, Shenzhen, China
| | - Jean-Daniel Zucker
- Integromics, Institute of Cardiometabolism and Nutrition, Hôpital de la Pitié-Salpêtrière, 47 Boulevard de l'Hôpital, 75013, Paris, France
| |
Collapse
|
4
|
He LS, Zhang PW, Huang JM, Zhu FC, Danchin A, Wang Y. The Enigmatic Genome of an Obligate Ancient Spiroplasma Symbiont in a Hadal Holothurian. Appl Environ Microbiol 2018; 84:e01965-17. [PMID: 29054873 PMCID: PMC5734040 DOI: 10.1128/aem.01965-17] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2017] [Accepted: 10/15/2017] [Indexed: 11/20/2022] Open
Abstract
Protective symbiosis has been reported in many organisms, but the molecular mechanisms of the mutualistic interactions between the symbionts and their hosts are unclear. Here, we sequenced the 424-kbp genome of "Candidatus Spiroplasma holothuricola," which dominated the hindgut microbiome of a sea cucumber, a major scavenger captured in the Mariana Trench (6,140 m depth). Phylogenetic relationships indicated that the dominant bacterium in the hindgut was derived from a basal group of Spiroplasma species. In this organism, the genes responsible for the biosynthesis of amino acids, glycolysis, and sugar transporters were lost, strongly suggesting endosymbiosis. The highly decayed genome consists of two chromosomes and harbors genes coding for proteolysis, microbial toxin, restriction-methylation systems, and clustered regularly interspaced short palindromic repeats (CRISPRs), composed of three cas genes and 76 CRISPR spacers. The holothurian host is probably protected against invading viruses from sediments by the CRISPRs/Cas and restriction systems of the endosymbiotic spiroplasma. The protective endosymbiosis indicates the important ecological role of the ancient Spiroplasma symbiont in the maintenance of hadal ecosystems.IMPORTANCE Sea cucumbers are major inhabitants in hadal trenches. They collect microbes in surface sediment and remain tolerant against potential pathogenic bacteria and viruses. This study presents the genome of endosymbiotic spiroplasmas in the gut of a sea cucumber captured in the Mariana Trench. The extreme reduction of the genome and loss of essential metabolic pathways strongly support its endosymbiotic lifestyle. Moreover, a considerable part of the genome was occupied by a CRISPR/Cas system to provide immunity against viruses and antimicrobial toxin-encoding genes for the degradation of microbes. This novel species of Spiroplasma is probably an important protective symbiont for the sea cucumbers in the hadal zone.
Collapse
Affiliation(s)
- Li-Sheng He
- Institute of Deep-Sea Science and Engineering, Chinese Academy of Sciences, Sanya, Hainan, China
| | - Pei-Wei Zhang
- Institute of Deep-Sea Science and Engineering, Chinese Academy of Sciences, Sanya, Hainan, China
| | - Jiao-Mei Huang
- Institute of Deep-Sea Science and Engineering, Chinese Academy of Sciences, Sanya, Hainan, China
| | - Fang-Chao Zhu
- Institute of Deep-Sea Science and Engineering, Chinese Academy of Sciences, Sanya, Hainan, China
| | - Antoine Danchin
- Institute of Cardiometabolism and Nutrition, Hôpital de la Pitié-Salpêtrière, Paris, France
| | - Yong Wang
- Institute of Deep-Sea Science and Engineering, Chinese Academy of Sciences, Sanya, Hainan, China
| |
Collapse
|
5
|
Francois CM, Duret L, Simon L, Mermillod-Blondin F, Malard F, Konecny-Dupré L, Planel R, Penel S, Douady CJ, Lefébure T. No Evidence That Nitrogen Limitation Influences the Elemental Composition of Isopod Transcriptomes and Proteomes. Mol Biol Evol 2016; 33:2605-20. [PMID: 27401232 DOI: 10.1093/molbev/msw131] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
The field of stoichiogenomics aims at understanding the influence of nutrient limitations on the elemental composition of the genome, transcriptome, and proteome. The 20 amino acids and the 4 nt differ in the number of nutrients they contain, such as nitrogen (N). Thus, N limitation shall theoretically select for changes in the composition of proteins or RNAs through preferential use of N-poor amino acids or nucleotides, which will decrease the N-budget of an organism. While these N-saving mechanisms have been evidenced in microorganisms, they remain controversial in multicellular eukaryotes. In this study, we used 13 surface and subterranean isopod species pairs that face strongly contrasted N limitations, either in terms of quantity or quality. We combined in situ nutrient quantification and transcriptome sequencing to test if N limitation selected for N-savings through changes in the expression and composition of the transcriptome and proteome. No evidence of N-savings was found in the total N-budget of transcriptomes or proteomes or in the average protein N-cost. Nevertheless, subterranean species evolving in N-depleted habitats displayed lower N-usage at their third codon positions. To test if this convergent compositional change was driven by natural selection, we developed a method to detect the strand-asymmetric signature that stoichiogenomic selection should leave in the substitution pattern. No such signature was evidenced, indicating that the observed stoichiogenomic-like patterns were attributable to nonadaptive processes. The absence of stoichiogenomic signal despite strong N limitation within a powerful phylogenetic framework casts doubt on the existence of stoichiogenomic mechanisms in metazoans.
Collapse
Affiliation(s)
- Clémentine M Francois
- Univ Lyon, Université Claude Bernard Lyon 1, CNRS, ENTPE, Laboratoire d'Ecologie des Hydrosystèmes Naturels et Anthropisés UMR5023, Villeurbanne, France
| | - Laurent Duret
- Univ Lyon, Université Claude Bernard Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR5558, Villeurbanne, France
| | - Laurent Simon
- Univ Lyon, Université Claude Bernard Lyon 1, CNRS, ENTPE, Laboratoire d'Ecologie des Hydrosystèmes Naturels et Anthropisés UMR5023, Villeurbanne, France
| | - Florian Mermillod-Blondin
- Univ Lyon, Université Claude Bernard Lyon 1, CNRS, ENTPE, Laboratoire d'Ecologie des Hydrosystèmes Naturels et Anthropisés UMR5023, Villeurbanne, France
| | - Florian Malard
- Univ Lyon, Université Claude Bernard Lyon 1, CNRS, ENTPE, Laboratoire d'Ecologie des Hydrosystèmes Naturels et Anthropisés UMR5023, Villeurbanne, France
| | - Lara Konecny-Dupré
- Univ Lyon, Université Claude Bernard Lyon 1, CNRS, ENTPE, Laboratoire d'Ecologie des Hydrosystèmes Naturels et Anthropisés UMR5023, Villeurbanne, France
| | - Rémi Planel
- Univ Lyon, Université Claude Bernard Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR5558, Villeurbanne, France
| | - Simon Penel
- Univ Lyon, Université Claude Bernard Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR5558, Villeurbanne, France
| | - Christophe J Douady
- Univ Lyon, Université Claude Bernard Lyon 1, CNRS, ENTPE, Laboratoire d'Ecologie des Hydrosystèmes Naturels et Anthropisés UMR5023, Villeurbanne, France Institut Universitaire de France, Paris, France
| | - Tristan Lefébure
- Univ Lyon, Université Claude Bernard Lyon 1, CNRS, ENTPE, Laboratoire d'Ecologie des Hydrosystèmes Naturels et Anthropisés UMR5023, Villeurbanne, France
| |
Collapse
|
6
|
Danchin A. Archives or Palimpsests? Bacterial Genomes Unveil a Scenario for the Origin of Life. ACTA ACUST UNITED AC 2015. [DOI: 10.1162/biot.2007.2.1.52] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
|
7
|
Saidijam M, Patching SG. Amino acid composition analysis of secondary transport proteins from Escherichia coli with relation to functional classification, ligand specificity and structure. J Biomol Struct Dyn 2015; 33:2205-20. [DOI: 10.1080/07391102.2014.998283] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Affiliation(s)
- Massoud Saidijam
- Department of Molecular Medicine and Genetics, Research Centre for Molecular Medicine, School of Medicine, Hamadan University of Medical Sciences , Hamadan, Iran
| | - Simon G. Patching
- Department of Molecular Medicine and Genetics, Research Centre for Molecular Medicine, School of Medicine, Hamadan University of Medical Sciences , Hamadan, Iran
| |
Collapse
|
8
|
Danchin A, Sekowska A. The logic of metabolism and its fuzzy consequences. Environ Microbiol 2013; 16:19-28. [PMID: 24387040 DOI: 10.1111/1462-2920.12270] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2013] [Revised: 08/02/2013] [Accepted: 08/26/2013] [Indexed: 12/26/2022]
Abstract
Intermediary metabolism molecules are orchestrated into logical pathways stemming from history (L-amino acids, D-sugars) and dynamic constraints (hydrolysis of pyrophosphate or amide groups is the driving force of anabolism). Beside essential metabolites, numerous variants derive from programmed or accidental changes. Broken down, variants enter standard pathways, producing further variants. Macromolecule modification alters enzyme reactions specificity. Metabolism conform thermodynamic laws, precluding strict accuracy. Hence, for each regular pathway, a wealth of variants inputs and produces metabolites that are similar to but not the exact replicas of core metabolites. As corollary, a shadow, paralogous metabolism, is associated to standard metabolism. We focus on a logic of paralogous metabolism based on diversion of the core metabolic mimics into pathways where they are modified to minimize their input in the core pathways where they create havoc. We propose that a significant proportion of paralogues of well-characterized enzymes have evolved as the natural way to cope with paralogous metabolites. A second type of denouement uses a process where protecting/deprotecting unwanted metabolites - conceptually similar to the procedure used in the laboratory of an organic chemist - is used to enter a completely new catabolic pathway.
Collapse
Affiliation(s)
- Antoine Danchin
- Building G1, AMAbiotics SAS, 2 rue Gaston Crémieux, Evry, 91000, France
| | | |
Collapse
|
9
|
Heffron JD, Jenkins AL, Bozue JA, Kaatz LK, Cote CK, Welkos SL. Phenotypic changes in spores and vegetative cells of Bacillus anthracis associated with BenK. Microb Pathog 2012. [PMID: 23178382 DOI: 10.1016/j.micpath.2012.11.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
A transposon insertional mutagenesis spore library of the pathogen Bacillus anthracis was screened to identify mutants altered in germination kinetics. One mutant exhibited an accelerated rate of germination in association with disruption of benK. This gene encodes a putative protein with high homology to membrane transporters that facilitate benzoate transport. We hypothesized that BenK may not be only spore associated, but also have a vegetative cell role. A reporter strain with a translational fusion of benK to green fluorescent protein demonstrated that full-length BenK was present in vegetative cells and that a BenK degradation product was present in spores by detecting the reporter using fluorescence and Western blot analysis. A minimum inhibitory concentration assay indicated that vegetative cells of a benK::Kan mutant were more susceptible to the antimicrobial effects of Na-benzoate. The mutant spores germinated to a greater extent within 1 h than the wild type in an in vitro fluorescence assay. The disruption of benK also resulted in spores that were less readily phagocytosed in a macrophage assay. Despite these altered in vitro phenotypes, no apparent effect of the BenK protein on virulence in the intranasal mouse model or the guinea pig competitive assay was observed. This work shows that, although the BenK protein does not impact fitness or virulence in an infection model, it is involved in other aspects of both the spore and vegetative forms of the organism.
Collapse
Affiliation(s)
- Jared D Heffron
- United States Army Medical Research Institute of Infectious Diseases (USAMRIID), Bacteriology Division, 1425 Porter Street, Fort Detrick, Frederick, MD 21702-5011, USA
| | | | | | | | | | | |
Collapse
|
10
|
Medema MH, Zhou M, van Hijum SAFT, Gloerich J, Wessels HJCT, Siezen RJ, Strous M. A predicted physicochemically distinct sub-proteome associated with the intracellular organelle of the anammox bacterium Kuenenia stuttgartiensis. BMC Genomics 2010; 11:299. [PMID: 20459862 PMCID: PMC2881027 DOI: 10.1186/1471-2164-11-299] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2010] [Accepted: 05/12/2010] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND Anaerobic ammonium-oxidizing (anammox) bacteria perform a key step in global nitrogen cycling. These bacteria make use of an organelle to oxidize ammonia anaerobically to nitrogen (N2) and so contribute approximately 50% of the nitrogen in the atmosphere. It is currently unknown which proteins constitute the organellar proteome and how anammox bacteria are able to specifically target organellar and cell-envelope proteins to their correct final destinations. Experimental approaches are complicated by the absence of pure cultures and genetic accessibility. However, the genome of the anammox bacterium Candidatus "Kuenenia stuttgartiensis" has recently been sequenced. Here, we make use of these genome data to predict the organellar sub-proteome and address the molecular basis of protein sorting in anammox bacteria. RESULTS Two training sets representing organellar (30 proteins) and cell envelope (59 proteins) proteins were constructed based on previous experimental evidence and comparative genomics. Random forest (RF) classifiers trained on these two sets could differentiate between organellar and cell envelope proteins with ~89% accuracy using 400 features consisting of frequencies of two adjacent amino acid combinations. A physicochemically distinct organellar sub-proteome containing 562 proteins was predicted with the best RF classifier. This set included almost all catabolic and respiratory factors encoded in the genome. Apparently, the cytoplasmic membrane performs no catabolic functions. We predict that the Tat-translocation system is located exclusively in the organellar membrane, whereas the Sec-translocation system is located on both the organellar and cytoplasmic membranes. Canonical signal peptides were predicted and validated experimentally, but a specific (N- or C-terminal) signal that could be used for protein targeting to the organelle remained elusive. CONCLUSIONS A physicochemically distinct organellar sub-proteome was predicted from the genome of the anammox bacterium K. stuttgartiensis. This result provides strong in silico support for the existing experimental evidence for the existence of an organelle in this bacterium, and is an important step forward in unravelling a geochemically relevant case of cytoplasmic differentiation in bacteria. The predicted dual location of the Sec-translocation system and the apparent absence of a specific N- or C-terminal signal in the organellar proteins suggests that additional chaperones may be necessary that act on an as-yet unknown property of the targeted proteins.
Collapse
Affiliation(s)
- Marnix H Medema
- Department of Microbiology, Radboud University Nijmegen, Toernooiveld 1, 6525 ED Nijmegen, the Netherlands
| | | | | | | | | | | | | |
Collapse
|
11
|
Codon Usage Patterns in Corynebacterium glutamicum: Mutational Bias, Natural Selection and Amino Acid Conservation. Comp Funct Genomics 2010; 2010:343569. [PMID: 20445740 PMCID: PMC2860111 DOI: 10.1155/2010/343569] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2009] [Revised: 01/29/2010] [Accepted: 02/04/2010] [Indexed: 11/17/2022] Open
Abstract
The alternative synonymous codons in Corynebacterium glutamicum, a well-known bacterium used in industry for the production of amino acid, have been investigated by multivariate analysis. As C. glutamicum is a GC-rich organism, G and C are expected to predominate at the third position of codons. Indeed, overall codon usage analyses have indicated that C and/or G ending codons are predominant in this organism. Through multivariate statistical analysis, apart from mutational selection, we identified three other trends of codon usage variation among the genes. Firstly, the majority of highly expressed genes are scattered towards the positive end of the first axis, whereas the majority of lowly expressed genes are clustered towards the other end of the first axis. Furthermore, the distinct difference in the two sets of genes was that the C ending codons are predominate in putatively highly expressed genes, suggesting that the C ending codons are translationally optimal in this organism. Secondly, the majority of the putatively highly expressed genes have a tendency to locate on the leading strand, which indicates that replicational and transciptional selection might be invoked. Thirdly, highly expressed genes are more conserved than lowly expressed genes by synonymous and nonsynonymous substitutions among orthologous genes fromthe genomes of C. glutamicum and C. diphtheriae. We also analyzed other factors such as the length of genes and hydrophobicity that might influence codon usage and found their contributions to be weak.
Collapse
|
12
|
de Lorenzo V. Recombinant bacteria for environmental release: what went wrong and what we have learnt from it. Clin Microbiol Infect 2009; 15 Suppl 1:63-5. [DOI: 10.1111/j.1469-0691.2008.02683.x] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
13
|
Abstract
ORFan genes can constitute a large fraction of a bacterial genome, but due to their lack of homologs, their functions have remained largely unexplored. To determine if particular features of ORFan-encoded proteins promote their presence in a genome, we analyzed properties of ORFans that originated over a broad evolutionary timescale. We also compared ORFan genes to another class of acquired genes, heterogeneous occurrence in prokaryotes (HOPs), which have homologs in other bacteria. A total of 54 ORFan and HOP genes selected from different phylogenetic depths in the Escherichia coli lineage were cloned, expressed, purified, and subjected to circular dichroism (CD) spectroscopy. A majority of genes could be expressed, but only 18 yielded sufficient soluble protein for spectral analysis. Of these, half were significantly alpha-helical, three were predominantly beta-sheet, and six were of intermediate/indeterminate structure. Although a higher proportion of HOPs yielded soluble proteins with resolvable secondary structures, ORFans resembled HOPs with regard to most of the other features tested. Overall, we found that those ORFan and HOP genes that have persisted in the E. coli lineage were more likely to encode soluble and folded proteins, more likely to display environmental modulation of their gene expression, and by extrapolation, are more likely to be functional.
Collapse
Affiliation(s)
- Hema Prasad Narra
- Department of Biochemistry & Molecular Biophysics, University of Arizona, Tucson, AZ, USA
| | - Matthew H. J. Cordes
- Department of Biochemistry & Molecular Biophysics, University of Arizona, Tucson, AZ, USA
| | - Howard Ochman
- Department of Biochemistry & Molecular Biophysics, University of Arizona, Tucson, AZ, USA
| |
Collapse
|
14
|
Riley M, Staley JT, Danchin A, Wang TZ, Brettin TS, Hauser LJ, Land ML, Thompson LS. Genomics of an extreme psychrophile, Psychromonas ingrahamii. BMC Genomics 2008; 9:210. [PMID: 18460197 PMCID: PMC2405808 DOI: 10.1186/1471-2164-9-210] [Citation(s) in RCA: 76] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2007] [Accepted: 05/06/2008] [Indexed: 11/10/2022] Open
Abstract
Background The genome sequence of the sea-ice bacterium Psychromonas ingrahamii 37, which grows exponentially at -12C, may reveal features that help to explain how this extreme psychrophile is able to grow at such low temperatures. Determination of the whole genome sequence allows comparison with genes of other psychrophiles and mesophiles. Results Correspondence analysis of the composition of all P. ingrahamii proteins showed that (1) there are 6 classes of proteins, at least one more than other bacteria, (2) integral inner membrane proteins are not sharply separated from bulk proteins suggesting that, overall, they may have a lower hydrophobic character, and (3) there is strong opposition between asparagine and the oxygen-sensitive amino acids methionine, arginine, cysteine and histidine and (4) one of the previously unseen clusters of proteins has a high proportion of "orphan" hypothetical proteins, raising the possibility these are cold-specific proteins. Based on annotation of proteins by sequence similarity, (1) P. ingrahamii has a large number (61) of regulators of cyclic GDP, suggesting that this bacterium produces an extracellular polysaccharide that may help sequester water or lower the freezing point in the vicinity of the cell. (2) P. ingrahamii has genes for production of the osmolyte, betaine choline, which may balance the osmotic pressure as sea ice freezes. (3) P. ingrahamii has a large number (11) of three-subunit TRAP systems that may play an important role in the transport of nutrients into the cell at low temperatures. (4) Chaperones and stress proteins may play a critical role in transforming nascent polypeptides into 3-dimensional configurations that permit low temperature growth. (5) Metabolic properties of P. ingrahamii were deduced. Finally, a few small sets of proteins of unknown function which may play a role in psychrophily have been singled out as worthy of future study. Conclusion The results of this genomic analysis provide a springboard for further investigations into mechanisms of psychrophily. Focus on the role of asparagine excess in proteins, targeted phenotypic characterizations and gene expression investigations are needed to ascertain if and how the organism regulates various proteins in response to growth at lower temperatures.
Collapse
Affiliation(s)
- Monica Riley
- Bay Paul Center, Marine Biological Laboratory, Woods Hole, MA 02543, USA.
| | | | | | | | | | | | | | | |
Collapse
|
15
|
Silva-Rocha R, de Lorenzo V. Mining logic gates in prokaryotic transcriptional regulation networks. FEBS Lett 2008; 582:1237-44. [PMID: 18275855 DOI: 10.1016/j.febslet.2008.01.060] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2008] [Accepted: 01/28/2008] [Indexed: 10/22/2022]
Abstract
Prokaryotic transcriptional networks possess a large number of regulatory modules that formally implement many of the logic gates that are typical of digital, Boolean circuits. Yet, natural regulatory elements appear most often compressed and exaggeratedly context-dependent for any reliable circuit engineering barely comparable to electronic counterparts. To overcome this impasse, we argue that designing new functions with biological parts requires (i) the recognition of logic gates not yet assigned but surely present in the meta-genome, (ii) the orthogonalization and disambiguation of natural regulatory modules and (iii) the development of ways to tackle the connectivity and the definition of boundaries between minimal biological components.
Collapse
Affiliation(s)
- Rafael Silva-Rocha
- Centro Nacional de Biotecnología, CSIC, Campus de Cantoblanco, Madrid 28049, Spain
| | | |
Collapse
|
16
|
Touchon M, Rocha EPC. From GC skews to wavelets: a gentle guide to the analysis of compositional asymmetries in genomic data. Biochimie 2007; 90:648-59. [PMID: 17988781 DOI: 10.1016/j.biochi.2007.09.015] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2007] [Accepted: 09/21/2007] [Indexed: 12/29/2022]
Abstract
Compositional asymmetries are pervasive in DNA sequences. They are the result of the asymmetric interactions between DNA and cellular mechanisms such as replication and transcription. Here, we review many of the methods that have been proposed over the years to analyse compositional asymmetries in DNA sequences. Among these we list GC skews, oligonucleotide skews and wavelets, which among other uses have been extensively employed to delimitate origins and termini of replication in genomes. We also review the use of multivariate methods, such as factorial correspondence analysis, discriminant analysis and analysis of variance, which allow assigning compositional strand asymmetries to the different biological processes shaping sequence composition. Finally, we review methods that have been used to infer substitution matrices and allow understanding the mutational processes underlying strand asymmetry. We focus on replication asymmetries because they have been more thoroughly studied, but the methods may be adapted, and often are, to other problems. Although strand asymmetry has been studied more frequently through compositional skews of nucleotides or oligonucleotides, we recall that, depending on the goal of the analysis, other methods may be more appropriate to answer certain biological questions. We also refer to programs freely available to analyse strand asymmetry.
Collapse
Affiliation(s)
- Marie Touchon
- Atelier de Bioinformatique, Université Pierre et Marie Curie-Paris 6, Paris, France
| | | |
Collapse
|
17
|
Perlstein EO, de Bivort BL, Kunes S, Schreiber SL. Evolutionarily conserved optimization of amino acid biosynthesis. J Mol Evol 2007; 65:186-96. [PMID: 17684697 DOI: 10.1007/s00239-007-0013-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2007] [Accepted: 04/17/2007] [Indexed: 11/30/2022]
Abstract
The "cognate bias hypothesis" states that early in evolutionary history the biosynthetic enzymes for amino acid x gradually lost residues of x, thereby reducing the threshold for deleterious effects of x scarcity. The resulting reduction in cognate amino acid composition of the enzymes comprising a particular amino acid biosynthetic pathway is predicted to confer a selective growth advantage on cells. Bioinformatic evidence from protein-sequence data of two bacterial species previously demonstrated reduced cognate bias in amino acid biosynthetic pathways. Here we show that cognate bias in amino acid biosynthesis is present in the other domains of life-Archaebacteria and Eukaryota. We also observe evolutionarily conserved underrepresentations (e.g., glycine in methionine biosynthesis) and overrepresentations (e.g., tryptophan in asparagine biosynthesis) of amino acids in noncognate biosynthetic pathways, which can be explained by secondary amino acid metabolism. Additionally, we experimentally validate the cognate bias hypothesis using the yeast Saccharomyces cerevisiae. Specifically, we show that the degree to which growth declines following amino acid deprivation is negatively correlated with the degree to which an amino acid is underrepresented in the enzymes that comprise its cognate biosynthetic pathway. Moreover, we demonstrate that cognate fold representation is more predictive of growth advantage than a host of other potential growth-limiting factors, including an amino acid's metabolic cost or its intracellular concentration and compartmental distribution.
Collapse
Affiliation(s)
- Ethan O Perlstein
- Howard Hughes Medical Institute, Broad Institute of Harvard and MIT, 7 Cambridge Center, Cambridge, MA 02142, USA.
| | | | | | | |
Collapse
|
18
|
Bragg JG, Wagner A. Protein carbon content evolves in response to carbon availability and may influence the fate of duplicated genes. Proc Biol Sci 2007; 274:1063-70. [PMID: 17264057 PMCID: PMC2124476 DOI: 10.1098/rspb.2006.0290] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2006] [Revised: 01/06/2007] [Accepted: 01/09/2007] [Indexed: 11/12/2022] Open
Abstract
Natural selection can influence even the lowest level of biological organization, the atomic composition of biological macromolecules. In analysing genome-scale gene expression data, we find that ancestral yeast strains preferentially express proteins with low carbon content during carbon limitation, relative to strains selected in the laboratory under carbon limitation. The likely reason is that the artificially selected strains acquire adaptations that refine their response to the limitation or partly circumvent the limiting condition. This finding extends previous work which shows that natural selection can act on the atomic costs of proteins. We also show that genes with high carbon and nitrogen content are less likely to have duplicates, indicating that atomic composition also plays a role in evolution by gene duplication. Taken together, our results contribute to the emerging view that protein atomic composition influences genome and transcriptome evolution.
Collapse
Affiliation(s)
- Jason G Bragg
- Biology Department, University of New MexicoAlbuquerque, NM 87131, USA
| | - Andreas Wagner
- Department of Biochemistry, University of ZurichWinterthurerstrasse 190, 8057 Zurich, Switzerland
| |
Collapse
|