1
|
Rybina AA, Glushak RA, Bessonova TA, Dakhnovets AI, Rudenko AY, Ozhiganov RM, Kaznadzey AD, Tutukina MN, Gelfand MS. Phylogeny and structural modeling of the transcription factor CsqR (YihW) from Escherichia coli. Sci Rep 2024; 14:7852. [PMID: 38570624 PMCID: PMC10991401 DOI: 10.1038/s41598-024-58492-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 03/29/2024] [Indexed: 04/05/2024] Open
Abstract
CsqR (YihW) is a local transcription factor that controls expression of yih genes involved in degradation of sulfoquinovose in Escherichia coli. We recently showed that expression of the respective gene cassette might be regulated by lactose. Here, we explore the phylogenetic and functional traits of CsqR. Phylogenetic analysis revealed that CsqR had a conserved Met25. Western blot demonstrated that CsqR was synthesized in the bacterial cell as two protein forms, 28.5 (CsqR-l) and 26 kDa (CsqR-s), the latter corresponding to start of translation at Met25. CsqR-s was dramatically activated during growth with sulfoquinovose as a sole carbon source, and displaced CsqR-l in the stationary phase during growth on rich medium. Molecular dynamic simulations revealed two possible states of the CsqR-s structure, with the interdomain linker being represented by either a disordered loop or an ɑ-helix. This helix allowed the hinge-like motion of the N-terminal domain resulting in a switch of CsqR-s between two conformational states, "open" and "compact". We then modeled the interaction of both CsqR forms with putative effectors sulfoquinovose, sulforhamnose, sulfoquinovosyl glycerol, and lactose, and revealed that they all preferred the same pocket in CsqR-l, while in CsqR-s there were two possible options dependent on the linker structure.
Collapse
Affiliation(s)
- Anna A Rybina
- Skolkovo Institute of Science and Technology, Moscow, Russia, 121205.
| | - Roman A Glushak
- Faculty of Biology, Lomonosov Moscow State University, Moscow, Russia, 119234
| | - Tatiana A Bessonova
- Institute of Cell Biophysics RAS (Federal Research Center "Pushchino Scientific Center for Biological Research RAS"), Pushchino, Russia, 142290
| | | | - Alexander Yu Rudenko
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, Russia, 119991
| | - Ratislav M Ozhiganov
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, Russia, 119991
| | - Anna D Kaznadzey
- Institute for Information Transmission Problems RAS, Moscow, Russia, 127051
| | - Maria N Tutukina
- Skolkovo Institute of Science and Technology, Moscow, Russia, 121205
- Institute of Cell Biophysics RAS (Federal Research Center "Pushchino Scientific Center for Biological Research RAS"), Pushchino, Russia, 142290
- Institute for Information Transmission Problems RAS, Moscow, Russia, 127051
| | - Mikhail S Gelfand
- Skolkovo Institute of Science and Technology, Moscow, Russia, 121205
- Institute for Information Transmission Problems RAS, Moscow, Russia, 127051
| |
Collapse
|
2
|
Garushyants SK, Sane M, Selifanova MV, Agashe D, Bazykin GA, Gelfand MS. Mutational Signatures in Wild Type Escherichia coli Strains Reveal Predominance of DNA Polymerase Errors. Genome Biol Evol 2024; 16:evae035. [PMID: 38401265 PMCID: PMC10995721 DOI: 10.1093/gbe/evae035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Revised: 02/13/2024] [Accepted: 02/17/2024] [Indexed: 02/26/2024] Open
Abstract
While mutational processes operating in the Escherichia coli genome have been revealed by multiple laboratory experiments, the contribution of these processes to accumulation of bacterial polymorphism and evolution in natural environments is unknown. To address this question, we reconstruct signatures of distinct mutational processes from experimental data on E. coli hypermutators, and ask how these processes contribute to differences between naturally occurring E. coli strains. We show that both mutations accumulated in the course of evolution of wild-type strains in nature and in the lab-grown nonmutator laboratory strains are explained predominantly by the low fidelity of DNA polymerases II and III. By contrast, contributions specific to disruption of DNA repair systems cannot be detected, suggesting that temporary accelerations of mutagenesis associated with such disruptions are unimportant for within-species evolution. These observations demonstrate that accumulation of diversity in bacterial strains in nature is predominantly associated with errors of DNA polymerases.
Collapse
Affiliation(s)
- Sofya K Garushyants
- A.A. Kharkevich Institute for Information Transmission Problems, RAS, Moscow, Russia
| | - Mrudula Sane
- National Centre for Biological Sciences, Bengaluru, India
| | - Maria V Selifanova
- Faculty of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Moscow, Russia
| | - Deepa Agashe
- National Centre for Biological Sciences, Bengaluru, India
| | - Georgii A Bazykin
- A.A. Kharkevich Institute for Information Transmission Problems, RAS, Moscow, Russia
| | - Mikhail S Gelfand
- A.A. Kharkevich Institute for Information Transmission Problems, RAS, Moscow, Russia
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology (Skoltech), Moscow, Russia
| |
Collapse
|
3
|
Bulygin I, Shatov V, Rykachevskiy A, Raiko A, Bernstein A, Burnaev E, Gelfand MS. Absence of enterotypes in the human gut microbiomes reanalyzed with non-linear dimensionality reduction methods. PeerJ 2023; 11:e15838. [PMID: 37701837 PMCID: PMC10494839 DOI: 10.7717/peerj.15838] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Accepted: 07/12/2023] [Indexed: 09/14/2023] Open
Abstract
Enterotypes of the human gut microbiome have been proposed to be a powerful prognostic tool to evaluate the correlation between lifestyle, nutrition, and disease. However, the number of enterotypes suggested in the literature ranged from two to four. The growth of available metagenome data and the use of exact, non-linear methods of data analysis challenges the very concept of clusters in the multidimensional space of bacterial microbiomes. Using several published human gut microbiome datasets of variable 16S rRNA regions, we demonstrate the presence of a lower-dimensional structure in the microbiome space, with high-dimensional data concentrated near a low-dimensional non-linear submanifold, but the absence of distinct and stable clusters that could represent enterotypes. This observation is robust with regard to diverse combinations of dimensionality reduction techniques and clustering algorithms.
Collapse
Affiliation(s)
- Ivan Bulygin
- Skolkovo Institute of Science and Technology, Moscow, Russia
| | | | | | - Arsenii Raiko
- Skolkovo Institute of Science and Technology, Moscow, Russia
| | | | - Evgeny Burnaev
- Skolkovo Institute of Science and Technology, Moscow, Russia
- Artificial Intelligence Research Institute (AIRI), Moscow, Russia
| | - Mikhail S. Gelfand
- Skolkovo Institute of Science and Technology, Moscow, Russia
- Institute for Information Transmission Problems, Moscow, Russia
| |
Collapse
|
4
|
Gaydukova SA, Moldovan MA, Vallesi A, Heaphy SM, Atkins JF, Gelfand MS, Baranov PV. Nontriplet feature of genetic code in Euplotes ciliates is a result of neutral evolution. Proc Natl Acad Sci U S A 2023; 120:e2221683120. [PMID: 37216548 PMCID: PMC10235951 DOI: 10.1073/pnas.2221683120] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 04/12/2023] [Indexed: 05/24/2023] Open
Abstract
The triplet nature of the genetic code is considered a universal feature of known organisms. However, frequent stop codons at internal mRNA positions in Euplotes ciliates ultimately specify ribosomal frameshifting by one or two nucleotides depending on the context, thus posing a nontriplet feature of the genetic code of these organisms. Here, we sequenced transcriptomes of eight Euplotes species and assessed evolutionary patterns arising at frameshift sites. We show that frameshift sites are currently accumulating more rapidly by genetic drift than they are removed by weak selection. The time needed to reach the mutational equilibrium is several times longer than the age of Euplotes and is expected to occur after a several-fold increase in the frequency of frameshift sites. This suggests that Euplotes are at an early stage of the spread of frameshifting in expression of their genome. In addition, we find the net fitness burden of frameshift sites to be noncritical for the survival of Euplotes. Our results suggest that fundamental genome-wide changes such as a violation of the triplet character of genetic code can be introduced and maintained solely by neutral evolution.
Collapse
Affiliation(s)
- Sofya A. Gaydukova
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow199911, Russia
| | - Mikhail A. Moldovan
- A. A. Kharkevich Institute for Information Transmission Problems RAS, Moscow127051, Russia
| | - Adriana Vallesi
- Laboratory of Eukaryotic Microbiology and Animal Biology, School of Biosciences and Veterinary Medicine, University of Camerino, Camerino62032, Italy
| | - Stephen M. Heaphy
- School of Biochemistry and Cell Biology, University College Cork, CorkT12 XF62, Ireland
| | - John F. Atkins
- School of Biochemistry and Cell Biology, University College Cork, CorkT12 XF62, Ireland
- Department of Human Genetics, University of Utah, Salt Lake City, UT84112
| | - Mikhail S. Gelfand
- A. A. Kharkevich Institute for Information Transmission Problems RAS, Moscow127051, Russia
| | - Pavel V. Baranov
- School of Biochemistry and Cell Biology, University College Cork, CorkT12 XF62, Ireland
| |
Collapse
|
5
|
Kobets VA, Ulianov SV, Galitsyna AA, Doronin SA, Mikhaleva EA, Gelfand MS, Shevelyov YY, Razin SV, Khrameeva EE. HiConfidence: a novel approach uncovering the biological signal in Hi-C data affected by technical biases. Brief Bioinform 2023; 24:7033301. [PMID: 36759336 PMCID: PMC10025441 DOI: 10.1093/bib/bbad044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Revised: 01/04/2023] [Accepted: 01/20/2023] [Indexed: 02/11/2023] Open
Abstract
The chromatin interaction assays, particularly Hi-C, enable detailed studies of genome architecture in multiple organisms and model systems, resulting in a deeper understanding of gene expression regulation mechanisms mediated by epigenetics. However, the analysis and interpretation of Hi-C data remain challenging due to technical biases, limiting direct comparisons of datasets obtained in different experiments and laboratories. As a result, removing biases from Hi-C-generated chromatin contact matrices is a critical data analysis step. Our novel approach, HiConfidence, eliminates biases from the Hi-C data by weighing chromatin contacts according to their consistency between replicates so that low-quality replicates do not substantially influence the result. The algorithm is effective for the analysis of global changes in chromatin structures such as compartments and topologically associating domains. We apply the HiConfidence approach to several Hi-C datasets with significant technical biases, that could not be analyzed effectively using existing methods, and obtain meaningful biological conclusions. In particular, HiConfidence aids in the study of how changes in histone acetylation pattern affect chromatin organization in Drosophila melanogaster S2 cells. The method is freely available at GitHub: https://github.com/victorykobets/HiConfidence.
Collapse
Affiliation(s)
- Victoria A Kobets
- Skolkovo Institute of Science and Technology, Moscow, 121205, Russia
| | - Sergey V Ulianov
- Institute of Gene Biology, Russian Academy of Sciences, Moscow, 119334, Russia
- Faculty of Biology, M.V. Lomonosov Moscow State University, Moscow, 119992, Russia
| | - Aleksandra A Galitsyna
- Skolkovo Institute of Science and Technology, Moscow, 121205, Russia
- Institute of Gene Biology, Russian Academy of Sciences, Moscow, 119334, Russia
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, 127051, Russia
| | - Semen A Doronin
- Institute of Molecular Genetics of National Research Centre "Kurchatov Institute", Moscow, 123182, Russia
| | - Elena A Mikhaleva
- Institute of Molecular Genetics of National Research Centre "Kurchatov Institute", Moscow, 123182, Russia
| | - Mikhail S Gelfand
- Skolkovo Institute of Science and Technology, Moscow, 121205, Russia
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, 127051, Russia
| | - Yuri Y Shevelyov
- Institute of Molecular Genetics of National Research Centre "Kurchatov Institute", Moscow, 123182, Russia
| | - Sergey V Razin
- Institute of Gene Biology, Russian Academy of Sciences, Moscow, 119334, Russia
- Faculty of Biology, M.V. Lomonosov Moscow State University, Moscow, 119992, Russia
| | | |
Collapse
|
6
|
Tutukina MN, Dakhnovets AI, Kaznadzey AD, Gelfand MS, Ozoline ON. Sense and antisense RNA products of the uxuR gene can affect motility and chemotaxis acting independent of the UxuR protein. Front Mol Biosci 2023; 10:1121376. [PMID: 36936992 PMCID: PMC10016265 DOI: 10.3389/fmolb.2023.1121376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Accepted: 02/06/2023] [Indexed: 02/19/2023] Open
Abstract
Small non-coding and antisense RNAs are widespread in all kingdoms of life, however, the diversity of their functions in bacteria is largely unknown. Here, we study RNAs synthesised from divergent promoters located in the 3'-end of the uxuR gene, encoding transcription factor regulating hexuronate metabolism in Escherichia coli. These overlapping promoters were predicted in silico with rather high scores, effectively bound RNA polymerase in vitro and in vivo and were capable of initiating transcription in sense and antisense directions. The genome-wide correlation between in silico promoter scores and RNA polymerase binding in vitro and in vivo was higher for promoters located on the antisense strands of the genes, however, sense promoters within the uxuR gene were more active. Both regulatory RNAs synthesised from the divergent promoters inhibited expression of genes associated with the E. coli motility and chemotaxis independent of a carbon source on which bacteria had been grown. Direct effects of these RNAs were confirmed for the fliA gene encoding σ28 subunit of RNA polymerase. In addition to intracellular sRNAs, promoters located within the uxuR gene could initiate synthesis of transcripts found in the fraction of RNAs secreted in the extracellular medium. Their profile was also carbon-independent suggesting that intragenic uxuR transcripts have a specific regulatory role not directly related to the function of the protein in which gene they are encoded.
Collapse
Affiliation(s)
- Maria N. Tutukina
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Moscow, Russia
- Lab of Functional Genomics and Cellular Stress, Institute of Cell Biophysics RAS, FRC PRCBR, Pushchino, Russia
- RTC “Bioinformatics”, A. A. Kharkevich Institute for Information Transmission Problems RAS, Moscow, Russia
- *Correspondence: Maria N. Tutukina, , Olga N. Ozoline,
| | - Artemiy I. Dakhnovets
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Moscow, Russia
- Department of Biotechnology, M. V. Lomonosov Moscow State University, Moscow, Russia
| | - Anna D. Kaznadzey
- RTC “Bioinformatics”, A. A. Kharkevich Institute for Information Transmission Problems RAS, Moscow, Russia
| | - Mikhail S. Gelfand
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Moscow, Russia
- RTC “Bioinformatics”, A. A. Kharkevich Institute for Information Transmission Problems RAS, Moscow, Russia
| | - Olga N. Ozoline
- Lab of Functional Genomics and Cellular Stress, Institute of Cell Biophysics RAS, FRC PRCBR, Pushchino, Russia
- *Correspondence: Maria N. Tutukina, , Olga N. Ozoline,
| |
Collapse
|
7
|
Semenkov IN, Shelyakin PV, Nikolaeva DD, Tutukina MN, Sharapova AV, Lednev SA, Sarana YV, Gelfand MS, Krechetov PP, Koroleva TV. Data on the temporal changes in soil properties and microbiome composition after a jet-fuel contamination during the pot and field experiments. Data Brief 2022; 46:108860. [PMID: 36632439 PMCID: PMC9826931 DOI: 10.1016/j.dib.2022.108860] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Revised: 12/20/2022] [Accepted: 12/21/2022] [Indexed: 12/29/2022] Open
Abstract
The soil response to a jet-fuel contamination is uncertain. In this article, original data on the influence of a jet-fuel spillage on the topsoil properties are presented. The data set is obtained during a one-year long pot and field experiments with Dystric Arenosols, Fibric Histosols and Albic Luvisols. Kerosene loads were 1, 5, 10, 25 and 100 g/kg. The data set includes information about temporal changes in kerosene concentration; physicochemical properties, such as рН, moisture, cation exchange capacity, content of soil organic matter, available P and K, exchangeable NH4 +, and water-soluble NO3 -; and biological properties, such as biological consumption of oxygen, and cellulolytic activity. Also, we provide sequencing data on variable regions of 16S ribosomal RNA of microbial communities from the respective soil samples.
Collapse
Key Words
- AL, Albic Luvisols
- ASV, amplicon sequence variant
- Bearing capacity
- CA, cellulolytic activity
- CEC, cation exchange capacity
- DA, Dystric Arenosols
- DNA, deoxyribonucleic acid
- EDTA, Ethylenediaminetetraacetic acid
- Ecological indicators
- FH, Fibric Histosols
- Gasoline
- Kav, available potassium
- NH4+, exchangeable ammonium
- NO3–, water-soluble nitrate
- PCR, polymerase chain reaction
- Pav, available phosphorus
- SOM, soil organic matter
- Soil metagenome
- Soil pollution
- Total petroleum hydrocarbons
- WMO, World Meteorological Organization
- Xenobiotic compounds
- qPCR, real-time polymerase chain reaction
- rRNA, ribosomal ribonucleic acid
Collapse
Affiliation(s)
- Ivan N. Semenkov
- M.V. Lomonosov Moscow State University, 119991 Moscow, Russia,Correspondence.
| | - Pavel V. Shelyakin
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, 127051 Moscow, Russia,Vavilov Institute of General Genetics, Russian Academy of Sciences, 119333 Moscow, Russia
| | | | - Maria N. Tutukina
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, 127051 Moscow, Russia,Skolkovo Institute of Science and Technology, 121205 Moscow, Russia
| | | | | | - Yuliya V. Sarana
- Skolkovo Institute of Science and Technology, 121205 Moscow, Russia
| | - Mikhail S. Gelfand
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, 127051 Moscow, Russia,Skolkovo Institute of Science and Technology, 121205 Moscow, Russia
| | | | | |
Collapse
|
8
|
Grigorashvili EI, Chervontseva ZS, Gelfand MS. Predicting RNA secondary structure by a neural network: what features may be learned? PeerJ 2022; 10:e14335. [PMID: 36530406 PMCID: PMC9756865 DOI: 10.7717/peerj.14335] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Accepted: 10/12/2022] [Indexed: 12/14/2022] Open
Abstract
Deep learning is a class of machine learning techniques capable of creating internal representation of data without explicit preprogramming. Hence, in addition to practical applications, it is of interest to analyze what features of biological data may be learned by such models. Here, we describe PredPair, a deep learning neural network trained to predict base pairs in RNA structure from sequence alone, without any incorporated prior knowledge, such as the stacking energies or possible spatial structures. PredPair learned the Watson-Crick and wobble base-pairing rules and created an internal representation of the stacking energies and helices. Application to independent experimental (DMS-Seq) data on nucleotide accessibility in mRNA showed that the nucleotides predicted as paired indeed tend to be involved in the RNA structure. The performance of the constructed model was comparable with the state-of-the-art method based on the thermodynamic approach, but with a higher false positives rate. On the other hand, it successfully predicted pseudoknots. t-SNE clusters of embeddings of RNA sequences created by PredPair tend to contain embeddings from particular Rfam families, supporting the predictions of PredPair being in line with biological classification.
Collapse
Affiliation(s)
| | | | - Mikhail S. Gelfand
- Center of Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Moscow, Russia,Institute of Information Transmission Problems, Moscow, Russia
| |
Collapse
|
9
|
Ozerova AM, Gelfand MS. Recapitulation of the embryonic transcriptional program in holometabolous insect pupae. Sci Rep 2022; 12:17570. [PMID: 36266393 PMCID: PMC9584902 DOI: 10.1038/s41598-022-22188-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2022] [Accepted: 10/11/2022] [Indexed: 01/13/2023] Open
Abstract
Holometabolous insects are predominantly motionless during metamorphosis, when no active feeding is observed and the body is enclosed in a hardened cuticle. These physiological properties as well as undergoing processes resemble embryogenesis, since at the pupal stage organs and systems of the imago are formed. Therefore, recapitulation of the embryonic expression program during metamorphosis could be hypothesized. To assess this hypothesis at the transcriptome level, we have performed a comprehensive analysis of the developmental datasets available in the public domain. Indeed, for most datasets, the pupal gene expression resembles the embryonic rather than the larval pattern, interrupting gradual changes in the transcriptome. Moreover, changes in the transcriptome profile during the pupa-to-imago transition are positively correlated with those at the embryo-to-larvae transition, suggesting that similar expression programs are activated. Gene sets that change their expression level during the larval stage and revert it to the embryonic-like state during the metamorphosis are enriched with genes associated with metabolism and development.
Collapse
Affiliation(s)
- Alexandra M. Ozerova
- grid.454320.40000 0004 0555 3608Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Mikhail S. Gelfand
- grid.454320.40000 0004 0555 3608Skolkovo Institute of Science and Technology, Moscow, Russia ,grid.435025.50000 0004 0619 6198Institute for Information Transmission Problems (Kharkevich Institute), RAS, Moscow, Russia
| |
Collapse
|
10
|
Ashniev GA, Sernova NV, Shevkoplias AE, Rodionov ID, Rodionova IA, Vitreschak AG, Gelfand MS, Rodionov DA. Evolution of transcriptional regulation of histidine metabolism in Gram-positive bacteria. BMC Genomics 2022; 23:558. [PMID: 36008760 PMCID: PMC9413887 DOI: 10.1186/s12864-022-08796-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 07/27/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The histidine metabolism and transport (his) genes are controlled by a variety of RNA-dependent regulatory systems among diverse taxonomic groups of bacteria including T-box riboswitches in Firmicutes and Actinobacteria and RNA attenuators in Proteobacteria. Using a comparative genomic approach, we previously identified a novel DNA-binding transcription factor (named HisR) that controls the histidine metabolism genes in diverse Gram-positive bacteria from the Firmicutes phylum. RESULTS Here we report the identification of HisR-binding sites within the regulatory regions of the histidine metabolism and transport genes in 395 genomes representing the Bacilli, Clostridia, Negativicutes, and Tissierellia classes of Firmicutes, as well as in 97 other HisR-encoding genomes from the Actinobacteria, Proteobacteria, and Synergistetes phyla. HisR belongs to the TrpR family of transcription factors, and their predicted DNA binding motifs have a similar 20-bp palindromic structure but distinct lineage-specific consensus sequences. The predicted HisR-binding motif was validated in vitro using DNA binding assays with purified protein from the human gut bacterium Ruminococcus gnavus. To fill a knowledge gap in the regulation of histidine metabolism genes in Firmicutes genomes that lack a hisR repressor gene, we systematically searched their upstream regions for potential RNA regulatory elements. As result, we identified 158 T-box riboswitches preceding the histidine biosynthesis and/or transport genes in 129 Firmicutes genomes. Finally, novel candidate RNA attenuators were identified upstream of the histidine biosynthesis operons in six species from the Bacillus cereus group, as well as in five Eubacteriales and six Erysipelotrichales species. CONCLUSIONS The obtained distribution of the HisR transcription factor and two RNA-mediated regulatory mechanisms for histidine metabolism genes across over 600 species of Firmicutes is discussed from functional and evolutionary points of view.
Collapse
Affiliation(s)
- German A Ashniev
- A.A. Kharkevich Institute for Information Transmission Problems, RAS, Moscow, Russia
| | - Natalia V Sernova
- A.A. Kharkevich Institute for Information Transmission Problems, RAS, Moscow, Russia
| | - Aleksei E Shevkoplias
- A.A. Kharkevich Institute for Information Transmission Problems, RAS, Moscow, Russia.,National Research University Higher School of Economics, Moscow, Russia
| | - Ivan D Rodionov
- University of California San Diego, La Jolla, San Diego, CA, USA
| | | | - Alexey G Vitreschak
- A.A. Kharkevich Institute for Information Transmission Problems, RAS, Moscow, Russia
| | - Mikhail S Gelfand
- A.A. Kharkevich Institute for Information Transmission Problems, RAS, Moscow, Russia.,Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Dmitry A Rodionov
- Sanford-Burnham Medical Research Institute, La Jolla, San Diego, CA, USA.
| |
Collapse
|
11
|
Bessonova TA, Fando MS, Kostareva OS, Tutukina MN, Ozoline ON, Gelfand MS, Nikulin AD, Tishchenko SV. Differential Impact of Hexuronate Regulators ExuR and UxuR on the Escherichia coli Proteome. Int J Mol Sci 2022; 23:ijms23158379. [PMID: 35955512 PMCID: PMC9369180 DOI: 10.3390/ijms23158379] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Revised: 07/19/2022] [Accepted: 07/26/2022] [Indexed: 11/16/2022] Open
Abstract
ExuR and UxuR are paralogous proteins belonging to the GntR family of transcriptional regulators. Both are known to control hexuronic acid metabolism in a variety of Gammaproteobacteria but the relative impact of each of them is still unclear. Here, we apply 2D difference electrophoresis followed by mass-spectrometry to characterise the changes in the Escherichia coli proteome in response to a uxuR or exuR deletion. Our data clearly show that the effects are different: deletion of uxuR resulted in strongly enhanced expression of D-mannonate dehydratase UxuA and flagellar protein FliC, and in a reduced amount of outer membrane porin OmpF, while the absence of ExuR did not significantly alter the spectrum of detected proteins. Consequently, the physiological roles of proteins predicted as homologs seem to be far from identical. Effects of uxuR deletion were largely dependent on the cultivation conditions: during growth with glucose, UxuA and FliC were dramatically altered, while during growth with glucuronate, activation of both was not so prominent. During the growth with glucose, maximal activation was detected for FliC. This was further confirmed by expression analysis and physiological tests, thus suggesting the involvement of UxuR in the regulation of bacterial motility and biofilm formation.
Collapse
Affiliation(s)
- Tatiana A. Bessonova
- Institute of Cell Biophysics, Russian Academy of Sciences, PSCBR RAS, Institutskaya, 3, Pushchino 142290, Russia; (T.A.B.); (O.N.O.)
| | - Maria S. Fando
- Institute of Protein Research, Russian Academy of Sciences, Institutskaya, 4, Pushchino 142290, Russia; (M.S.F.); (O.S.K.); (A.D.N.); (S.V.T.)
| | - Olga S. Kostareva
- Institute of Protein Research, Russian Academy of Sciences, Institutskaya, 4, Pushchino 142290, Russia; (M.S.F.); (O.S.K.); (A.D.N.); (S.V.T.)
| | - Maria N. Tutukina
- Institute of Cell Biophysics, Russian Academy of Sciences, PSCBR RAS, Institutskaya, 3, Pushchino 142290, Russia; (T.A.B.); (O.N.O.)
- Skolkovo Institute of Science and Technology, Bolshoy Boulevard 30 Build 1, Moscow 121205, Russia;
- Institute for Information Transmission Problems, Russian Academy of Sciences, Bolshoy Karetny Per 19 Build 1, Moscow 127051, Russia
- Correspondence:
| | - Olga N. Ozoline
- Institute of Cell Biophysics, Russian Academy of Sciences, PSCBR RAS, Institutskaya, 3, Pushchino 142290, Russia; (T.A.B.); (O.N.O.)
| | - Mikhail S. Gelfand
- Skolkovo Institute of Science and Technology, Bolshoy Boulevard 30 Build 1, Moscow 121205, Russia;
- Institute for Information Transmission Problems, Russian Academy of Sciences, Bolshoy Karetny Per 19 Build 1, Moscow 127051, Russia
| | - Alexey D. Nikulin
- Institute of Protein Research, Russian Academy of Sciences, Institutskaya, 4, Pushchino 142290, Russia; (M.S.F.); (O.S.K.); (A.D.N.); (S.V.T.)
| | - Svetlana V. Tishchenko
- Institute of Protein Research, Russian Academy of Sciences, Institutskaya, 4, Pushchino 142290, Russia; (M.S.F.); (O.S.K.); (A.D.N.); (S.V.T.)
| |
Collapse
|
12
|
Moldovan MA, Chervontseva ZS, Nogina DS, Gelfand MS. A hierarchy in clusters of cephalopod mRNA editing sites. Sci Rep 2022; 12:3447. [PMID: 35236910 PMCID: PMC8891338 DOI: 10.1038/s41598-022-07460-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2021] [Accepted: 02/07/2022] [Indexed: 11/09/2022] Open
Abstract
RNA editing in the form of substituting adenine with inosine (A-to-I editing) is the most frequent type of RNA editing in many metazoan species. In most species, A-to-I editing sites tend to form clusters and editing at clustered sites depends on editing of the adjacent sites. Although functionally important in some specific cases, A-to-I editing usually is rare. The exception occurs in soft-bodied coleoid cephalopods, where tens of thousands of potentially important A-to-I editing sites have been identified, making coleoids an ideal model for studying of properties and evolution of A-to-I editing sites. Here, we apply several diverse techniques to demonstrate a strong tendency of coleoid RNA editing sites to cluster along the transcript. We show that clustering of editing sites and correlated editing substantially contribute to the transcriptome diversity that arises due to extensive RNA editing. Moreover, we identify three distinct types of editing site clusters, varying in size, and describe RNA structural features and mechanisms likely underlying formation of these clusters. In particular, these observations may explain sequence conservation at large distances around editing sites and the observed dependency of editing on mutations in the vicinity of editing sites.
Collapse
Affiliation(s)
- Mikhail A Moldovan
- Skolkovo Institute of Science and Technology, Bolshoy Boulevard 30, bld. 1, Moscow, Russia, 121205.
| | - Zoe S Chervontseva
- A.A.Kharkevich Institute for Information Transmission Problems (RAS), Bolshoy Karetny Per. 19, bld.1, Moscow, Russia, 127051
| | - Daria S Nogina
- A.A.Kharkevich Institute for Information Transmission Problems (RAS), Bolshoy Karetny Per. 19, bld.1, Moscow, Russia, 127051.,Faculty of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Leninskie Gory 1, Moscow, Russia, 119991
| | - Mikhail S Gelfand
- Skolkovo Institute of Science and Technology, Bolshoy Boulevard 30, bld. 1, Moscow, Russia, 121205.,A.A.Kharkevich Institute for Information Transmission Problems (RAS), Bolshoy Karetny Per. 19, bld.1, Moscow, Russia, 127051
| |
Collapse
|
13
|
Shelyakin PV, Semenkov IN, Tutukina MN, Nikolaeva DD, Sharapova AV, Sarana YV, Lednev SA, Smolenkov AD, Gelfand MS, Krechetov PP, Koroleva TV. The Influence of Kerosene on Microbiomes of Diverse Soils. Life (Basel) 2022; 12:life12020221. [PMID: 35207510 PMCID: PMC8878009 DOI: 10.3390/life12020221] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Revised: 01/17/2022] [Accepted: 01/27/2022] [Indexed: 01/04/2023] Open
Abstract
One of the most important challenges for soil science is to determine the limits for the sustainable functioning of contaminated ecosystems. The response of soil microbiomes to kerosene pollution is still poorly understood. Here, we model the impact of kerosene leakage on the composition of the topsoil microbiome in pot and field experiments with different loads of added kerosene (loads up to 100 g/kg; retention time up to 360 days). At four time points we measured kerosene concentration and sequenced variable regions of 16S ribosomal RNA in the microbial communities. Mainly alkaline Dystric Arenosols with low content of available phosphorus and soil organic matter had an increased fraction of Actinobacteriota, Firmicutes, Nitrospirota, Planctomycetota, and, to a lesser extent, Acidobacteriota and Verrucomicobacteriota. In contrast, in highly acidic Fibric Histosols, rich in soil organic matter and available phosphorus, the fraction of Acidobacteriota was higher, while the fraction of Actinobacteriota was lower. Albic Luvisols occupied an intermediate position in terms of both physicochemical properties and microbiome composition. The microbiomes of different soils show similar response to equal kerosene loads. In highly contaminated soils, the proportion of anaerobic bacteria-metabolizing hydrocarbons increased, whereas the proportion of aerobic bacteria decreased. During the field experiment, the soil microbiome recovered much faster than in the pot experiments, possibly due to migration of microorganisms from the polluted area. The microbial community of Fibric Histosols recovered in 6 months after kerosene had been loaded, while microbiomes of Dystric Arenosols and Albic Luvisols did not restore even after a year.
Collapse
Affiliation(s)
- Pavel V. Shelyakin
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, 127051 Moscow, Russia; (P.V.S.); (M.N.T.); (D.D.N.); (M.S.G.)
- Department of Computational Biology, N.I. Vavilov Institute of General Genetics, Russian Academy of Sciences, 119333 Moscow, Russia
| | - Ivan N. Semenkov
- Faculty of Geography, M.V. Lomonosov Moscow State University, 119991 Moscow, Russia; (A.V.S.); (S.A.L.); (P.P.K.); (T.V.K.)
- Correspondence:
| | - Maria N. Tutukina
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, 127051 Moscow, Russia; (P.V.S.); (M.N.T.); (D.D.N.); (M.S.G.)
- Center of Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, 121205 Moscow, Russia;
- Lab of Functional Genomics and Cellular Stress, Institute of Cell Biophysics RAS, 142290 Moscow, Russia
| | - Daria D. Nikolaeva
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, 127051 Moscow, Russia; (P.V.S.); (M.N.T.); (D.D.N.); (M.S.G.)
- Center of Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, 121205 Moscow, Russia;
| | - Anna V. Sharapova
- Faculty of Geography, M.V. Lomonosov Moscow State University, 119991 Moscow, Russia; (A.V.S.); (S.A.L.); (P.P.K.); (T.V.K.)
| | - Yulia V. Sarana
- Center of Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, 121205 Moscow, Russia;
| | - Sergey A. Lednev
- Faculty of Geography, M.V. Lomonosov Moscow State University, 119991 Moscow, Russia; (A.V.S.); (S.A.L.); (P.P.K.); (T.V.K.)
| | | | - Mikhail S. Gelfand
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, 127051 Moscow, Russia; (P.V.S.); (M.N.T.); (D.D.N.); (M.S.G.)
- Center of Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, 121205 Moscow, Russia;
| | - Pavel P. Krechetov
- Faculty of Geography, M.V. Lomonosov Moscow State University, 119991 Moscow, Russia; (A.V.S.); (S.A.L.); (P.P.K.); (T.V.K.)
| | - Tatiana V. Koroleva
- Faculty of Geography, M.V. Lomonosov Moscow State University, 119991 Moscow, Russia; (A.V.S.); (S.A.L.); (P.P.K.); (T.V.K.)
| |
Collapse
|
14
|
Abstract
Over the past decade, genome-wide assays for chromatin interactions in single cells have enabled the study of individual nuclei at unprecedented resolution and throughput. Current chromosome conformation capture techniques survey contacts for up to tens of thousands of individual cells, improving our understanding of genome function in 3D. However, these methods recover a small fraction of all contacts in single cells, requiring specialised processing of sparse interactome data. In this review, we highlight recent advances in methods for the interpretation of single-cell genomic contacts. After discussing the strengths and limitations of these methods, we outline frontiers for future development in this rapidly moving field.
Collapse
Affiliation(s)
- Aleksandra A Galitsyna
- Skolkovo Institute of Science and Technology, Skolkovo, Russia
- Institute for Information Transmission Problems, RAS, Moscow, Russia
- Institute of Gene Biology, RAS, Moscow, Russia
| | - Mikhail S Gelfand
- Skolkovo Institute of Science and Technology, Skolkovo, Russia
- Institute for Information Transmission Problems, RAS, Moscow, Russia
| |
Collapse
|
15
|
Chervova A, Fatykhov B, Koblov A, Shvarov E, Preobrazhenskaya J, Vinogradov D, Ponomarev GV, Gelfand MS, Kazanov MD. Analysis of gene expression and mutation data points on contribution of transcription to the mutagenesis by APOBEC enzymes. NAR Cancer 2021; 3:zcab025. [PMID: 34316712 PMCID: PMC8253550 DOI: 10.1093/narcan/zcab025] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2020] [Revised: 06/04/2021] [Accepted: 06/14/2021] [Indexed: 11/30/2022] Open
Abstract
Since the discovery of the role of the APOBEC enzymes in human cancers, the mechanisms of this type of mutagenesis remain little understood. Theoretically, targeting of single-stranded DNA by the APOBEC enzymes could occur during cellular processes leading to the unwinding of DNA double-stranded structure. Some evidence points to the importance of replication in the APOBEC mutagenesis, while the role of transcription is still underexplored. Here, we analyzed gene expression and whole genome sequencing data from five types of human cancers with substantial APOBEC activity to estimate the involvement of transcription in the APOBEC mutagenesis and compare its impact with that of replication. Using the TCN motif as the mutation signature of the APOBEC enzymes, we observed a correlation of active APOBEC mutagenesis with gene expression, confirmed the increase of APOBEC-induced mutations in early-replicating regions and estimated the relative impact of transcription and replication on the APOBEC mutagenesis. We also found that the known effect of higher density of APOBEC-induced mutations on the lagging strand was highest in middle-replicating regions and observed higher APOBEC mutation density on the sense strand, the latter bias positively correlated with the gene expression level.
Collapse
Affiliation(s)
- Almira Chervova
- Institute of Oncology, Radiology and Nuclear Medicine, Dmitry Rogachev National Medical Research Center of Pediatric Hematology, Oncology and Immunology, Moscow, 117997, Russia
| | - Bulat Fatykhov
- Department of Control and Applied Mathematics, Moscow Institute of Physics and Technology, Dolgoprudny, 141700, Russia
| | | | | | - Julia Preobrazhenskaya
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, 119234, Russia
| | - Dmitry Vinogradov
- Research and Training Center of Bioinformatics, Institute for Information Transmission Problems (the Kharkevich Institute, RAS), Moscow, 127051, Russia
| | - Gennady V Ponomarev
- Research and Training Center of Bioinformatics, Institute for Information Transmission Problems (the Kharkevich Institute, RAS), Moscow, 127051, Russia
| | - Mikhail S Gelfand
- Research and Training Center of Bioinformatics, Institute for Information Transmission Problems (the Kharkevich Institute, RAS), Moscow, 127051, Russia
| | - Marat D Kazanov
- Research and Training Center of Bioinformatics, Institute for Information Transmission Problems (the Kharkevich Institute, RAS), Moscow, 127051, Russia
| |
Collapse
|
16
|
Stetsenko A, Stehantsev P, Dranenko NO, Gelfand MS, Guskov A. Structural and biochemical characterization of a novel ZntB (CmaX) transporter protein from Pseudomonas aeruginosa. Int J Biol Macromol 2021; 184:760-767. [PMID: 34175341 DOI: 10.1016/j.ijbiomac.2021.06.130] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Revised: 06/16/2021] [Accepted: 06/18/2021] [Indexed: 11/19/2022]
Abstract
The 2-TM-GxN family of membrane proteins is widespread in prokaryotes and plays an important role in transport of divalent cations. The canonical signature motif, which is also a selectivity filter, has a composition of Gly-Met-Asn. Some members though deviate from this composition, however no data are available as to whether this has any functional implications. Here we report the functional and structural analysis of CmaX protein from a pathogenic Pseudomonas aeruginosa bacterium, which has a Gly-Ile-Asn signature motif. CmaX readily transports Zn2+, Mg2+, Cd2+, Ni2+ and Co2+ ions, but it does not utilize proton-symport as does ZntB from Escherichia coli. Together with the bioinformatics analysis, our data suggest that deviations from the canonical signature motif do not reveal any changes in substrate selectivity or transport and easily alter in course of evolution.
Collapse
Affiliation(s)
- Artem Stetsenko
- Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, the Netherlands
| | - Pavlo Stehantsev
- Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, the Netherlands
| | - Natalia O Dranenko
- Institute for Information Transmission Problems (Kharkevich Institute) RAS, Moscow, Russia
| | - Mikhail S Gelfand
- Institute for Information Transmission Problems (Kharkevich Institute) RAS, Moscow, Russia; Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Albert Guskov
- Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, the Netherlands; Moscow Institute of Physics and Technology, Dolgoprudny, Russia.
| |
Collapse
|
17
|
Abstract
The ribosome is an essential cellular machine performing protein biosynthesis. Its structure and composition are highly conserved in all species. However, some bacteria have been reported to have an incomplete set of ribosomal proteins. We have analyzed ribosomal protein composition in 214 small bacterial genomes (<1 Mb) and found that although the ribosome composition is fairly stable, some ribosomal proteins may be absent, especially in bacteria with dramatically reduced genomes. The protein composition of the large subunit is less conserved than that of the small subunit. We have identified the set of frequently lost ribosomal proteins and demonstrated that they tend to be positioned on the ribosome surface and have fewer contacts to other ribosome components. Moreover, some proteins are lost in an evolutionary correlated manner. The reduction of ribosomal RNA is also common, with deletions mostly occurring in free loops. Finally, the loss of the anti-Shine-Dalgarno sequence is associated with the loss of a higher number of ribosomal proteins.
Collapse
Affiliation(s)
- Daria D Nikolaeva
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia.,Institute for Information Transmission Problems (Kharkevich Institute), Moscow, Russia
| | - Mikhail S Gelfand
- Institute for Information Transmission Problems (Kharkevich Institute), Moscow, Russia.,Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Sofya K Garushyants
- Institute for Information Transmission Problems (Kharkevich Institute), Moscow, Russia
| |
Collapse
|
18
|
Suvorova IA, Gelfand MS. Comparative Analysis of the IclR-Family of Bacterial Transcription Factors and Their DNA-Binding Motifs: Structure, Positioning, Co-Evolution, Regulon Content. Front Microbiol 2021; 12:675815. [PMID: 34177859 PMCID: PMC8222616 DOI: 10.3389/fmicb.2021.675815] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Accepted: 05/14/2021] [Indexed: 11/13/2022] Open
Abstract
The IclR-family is a large group of transcription factors (TFs) regulating various biological processes in diverse bacteria. Using comparative genomics techniques, we have identified binding motifs of IclR-family TFs, reconstructed regulons and analyzed their content, finding co-occurrences between the regulated COGs (clusters of orthologous genes), useful for future functional characterizations of TFs and their regulated genes. We describe two main types of IclR-family motifs, similar in sequence but different in the arrangement of the half-sites (boxes), with GKTYCRYW3-4RYGRAMC and TGRAACAN1-2TGTTYCA consensuses, and also predict that TFs in 32 orthologous groups have binding sites comprised of three boxes with alternating direction, which implies two possible alternative modes of dimerization of TFs. We identified trends in site positioning relative to the translational gene start, and show that TFs in 94 orthologous groups bind tandem sites with 18-22 nucleotides between their centers. We predict protein-DNA contacts via the correlation analysis of nucleotides in binding sites and amino acids of the DNA-binding domain of TFs, and show that the majority of interacting positions and predicted contacts are similar for both types of motifs and conform well both to available experimental data and to general protein-DNA interaction trends.
Collapse
Affiliation(s)
- Inna A Suvorova
- Institute for Information Transmission Problems of Russian Academy of Sciences (The Kharkevich Institute), Moscow, Russia
| | - Mikhail S Gelfand
- Institute for Information Transmission Problems of Russian Academy of Sciences (The Kharkevich Institute), Moscow, Russia.,Skolkovo Institute of Science and Technology, Moscow, Russia
| |
Collapse
|
19
|
Seferbekova Z, Zabelkin A, Yakovleva Y, Afasizhev R, Dranenko NO, Alexeev N, Gelfand MS, Bochkareva OO. High Rates of Genome Rearrangements and Pathogenicity of Shigella spp. Front Microbiol 2021; 12:628622. [PMID: 33912145 PMCID: PMC8072062 DOI: 10.3389/fmicb.2021.628622] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Accepted: 03/22/2021] [Indexed: 02/01/2023] Open
Abstract
Shigella are pathogens originating within the Escherichia lineage but frequently classified as a separate genus. Shigella genomes contain numerous insertion sequences (ISs) that lead to pseudogenisation of affected genes and an increase of non-homologous recombination. Here, we study 414 genomes of E. coli and Shigella strains to assess the contribution of genomic rearrangements to Shigella evolution. We found that Shigella experienced exceptionally high rates of intragenomic rearrangements and had a decreased rate of homologous recombination compared to pathogenic and non-pathogenic E. coli. The high rearrangement rate resulted in independent disruption of syntenic regions and parallel rearrangements in different Shigella lineages. Specifically, we identified two types of chromosomally encoded E3 ubiquitin-protein ligases acquired independently by all Shigella strains that also showed a high level of sequence conservation in the promoter and further in the 5′-intergenic region. In the only available enteroinvasive E. coli (EIEC) strain, which is a pathogenic E. coli with a phenotype intermediate between Shigella and non-pathogenic E. coli, we found a rate of genome rearrangements comparable to those in other E. coli and no functional copies of the two Shigella-specific E3 ubiquitin ligases. These data indicate that the accumulation of ISs influenced many aspects of genome evolution and played an important role in the evolution of intracellular pathogens. Our research demonstrates the power of comparative genomics-based on synteny block composition and an important role of non-coding regions in the evolution of genomic islands.
Collapse
Affiliation(s)
- Zaira Seferbekova
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia.,Institute for Information Transmission Problems (The Kharkevich Institute, RAS), Moscow, Russia
| | - Alexey Zabelkin
- Computer Technologies Laboratory, ITMO University, Saint Petersburg, Russia.,JetBrains Research, Saint Petersburg, Russia.,Bioinformatics Institute, Saint Petersburg, Russia
| | - Yulia Yakovleva
- Bioinformatics Institute, Saint Petersburg, Russia.,Department of Cytology and Histology, Saint Petersburg State University, Saint Petersburg, Russia
| | - Robert Afasizhev
- Institute for Information Transmission Problems (The Kharkevich Institute, RAS), Moscow, Russia
| | - Natalia O Dranenko
- Institute for Information Transmission Problems (The Kharkevich Institute, RAS), Moscow, Russia
| | - Nikita Alexeev
- Computer Technologies Laboratory, ITMO University, Saint Petersburg, Russia
| | - Mikhail S Gelfand
- Institute for Information Transmission Problems (The Kharkevich Institute, RAS), Moscow, Russia.,Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Olga O Bochkareva
- Institute for Information Transmission Problems (The Kharkevich Institute, RAS), Moscow, Russia.,Institute of Science and Technology (IST Austria), Klosterneuburg, Austria
| |
Collapse
|
20
|
Fedorov AK, Gelfand MS. Towards practical applications in quantum computational biology. Nat Comput Sci 2021; 1:114-119. [PMID: 38217223 DOI: 10.1038/s43588-021-00024-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Accepted: 01/12/2021] [Indexed: 01/15/2024]
Abstract
Fascinating progress in understanding our world at the smallest scales moves us to the border of a new technological revolution governed by quantum physics. By taking advantage of quantum phenomena, quantum computing devices allow a speedup in solving diverse tasks. In this Perspective, we discuss the potential impact of quantum computing on computational biology. Bearing in mind the limitations of existing quantum computing devices, we attempt to indicate promising directions for further research in the emerging area of quantum computational biology.
Collapse
Affiliation(s)
- A K Fedorov
- Russian Quantum Center, Moscow, Russia.
- Moscow Institute of Physics and Technology, Dolgoprudny, Russia.
| | - M S Gelfand
- Skolkovo Institute of Science and Technology, Moscow, Russia
- Kharkevitch Institute for Information Transmission Problems, Moscow, Russia
| |
Collapse
|
21
|
Abstract
The ribosome is an essential cellular machine performing protein biosynthesis. Its structure and composition are highly conserved in all species. However, some bacteria have been reported to have an incomplete set of ribosomal proteins. We have analyzed ribosomal protein composition in 214 small bacterial genomes (<1 Mb) and found that although the ribosome composition is fairly stable, some ribosomal proteins may be absent, especially in bacteria with dramatically reduced genomes. The protein composition of the large subunit is less conserved than that of the small subunit. We have identified the set of frequently lost ribosomal proteins and demonstrated that they tend to be positioned on the ribosome surface and have fewer contacts to other ribosome components. Moreover, some proteins are lost in an evolutionary correlated manner. The reduction of ribosomal RNA is also common, with deletions mostly occurring in free loops. Finally, the loss of the anti-Shine-Dalgarno sequence is associated with the loss of a higher number of ribosomal proteins.
Collapse
Affiliation(s)
- Daria D Nikolaeva
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia
- Institute for Information Transmission Problems (Kharkevich Institute), Moscow, Russia
| | - Mikhail S Gelfand
- Institute for Information Transmission Problems (Kharkevich Institute), Moscow, Russia
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Sofya K Garushyants
- Institute for Information Transmission Problems (Kharkevich Institute), Moscow, Russia
| |
Collapse
|
22
|
Abstract
Background Protein phosphorylation is the best studied post-translational modification strongly influencing protein function. Phosphorylated amino acids not only differ in physico-chemical properties from non-phosphorylated counterparts, but also exhibit different evolutionary patterns, tending to mutate to and originate from negatively charged amino acids (NCAs). The distribution of phosphosites along protein sequences is non-uniform, as phosphosites tend to cluster, forming so-called phospho-islands. Methods Here, we have developed a hidden Markov model-based procedure for the identification of phospho-islands and studied the properties of the obtained phosphorylation clusters. To check robustness of evolutionary analysis, we consider different models for the reconstructions of ancestral phosphorylation states. Results Clustered phosphosites differ from individual phosphosites in several functional and evolutionary aspects including underrepresentation of phosphotyrosines, higher conservation, more frequent mutations to NCAs. The spectrum of tissues, frequencies of specific phosphorylation contexts, and mutational patterns observed near clustered sites also are different.
Collapse
Affiliation(s)
| | - Mikhail S Gelfand
- Skolkovo Institute of Science and Technology, Moscow, Russia.,A. A. Kharkevich Institute for Information Transmission Problems, Moscow, Russia
| |
Collapse
|
23
|
Rozenwald MB, Galitsyna AA, Sapunov GV, Khrameeva EE, Gelfand MS. A machine learning framework for the prediction of chromatin folding in Drosophila using epigenetic features. PeerJ Comput Sci 2020; 6:e307. [PMID: 33816958 PMCID: PMC7924456 DOI: 10.7717/peerj-cs.307] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Accepted: 09/30/2020] [Indexed: 05/03/2023]
Abstract
Technological advances have lead to the creation of large epigenetic datasets, including information about DNA binding proteins and DNA spatial structure. Hi-C experiments have revealed that chromosomes are subdivided into sets of self-interacting domains called Topologically Associating Domains (TADs). TADs are involved in the regulation of gene expression activity, but the mechanisms of their formation are not yet fully understood. Here, we focus on machine learning methods to characterize DNA folding patterns in Drosophila based on chromatin marks across three cell lines. We present linear regression models with four types of regularization, gradient boosting, and recurrent neural networks (RNN) as tools to study chromatin folding characteristics associated with TADs given epigenetic chromatin immunoprecipitation data. The bidirectional long short-term memory RNN architecture produced the best prediction scores and identified biologically relevant features. Distribution of protein Chriz (Chromator) and histone modification H3K4me3 were selected as the most informative features for the prediction of TADs characteristics. This approach may be adapted to any similar biological dataset of chromatin features across various cell lines and species. The code for the implemented pipeline, Hi-ChiP-ML, is publicly available: https://github.com/MichalRozenwald/Hi-ChIP-ML.
Collapse
Affiliation(s)
- Michal B. Rozenwald
- Faculty of Computer Science, National Research University Higher School of Economics, Moscow, Russia
| | | | - Grigory V. Sapunov
- Faculty of Computer Science, National Research University Higher School of Economics, Moscow, Russia
- Intento, Inc., Berkeley, CA, USA
| | | | - Mikhail S. Gelfand
- Skolkovo Institute of Science and Technology, Moscow, Russia
- A.A. Kharkevich Institute for Information Transmission Problems, RAS, Moscow, Russia
| |
Collapse
|
24
|
Moldovan M, Chervontseva Z, Bazykin G, Gelfand MS. Adaptive evolution at mRNA editing sites in soft-bodied cephalopods. PeerJ 2020; 8:e10456. [PMID: 33312772 PMCID: PMC7703385 DOI: 10.7717/peerj.10456] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Accepted: 11/09/2020] [Indexed: 12/11/2022] Open
Abstract
Background The bulk of variability in mRNA sequence arises due to mutation—change in DNA sequence which is heritable if it occurs in the germline. However, variation in mRNA can also be achieved by post-transcriptional modification including mRNA editing, changes in mRNA nucleotide sequence that mimic the effect of mutations. Such modifications are not inherited directly; however, as the processes affecting them are encoded in the genome, they have a heritable component, and therefore can be shaped by selection. In soft-bodied cephalopods, adenine-to-inosine RNA editing is very frequent, and much of it occurs at nonsynonymous sites, affecting the sequence of the encoded protein. Methods We study selection regimes at coleoid A-to-I editing sites, estimate the prevalence of positive selection, and analyze interdependencies between the editing level and contextual characteristics of editing site. Results Here, we show that mRNA editing of individual nonsynonymous sites in cephalopods originates in evolution through substitutions at regions adjacent to these sites. As such substitutions mimic the effect of the substitution at the edited site itself, we hypothesize that they are favored by selection if the inosine is selectively advantageous to adenine at the edited position. Consistent with this hypothesis, we show that edited adenines are more frequently substituted with guanine, an informational analog of inosine, in the course of evolution than their unedited counterparts, and for heavily edited adenines, these transitions are favored by positive selection. Our study shows that coleoid editing sites may enhance adaptation, which, together with recent observations on Drosophila and human editing sites, points at a general role of RNA editing in the molecular evolution of metazoans.
Collapse
Affiliation(s)
- Mikhail Moldovan
- Skolkovo Institute of Science and Technology, Moscow, Russian Federation
| | - Zoe Chervontseva
- Skolkovo Institute of Science and Technology, Moscow, Russian Federation.,A.A.Kharkevich Institute for Information Transmission Problems (RAS), Moscow, Russian Federation
| | - Georgii Bazykin
- Skolkovo Institute of Science and Technology, Moscow, Russian Federation.,A.A.Kharkevich Institute for Information Transmission Problems (RAS), Moscow, Russian Federation
| | - Mikhail S Gelfand
- Skolkovo Institute of Science and Technology, Moscow, Russian Federation.,A.A.Kharkevich Institute for Information Transmission Problems (RAS), Moscow, Russian Federation
| |
Collapse
|
25
|
Osterman IA, Chervontseva ZS, Evfratov SA, Sorokina AV, Rodin VA, Rubtsova MP, Komarova ES, Zatsepin TS, Kabilov MR, Bogdanov AA, Gelfand MS, Dontsova OA, Sergiev PV. Translation at first sight: the influence of leading codons. Nucleic Acids Res 2020; 48:6931-6942. [PMID: 32427319 PMCID: PMC7337518 DOI: 10.1093/nar/gkaa430] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2020] [Revised: 05/07/2020] [Accepted: 05/14/2020] [Indexed: 01/31/2023] Open
Abstract
First triplets of mRNA coding region affect the yield of translation. We have applied the flowseq method to analyze >30 000 variants of the codons 2-11 of the fluorescent protein reporter to identify factors affecting the protein synthesis. While the negative influence of mRNA secondary structure on translation has been confirmed, a positive role of rare codons at the beginning of a coding sequence for gene expression has not been observed. The identity of triplets proximal to the start codon contributes more to the protein yield then more distant ones. Additional in-frame start codons enhance translation, while Shine-Dalgarno-like motifs downstream the initiation codon are inhibitory. The metabolic cost of amino acids affects the yield of protein in the poor medium. The most efficient translation was observed for variants with features resembling those of native Escherichia coli genes.
Collapse
Affiliation(s)
- Ilya A Osterman
- Skolkovo Institute of Science and Technology, Skolkovo, Moscow region 143025, Russia.,Lomonosov Moscow State University, Moscow 119992, Russia
| | - Zoe S Chervontseva
- Skolkovo Institute of Science and Technology, Skolkovo, Moscow region 143025, Russia.,A.A.Kharkevich Institute for Information Transmission Problems, Moscow 127051, Russia
| | | | - Alena V Sorokina
- Skolkovo Institute of Science and Technology, Skolkovo, Moscow region 143025, Russia
| | | | - Maria P Rubtsova
- Skolkovo Institute of Science and Technology, Skolkovo, Moscow region 143025, Russia.,Lomonosov Moscow State University, Moscow 119992, Russia
| | - Ekaterina S Komarova
- Skolkovo Institute of Science and Technology, Skolkovo, Moscow region 143025, Russia.,Lomonosov Moscow State University, Moscow 119992, Russia
| | - Timofei S Zatsepin
- Skolkovo Institute of Science and Technology, Skolkovo, Moscow region 143025, Russia.,Lomonosov Moscow State University, Moscow 119992, Russia
| | - Marsel R Kabilov
- Institute of Chemical Biology and Fundamental Medicine, Siberian Branch of the Russian Academy of Sciences, Novosibirsk 630090, Russia
| | | | - Mikhail S Gelfand
- Skolkovo Institute of Science and Technology, Skolkovo, Moscow region 143025, Russia.,A.A.Kharkevich Institute for Information Transmission Problems, Moscow 127051, Russia
| | - Olga A Dontsova
- Skolkovo Institute of Science and Technology, Skolkovo, Moscow region 143025, Russia.,Lomonosov Moscow State University, Moscow 119992, Russia.,Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Moscow 117997, Russia
| | - Petr V Sergiev
- Skolkovo Institute of Science and Technology, Skolkovo, Moscow region 143025, Russia.,Lomonosov Moscow State University, Moscow 119992, Russia
| |
Collapse
|
26
|
Soutourina O, Dubois T, Monot M, Shelyakin PV, Saujet L, Boudry P, Gelfand MS, Dupuy B, Martin-Verstraete I. Genome-Wide Transcription Start Site Mapping and Promoter Assignments to a Sigma Factor in the Human Enteropathogen Clostridioides difficile. Front Microbiol 2020; 11:1939. [PMID: 32903654 PMCID: PMC7438776 DOI: 10.3389/fmicb.2020.01939] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Accepted: 07/23/2020] [Indexed: 12/12/2022] Open
Abstract
The emerging human enteropathogen Clostridioides difficile is the main cause of diarrhea associated with antibiotherapy. Regulatory pathways underlying the adaptive responses remain understudied and the global view of C. difficile promoter structure is still missing. In the genome of C. difficile 630, 22 genes encoding sigma factors are present suggesting a complex pattern of transcription in this bacterium. We present here the first transcriptional map of the C. difficile genome resulting from the identification of transcriptional start sites (TSS), promoter motifs and operon structures. By 5′-end RNA-seq approach, we mapped more than 1000 TSS upstream of genes. In addition to these primary TSS, this analysis revealed complex structure of transcriptional units such as alternative and internal promoters, potential RNA processing events and 5′ untranslated regions. By following an in silico iterative strategy that used as an input previously published consensus sequences and transcriptomic analysis, we identified candidate promoters upstream of most of protein-coding and non-coding RNAs genes. This strategy also led to refine consensus sequences of promoters recognized by major sigma factors of C. difficile. Detailed analysis focuses on the transcription in the pathogenicity locus and regulatory genes, as well as regulons of transition phase and sporulation sigma factors as important components of C. difficile regulatory network governing toxin gene expression and spore formation. Among the still uncharacterized regulons of the major sigma factors of C. difficile, we defined the SigL regulon by combining transcriptome and in silico analyses. We showed that the SigL regulon is largely involved in amino-acid degradation, a metabolism crucial for C. difficile gut colonization. Finally, we combined our TSS mapping, in silico identification of promoters and RNA-seq data to improve gene annotation and to suggest operon organization in C. difficile. These data will considerably improve our knowledge of global regulatory circuits controlling gene expression in C. difficile and will serve as a useful rich resource for scientific community both for the detailed analysis of specific genes and systems biology studies.
Collapse
Affiliation(s)
- Olga Soutourina
- Laboratoire Pathogenèses des Bactéries Anaérobies, Institut Pasteur, UMR CNRS 2001, Université de Paris, Paris, France.,Institut Universitaire de France, Paris, France.,Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Gif-sur-Yvette, France
| | - Thomas Dubois
- Laboratoire Pathogenèses des Bactéries Anaérobies, Institut Pasteur, UMR CNRS 2001, Université de Paris, Paris, France
| | - Marc Monot
- Laboratoire Pathogenèses des Bactéries Anaérobies, Institut Pasteur, UMR CNRS 2001, Université de Paris, Paris, France
| | | | - Laure Saujet
- Laboratoire Pathogenèses des Bactéries Anaérobies, Institut Pasteur, UMR CNRS 2001, Université de Paris, Paris, France
| | - Pierre Boudry
- Laboratoire Pathogenèses des Bactéries Anaérobies, Institut Pasteur, UMR CNRS 2001, Université de Paris, Paris, France
| | - Mikhail S Gelfand
- Institute for Information Transmission Problems, Moscow, Russia.,Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Bruno Dupuy
- Laboratoire Pathogenèses des Bactéries Anaérobies, Institut Pasteur, UMR CNRS 2001, Université de Paris, Paris, France
| | - Isabelle Martin-Verstraete
- Laboratoire Pathogenèses des Bactéries Anaérobies, Institut Pasteur, UMR CNRS 2001, Université de Paris, Paris, France.,Institut Universitaire de France, Paris, France
| |
Collapse
|
27
|
Samborskaia MD, Galitsyna A, Pletenev I, Trofimova A, Mironov AA, Gelfand MS, Khrameeva EE. Cumulative contact frequency of a chromatin region is an intrinsic property linked to its function. PeerJ 2020; 8:e9566. [PMID: 32864204 PMCID: PMC7425636 DOI: 10.7717/peerj.9566] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2020] [Accepted: 06/27/2020] [Indexed: 12/18/2022] Open
Abstract
Regulation of gene transcription is a complex process controlled by many factors, including the conformation of chromatin in the nucleus. Insights into chromatin conformation on both local and global scales can be provided by the Hi-C (high-throughput chromosomes conformation capture) method. One of the drawbacks of Hi-C analysis and interpretation is the presence of systematic biases, such as different accessibility to enzymes, amplification, and mappability of DNA regions, which all result in different visibility of the regions. Iterative correction (IC) is one of the most popular techniques developed for the elimination of these systematic biases. IC is based on the assumption that all chromatin regions have an equal number of observed contacts in Hi-C. In other words, the IC procedure is equalizing the experimental visibility approximated by the cumulative contact frequency (CCF) for all genomic regions. However, the differences in experimental visibility might be explained by biological factors such as chromatin openness, which is characteristic of distinct chromatin states. Here we show that CCF is positively correlated with active transcription. It is associated with compartment organization, since compartment A demonstrates higher CCF and gene expression levels than compartment B. Notably, this observation holds for a wide range of species, including human, mouse, and Drosophila. Moreover, we track the CCF state for syntenic blocks between human and mouse and conclude that active state assessed by CCF is an intrinsic property of the DNA region, which is independent of local genomic and epigenomic context. Our findings establish a missing link between Hi-C normalization procedures removing CCF from the data and poorly investigated and possibly relevant biological factors contributing to CCF.
Collapse
Affiliation(s)
- Margarita D Samborskaia
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia
| | - Aleksandra Galitsyna
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia.,A.A. Kharkevich Institute for Information Transmission Problems, RAS, Moscow, Russia.,Institute of Gene Biology, RAS, Moscow, Russia
| | - Ilya Pletenev
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Anna Trofimova
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Andrey A Mironov
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia.,A.A. Kharkevich Institute for Information Transmission Problems, RAS, Moscow, Russia
| | - Mikhail S Gelfand
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia.,A.A. Kharkevich Institute for Information Transmission Problems, RAS, Moscow, Russia
| | - Ekaterina E Khrameeva
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia
| |
Collapse
|
28
|
Komarova ES, Chervontseva ZS, Osterman IA, Evfratov SA, Rubtsova MP, Zatsepin TS, Semashko TA, Kostryukova ES, Bogdanov AA, Gelfand MS, Dontsova OA, Sergiev PV. Influence of the spacer region between the Shine-Dalgarno box and the start codon for fine-tuning of the translation efficiency in Escherichia coli. Microb Biotechnol 2020; 13:1254-1261. [PMID: 32202698 PMCID: PMC7264876 DOI: 10.1111/1751-7915.13561] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2019] [Revised: 02/06/2020] [Accepted: 03/02/2020] [Indexed: 01/27/2023] Open
Abstract
Translation efficiency contributes several orders of magnitude difference in the overall yield of exogenous gene expression in bacteria. In diverse bacteria, the translation initiation site, whose sequence is the primary determinant of the translation performance, is comprised of the start codon and the Shine-Dalgarno box located upstream. Here, we have examined how the sequence of a spacer between these main components of the translation initiation site contributes to the yield of synthesized protein. We have created a library of reporter constructs with the randomized spacer region, performed fluorescently activated cell sorting and applied next-generation sequencing analysis (the FlowSeq protocol). As a result, we have identified sequence motifs for the spacer region between the Shine-Dalgarno box and AUG start codon that may modulate the translation efficiency in a 100-fold range.
Collapse
Affiliation(s)
- Ekaterina S. Komarova
- Skolkovo Institute of Science and TechnologyMoscow143025Russia
- Department of ChemistryFaculty of Bioengineering and BioinformaticsInstitute of Functional GenomicsA.N. Belozersky Institute of Physico‐Chemical BiologyLomonosov Moscow State UniversityMoscow119992Russia
| | - Zoya S. Chervontseva
- Skolkovo Institute of Science and TechnologyMoscow143025Russia
- A.A. Kharkevich Institute for Information Transmission Problems RASMoscow127051Russia
| | - Ilya A. Osterman
- Skolkovo Institute of Science and TechnologyMoscow143025Russia
- Department of ChemistryFaculty of Bioengineering and BioinformaticsInstitute of Functional GenomicsA.N. Belozersky Institute of Physico‐Chemical BiologyLomonosov Moscow State UniversityMoscow119992Russia
| | - Sergey A. Evfratov
- Department of ChemistryFaculty of Bioengineering and BioinformaticsInstitute of Functional GenomicsA.N. Belozersky Institute of Physico‐Chemical BiologyLomonosov Moscow State UniversityMoscow119992Russia
| | - Maria P. Rubtsova
- Skolkovo Institute of Science and TechnologyMoscow143025Russia
- Department of ChemistryFaculty of Bioengineering and BioinformaticsInstitute of Functional GenomicsA.N. Belozersky Institute of Physico‐Chemical BiologyLomonosov Moscow State UniversityMoscow119992Russia
| | - Timofei S. Zatsepin
- Skolkovo Institute of Science and TechnologyMoscow143025Russia
- Department of ChemistryFaculty of Bioengineering and BioinformaticsInstitute of Functional GenomicsA.N. Belozersky Institute of Physico‐Chemical BiologyLomonosov Moscow State UniversityMoscow119992Russia
| | | | - Elena S. Kostryukova
- Research Institute for Physical‐Chemical MedicineFMBAMoscow119435Russia
- Moscow Institute of Physics and TechnologyMoscow region141700Russia
| | - Alexey A. Bogdanov
- Department of ChemistryFaculty of Bioengineering and BioinformaticsInstitute of Functional GenomicsA.N. Belozersky Institute of Physico‐Chemical BiologyLomonosov Moscow State UniversityMoscow119992Russia
| | - Mikhail S. Gelfand
- Skolkovo Institute of Science and TechnologyMoscow143025Russia
- A.A. Kharkevich Institute for Information Transmission Problems RASMoscow127051Russia
- National Research University Higher School of EconomicsMoscow125319Russia
| | - Olga A. Dontsova
- Skolkovo Institute of Science and TechnologyMoscow143025Russia
- Department of ChemistryFaculty of Bioengineering and BioinformaticsInstitute of Functional GenomicsA.N. Belozersky Institute of Physico‐Chemical BiologyLomonosov Moscow State UniversityMoscow119992Russia
- Shemyakin‐Ovchinnikov Institute of Bioorganic ChemistryMoscow117997Russia
| | - Petr V. Sergiev
- Skolkovo Institute of Science and TechnologyMoscow143025Russia
- Department of ChemistryFaculty of Bioengineering and BioinformaticsInstitute of Functional GenomicsA.N. Belozersky Institute of Physico‐Chemical BiologyLomonosov Moscow State UniversityMoscow119992Russia
| |
Collapse
|
29
|
Babenko VV, Podgorny OV, Manuvera VA, Kasianov AS, Manolov AI, Grafskaia EN, Shirokov DA, Kurdyumov AS, Vinogradov DV, Nikitina AS, Kovalchuk SI, Anikanov NA, Butenko IO, Pobeguts OV, Matyushkina DS, Rakitina DV, Kostryukova ES, Zgoda VG, Baskova IP, Trukhan VM, Gelfand MS, Govorun VM, Schiöth HB, Lazarev VN. Draft genome sequences of Hirudo medicinalis and salivary transcriptome of three closely related medicinal leeches. BMC Genomics 2020; 21:331. [PMID: 32349672 PMCID: PMC7191736 DOI: 10.1186/s12864-020-6748-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Accepted: 04/21/2020] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Salivary cell secretion (SCS) plays a critical role in blood feeding by medicinal leeches, making them of use for certain medical purposes even today. RESULTS We annotated the Hirudo medicinalis genome and performed RNA-seq on salivary cells isolated from three closely related leech species, H. medicinalis, Hirudo orientalis, and Hirudo verbana. Differential expression analysis verified by proteomics identified salivary cell-specific gene expression, many of which encode previously unknown salivary components. However, the genes encoding known anticoagulants have been found to be expressed not only in salivary cells. The function-related analysis of the unique salivary cell genes enabled an update of the concept of interactions between salivary proteins and components of haemostasis. CONCLUSIONS Here we report a genome draft of Hirudo medicinalis and describe identification of novel salivary proteins and new homologs of genes encoding known anticoagulants in transcriptomes of three medicinal leech species. Our data provide new insights in genetics of blood-feeding lifestyle in leeches.
Collapse
Affiliation(s)
- Vladislav V Babenko
- Federal Research and Clinical Centre of Physical-Chemical Medicine of Federal Medical Biological Agency, 1a Malaya Pirogovskaya Str, Moscow, 119435, Russia.
| | - Oleg V Podgorny
- Federal Research and Clinical Centre of Physical-Chemical Medicine of Federal Medical Biological Agency, 1a Malaya Pirogovskaya Str, Moscow, 119435, Russia
- Koltzov Institute of Developmental Biology, Russian Academy of Sciences, 26 Vavilov str, Moscow, 119334, Russia
| | - Valentin A Manuvera
- Federal Research and Clinical Centre of Physical-Chemical Medicine of Federal Medical Biological Agency, 1a Malaya Pirogovskaya Str, Moscow, 119435, Russia
- Moscow Institute of Physics and Technology, 9 Institutskiy per., Dolgoprudny, Moscow Region, 141700, Russia
| | - Artem S Kasianov
- Moscow Institute of Physics and Technology, 9 Institutskiy per., Dolgoprudny, Moscow Region, 141700, Russia
- Vavilov Institute of General Genetics, Russian Academy of Sciences, 3 Gubkina str, Moscow, 119991, Russia
| | - Alexander I Manolov
- Federal Research and Clinical Centre of Physical-Chemical Medicine of Federal Medical Biological Agency, 1a Malaya Pirogovskaya Str, Moscow, 119435, Russia
| | - Ekaterina N Grafskaia
- Federal Research and Clinical Centre of Physical-Chemical Medicine of Federal Medical Biological Agency, 1a Malaya Pirogovskaya Str, Moscow, 119435, Russia
- Moscow Institute of Physics and Technology, 9 Institutskiy per., Dolgoprudny, Moscow Region, 141700, Russia
| | - Dmitriy A Shirokov
- Federal Research and Clinical Centre of Physical-Chemical Medicine of Federal Medical Biological Agency, 1a Malaya Pirogovskaya Str, Moscow, 119435, Russia
| | - Alexey S Kurdyumov
- Federal Research and Clinical Centre of Physical-Chemical Medicine of Federal Medical Biological Agency, 1a Malaya Pirogovskaya Str, Moscow, 119435, Russia
| | - Dmitriy V Vinogradov
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, 19 Bol'shoi Karetnyi per, Moscow, 127051, Russia
- Skolkovo Institute of Science and Technology, 3 Nobelya Ulitsa str, Moscow, 121205, Russia
| | - Anastasia S Nikitina
- Federal Research and Clinical Centre of Physical-Chemical Medicine of Federal Medical Biological Agency, 1a Malaya Pirogovskaya Str, Moscow, 119435, Russia
- Moscow Institute of Physics and Technology, 9 Institutskiy per., Dolgoprudny, Moscow Region, 141700, Russia
| | - Sergey I Kovalchuk
- Federal Research and Clinical Centre of Physical-Chemical Medicine of Federal Medical Biological Agency, 1a Malaya Pirogovskaya Str, Moscow, 119435, Russia
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 16/10 Miklukho-Maklaya str, Moscow, 117997, Russia
| | - Nickolay A Anikanov
- Federal Research and Clinical Centre of Physical-Chemical Medicine of Federal Medical Biological Agency, 1a Malaya Pirogovskaya Str, Moscow, 119435, Russia
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 16/10 Miklukho-Maklaya str, Moscow, 117997, Russia
| | - Ivan O Butenko
- Federal Research and Clinical Centre of Physical-Chemical Medicine of Federal Medical Biological Agency, 1a Malaya Pirogovskaya Str, Moscow, 119435, Russia
| | - Olga V Pobeguts
- Federal Research and Clinical Centre of Physical-Chemical Medicine of Federal Medical Biological Agency, 1a Malaya Pirogovskaya Str, Moscow, 119435, Russia
| | - Daria S Matyushkina
- Federal Research and Clinical Centre of Physical-Chemical Medicine of Federal Medical Biological Agency, 1a Malaya Pirogovskaya Str, Moscow, 119435, Russia
| | - Daria V Rakitina
- Federal Research and Clinical Centre of Physical-Chemical Medicine of Federal Medical Biological Agency, 1a Malaya Pirogovskaya Str, Moscow, 119435, Russia
| | - Elena S Kostryukova
- Federal Research and Clinical Centre of Physical-Chemical Medicine of Federal Medical Biological Agency, 1a Malaya Pirogovskaya Str, Moscow, 119435, Russia
| | - Victor G Zgoda
- V.N. Orekhovich Research Institute of Biomedical Chemistry, Russian Academy of Medical Sciences, 10 Pogodinskaja str, Moscow, 119832, Russia
| | - Isolda P Baskova
- Faculty of Biology, Lomonosov Moscow State University, 1-12 Leninskie Gory, Moscow, 119991, Russia
| | - Vladimir M Trukhan
- I.M. Sechenov First Moscow State Medical University of the Ministry of Healthcare of the Russian Federation (Sechenovskiy University), Trubetskaya str., 8-2, Moscow, 119991, Russia
| | - Mikhail S Gelfand
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, 19 Bol'shoi Karetnyi per, Moscow, 127051, Russia
- Skolkovo Institute of Science and Technology, 3 Nobelya Ulitsa str, Moscow, 121205, Russia
- Faculty of Computer Science, National Research University Higher School of Economics, 20 Myasnitskaya str, Moscow, 101000, Russia
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, 1-73 Leninskie Gory, Moscow, 119991, Russia
| | - Vadim M Govorun
- Federal Research and Clinical Centre of Physical-Chemical Medicine of Federal Medical Biological Agency, 1a Malaya Pirogovskaya Str, Moscow, 119435, Russia
- Moscow Institute of Physics and Technology, 9 Institutskiy per., Dolgoprudny, Moscow Region, 141700, Russia
| | - Helgi B Schiöth
- I.M. Sechenov First Moscow State Medical University of the Ministry of Healthcare of the Russian Federation (Sechenovskiy University), Trubetskaya str., 8-2, Moscow, 119991, Russia
- Functional Pharmacology, Department of Neuroscience, Uppsala University, Husargatan 3, Uppsala, 75124, Sweden
| | - Vassili N Lazarev
- Federal Research and Clinical Centre of Physical-Chemical Medicine of Federal Medical Biological Agency, 1a Malaya Pirogovskaya Str, Moscow, 119435, Russia
- Moscow Institute of Physics and Technology, 9 Institutskiy per., Dolgoprudny, Moscow Region, 141700, Russia
| |
Collapse
|
30
|
Sigalova OM, Chaplin AV, Bochkareva OO, Shelyakin PV, Filaretov VA, Akkuratov EE, Burskaia V, Gelfand MS. Chlamydia pan-genomic analysis reveals balance between host adaptation and selective pressure to genome reduction. BMC Genomics 2019; 20:710. [PMID: 31510914 PMCID: PMC6740158 DOI: 10.1186/s12864-019-6059-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2019] [Accepted: 08/22/2019] [Indexed: 01/23/2023] Open
Abstract
Background Chlamydia are ancient intracellular pathogens with reduced, though strikingly conserved genome. Despite their parasitic lifestyle and isolated intracellular environment, these bacteria managed to avoid accumulation of deleterious mutations leading to subsequent genome degradation characteristic for many parasitic bacteria. Results We report pan-genomic analysis of sixteen species from genus Chlamydia including identification and functional annotation of orthologous genes, and characterization of gene gains, losses, and rearrangements. We demonstrate the overall genome stability of these bacteria as indicated by a large fraction of common genes with conserved genomic locations. On the other hand, extreme evolvability is confined to several paralogous gene families such as polymorphic membrane proteins and phospholipase D, and likely is caused by the pressure from the host immune system. Conclusions This combination of a large, conserved core genome and a small, evolvable periphery likely reflect the balance between the selective pressure towards genome reduction and the need to adapt to escape from the host immunity. Electronic supplementary material The online version of this article (10.1186/s12864-019-6059-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Olga M Sigalova
- Kharkevich Institute for Information Transmission Problems, RAS, Moscow, Russia.,current address: European Molecular Biology Laboratory, Heidelberg, Germany
| | - Andrei V Chaplin
- Microbiology and Virology Department, Pirogov Russian National Research Medical University, Moscow, Russia
| | - Olga O Bochkareva
- Kharkevich Institute for Information Transmission Problems, RAS, Moscow, Russia. .,current address: Institute of Science and Technology Austria, Klosterneuburg, Austria.
| | - Pavel V Shelyakin
- Kharkevich Institute for Information Transmission Problems, RAS, Moscow, Russia.,Center for Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia.,Vavilov Institute of General Genetics, RAS, Moscow, Russia
| | | | - Evgeny E Akkuratov
- Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg, Russia.,current address: Science for Life Laboratory, Department of Applied Physics, Royal Institute of Technology, Stockholm, Sweden
| | - Valentina Burskaia
- Center for Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Mikhail S Gelfand
- Kharkevich Institute for Information Transmission Problems, RAS, Moscow, Russia.,Center for Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia.,Faculty of Computer Science, Higher School of Economics, Moscow, Russia
| |
Collapse
|
31
|
Fedonin GG, Eroshkin A, Cieplak P, Matveev EV, Ponomarev GV, Gelfand MS, Ratnikov BI, Kazanov MD. Predictive models of protease specificity based on quantitative protease-activity profiling data. Biochim Biophys Acta Proteins Proteom 2019; 1867:140253. [PMID: 31330204 DOI: 10.1016/j.bbapap.2019.07.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/08/2019] [Revised: 07/09/2019] [Accepted: 07/17/2019] [Indexed: 10/26/2022]
Abstract
Bioinformatics-based prediction of protease substrates can help to elucidate regulatory proteolytic pathways that control a broad range of biological processes such as apoptosis and blood coagulation. The majority of published predictive models are position weight matrices (PWM) reflecting specificity of proteases toward target sequence. These models are typically derived from experimental data on positions of hydrolyzed peptide bonds and show a reasonable predictive power. New emerging techniques that not only register the cleavage position but also measure catalytic efficiency of proteolysis are expected to improve the quality of predictions or at least substantially reduce the number of tested substrates required for confident predictions. The main goal of this study was to develop new prediction models based on such data and to estimate the performance of the constructed models. We used data on catalytic efficiency of proteolysis measured for eight major human matrix metalloproteinases to construct predictive models of protease specificity using a variety of regression analysis techniques. The obtained results suggest that efficiency-based (quantitative) models show a comparable performance with conventional PWM-based algorithms, while less training data are required. The derived list of candidate cleavage sites in human secreted proteins may serve as a starting point for experimental analysis.
Collapse
Affiliation(s)
- Gennady G Fedonin
- Central Research Institute of Epidemiology, Moscow 111123, Russia; A.A.Kharkevich Institute of Information Transmission Problems, Moscow 127051, Russia; Moscow Institute of Physics and Technology, Dolgoprudny 141700, Russia
| | - Alexey Eroshkin
- Sanford-Burnham-Prebys Medical Discovery Institute, La Jolla, CA 92037, USA
| | - Piotr Cieplak
- Sanford-Burnham-Prebys Medical Discovery Institute, La Jolla, CA 92037, USA
| | | | - Gennady V Ponomarev
- A.A.Kharkevich Institute of Information Transmission Problems, Moscow 127051, Russia
| | - Mikhail S Gelfand
- A.A.Kharkevich Institute of Information Transmission Problems, Moscow 127051, Russia; Skolkovo Institute of Science and Technology, Moscow 121205, Russia; National Research University Higher School of Economics, Moscow 101000, Russia
| | - Boris I Ratnikov
- Sanford-Burnham-Prebys Medical Discovery Institute, La Jolla, CA 92037, USA
| | - Marat D Kazanov
- A.A.Kharkevich Institute of Information Transmission Problems, Moscow 127051, Russia; Skolkovo Institute of Science and Technology, Moscow 121205, Russia; Dmitry Rogachev National Medical Research Center of Pediatric Hematology, Oncology and Immunology, Moscow 117997, Russia.
| |
Collapse
|
32
|
Suvorova IA, Gelfand MS. Comparative Genomic Analysis of the Regulation of Aromatic Metabolism in Betaproteobacteria. Front Microbiol 2019; 10:642. [PMID: 30984152 PMCID: PMC6449761 DOI: 10.3389/fmicb.2019.00642] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2018] [Accepted: 03/14/2019] [Indexed: 01/23/2023] Open
Abstract
Aromatic compounds are a common carbon and energy source for many microorganisms, some of which can even degrade toxic chloroaromatic xenobiotics. This comparative study of aromatic metabolism in 32 Betaproteobacteria species describes the links between several transcription factors (TFs) that control benzoate (BenR, BenM, BoxR, BzdR), catechol (CatR, CatM, BenM), chlorocatechol (ClcR), methylcatechol (MmlR), 2,4-dichlorophenoxyacetate (TfdR, TfdS), phenol (AphS, AphR, AphT), biphenyl (BphS), and toluene (TbuT) metabolism. We characterize the complexity and variability in the organization of aromatic metabolism operons and the structure of regulatory networks that may differ even between closely related species. Generally, the upper parts of pathways, rare pathway variants, and degradative pathways of exotic and complex, in particular, xenobiotic compounds are often controlled by a single TF, while the regulation of more common and/or central parts of the aromatic metabolism may vary widely and often involves several TFs with shared and/or dual, or cascade regulation. The most frequent and at the same time variable connections exist between AphS, AphR, AphT, and BenR. We have identified a novel LysR-family TF that regulates the metabolism of catechol (or some catechol derivative) and either substitutes CatR(M)/BenM, or shares functions with it. We have also predicted several new members of aromatic metabolism regulons, in particular, some COGs regulated by several different TFs.
Collapse
Affiliation(s)
- Inna A Suvorova
- Institute for Information Transmission Problems RAS (The Kharkevich Institute), Moscow, Russia
| | - Mikhail S Gelfand
- Institute for Information Transmission Problems RAS (The Kharkevich Institute), Moscow, Russia.,Faculty of Computer Science, Higher School of Economics, Moscow, Russia.,Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia
| |
Collapse
|
33
|
Shelyakin PV, Bochkareva OO, Karan AA, Gelfand MS. Micro-evolution of three Streptococcus species: selection, antigenic variation, and horizontal gene inflow. BMC Evol Biol 2019; 19:83. [PMID: 30917781 PMCID: PMC6437910 DOI: 10.1186/s12862-019-1403-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2017] [Accepted: 02/25/2019] [Indexed: 02/07/2023] Open
Abstract
Background The genus Streptococcus comprises pathogens that strongly influence the health of humans and animals. Genome sequencing of multiple Streptococcus strains demonstrated high variability in gene content and order even in closely related strains of the same species and created a newly emerged object for genomic analysis, the pan-genome. Here we analysed the genome evolution of 25 strains of Streptococcus suis, 50 strains of Streptococcus pyogenes and 28 strains of Streptococcus pneumoniae. Results Fractions of the pan-genome, unique, periphery, and universal genes differ in size, functional composition, the level of nucleotide substitutions, and predisposition to horizontal gene transfer and genomic rearrangements. The density of substitutions in intergenic regions appears to be correlated with selection acting on adjacent genes, implying that more conserved genes tend to have more conserved regulatory regions. The total pan-genome of the genus is open, but only due to strain-specific genes, whereas other pan-genome fractions reach saturation. We have identified the set of genes with phylogenies inconsistent with species and non-conserved location in the chromosome; these genes are rare in at least one species and have likely experienced recent horizontal transfer between species. The strain-specific fraction is enriched with mobile elements and hypothetical proteins, but also contains a number of candidate virulence-related genes, so it may have a strong impact on adaptability and pathogenicity. Mapping the rearrangements to the phylogenetic tree revealed large parallel inversions in all species. A parallel inversion of length 15 kB with breakpoints formed by genes encoding surface antigen proteins PhtD and PhtB in S. pneumoniae leads to replacement of gene fragments that likely indicates the action of an antigen variation mechanism. Conclusions Members of genus Streptococcus have a highly dynamic, open pan-genome, that potentially confers them with the ability to adapt to changing environmental conditions, i.e. antibiotic resistance or transmission between different hosts. Hence, integrated analysis of all aspects of genome evolution is important for the identification of potential pathogens and design of drugs and vaccines. Electronic supplementary material The online version of this article (10.1186/s12862-019-1403-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Pavel V Shelyakin
- Vavilov Institute of General Genetics Russian Academy of Sciences, Gubkina str. 3, Moscow, 119991, Russia. .,Kharkevich Institute for Information Transmission Problems, 19, Bolshoy Karetny per., Moscow, 127051, Russia. .,Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia.
| | - Olga O Bochkareva
- Kharkevich Institute for Information Transmission Problems, 19, Bolshoy Karetny per., Moscow, 127051, Russia.,Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Anna A Karan
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia
| | - Mikhail S Gelfand
- Kharkevich Institute for Information Transmission Problems, 19, Bolshoy Karetny per., Moscow, 127051, Russia.,Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia.,Faculty of Computer Science, Higher School of Economics, Moscow, Russia
| |
Collapse
|
34
|
Bochkareva OO, Moroz EV, Davydov II, Gelfand MS. Genome rearrangements and selection in multi-chromosome bacteria Burkholderia spp. BMC Genomics 2018; 19:965. [PMID: 30587126 PMCID: PMC6307245 DOI: 10.1186/s12864-018-5245-1] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2018] [Accepted: 11/14/2018] [Indexed: 11/30/2022] Open
Abstract
BACKGROUND The genus Burkholderia consists of species that occupy remarkably diverse ecological niches. Its best known members are important pathogens, B. mallei and B. pseudomallei, which cause glanders and melioidosis, respectively. Burkholderia genomes are unusual due to their multichromosomal organization, generally comprised of 2-3 chromosomes. RESULTS We performed integrated genomic analysis of 127 Burkholderia strains. The pan-genome is open with the saturation to be reached between 86,000 and 88,000 genes. The reconstructed rearrangements indicate a strong avoidance of intra-replichore inversions that is likely caused by selection against the transfer of large groups of genes between the leading and the lagging strands. Translocated genes also tend to retain their position in the leading or the lagging strand, and this selection is stronger for large syntenies. Integrated reconstruction of chromosome rearrangements in the context of strains phylogeny reveals parallel rearrangements that may indicate inversion-based phase variation and integration of new genomic islands. In particular, we detected parallel inversions in the second chromosomes of B. pseudomallei with breakpoints formed by genes encoding membrane components of multidrug resistance complex, that may be linked to a phase variation mechanism. Two genomic islands, spreading horizontally between chromosomes, were detected in the B. cepacia group. CONCLUSIONS This study demonstrates the power of integrated analysis of pan-genomes, chromosome rearrangements, and selection regimes. Non-random inversion patterns indicate selective pressure, inversions are particularly frequent in a recent pathogen B. mallei, and, together with periods of positive selection at other branches, may indicate adaptation to new niches. One such adaptation could be a possible phase variation mechanism in B. pseudomallei.
Collapse
Affiliation(s)
- Olga O. Bochkareva
- Kharkevich Institute for Information Transmission Problems, Moscow, Russia
- Center of Life Sciences Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Elena V. Moroz
- Kharkevich Institute for Information Transmission Problems, Moscow, Russia
| | - Iakov I. Davydov
- Department of Ecology and Evolution & Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Mikhail S. Gelfand
- Kharkevich Institute for Information Transmission Problems, Moscow, Russia
- Center of Life Sciences Skolkovo Institute of Science and Technology, Moscow, Russia
- Faculty of Computer Science, Higher School of Economics, Moscow, Russia
| |
Collapse
|
35
|
Shelyakin PV, Garushyants SK, Nikitin MA, Mudrova SV, Berumen M, Speksnijder AGCL, Hoeksema BW, Fontaneto D, Gelfand MS, Ivanenko VN. Microbiomes of gall-inducing copepod crustaceans from the corals Stylophora pistillata (Scleractinia) and Gorgonia ventalina (Alcyonacea). Sci Rep 2018; 8:11563. [PMID: 30069039 PMCID: PMC6070567 DOI: 10.1038/s41598-018-29953-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Accepted: 07/18/2018] [Indexed: 12/31/2022] Open
Abstract
Corals harbor complex and diverse microbial communities that strongly impact host fitness and resistance to diseases, but these microbes themselves can be influenced by stresses, like those caused by the presence of macroscopic symbionts. In addition to directly influencing the host, symbionts may transmit pathogenic microbial communities. We analyzed two coral gall-forming copepod systems by using 16S rRNA gene metagenomic sequencing: (1) the sea fan Gorgonia ventalina with copepods of the genus Sphaerippe from the Caribbean and (2) the scleractinian coral Stylophora pistillata with copepods of the genus Spaniomolgus from the Saudi Arabian part of the Red Sea. We show that bacterial communities in these two systems were substantially different with Actinobacteria, Alphaproteobacteria, and Betaproteobacteria more prevalent in samples from Gorgonia ventalina, and Gammaproteobacteria in Stylophora pistillata. In Stylophora pistillata, normal coral microbiomes were enriched with the common coral symbiont Endozoicomonas and some unclassified bacteria, while copepod and gall-tissue microbiomes were highly enriched with the family ME2 (Oceanospirillales) or Rhodobacteraceae. In Gorgonia ventalina, no bacterial group had significantly different prevalence in the normal coral tissues, copepods, and injured tissues. The total microbiome composition of polyps injured by copepods was different. Contrary to our expectations, the microbial community composition of the injured gall tissues was not directly affected by the microbiome of the gall-forming symbiont copepods.
Collapse
Affiliation(s)
- Pavel V Shelyakin
- Kharkevich Institute for Information Transmission Problems RAS, B. Karetny per. 19, Moscow, 127051, Russia.,Vavilov Institute of General Genetics RAS, Gubkina str. 3, Moscow, 119333, Russia
| | - Sofya K Garushyants
- Kharkevich Institute for Information Transmission Problems RAS, B. Karetny per. 19, Moscow, 127051, Russia.,Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute of Science and Technology, Nobel str. 1, Moscow, 121205, Russia
| | - Mikhail A Nikitin
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, 119992, Russia
| | - Sofya V Mudrova
- Red Sea Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Michael Berumen
- Red Sea Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | | | - Bert W Hoeksema
- Naturalis Biodiversity Center, Leiden, 2332 AA, The Netherlands
| | - Diego Fontaneto
- National Research Council, Institute of Ecosystem Study, Verbania, 28922, Italy
| | - Mikhail S Gelfand
- Kharkevich Institute for Information Transmission Problems RAS, B. Karetny per. 19, Moscow, 127051, Russia.,Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute of Science and Technology, Nobel str. 1, Moscow, 121205, Russia.,Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, 119992, Russia.,Faculty of Computer Science, Higher School of Economics, Kochnovsky pr. 3, Moscow, 125319, Russia
| | - Viatcheslav N Ivanenko
- Naturalis Biodiversity Center, Leiden, 2332 AA, The Netherlands. .,Department of Invertebrate Zoology, Biological Faculty, Lomonosov Moscow State University, Moscow, 119992, Russia.
| |
Collapse
|
36
|
Abstract
Sequencing of complete nuclear genomes of Neanderthal and Denisovan stimulated studies about their relationship with modern humans demonstrating, in particular, that DNA alleles from both Neanderthal and Denisovan genomes are present in genomes of modern humans. The Papuan genome is a unique object because it contains both Neanderthal and Denisovan alleles. Here, we have shown that the Papuan genomes contain different gene functional groups inherited from each of the ancient people. The Papuan genomes demonstrate a relative prevalence of Neanderthal alleles in genes responsible for the regulation of transcription and neurogenesis. The enrichment of specific functional groups with Denisovan alleles is less pronounced; these groups are responsible for bone and tissue remodeling. This analysis shows that introgression of alleles from Neanderthals and Denisovans to Papuans occurred independently and retention of these alleles may carry specific adaptive advantages.
Collapse
Affiliation(s)
- Evgeny E. Akkuratov
- St. Petersburg State University, Institute of Translational Biomedicine, St. Petersburg, Russia
| | - Mikhail S. Gelfand
- Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute for Science and Technology, Moscow, Russia
- Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
- Faculty of Computer Science, National Research University – Higher School of Economics, Moscow, Russia
- Department of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Moscow, Russia
| | - Ekaterina E. Khrameeva
- Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute for Science and Technology, Moscow, Russia
- Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
37
|
Garushyants SK, Beliavskaia AY, Malko DB, Logacheva MD, Rautian MS, Gelfand MS. Comparative Genomic Analysis of Holospora spp., Intranuclear Symbionts of Paramecia. Front Microbiol 2018; 9:738. [PMID: 29713316 PMCID: PMC5911502 DOI: 10.3389/fmicb.2018.00738] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2017] [Accepted: 03/29/2018] [Indexed: 11/16/2022] Open
Abstract
While most endosymbiotic bacteria are transmitted only vertically, Holospora spp., an alphaproteobacterium from the Rickettsiales order, can desert its host and invade a new one. All bacteria from the genus Holospora are intranuclear symbionts of ciliates Paramecium spp. with strict species and nuclear specificity. Comparative metabolic reconstruction based on the newly sequenced genome of Holospora curviuscula, a macronuclear symbiont of Paramecium bursaria, and known genomes of other Holospora species shows that even though all Holospora spp. can persist outside the host, they cannot synthesize most of the essential small molecules, such as amino acids, and lack some central energy metabolic pathways, including glycolysis and the citric acid cycle. As the main energy source, Holospora spp. likely rely on nucleotides pirated from the host. Holospora-specific genes absent from other Rickettsiales are possibly involved in the lifestyle switch from the infectious to the reproductive form and in cell invasion.
Collapse
Affiliation(s)
- Sofya K Garushyants
- Skolkovo Institute of Science and Technology, Moscow, Russia.,Kharkevitch Institute for Information Transmission Problems, Moscow, Russia
| | - Alexandra Y Beliavskaia
- Institute of Integrative Biology, University of Liverpool, Liverpool, United Kingdom.,Department of Invertebrate Zoology, Faculty of Biology, Saint Petersburg State University, Saint Petersburg, Russia
| | | | - Maria D Logacheva
- Skolkovo Institute of Science and Technology, Moscow, Russia.,Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia
| | - Maria S Rautian
- Department of Invertebrate Zoology, Faculty of Biology, Saint Petersburg State University, Saint Petersburg, Russia
| | - Mikhail S Gelfand
- Skolkovo Institute of Science and Technology, Moscow, Russia.,Kharkevitch Institute for Information Transmission Problems, Moscow, Russia.,National Research Center for Hematology, Moscow, Russia.,National Research University Higher School of Economics, Moscow, Russia
| |
Collapse
|
38
|
Mazin PV, Jiang X, Fu N, Han D, Guo M, Gelfand MS, Khaitovich P. Conservation, evolution, and regulation of splicing during prefrontal cortex development in humans, chimpanzees, and macaques. RNA 2018; 24:585-596. [PMID: 29363555 PMCID: PMC5855957 DOI: 10.1261/rna.064931.117] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/19/2017] [Accepted: 01/10/2018] [Indexed: 05/03/2023]
Abstract
Changes in splicing are known to affect the function and regulation of genes. We analyzed splicing events that take place during the postnatal development of the prefrontal cortex in humans, chimpanzees, and rhesus macaques based on data obtained from 168 individuals. Our study revealed that among the 38,822 quantified alternative exons, 15% are differentially spliced among species, and more than 6% splice differently at different ages. Mutations in splicing acceptor and/or donor sites might explain more than 14% of all splicing differences among species and up to 64% of high-amplitude differences. A reconstructed trans-regulatory network containing 21 RNA-binding proteins explains a further 4% of splicing variations within species. While most age-dependent splicing patterns are conserved among the three species, developmental changes in intron retention are substantially more pronounced in humans.
Collapse
Affiliation(s)
- Pavel V Mazin
- Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute of Science and Technology, Moscow 143028, Russia
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, Moscow 127051, Russia
- Faculty of Computer Science, Higher School of Economics, Moscow 125319, Russia
| | - Xi Jiang
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai 200031, China
| | - Ning Fu
- Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute of Science and Technology, Moscow 143028, Russia
| | - Dingding Han
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai 200031, China
| | - Meng Guo
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai 200031, China
- University of Chinese Academy of Sciences, Beijing 100049, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Mikhail S Gelfand
- Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute of Science and Technology, Moscow 143028, Russia
- Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, Moscow 127051, Russia
- Faculty of Computer Science, Higher School of Economics, Moscow 125319, Russia
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow 119992, Russia
| | - Philipp Khaitovich
- Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute of Science and Technology, Moscow 143028, Russia
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai 200031, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China
- Max Planck Institute for Evolutionary Anthropology, Leipzig 04103, Germany
| |
Collapse
|
39
|
Bochkareva OO, Dranenko NO, Ocheredko ES, Kanevsky GM, Lozinsky YN, Khalaycheva VA, Artamonova II, Gelfand MS. Genome rearrangements and phylogeny reconstruction in Yersinia pestis. PeerJ 2018; 6:e4545. [PMID: 29607260 PMCID: PMC5877447 DOI: 10.7717/peerj.4545] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2017] [Accepted: 03/07/2018] [Indexed: 12/20/2022] Open
Abstract
Genome rearrangements have played an important role in the evolution of Yersinia pestis from its progenitor Yersinia pseudotuberculosis. Traditional phylogenetic trees for Y. pestis based on sequence comparison have short internal branches and low bootstrap supports as only a small number of nucleotide substitutions have occurred. On the other hand, even a small number of genome rearrangements may resolve topological ambiguities in a phylogenetic tree. We reconstructed phylogenetic trees based on genome rearrangements using several popular approaches such as Maximum likelihood for Gene Order and the Bayesian model of genome rearrangements by inversions. We also reconciled phylogenetic trees for each of the three CRISPR loci to obtain an integrated scenario of the CRISPR cassette evolution. Analysis of contradictions between the obtained evolutionary trees yielded numerous parallel inversions and gain/loss events. Our data indicate that an integrated analysis of sequence-based and inversion-based trees enhances the resolution of phylogenetic reconstruction. In contrast, reconstructions of strain relationships based on solely CRISPR loci may not be reliable, as the history is obscured by large deletions, obliterating the order of spacer gains. Similarly, numerous parallel gene losses preclude reconstruction of phylogeny based on gene content.
Collapse
Affiliation(s)
- Olga O Bochkareva
- Kharkevich Institute for Information Transmission Problems, Moscow, Russia.,Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Natalia O Dranenko
- Department of Molecular and Chemical Physics, Moscow Institute of Physics and Technology, Moscow, Russia
| | - Elena S Ocheredko
- Department of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia
| | - German M Kanevsky
- Higher Chemical College of the Russian Academy of Sciences, D. Mendeleev University of Chemical Technology of Russia, Moscow, Russia
| | - Yaroslav N Lozinsky
- Department of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia
| | | | - Irena I Artamonova
- Kharkevich Institute for Information Transmission Problems, Moscow, Russia.,Vavilov Institute of General Genetics Russian Academy of Sciences, Moscow, Russia
| | - Mikhail S Gelfand
- Kharkevich Institute for Information Transmission Problems, Moscow, Russia.,Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute of Science and Technology, Moscow, Russia.,Department of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia.,Faculty of Computer Science, Higher School of Economics, Moscow, Russia
| |
Collapse
|
40
|
Moldovan MA, Petrova SA, Gelfand MS. Comparative genomic analysis of fungal TPP-riboswitches. Fungal Genet Biol 2018; 114:34-41. [PMID: 29548845 DOI: 10.1016/j.fgb.2018.03.004] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2017] [Revised: 02/17/2018] [Accepted: 03/08/2018] [Indexed: 12/11/2022]
Abstract
Riboswitches are conserved RNA structures located in non-coding regions of mRNA and able to bind small molecules (e.g. metabolites) changing conformation upon binding. This feature enables them to function as regulators of gene expression. The thiamin pyrophosphate (TPP) riboswitch is the only type of riboswitches found not only in bacteria, but also in eukaryotes - in plants, green algae, protists, and fungi. Two main mechanisms of fungal TPP riboswitch action, involving alternative splicing, have been established so far. Here, we report a large-scale bioinformatic study of riboswitch structural features, action mechanisms, and distribution along the fungal taxonomy groups. For each putatively regulated gene, we reconstruct the riboswitch structure, identify other components of the regulation machinery, and establish mechanisms of riboswitch-mediated regulation. In addition to three genes known to be regulated by TPP riboswitches, thiazole synthase THI4, hydroxymethilpyrimidine-syntase NMT1, and putative transporter NCU01977, we identify two new genes, a putative thiamin transporter THI9 and a transporter of unknown specificity. While the riboswitch sequence and structure remain highly conserved in all species and genes, the mode of riboswitch-mediated regulation varies between regulated genes. The riboswitch usage varies strongly between fungal taxa, with the largest number of riboswitch-regulated genes found in Pezizomycotina and no riboswitch-mediated regulation established in Saccaromycotina.
Collapse
Affiliation(s)
- Mikhail A Moldovan
- A.A. Kharkevich Institute for Information Transmission Problems, RAS, Bolshoy Karetny per. 19, Moscow 127051, Russia; Faculty of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Vorobievy Gory 1-73, Moscow 119991, Russia,; Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute of Science and Technology, Moscow 143028, Russia.
| | - Svetlana A Petrova
- A.A. Kharkevich Institute for Information Transmission Problems, RAS, Bolshoy Karetny per. 19, Moscow 127051, Russia
| | - Mikhail S Gelfand
- A.A. Kharkevich Institute for Information Transmission Problems, RAS, Bolshoy Karetny per. 19, Moscow 127051, Russia; Faculty of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Vorobievy Gory 1-73, Moscow 119991, Russia,; Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute of Science and Technology, Moscow 143028, Russia; Faculty of Computer Science, Higher School of Economics, Kochnovsky pr. 3, Moscow 125319, Russia
| |
Collapse
|
41
|
Moldovan MA, Gelfand MS. Pangenomic Definition of Prokaryotic Species and the Phylogenetic Structure of Prochlorococcus spp. Front Microbiol 2018; 9:428. [PMID: 29593678 PMCID: PMC5857598 DOI: 10.3389/fmicb.2018.00428] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2017] [Accepted: 02/23/2018] [Indexed: 11/13/2022] Open
Abstract
The pangenome is the collection of all groups of orthologous genes (OGGs) from a set of genomes. We apply the pangenome analysis to propose a definition of prokaryotic species based on identification of lineage-specific gene sets. While being similar to the classical biological definition based on allele flow, it does not rely on DNA similarity levels and does not require analysis of homologous recombination. Hence this definition is relatively objective and independent of arbitrary thresholds. A systematic analysis of 110 accepted species with the largest numbers of sequenced strains yields results largely consistent with the existing nomenclature. However, it has revealed that abundant marine cyanobacteria Prochlorococcus marinus should be divided into two species. As a control we have confirmed the paraphyletic origin of Yersinia pseudotuberculosis (with embedded, monophyletic Y. pestis) and Burkholderia pseudomallei (with B. mallei). We also demonstrate that by our definition and in accordance with recent studies Escherichia coli and Shigella spp. are one species.
Collapse
Affiliation(s)
- Mikhail A. Moldovan
- A.A.Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences (RAS), Moscow, Russia
- Faculty of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Moscow, Russia
- Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Mikhail S. Gelfand
- A.A.Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences (RAS), Moscow, Russia
- Faculty of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Moscow, Russia
- Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute of Science and Technology, Moscow, Russia
- Faculty of Computer Science, Higher School of Economics, Moscow, Russia
| |
Collapse
|
42
|
Kaznadzey A, Shelyakin P, Belousova E, Eremina A, Shvyreva U, Bykova D, Emelianenko V, Korosteleva A, Tutukina M, Gelfand MS. The genes of the sulphoquinovose catabolism in Escherichia coli are also associated with a previously unknown pathway of lactose degradation. Sci Rep 2018; 8:3177. [PMID: 29453395 PMCID: PMC5816610 DOI: 10.1038/s41598-018-21534-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2017] [Accepted: 02/06/2018] [Indexed: 12/29/2022] Open
Abstract
Comparative genomics analysis of conserved gene cassettes demonstrated resemblance between a recently described cassette of genes involved in sulphoquinovose degradation in Escherichia coli K-12 MG1655 and a Bacilli cassette linked with lactose degradation. Six genes from both cassettes had similar functions related to carbohydrate metabolism, namely, hydrolase, aldolase, kinase, isomerase, transporter, and transcription factor. The Escherichia coli sulphoglycolysis cassette was thus predicted to be associated with lactose degradation. This prediction was confirmed experimentally: expression of genes coding for aldolase (yihT), isomerase (yihS), and kinase (yihV) was dramatically increased during growth on lactose. These genes were previously shown to be activated during growth on sulphoquinovose, so our observation may indicate multi-functional capabilities of the respective proteins. Transcription starts for yihT, yihV and yihW were mapped in silico, in vitro and in vivo. Out of three promoters for yihT, one was active only during growth on lactose. We further showed that switches in yihT transcription are controlled by YihW, a DeoR-family transcription factor in the Escherichia coli cassette. YihW acted as a carbon source-dependent dual regulator involved in sustaining the baseline growth in the absence of lac-operon, with function either complementary, or opposite to a global regulator of carbohydrate metabolism, cAMP-CRP.
Collapse
Affiliation(s)
- Anna Kaznadzey
- A. A. Kharkevich Institute for Information Transmission Problems, RAS, Bolshoy Karetny per. 19, Moscow, 127051, Russia
| | - Pavel Shelyakin
- A. A. Kharkevich Institute for Information Transmission Problems, RAS, Bolshoy Karetny per. 19, Moscow, 127051, Russia
- N. I. Vavilov Institute of General Genetics, RAS, ul. Gubkina 3, Moscow, 119991, Russia
| | - Evgeniya Belousova
- M. V. Lomonosov Moscow State University, Vorobievy Gory 1-73, Moscow, 119991, Russia
| | - Aleksandra Eremina
- The University of Edinburgh, Alexander Crum Brown Rd, Edinburgh, Scotland, EH9 3FF, UK
| | - Uliana Shvyreva
- Institute of Cell Biophysics, RAS, Institutskaya 3, Pushchino, 142290, Russia
| | - Darya Bykova
- M. V. Lomonosov Moscow State University, Vorobievy Gory 1-73, Moscow, 119991, Russia
| | - Vera Emelianenko
- M. V. Lomonosov Moscow State University, Vorobievy Gory 1-73, Moscow, 119991, Russia
| | | | - Maria Tutukina
- Institute of Cell Biophysics, RAS, Institutskaya 3, Pushchino, 142290, Russia.
- Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute of Science and Technology, Moscow, 143028, Russia.
| | - Mikhail S Gelfand
- A. A. Kharkevich Institute for Information Transmission Problems, RAS, Bolshoy Karetny per. 19, Moscow, 127051, Russia
- M. V. Lomonosov Moscow State University, Vorobievy Gory 1-73, Moscow, 119991, Russia
- Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute of Science and Technology, Moscow, 143028, Russia
- Faculty of Computer Science, Higher School of Economics, Kochnovsky pr. 3, Moscow, 125319, Russia
| |
Collapse
|
43
|
Kaznadzey A, Shelyakin P, Gelfand MS. Sugar Lego: gene composition of bacterial carbohydrate metabolism genomic loci. Biol Direct 2017; 12:28. [PMID: 29178959 PMCID: PMC5702140 DOI: 10.1186/s13062-017-0200-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Accepted: 11/20/2017] [Indexed: 11/25/2022] Open
Abstract
Background Bacterial carbohydrate metabolism is extremely diverse, since carbohydrates serve as a major energy source and are involved in a variety of cellular processes. Bacterial genes belonging to same metabolic pathway are often co-localized in the chromosome, but it is not a strict rule. Gene co-localization in linked to co-evolution and co-regulation. This study focuses on a large-scale analysis of bacterial genomic loci related to the carbohydrate metabolism. Results We demonstrate that only 53% of 148,000 studied genes from over six hundred bacterial genomes are co-localized in bacterial genomes with other carbohydrate metabolism genes, which points to a significant role of singleton genes. Co-localized genes form cassettes, ranging in size from two to fifteen genes. Two major factors influencing the cassette-forming tendency are gene function and bacterial phylogeny. We have obtained a comprehensive picture of co-localization preferences of genes for nineteen major carbohydrate metabolism functional classes, over two hundred gene orthologous clusters, and thirty bacterial classes, and characterized the cassette variety in size and content among different species, highlighting a significant role of short cassettes. The preference towards co-localization of carbohydrate metabolism genes varies between 40 and 76% for bacterial taxa. Analysis of frequently co-localized genes yielded forty-five significant pairwise links between genes belonging to different functional classes. The number of such links per class range from zero to eight, demonstrating varying preferences of respective genes towards a specific chromosomal neighborhood. Genes from eleven functional classes tend to co-localize with genes from the same class, indicating an important role of clustering of genes with similar functions. At that, in most cases such co-localization does not originate from local duplication events. Conclusions Overall, we describe a complex web formed by evolutionary relationships of bacterial carbohydrate metabolism genes, manifested as co-localization patterns. Reviewers This article was reviewed by Daria V. Dibrova (A.N. Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, Russia), nominated by Armen Mulkidjanian (University of Osnabrück, Germany), Igor Rogozin (NCBI, NLM, NIH, USA) and Yuri Wolf (NCBI, NLM, NIH, USA). Electronic supplementary material The online version of this article (10.1186/s13062-017-0200-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Anna Kaznadzey
- A.A.Kharkevich Institute for Information Transmission Problems, RAS, Bolshoy Karetny per. 19, Moscow, 127051, Russia.
| | - Pavel Shelyakin
- A.A.Kharkevich Institute for Information Transmission Problems, RAS, Bolshoy Karetny per. 19, Moscow, 127051, Russia.,Vavilov Institute of General Genetics, Gubkin 3, Moscow, 119991, Russia
| | - Mikhail S Gelfand
- A.A.Kharkevich Institute for Information Transmission Problems, RAS, Bolshoy Karetny per. 19, Moscow, 127051, Russia.,Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute of Science and Technology, Moscow, 143028, Russia.,Faculty of Computer Science, Higher School of Economics, Kochnovsky pr. 3, Moscow, 125319, Russia.,Faculty of Bioengineering and Bioinformatics, M.V.Lomonosov Moscow State University, Vorobievy Gory 1-73, Moscow, 119991, Russia
| |
Collapse
|
44
|
Evfratov SA, Osterman IA, Komarova ES, Pogorelskaya AM, Rubtsova MP, Zatsepin TS, Semashko TA, Kostryukova ES, Mironov AA, Burnaev E, Krymova E, Gelfand MS, Govorun VM, Bogdanov AA, Sergiev PV, Dontsova OA. Application of sorting and next generation sequencing to study 5΄-UTR influence on translation efficiency in Escherichia coli. Nucleic Acids Res 2017; 45:3487-3502. [PMID: 27899632 PMCID: PMC5389652 DOI: 10.1093/nar/gkw1141] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2016] [Accepted: 10/31/2016] [Indexed: 12/24/2022] Open
Abstract
Yield of protein per translated mRNA may vary by four orders of magnitude. Many studies analyzed the influence of mRNA features on the translation yield. However, a detailed understanding of how mRNA sequence determines its propensity to be translated is still missing. Here, we constructed a set of reporter plasmid libraries encoding CER fluorescent protein preceded by randomized 5΄ untranslated regions (5΄-UTR) and Red fluorescent protein (RFP) used as an internal control. Each library was transformed into Escherchia coli cells, separated by efficiency of CER mRNA translation by a cell sorter and subjected to next generation sequencing. We tested efficiency of translation of the CER gene preceded by each of 48 natural 5΄-UTR sequences and introduced random and designed mutations into natural and artificially selected 5΄-UTRs. Several distinct properties could be ascribed to a group of 5΄-UTRs most efficient in translation. In addition to known ones, several previously unrecognized features that contribute to the translation enhancement were found, such as low proportion of cytidine residues, multiple SD sequences and AG repeats. The latter could be identified as translation enhancer, albeit less efficient than SD sequence in several natural 5΄-UTRs.
Collapse
Affiliation(s)
- Sergey A Evfratov
- Department of Chemistry, Faculty of Bioinformatics and Bioengeneering, Lomonosov Moscow State University, Moscow, 119992, Russia
| | - Ilya A Osterman
- Department of Chemistry, Faculty of Bioinformatics and Bioengeneering, Lomonosov Moscow State University, Moscow, 119992, Russia.,Skolkovo Institute of Science and Technology, Skolkovo, Moscow, 143025, Russia.,A.N. Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119992, Russia
| | - Ekaterina S Komarova
- Department of Chemistry, Faculty of Bioinformatics and Bioengeneering, Lomonosov Moscow State University, Moscow, 119992, Russia
| | - Alexandra M Pogorelskaya
- Department of Chemistry, Faculty of Bioinformatics and Bioengeneering, Lomonosov Moscow State University, Moscow, 119992, Russia
| | - Maria P Rubtsova
- Department of Chemistry, Faculty of Bioinformatics and Bioengeneering, Lomonosov Moscow State University, Moscow, 119992, Russia.,Skolkovo Institute of Science and Technology, Skolkovo, Moscow, 143025, Russia.,A.N. Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119992, Russia
| | - Timofei S Zatsepin
- Department of Chemistry, Faculty of Bioinformatics and Bioengeneering, Lomonosov Moscow State University, Moscow, 119992, Russia.,Skolkovo Institute of Science and Technology, Skolkovo, Moscow, 143025, Russia.,A.N. Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119992, Russia
| | - Tatiana A Semashko
- Research Institute for Physical-Chemical Medicine, FMBA, Moscow, 119435, Russia
| | - Elena S Kostryukova
- Research Institute for Physical-Chemical Medicine, FMBA, Moscow, 119435, Russia.,Moscow Institute of Physics and Technology, Dolgoprpudny, Moscow, 141700, Russia
| | - Andrey A Mironov
- Department of Chemistry, Faculty of Bioinformatics and Bioengeneering, Lomonosov Moscow State University, Moscow, 119992, Russia.,A.N. Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119992, Russia
| | - Evgeny Burnaev
- Skolkovo Institute of Science and Technology, Skolkovo, Moscow, 143025, Russia.,A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, 127051, Russia
| | - Ekaterina Krymova
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, 127051, Russia
| | - Mikhail S Gelfand
- Department of Chemistry, Faculty of Bioinformatics and Bioengeneering, Lomonosov Moscow State University, Moscow, 119992, Russia.,Skolkovo Institute of Science and Technology, Skolkovo, Moscow, 143025, Russia.,A.N. Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119992, Russia.,A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, 127051, Russia.,National Research University Higher School of Economics, Moscow, 123458, Russia
| | - Vadim M Govorun
- Research Institute for Physical-Chemical Medicine, FMBA, Moscow, 119435, Russia
| | - Alexey A Bogdanov
- Department of Chemistry, Faculty of Bioinformatics and Bioengeneering, Lomonosov Moscow State University, Moscow, 119992, Russia.,A.N. Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119992, Russia
| | - Petr V Sergiev
- Department of Chemistry, Faculty of Bioinformatics and Bioengeneering, Lomonosov Moscow State University, Moscow, 119992, Russia.,Skolkovo Institute of Science and Technology, Skolkovo, Moscow, 143025, Russia.,A.N. Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119992, Russia
| | - Olga A Dontsova
- Department of Chemistry, Faculty of Bioinformatics and Bioengeneering, Lomonosov Moscow State University, Moscow, 119992, Russia.,Skolkovo Institute of Science and Technology, Skolkovo, Moscow, 143025, Russia.,A.N. Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119992, Russia
| |
Collapse
|
45
|
Rakitina DV, Manolov AI, Kanygina AV, Garushyants SK, Baikova JP, Alexeev DG, Ladygina VG, Kostryukova ES, Larin AK, Semashko TA, Karpova IY, Babenko VV, Ismagilova RK, Malanin SY, Gelfand MS, Ilina EN, Gorodnichev RB, Lisitsyna ES, Aleshkin GI, Scherbakov PL, Khalif IL, Shapina MV, Maev IV, Andreev DN, Govorun VM. Genome analysis of E. coli isolated from Crohn's disease patients. BMC Genomics 2017; 18:544. [PMID: 28724357 PMCID: PMC5517970 DOI: 10.1186/s12864-017-3917-x] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2017] [Accepted: 07/02/2017] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Escherichia coli (E. coli) has been increasingly implicated in the pathogenesis of Crohn's disease (CD). The phylogeny of E. coli isolated from Crohn's disease patients (CDEC) was controversial, and while genotyping results suggested heterogeneity, the sequenced strains of E. coli from CD patients were closely related. RESULTS We performed the shotgun genome sequencing of 28 E. coli isolates from ten CD patients and compared genomes from these isolates with already published genomes of CD strains and other pathogenic and non-pathogenic strains. CDEC was shown to belong to A, B1, B2 and D phylogenetic groups. The plasmid and several operons from the reference CD-associated E. coli strain LF82 were demonstrated to be more often present in CDEC genomes belonging to different phylogenetic groups than in genomes of commensal strains. The operons include carbon-source induced invasion GimA island, prophage I, iron uptake operons I and II, capsular assembly pathogenetic island IV and propanediol and galactitol utilization operons. CONCLUSIONS Our findings suggest that CDEC are phylogenetically diverse. However, some strains isolated from independent sources possess highly similar chromosome or plasmids. Though no CD-specific genes or functional domains were present in all CD-associated strains, some genes and operons are more often found in the genomes of CDEC than in commensal E. coli. They are principally linked to gut colonization and utilization of propanediol and other sugar alcohols.
Collapse
Affiliation(s)
- Daria V. Rakitina
- Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
| | - Alexander I. Manolov
- Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
| | | | - Sofya K. Garushyants
- Skolkovo Institute of Science and Technology, Moscow, Russia
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
| | - Julia P. Baikova
- Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
| | - Dmitry G. Alexeev
- Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
- Moscow Institute for Physics and Technology, Moscow, Russia
| | - Valentina G. Ladygina
- Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
| | - Elena S. Kostryukova
- Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
| | - Andrei K. Larin
- Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
| | - Tatiana A. Semashko
- Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
| | - Irina Y. Karpova
- Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
| | - Vladislav V. Babenko
- Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
| | - Ruzilya K. Ismagilova
- Skolkovo Institute of Science and Technology, Moscow, Russia
- Institute of Fundamental Medicine and Biology of Kazan Federal University, Kazan, Russia
| | - Sergei Y. Malanin
- Skolkovo Institute of Science and Technology, Moscow, Russia
- Institute of Fundamental Medicine and Biology of Kazan Federal University, Kazan, Russia
| | - Mikhail S. Gelfand
- Skolkovo Institute of Science and Technology, Moscow, Russia
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
- Faculty of Computer Science, National Research University Higher School of Economics, Moscow, Russia
- Faculty of Bioengineering and Bioinformatics, Moscow State University, Moscow, Russia
| | - Elena N. Ilina
- Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
| | - Roman B. Gorodnichev
- Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
| | - Eugenia S. Lisitsyna
- Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
| | - Gennady I. Aleshkin
- The Gamaleya Research Institute for Epidemiology and Microbiology of the Russian Academy of Medical Science, Moscow, Russia
| | - Petr L. Scherbakov
- Central Scientific Institute of Gastroenterology, Moscow Clinical Research Centre, Moscow, Russia
| | - Igor L. Khalif
- State Scientific Center of Coloproctology, Ministry of Health of Russian Federation, Moscow, Russia
| | - Marina V. Shapina
- State Scientific Center of Coloproctology, Ministry of Health of Russian Federation, Moscow, Russia
| | - Igor V. Maev
- Moscow State University of Medicine and Dentistry named after A.I. Evdokimov, Ministry of Health of Russian Federation, Moscow, Russia
| | - Dmitry N. Andreev
- Moscow State University of Medicine and Dentistry named after A.I. Evdokimov, Ministry of Health of Russian Federation, Moscow, Russia
| | - Vadim M. Govorun
- Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
- Moscow Institute for Physics and Technology, Moscow, Russia
| |
Collapse
|
46
|
Ulianov SV, Galitsyna AA, Flyamer IM, Golov AK, Khrameeva EE, Imakaev MV, Abdennur NA, Gelfand MS, Gavrilov AA, Razin SV. Activation of the alpha-globin gene expression correlates with dramatic upregulation of nearby non-globin genes and changes in local and large-scale chromatin spatial structure. Epigenetics Chromatin 2017; 10:35. [PMID: 28693562 PMCID: PMC5504709 DOI: 10.1186/s13072-017-0142-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2016] [Accepted: 07/03/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In homeotherms, the alpha-globin gene clusters are located within permanently open genome regions enriched in housekeeping genes. Terminal erythroid differentiation results in dramatic upregulation of alpha-globin genes making their expression comparable to the rRNA transcriptional output. Little is known about the influence of the erythroid-specific alpha-globin gene transcription outburst on adjacent, widely expressed genes and large-scale chromatin organization. Here, we have analyzed the total transcription output, the overall chromatin contact profile, and CTCF binding within the 2.7 Mb segment of chicken chromosome 14 harboring the alpha-globin gene cluster in cultured lymphoid cells and cultured erythroid cells before and after induction of terminal erythroid differentiation. RESULTS We found that, similarly to mammalian genome, the chicken genomes is organized in TADs and compartments. Full activation of the alpha-globin gene transcription in differentiated erythroid cells is correlated with upregulation of several adjacent housekeeping genes and the emergence of abundant intergenic transcription. An extended chromosome region encompassing the alpha-globin cluster becomes significantly decompacted in differentiated erythroid cells, and depleted in CTCF binding and CTCF-anchored chromatin loops, while the sub-TAD harboring alpha-globin gene cluster and the upstream major regulatory element (MRE) becomes highly enriched with chromatin interactions as compared to lymphoid and proliferating erythroid cells. The alpha-globin gene domain and the neighboring loci reside within the A-like chromatin compartment in both lymphoid and erythroid cells and become further segregated from the upstream gene desert upon terminal erythroid differentiation. CONCLUSIONS Our findings demonstrate that the effects of tissue-specific transcription activation are not restricted to the host genomic locus but affect the overall chromatin structure and transcriptional output of the encompassing topologically associating domain.
Collapse
Affiliation(s)
- Sergey V Ulianov
- Institute of Gene Biology of the Russian Academy of Sciences, Moscow, Russia 119334.,Faculty of Biology, M.V. Lomonosov Moscow State University, Moscow, Russia 119992
| | - Aleksandra A Galitsyna
- Institute of Gene Biology of the Russian Academy of Sciences, Moscow, Russia 119334.,Faculty of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Moscow, Russia 119992.,Institute for Information Transmission Problems (the Kharkevich Institute) of the Russian Academy of Sciences, Moscow, Russia 127051
| | - Ilya M Flyamer
- Institute of Gene Biology of the Russian Academy of Sciences, Moscow, Russia 119334.,Faculty of Biology, M.V. Lomonosov Moscow State University, Moscow, Russia 119992.,MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, UK
| | - Arkadiy K Golov
- Institute of Gene Biology of the Russian Academy of Sciences, Moscow, Russia 119334
| | - Ekaterina E Khrameeva
- Skolkovo Institute of Science and Technology, Skolkovo, Russia 143026.,Institute for Information Transmission Problems (the Kharkevich Institute) of the Russian Academy of Sciences, Moscow, Russia 127051
| | - Maxim V Imakaev
- Department of Physics, Massachusetts Institute of Technology, Cambridge, MA 02139 USA
| | - Nezar A Abdennur
- Computational and Systems Biology Graduate Program, Massachusetts Institute of Technology, Cambridge, MA USA
| | - Mikhail S Gelfand
- Faculty of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Moscow, Russia 119992.,Skolkovo Institute of Science and Technology, Skolkovo, Russia 143026.,Institute for Information Transmission Problems (the Kharkevich Institute) of the Russian Academy of Sciences, Moscow, Russia 127051.,Faculty of Computer Science, Higher School of Economics, Moscow, Russia 125319
| | - Alexey A Gavrilov
- Institute of Gene Biology of the Russian Academy of Sciences, Moscow, Russia 119334
| | - Sergey V Razin
- Institute of Gene Biology of the Russian Academy of Sciences, Moscow, Russia 119334.,Faculty of Biology, M.V. Lomonosov Moscow State University, Moscow, Russia 119992
| |
Collapse
|
47
|
Kublanov IV, Sigalova OM, Gavrilov SN, Lebedinsky AV, Rinke C, Kovaleva O, Chernyh NA, Ivanova N, Daum C, Reddy TBK, Klenk HP, Spring S, Göker M, Reva ON, Miroshnichenko ML, Kyrpides NC, Woyke T, Gelfand MS, Bonch-Osmolovskaya EA. Genomic Analysis of Caldithrix abyssi, the Thermophilic Anaerobic Bacterium of the Novel Bacterial Phylum Calditrichaeota. Front Microbiol 2017; 8:195. [PMID: 28265262 PMCID: PMC5317091 DOI: 10.3389/fmicb.2017.00195] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2016] [Accepted: 01/26/2017] [Indexed: 11/13/2022] Open
Abstract
The genome of Caldithrix abyssi, the first cultivated representative of a phylum-level bacterial lineage, was sequenced within the framework of Genomic Encyclopedia of Bacteria and Archaea (GEBA) project. The genomic analysis revealed mechanisms allowing this anaerobic bacterium to ferment peptides or to implement nitrate reduction with acetate or molecular hydrogen as electron donors. The genome encoded five different [NiFe]- and [FeFe]-hydrogenases, one of which, group 1 [NiFe]-hydrogenase, is presumably involved in lithoheterotrophic growth, three other produce H2 during fermentation, and one is apparently bidirectional. The ability to reduce nitrate is determined by a nitrate reductase of the Nap family, while nitrite reduction to ammonia is presumably catalyzed by an octaheme cytochrome c nitrite reductase εHao. The genome contained genes of respiratory polysulfide/thiosulfate reductase, however, elemental sulfur and thiosulfate were not used as the electron acceptors for anaerobic respiration with acetate or H2, probably due to the lack of the gene of the maturation protein. Nevertheless, elemental sulfur and thiosulfate stimulated growth on fermentable substrates (peptides), being reduced to sulfide, most probably through the action of the cytoplasmic sulfide dehydrogenase and/or NAD(P)-dependent [NiFe]-hydrogenase (sulfhydrogenase) encoded by the genome. Surprisingly, the genome of this anaerobic microorganism encoded all genes for cytochrome c oxidase, however, its maturation machinery seems to be non-operational due to genomic rearrangements of supplementary genes. Despite the fact that sugars were not among the substrates reported when C. abyssi was first described, our genomic analysis revealed multiple genes of glycoside hydrolases, and some of them were predicted to be secreted. This finding aided in bringing out four carbohydrates that supported the growth of C. abyssi: starch, cellobiose, glucomannan and xyloglucan. The genomic analysis demonstrated the ability of C. abyssi to synthesize nucleotides and most amino acids and vitamins. Finally, the genomic sequence allowed us to perform a phylogenomic analysis, based on 38 protein sequences, which confirmed the deep branching of this lineage and justified the proposal of a novel phylum Calditrichaeota.
Collapse
Affiliation(s)
- Ilya V Kublanov
- Winogradsky Institute of Microbiology, Research Center of Biotechnology, Russian Academy of Sciences Moscow, Russia
| | - Olga M Sigalova
- A.A.Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences Moscow, Russia
| | - Sergey N Gavrilov
- Winogradsky Institute of Microbiology, Research Center of Biotechnology, Russian Academy of Sciences Moscow, Russia
| | - Alexander V Lebedinsky
- Winogradsky Institute of Microbiology, Research Center of Biotechnology, Russian Academy of Sciences Moscow, Russia
| | - Christian Rinke
- Australian Centre for Ecogenomics, School of Chemistry and Molecular Biosciences, University of Queensland, St Lucia QLD, Australia
| | - Olga Kovaleva
- Winogradsky Institute of Microbiology, Research Center of Biotechnology, Russian Academy of Sciences Moscow, Russia
| | - Nikolai A Chernyh
- Winogradsky Institute of Microbiology, Research Center of Biotechnology, Russian Academy of Sciences Moscow, Russia
| | | | - Chris Daum
- DOE Joint Genome Institute, Walnut Creek CA, USA
| | - T B K Reddy
- DOE Joint Genome Institute, Walnut Creek CA, USA
| | | | - Stefan Spring
- Leibniz Institute DSMZ - German Collection of Microorganisms and Cell Cultures Braunschweig, Germany
| | - Markus Göker
- Leibniz Institute DSMZ - German Collection of Microorganisms and Cell Cultures Braunschweig, Germany
| | - Oleg N Reva
- Center for Bioinformatics and Computational Biology, Department of Biochemistry, University of Pretoria Pretoria, South Africa
| | - Margarita L Miroshnichenko
- Winogradsky Institute of Microbiology, Research Center of Biotechnology, Russian Academy of Sciences Moscow, Russia
| | | | - Tanja Woyke
- DOE Joint Genome Institute, Walnut CreekCA, USA; Biological Data Management and Technology Center, Lawrence Berkeley National Laboratory, BerkeleyCA, USA
| | - Mikhail S Gelfand
- A.A.Kharkevich Institute for Information Transmission Problems, Russian Academy of SciencesMoscow, Russia; Department of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State UniversityMoscow, Russia; Skolkovo Institute of Science and TechnologyMoscow, Russia; Faculty of Computer Science, National Research University - Higher School of EconomicsMoscow, Russia
| | | |
Collapse
|
48
|
Hahn J, Tsoy OV, Thalmann S, Čuklina J, Gelfand MS, Evguenieva-Hackenberg E. Small Open Reading Frames, Non-Coding RNAs and Repetitive Elements in Bradyrhizobium japonicum USDA 110. PLoS One 2016; 11:e0165429. [PMID: 27788207 PMCID: PMC5082802 DOI: 10.1371/journal.pone.0165429] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2016] [Accepted: 10/11/2016] [Indexed: 11/18/2022] Open
Abstract
Small open reading frames (sORFs) and genes for non-coding RNAs are poorly investigated components of most genomes. Our analysis of 1391 ORFs recently annotated in the soybean symbiont Bradyrhizobium japonicum USDA 110 revealed that 78% of them contain less than 80 codons. Twenty-one of these sORFs are conserved in or outside Alphaproteobacteria and most of them are similar to genes found in transposable elements, in line with their broad distribution. Stabilizing selection was demonstrated for sORFs with proteomic evidence and bll1319_ISGA which is conserved at the nucleotide level in 16 alphaproteobacterial species, 79 species from other taxa and 49 other Proteobacteria. Further we used Northern blot hybridization to validate ten small RNAs (BjsR1 to BjsR10) belonging to new RNA families. We found that BjsR1 and BjsR3 have homologs outside the genus Bradyrhizobium, and BjsR5, BjsR6, BjsR7, and BjsR10 have up to four imperfect copies in Bradyrhizobium genomes. BjsR8, BjsR9, and BjsR10 are present exclusively in nodules, while the other sRNAs are also expressed in liquid cultures. We also found that the level of BjsR4 decreases after exposure to tellurite and iron, and this down-regulation contributes to survival under high iron conditions. Analysis of additional small RNAs overlapping with 3’-UTRs revealed two new repetitive elements named Br-REP1 and Br-REP2. These REP elements may play roles in the genomic plasticity and gene regulation and could be useful for strain identification by PCR-fingerprinting. Furthermore, we studied two potential toxin genes in the symbiotic island and confirmed toxicity of the yhaV homolog bll1687 but not of the newly annotated higB homolog blr0229_ISGA in E. coli. Finally, we revealed transcription interference resulting in an antisense RNA complementary to blr1853, a gene induced in symbiosis. The presented results expand our knowledge on sORFs, non-coding RNAs and repetitive elements in B. japonicum and related bacteria.
Collapse
Affiliation(s)
- Julia Hahn
- Institute of Microbiology and Molecular Biology, Justus-Liebig-University, Giessen, Germany
| | - Olga V. Tsoy
- A. A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Bolshoi Karetny Ln. 19, Moscow, 127051, Russia
| | - Sebastian Thalmann
- Institute of Microbiology and Molecular Biology, Justus-Liebig-University, Giessen, Germany
| | - Jelena Čuklina
- A. A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Bolshoi Karetny Ln. 19, Moscow, 127051, Russia
- ETH, Institute of Molecular Systems Biology, Zürich, Switzerland
| | - Mikhail S. Gelfand
- A. A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Bolshoi Karetny Ln. 19, Moscow, 127051, Russia
- Skolkovo Institute of Science and Technology, Nobel Str. 3, Moscow, 143026, Russia
- Faculty of Bioengineering and Bioinformatics, M. V. Lomonosov Moscow State University, Vorobyevy Gory 1–73, Moscow, 119234, Russia
- Faculty of Computer Science, Higher School of Economics, Kochnovsky Dr. 3, Moscow, 125319, Russia
- * E-mail: (EEH); (MSG)
| | - Elena Evguenieva-Hackenberg
- Institute of Microbiology and Molecular Biology, Justus-Liebig-University, Giessen, Germany
- * E-mail: (EEH); (MSG)
| |
Collapse
|
49
|
Korostelev YD, Zharov IA, Mironov AA, Rakhmaininova AB, Gelfand MS. Identification of Position-Specific Correlations between DNA-Binding Domains and Their Binding Sites. Application to the MerR Family of Transcription Factors. PLoS One 2016; 11:e0162681. [PMID: 27690309 PMCID: PMC5045206 DOI: 10.1371/journal.pone.0162681] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2015] [Accepted: 08/26/2016] [Indexed: 11/25/2022] Open
Abstract
The large and increasing volume of genomic data analyzed by comparative methods provides information about transcription factors and their binding sites that, in turn, enables statistical analysis of correlations between factors and sites, uncovering mechanisms and evolution of specific protein-DNA recognition. Here we present an online tool, Prot-DNA-Korr, designed to identify and analyze crucial protein-DNA pairs of positions in a family of transcription factors. Correlations are identified by analysis of mutual information between columns of protein and DNA alignments. The algorithm reduces the effects of common phylogenetic history and of abundance of closely related proteins and binding sites. We apply it to five closely related subfamilies of the MerR family of bacterial transcription factors that regulate heavy metal resistance systems. We validate the approach using known 3D structures of MerR-family proteins in complexes with their cognate DNA binding sites and demonstrate that a significant fraction of correlated positions indeed form specific side-chain-to-base contacts. The joint distribution of amino acids and nucleotides hence may be used to predict changes of specificity for point mutations in transcription factors.
Collapse
Affiliation(s)
- Yuriy D. Korostelev
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, 19-1 Bolshoy Karetny pereulok, Moscow, Russia, 127994
- Department of Bioengineering and Bioinformatics, Moscow State University, 1-73 Vorobievy Gory, Moscow, Russia, 119991
| | - Ilya A. Zharov
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, 19-1 Bolshoy Karetny pereulok, Moscow, Russia, 127994
| | - Andrey A. Mironov
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, 19-1 Bolshoy Karetny pereulok, Moscow, Russia, 127994
- Department of Bioengineering and Bioinformatics, Moscow State University, 1-73 Vorobievy Gory, Moscow, Russia, 119991
| | - Alexandra B. Rakhmaininova
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, 19-1 Bolshoy Karetny pereulok, Moscow, Russia, 127994
| | - Mikhail S. Gelfand
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, 19-1 Bolshoy Karetny pereulok, Moscow, Russia, 127994
- Department of Bioengineering and Bioinformatics, Moscow State University, 1-73 Vorobievy Gory, Moscow, Russia, 119991
- * E-mail:
| |
Collapse
|
50
|
Ivanenko VN, Antonenko EA, Gelfand MS, Yager J, Ferrari FD. Changes in segmentation and setation along the anterior/posterior axis of the homonomous trunk limbs of a remipede (Crustacea, Arthropoda). PeerJ 2016; 4:e2305. [PMID: 27602276 PMCID: PMC4991865 DOI: 10.7717/peerj.2305] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2016] [Accepted: 07/08/2016] [Indexed: 11/24/2022] Open
Abstract
This study describes the segmentation and setation at different developmental stages of the homonomous trunk limbs of the remipede Speleonectes tulumensisYager, 1987 collected in anchialine caves of the Yucatan Peninsula. Most homonomous trunk limbs originate ventrolaterally and are composed of two protopodal segments, three exopodal segments and four endopodal segments; contralateral limb pairs are united by a sternal bar. However, the last few posterior limbs originate ventrally, are smaller sized, and have regressively fewer segments, suggesting that limb development passes through several intermediate steps beginning with a limb bud. A terminal stage of development is proposed for specimens on which the posterior somite bears a simple bilobate limb bud, and the adjacent somite bears a limb with a protopod comprised of a coxapod and basipod, and with three exopodal and four endopodal segments. On each trunk limb there are 20 serially homologous groups of setae, and the numbers of setae on different limbs usually varies. These groups of setae are arranged linearly and are identified based on the morphology of the setae and their position on the segments. The number of setae in these groups increases gradually from the anterior homonomous limb to a maximum between limbs 8–12; the number then decreases sharply on the more posterior limbs. Changes in the number of setae, which reach a maximum between trunk limbs 8–12, differ from changes in segmentation which vary only over the last few posterior trunk limbs. Following a vector analysis that identified a spatial pattern for these 20 groups of setae among the different homonomous limbs, the hypothesis was confirmed that the number of setae in any given group and any given limb is correlated with the group, with the position of the somite along the body axis, and with the number of somites present on the specimens. This is the first vector analysis used to analyze a pattern of developmental changes in serially homologs of an arthropod. Development of remipede limbs are compared and contrasted with similar copepod limbs. Architecture, particularly the sternal bar uniting contralateral limb pairs, proposed as homologous, and development of trunk limb segmentation of the remipede is generally similar to that of copepods, but the remipede limb differs in several ways including an additional endopodal segment, the proximal, that appears simultaneously with the protopod during development.
Collapse
Affiliation(s)
- Viacheslav N Ivanenko
- Department of Invertebrate Zoology, Biological Faculty, Lomonosov Moscow State University, Moscow, Russia
| | - Ekaterina A Antonenko
- Faculty of Mechanics and Mathematics, Lomonosov Moscow State University, Moscow, Russia
| | - Mikhail S Gelfand
- A.A.Kharkevich Institute for Information Transmission Problems, Moscow, Russia; Skolkovo Institute od Science and Technology, Skolkovo, Moscow Region, Russia; Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia
| | - Jill Yager
- Department of Invertebrate Zoology, National Museum of Natural History, Smithsonian Institution , Washington D.C. , USA
| | | |
Collapse
|