26
|
Soutourina O, Dubois T, Monot M, Shelyakin PV, Saujet L, Boudry P, Gelfand MS, Dupuy B, Martin-Verstraete I. Genome-Wide Transcription Start Site Mapping and Promoter Assignments to a Sigma Factor in the Human Enteropathogen Clostridioides difficile. Front Microbiol 2020; 11:1939. [PMID: 32903654 PMCID: PMC7438776 DOI: 10.3389/fmicb.2020.01939] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Accepted: 07/23/2020] [Indexed: 12/12/2022] Open
Abstract
The emerging human enteropathogen Clostridioides difficile is the main cause of diarrhea associated with antibiotherapy. Regulatory pathways underlying the adaptive responses remain understudied and the global view of C. difficile promoter structure is still missing. In the genome of C. difficile 630, 22 genes encoding sigma factors are present suggesting a complex pattern of transcription in this bacterium. We present here the first transcriptional map of the C. difficile genome resulting from the identification of transcriptional start sites (TSS), promoter motifs and operon structures. By 5′-end RNA-seq approach, we mapped more than 1000 TSS upstream of genes. In addition to these primary TSS, this analysis revealed complex structure of transcriptional units such as alternative and internal promoters, potential RNA processing events and 5′ untranslated regions. By following an in silico iterative strategy that used as an input previously published consensus sequences and transcriptomic analysis, we identified candidate promoters upstream of most of protein-coding and non-coding RNAs genes. This strategy also led to refine consensus sequences of promoters recognized by major sigma factors of C. difficile. Detailed analysis focuses on the transcription in the pathogenicity locus and regulatory genes, as well as regulons of transition phase and sporulation sigma factors as important components of C. difficile regulatory network governing toxin gene expression and spore formation. Among the still uncharacterized regulons of the major sigma factors of C. difficile, we defined the SigL regulon by combining transcriptome and in silico analyses. We showed that the SigL regulon is largely involved in amino-acid degradation, a metabolism crucial for C. difficile gut colonization. Finally, we combined our TSS mapping, in silico identification of promoters and RNA-seq data to improve gene annotation and to suggest operon organization in C. difficile. These data will considerably improve our knowledge of global regulatory circuits controlling gene expression in C. difficile and will serve as a useful rich resource for scientific community both for the detailed analysis of specific genes and systems biology studies.
Collapse
|
27
|
Samborskaia MD, Galitsyna A, Pletenev I, Trofimova A, Mironov AA, Gelfand MS, Khrameeva EE. Cumulative contact frequency of a chromatin region is an intrinsic property linked to its function. PeerJ 2020; 8:e9566. [PMID: 32864204 PMCID: PMC7425636 DOI: 10.7717/peerj.9566] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2020] [Accepted: 06/27/2020] [Indexed: 12/18/2022] Open
Abstract
Regulation of gene transcription is a complex process controlled by many factors, including the conformation of chromatin in the nucleus. Insights into chromatin conformation on both local and global scales can be provided by the Hi-C (high-throughput chromosomes conformation capture) method. One of the drawbacks of Hi-C analysis and interpretation is the presence of systematic biases, such as different accessibility to enzymes, amplification, and mappability of DNA regions, which all result in different visibility of the regions. Iterative correction (IC) is one of the most popular techniques developed for the elimination of these systematic biases. IC is based on the assumption that all chromatin regions have an equal number of observed contacts in Hi-C. In other words, the IC procedure is equalizing the experimental visibility approximated by the cumulative contact frequency (CCF) for all genomic regions. However, the differences in experimental visibility might be explained by biological factors such as chromatin openness, which is characteristic of distinct chromatin states. Here we show that CCF is positively correlated with active transcription. It is associated with compartment organization, since compartment A demonstrates higher CCF and gene expression levels than compartment B. Notably, this observation holds for a wide range of species, including human, mouse, and Drosophila. Moreover, we track the CCF state for syntenic blocks between human and mouse and conclude that active state assessed by CCF is an intrinsic property of the DNA region, which is independent of local genomic and epigenomic context. Our findings establish a missing link between Hi-C normalization procedures removing CCF from the data and poorly investigated and possibly relevant biological factors contributing to CCF.
Collapse
|
28
|
Komarova ES, Chervontseva ZS, Osterman IA, Evfratov SA, Rubtsova MP, Zatsepin TS, Semashko TA, Kostryukova ES, Bogdanov AA, Gelfand MS, Dontsova OA, Sergiev PV. Influence of the spacer region between the Shine-Dalgarno box and the start codon for fine-tuning of the translation efficiency in Escherichia coli. Microb Biotechnol 2020; 13:1254-1261. [PMID: 32202698 PMCID: PMC7264876 DOI: 10.1111/1751-7915.13561] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2019] [Revised: 02/06/2020] [Accepted: 03/02/2020] [Indexed: 01/27/2023] Open
Abstract
Translation efficiency contributes several orders of magnitude difference in the overall yield of exogenous gene expression in bacteria. In diverse bacteria, the translation initiation site, whose sequence is the primary determinant of the translation performance, is comprised of the start codon and the Shine-Dalgarno box located upstream. Here, we have examined how the sequence of a spacer between these main components of the translation initiation site contributes to the yield of synthesized protein. We have created a library of reporter constructs with the randomized spacer region, performed fluorescently activated cell sorting and applied next-generation sequencing analysis (the FlowSeq protocol). As a result, we have identified sequence motifs for the spacer region between the Shine-Dalgarno box and AUG start codon that may modulate the translation efficiency in a 100-fold range.
Collapse
|
29
|
Babenko VV, Podgorny OV, Manuvera VA, Kasianov AS, Manolov AI, Grafskaia EN, Shirokov DA, Kurdyumov AS, Vinogradov DV, Nikitina AS, Kovalchuk SI, Anikanov NA, Butenko IO, Pobeguts OV, Matyushkina DS, Rakitina DV, Kostryukova ES, Zgoda VG, Baskova IP, Trukhan VM, Gelfand MS, Govorun VM, Schiöth HB, Lazarev VN. Draft genome sequences of Hirudo medicinalis and salivary transcriptome of three closely related medicinal leeches. BMC Genomics 2020; 21:331. [PMID: 32349672 PMCID: PMC7191736 DOI: 10.1186/s12864-020-6748-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Accepted: 04/21/2020] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Salivary cell secretion (SCS) plays a critical role in blood feeding by medicinal leeches, making them of use for certain medical purposes even today. RESULTS We annotated the Hirudo medicinalis genome and performed RNA-seq on salivary cells isolated from three closely related leech species, H. medicinalis, Hirudo orientalis, and Hirudo verbana. Differential expression analysis verified by proteomics identified salivary cell-specific gene expression, many of which encode previously unknown salivary components. However, the genes encoding known anticoagulants have been found to be expressed not only in salivary cells. The function-related analysis of the unique salivary cell genes enabled an update of the concept of interactions between salivary proteins and components of haemostasis. CONCLUSIONS Here we report a genome draft of Hirudo medicinalis and describe identification of novel salivary proteins and new homologs of genes encoding known anticoagulants in transcriptomes of three medicinal leech species. Our data provide new insights in genetics of blood-feeding lifestyle in leeches.
Collapse
|
30
|
Sigalova OM, Chaplin AV, Bochkareva OO, Shelyakin PV, Filaretov VA, Akkuratov EE, Burskaia V, Gelfand MS. Chlamydia pan-genomic analysis reveals balance between host adaptation and selective pressure to genome reduction. BMC Genomics 2019; 20:710. [PMID: 31510914 PMCID: PMC6740158 DOI: 10.1186/s12864-019-6059-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2019] [Accepted: 08/22/2019] [Indexed: 01/23/2023] Open
Abstract
Background Chlamydia are ancient intracellular pathogens with reduced, though strikingly conserved genome. Despite their parasitic lifestyle and isolated intracellular environment, these bacteria managed to avoid accumulation of deleterious mutations leading to subsequent genome degradation characteristic for many parasitic bacteria. Results We report pan-genomic analysis of sixteen species from genus Chlamydia including identification and functional annotation of orthologous genes, and characterization of gene gains, losses, and rearrangements. We demonstrate the overall genome stability of these bacteria as indicated by a large fraction of common genes with conserved genomic locations. On the other hand, extreme evolvability is confined to several paralogous gene families such as polymorphic membrane proteins and phospholipase D, and likely is caused by the pressure from the host immune system. Conclusions This combination of a large, conserved core genome and a small, evolvable periphery likely reflect the balance between the selective pressure towards genome reduction and the need to adapt to escape from the host immunity. Electronic supplementary material The online version of this article (10.1186/s12864-019-6059-5) contains supplementary material, which is available to authorized users.
Collapse
|
31
|
Fedonin GG, Eroshkin A, Cieplak P, Matveev EV, Ponomarev GV, Gelfand MS, Ratnikov BI, Kazanov MD. Predictive models of protease specificity based on quantitative protease-activity profiling data. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2019; 1867:140253. [PMID: 31330204 DOI: 10.1016/j.bbapap.2019.07.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/08/2019] [Revised: 07/09/2019] [Accepted: 07/17/2019] [Indexed: 10/26/2022]
Abstract
Bioinformatics-based prediction of protease substrates can help to elucidate regulatory proteolytic pathways that control a broad range of biological processes such as apoptosis and blood coagulation. The majority of published predictive models are position weight matrices (PWM) reflecting specificity of proteases toward target sequence. These models are typically derived from experimental data on positions of hydrolyzed peptide bonds and show a reasonable predictive power. New emerging techniques that not only register the cleavage position but also measure catalytic efficiency of proteolysis are expected to improve the quality of predictions or at least substantially reduce the number of tested substrates required for confident predictions. The main goal of this study was to develop new prediction models based on such data and to estimate the performance of the constructed models. We used data on catalytic efficiency of proteolysis measured for eight major human matrix metalloproteinases to construct predictive models of protease specificity using a variety of regression analysis techniques. The obtained results suggest that efficiency-based (quantitative) models show a comparable performance with conventional PWM-based algorithms, while less training data are required. The derived list of candidate cleavage sites in human secreted proteins may serve as a starting point for experimental analysis.
Collapse
|
32
|
Suvorova IA, Gelfand MS. Comparative Genomic Analysis of the Regulation of Aromatic Metabolism in Betaproteobacteria. Front Microbiol 2019; 10:642. [PMID: 30984152 PMCID: PMC6449761 DOI: 10.3389/fmicb.2019.00642] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2018] [Accepted: 03/14/2019] [Indexed: 01/23/2023] Open
Abstract
Aromatic compounds are a common carbon and energy source for many microorganisms, some of which can even degrade toxic chloroaromatic xenobiotics. This comparative study of aromatic metabolism in 32 Betaproteobacteria species describes the links between several transcription factors (TFs) that control benzoate (BenR, BenM, BoxR, BzdR), catechol (CatR, CatM, BenM), chlorocatechol (ClcR), methylcatechol (MmlR), 2,4-dichlorophenoxyacetate (TfdR, TfdS), phenol (AphS, AphR, AphT), biphenyl (BphS), and toluene (TbuT) metabolism. We characterize the complexity and variability in the organization of aromatic metabolism operons and the structure of regulatory networks that may differ even between closely related species. Generally, the upper parts of pathways, rare pathway variants, and degradative pathways of exotic and complex, in particular, xenobiotic compounds are often controlled by a single TF, while the regulation of more common and/or central parts of the aromatic metabolism may vary widely and often involves several TFs with shared and/or dual, or cascade regulation. The most frequent and at the same time variable connections exist between AphS, AphR, AphT, and BenR. We have identified a novel LysR-family TF that regulates the metabolism of catechol (or some catechol derivative) and either substitutes CatR(M)/BenM, or shares functions with it. We have also predicted several new members of aromatic metabolism regulons, in particular, some COGs regulated by several different TFs.
Collapse
|
33
|
Shelyakin PV, Bochkareva OO, Karan AA, Gelfand MS. Micro-evolution of three Streptococcus species: selection, antigenic variation, and horizontal gene inflow. BMC Evol Biol 2019; 19:83. [PMID: 30917781 PMCID: PMC6437910 DOI: 10.1186/s12862-019-1403-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2017] [Accepted: 02/25/2019] [Indexed: 02/07/2023] Open
Abstract
Background The genus Streptococcus comprises pathogens that strongly influence the health of humans and animals. Genome sequencing of multiple Streptococcus strains demonstrated high variability in gene content and order even in closely related strains of the same species and created a newly emerged object for genomic analysis, the pan-genome. Here we analysed the genome evolution of 25 strains of Streptococcus suis, 50 strains of Streptococcus pyogenes and 28 strains of Streptococcus pneumoniae. Results Fractions of the pan-genome, unique, periphery, and universal genes differ in size, functional composition, the level of nucleotide substitutions, and predisposition to horizontal gene transfer and genomic rearrangements. The density of substitutions in intergenic regions appears to be correlated with selection acting on adjacent genes, implying that more conserved genes tend to have more conserved regulatory regions. The total pan-genome of the genus is open, but only due to strain-specific genes, whereas other pan-genome fractions reach saturation. We have identified the set of genes with phylogenies inconsistent with species and non-conserved location in the chromosome; these genes are rare in at least one species and have likely experienced recent horizontal transfer between species. The strain-specific fraction is enriched with mobile elements and hypothetical proteins, but also contains a number of candidate virulence-related genes, so it may have a strong impact on adaptability and pathogenicity. Mapping the rearrangements to the phylogenetic tree revealed large parallel inversions in all species. A parallel inversion of length 15 kB with breakpoints formed by genes encoding surface antigen proteins PhtD and PhtB in S. pneumoniae leads to replacement of gene fragments that likely indicates the action of an antigen variation mechanism. Conclusions Members of genus Streptococcus have a highly dynamic, open pan-genome, that potentially confers them with the ability to adapt to changing environmental conditions, i.e. antibiotic resistance or transmission between different hosts. Hence, integrated analysis of all aspects of genome evolution is important for the identification of potential pathogens and design of drugs and vaccines. Electronic supplementary material The online version of this article (10.1186/s12862-019-1403-6) contains supplementary material, which is available to authorized users.
Collapse
|
34
|
Bochkareva OO, Moroz EV, Davydov II, Gelfand MS. Genome rearrangements and selection in multi-chromosome bacteria Burkholderia spp. BMC Genomics 2018; 19:965. [PMID: 30587126 PMCID: PMC6307245 DOI: 10.1186/s12864-018-5245-1] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2018] [Accepted: 11/14/2018] [Indexed: 11/30/2022] Open
Abstract
BACKGROUND The genus Burkholderia consists of species that occupy remarkably diverse ecological niches. Its best known members are important pathogens, B. mallei and B. pseudomallei, which cause glanders and melioidosis, respectively. Burkholderia genomes are unusual due to their multichromosomal organization, generally comprised of 2-3 chromosomes. RESULTS We performed integrated genomic analysis of 127 Burkholderia strains. The pan-genome is open with the saturation to be reached between 86,000 and 88,000 genes. The reconstructed rearrangements indicate a strong avoidance of intra-replichore inversions that is likely caused by selection against the transfer of large groups of genes between the leading and the lagging strands. Translocated genes also tend to retain their position in the leading or the lagging strand, and this selection is stronger for large syntenies. Integrated reconstruction of chromosome rearrangements in the context of strains phylogeny reveals parallel rearrangements that may indicate inversion-based phase variation and integration of new genomic islands. In particular, we detected parallel inversions in the second chromosomes of B. pseudomallei with breakpoints formed by genes encoding membrane components of multidrug resistance complex, that may be linked to a phase variation mechanism. Two genomic islands, spreading horizontally between chromosomes, were detected in the B. cepacia group. CONCLUSIONS This study demonstrates the power of integrated analysis of pan-genomes, chromosome rearrangements, and selection regimes. Non-random inversion patterns indicate selective pressure, inversions are particularly frequent in a recent pathogen B. mallei, and, together with periods of positive selection at other branches, may indicate adaptation to new niches. One such adaptation could be a possible phase variation mechanism in B. pseudomallei.
Collapse
|
35
|
Shelyakin PV, Garushyants SK, Nikitin MA, Mudrova SV, Berumen M, Speksnijder AGCL, Hoeksema BW, Fontaneto D, Gelfand MS, Ivanenko VN. Microbiomes of gall-inducing copepod crustaceans from the corals Stylophora pistillata (Scleractinia) and Gorgonia ventalina (Alcyonacea). Sci Rep 2018; 8:11563. [PMID: 30069039 PMCID: PMC6070567 DOI: 10.1038/s41598-018-29953-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Accepted: 07/18/2018] [Indexed: 12/31/2022] Open
Abstract
Corals harbor complex and diverse microbial communities that strongly impact host fitness and resistance to diseases, but these microbes themselves can be influenced by stresses, like those caused by the presence of macroscopic symbionts. In addition to directly influencing the host, symbionts may transmit pathogenic microbial communities. We analyzed two coral gall-forming copepod systems by using 16S rRNA gene metagenomic sequencing: (1) the sea fan Gorgonia ventalina with copepods of the genus Sphaerippe from the Caribbean and (2) the scleractinian coral Stylophora pistillata with copepods of the genus Spaniomolgus from the Saudi Arabian part of the Red Sea. We show that bacterial communities in these two systems were substantially different with Actinobacteria, Alphaproteobacteria, and Betaproteobacteria more prevalent in samples from Gorgonia ventalina, and Gammaproteobacteria in Stylophora pistillata. In Stylophora pistillata, normal coral microbiomes were enriched with the common coral symbiont Endozoicomonas and some unclassified bacteria, while copepod and gall-tissue microbiomes were highly enriched with the family ME2 (Oceanospirillales) or Rhodobacteraceae. In Gorgonia ventalina, no bacterial group had significantly different prevalence in the normal coral tissues, copepods, and injured tissues. The total microbiome composition of polyps injured by copepods was different. Contrary to our expectations, the microbial community composition of the injured gall tissues was not directly affected by the microbiome of the gall-forming symbiont copepods.
Collapse
|
36
|
Akkuratov EE, Gelfand MS, Khrameeva EE. Neanderthal and Denisovan ancestry in Papuans: A functional study. J Bioinform Comput Biol 2018; 16:1840011. [DOI: 10.1142/s0219720018400115] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Sequencing of complete nuclear genomes of Neanderthal and Denisovan stimulated studies about their relationship with modern humans demonstrating, in particular, that DNA alleles from both Neanderthal and Denisovan genomes are present in genomes of modern humans. The Papuan genome is a unique object because it contains both Neanderthal and Denisovan alleles. Here, we have shown that the Papuan genomes contain different gene functional groups inherited from each of the ancient people. The Papuan genomes demonstrate a relative prevalence of Neanderthal alleles in genes responsible for the regulation of transcription and neurogenesis. The enrichment of specific functional groups with Denisovan alleles is less pronounced; these groups are responsible for bone and tissue remodeling. This analysis shows that introgression of alleles from Neanderthals and Denisovans to Papuans occurred independently and retention of these alleles may carry specific adaptive advantages.
Collapse
|
37
|
Garushyants SK, Beliavskaia AY, Malko DB, Logacheva MD, Rautian MS, Gelfand MS. Comparative Genomic Analysis of Holospora spp., Intranuclear Symbionts of Paramecia. Front Microbiol 2018; 9:738. [PMID: 29713316 PMCID: PMC5911502 DOI: 10.3389/fmicb.2018.00738] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2017] [Accepted: 03/29/2018] [Indexed: 11/16/2022] Open
Abstract
While most endosymbiotic bacteria are transmitted only vertically, Holospora spp., an alphaproteobacterium from the Rickettsiales order, can desert its host and invade a new one. All bacteria from the genus Holospora are intranuclear symbionts of ciliates Paramecium spp. with strict species and nuclear specificity. Comparative metabolic reconstruction based on the newly sequenced genome of Holospora curviuscula, a macronuclear symbiont of Paramecium bursaria, and known genomes of other Holospora species shows that even though all Holospora spp. can persist outside the host, they cannot synthesize most of the essential small molecules, such as amino acids, and lack some central energy metabolic pathways, including glycolysis and the citric acid cycle. As the main energy source, Holospora spp. likely rely on nucleotides pirated from the host. Holospora-specific genes absent from other Rickettsiales are possibly involved in the lifestyle switch from the infectious to the reproductive form and in cell invasion.
Collapse
|
38
|
Mazin PV, Jiang X, Fu N, Han D, Guo M, Gelfand MS, Khaitovich P. Conservation, evolution, and regulation of splicing during prefrontal cortex development in humans, chimpanzees, and macaques. RNA (NEW YORK, N.Y.) 2018; 24:585-596. [PMID: 29363555 PMCID: PMC5855957 DOI: 10.1261/rna.064931.117] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/19/2017] [Accepted: 01/10/2018] [Indexed: 05/03/2023]
Abstract
Changes in splicing are known to affect the function and regulation of genes. We analyzed splicing events that take place during the postnatal development of the prefrontal cortex in humans, chimpanzees, and rhesus macaques based on data obtained from 168 individuals. Our study revealed that among the 38,822 quantified alternative exons, 15% are differentially spliced among species, and more than 6% splice differently at different ages. Mutations in splicing acceptor and/or donor sites might explain more than 14% of all splicing differences among species and up to 64% of high-amplitude differences. A reconstructed trans-regulatory network containing 21 RNA-binding proteins explains a further 4% of splicing variations within species. While most age-dependent splicing patterns are conserved among the three species, developmental changes in intron retention are substantially more pronounced in humans.
Collapse
|
39
|
Bochkareva OO, Dranenko NO, Ocheredko ES, Kanevsky GM, Lozinsky YN, Khalaycheva VA, Artamonova II, Gelfand MS. Genome rearrangements and phylogeny reconstruction in Yersinia pestis. PeerJ 2018; 6:e4545. [PMID: 29607260 PMCID: PMC5877447 DOI: 10.7717/peerj.4545] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2017] [Accepted: 03/07/2018] [Indexed: 12/20/2022] Open
Abstract
Genome rearrangements have played an important role in the evolution of Yersinia pestis from its progenitor Yersinia pseudotuberculosis. Traditional phylogenetic trees for Y. pestis based on sequence comparison have short internal branches and low bootstrap supports as only a small number of nucleotide substitutions have occurred. On the other hand, even a small number of genome rearrangements may resolve topological ambiguities in a phylogenetic tree. We reconstructed phylogenetic trees based on genome rearrangements using several popular approaches such as Maximum likelihood for Gene Order and the Bayesian model of genome rearrangements by inversions. We also reconciled phylogenetic trees for each of the three CRISPR loci to obtain an integrated scenario of the CRISPR cassette evolution. Analysis of contradictions between the obtained evolutionary trees yielded numerous parallel inversions and gain/loss events. Our data indicate that an integrated analysis of sequence-based and inversion-based trees enhances the resolution of phylogenetic reconstruction. In contrast, reconstructions of strain relationships based on solely CRISPR loci may not be reliable, as the history is obscured by large deletions, obliterating the order of spacer gains. Similarly, numerous parallel gene losses preclude reconstruction of phylogeny based on gene content.
Collapse
|
40
|
Moldovan MA, Petrova SA, Gelfand MS. Comparative genomic analysis of fungal TPP-riboswitches. Fungal Genet Biol 2018; 114:34-41. [PMID: 29548845 DOI: 10.1016/j.fgb.2018.03.004] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2017] [Revised: 02/17/2018] [Accepted: 03/08/2018] [Indexed: 12/11/2022]
Abstract
Riboswitches are conserved RNA structures located in non-coding regions of mRNA and able to bind small molecules (e.g. metabolites) changing conformation upon binding. This feature enables them to function as regulators of gene expression. The thiamin pyrophosphate (TPP) riboswitch is the only type of riboswitches found not only in bacteria, but also in eukaryotes - in plants, green algae, protists, and fungi. Two main mechanisms of fungal TPP riboswitch action, involving alternative splicing, have been established so far. Here, we report a large-scale bioinformatic study of riboswitch structural features, action mechanisms, and distribution along the fungal taxonomy groups. For each putatively regulated gene, we reconstruct the riboswitch structure, identify other components of the regulation machinery, and establish mechanisms of riboswitch-mediated regulation. In addition to three genes known to be regulated by TPP riboswitches, thiazole synthase THI4, hydroxymethilpyrimidine-syntase NMT1, and putative transporter NCU01977, we identify two new genes, a putative thiamin transporter THI9 and a transporter of unknown specificity. While the riboswitch sequence and structure remain highly conserved in all species and genes, the mode of riboswitch-mediated regulation varies between regulated genes. The riboswitch usage varies strongly between fungal taxa, with the largest number of riboswitch-regulated genes found in Pezizomycotina and no riboswitch-mediated regulation established in Saccaromycotina.
Collapse
|
41
|
Moldovan MA, Gelfand MS. Pangenomic Definition of Prokaryotic Species and the Phylogenetic Structure of Prochlorococcus spp. Front Microbiol 2018; 9:428. [PMID: 29593678 PMCID: PMC5857598 DOI: 10.3389/fmicb.2018.00428] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2017] [Accepted: 02/23/2018] [Indexed: 11/13/2022] Open
Abstract
The pangenome is the collection of all groups of orthologous genes (OGGs) from a set of genomes. We apply the pangenome analysis to propose a definition of prokaryotic species based on identification of lineage-specific gene sets. While being similar to the classical biological definition based on allele flow, it does not rely on DNA similarity levels and does not require analysis of homologous recombination. Hence this definition is relatively objective and independent of arbitrary thresholds. A systematic analysis of 110 accepted species with the largest numbers of sequenced strains yields results largely consistent with the existing nomenclature. However, it has revealed that abundant marine cyanobacteria Prochlorococcus marinus should be divided into two species. As a control we have confirmed the paraphyletic origin of Yersinia pseudotuberculosis (with embedded, monophyletic Y. pestis) and Burkholderia pseudomallei (with B. mallei). We also demonstrate that by our definition and in accordance with recent studies Escherichia coli and Shigella spp. are one species.
Collapse
|
42
|
Kaznadzey A, Shelyakin P, Belousova E, Eremina A, Shvyreva U, Bykova D, Emelianenko V, Korosteleva A, Tutukina M, Gelfand MS. The genes of the sulphoquinovose catabolism in Escherichia coli are also associated with a previously unknown pathway of lactose degradation. Sci Rep 2018; 8:3177. [PMID: 29453395 PMCID: PMC5816610 DOI: 10.1038/s41598-018-21534-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2017] [Accepted: 02/06/2018] [Indexed: 12/29/2022] Open
Abstract
Comparative genomics analysis of conserved gene cassettes demonstrated resemblance between a recently described cassette of genes involved in sulphoquinovose degradation in Escherichia coli K-12 MG1655 and a Bacilli cassette linked with lactose degradation. Six genes from both cassettes had similar functions related to carbohydrate metabolism, namely, hydrolase, aldolase, kinase, isomerase, transporter, and transcription factor. The Escherichia coli sulphoglycolysis cassette was thus predicted to be associated with lactose degradation. This prediction was confirmed experimentally: expression of genes coding for aldolase (yihT), isomerase (yihS), and kinase (yihV) was dramatically increased during growth on lactose. These genes were previously shown to be activated during growth on sulphoquinovose, so our observation may indicate multi-functional capabilities of the respective proteins. Transcription starts for yihT, yihV and yihW were mapped in silico, in vitro and in vivo. Out of three promoters for yihT, one was active only during growth on lactose. We further showed that switches in yihT transcription are controlled by YihW, a DeoR-family transcription factor in the Escherichia coli cassette. YihW acted as a carbon source-dependent dual regulator involved in sustaining the baseline growth in the absence of lac-operon, with function either complementary, or opposite to a global regulator of carbohydrate metabolism, cAMP-CRP.
Collapse
|
43
|
Kaznadzey A, Shelyakin P, Gelfand MS. Sugar Lego: gene composition of bacterial carbohydrate metabolism genomic loci. Biol Direct 2017; 12:28. [PMID: 29178959 PMCID: PMC5702140 DOI: 10.1186/s13062-017-0200-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Accepted: 11/20/2017] [Indexed: 11/25/2022] Open
Abstract
Background Bacterial carbohydrate metabolism is extremely diverse, since carbohydrates serve as a major energy source and are involved in a variety of cellular processes. Bacterial genes belonging to same metabolic pathway are often co-localized in the chromosome, but it is not a strict rule. Gene co-localization in linked to co-evolution and co-regulation. This study focuses on a large-scale analysis of bacterial genomic loci related to the carbohydrate metabolism. Results We demonstrate that only 53% of 148,000 studied genes from over six hundred bacterial genomes are co-localized in bacterial genomes with other carbohydrate metabolism genes, which points to a significant role of singleton genes. Co-localized genes form cassettes, ranging in size from two to fifteen genes. Two major factors influencing the cassette-forming tendency are gene function and bacterial phylogeny. We have obtained a comprehensive picture of co-localization preferences of genes for nineteen major carbohydrate metabolism functional classes, over two hundred gene orthologous clusters, and thirty bacterial classes, and characterized the cassette variety in size and content among different species, highlighting a significant role of short cassettes. The preference towards co-localization of carbohydrate metabolism genes varies between 40 and 76% for bacterial taxa. Analysis of frequently co-localized genes yielded forty-five significant pairwise links between genes belonging to different functional classes. The number of such links per class range from zero to eight, demonstrating varying preferences of respective genes towards a specific chromosomal neighborhood. Genes from eleven functional classes tend to co-localize with genes from the same class, indicating an important role of clustering of genes with similar functions. At that, in most cases such co-localization does not originate from local duplication events. Conclusions Overall, we describe a complex web formed by evolutionary relationships of bacterial carbohydrate metabolism genes, manifested as co-localization patterns. Reviewers This article was reviewed by Daria V. Dibrova (A.N. Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, Russia), nominated by Armen Mulkidjanian (University of Osnabrück, Germany), Igor Rogozin (NCBI, NLM, NIH, USA) and Yuri Wolf (NCBI, NLM, NIH, USA). Electronic supplementary material The online version of this article (10.1186/s13062-017-0200-7) contains supplementary material, which is available to authorized users.
Collapse
|
44
|
Evfratov SA, Osterman IA, Komarova ES, Pogorelskaya AM, Rubtsova MP, Zatsepin TS, Semashko TA, Kostryukova ES, Mironov AA, Burnaev E, Krymova E, Gelfand MS, Govorun VM, Bogdanov AA, Sergiev PV, Dontsova OA. Application of sorting and next generation sequencing to study 5΄-UTR influence on translation efficiency in Escherichia coli. Nucleic Acids Res 2017; 45:3487-3502. [PMID: 27899632 PMCID: PMC5389652 DOI: 10.1093/nar/gkw1141] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2016] [Accepted: 10/31/2016] [Indexed: 12/24/2022] Open
Abstract
Yield of protein per translated mRNA may vary by four orders of magnitude. Many studies analyzed the influence of mRNA features on the translation yield. However, a detailed understanding of how mRNA sequence determines its propensity to be translated is still missing. Here, we constructed a set of reporter plasmid libraries encoding CER fluorescent protein preceded by randomized 5΄ untranslated regions (5΄-UTR) and Red fluorescent protein (RFP) used as an internal control. Each library was transformed into Escherchia coli cells, separated by efficiency of CER mRNA translation by a cell sorter and subjected to next generation sequencing. We tested efficiency of translation of the CER gene preceded by each of 48 natural 5΄-UTR sequences and introduced random and designed mutations into natural and artificially selected 5΄-UTRs. Several distinct properties could be ascribed to a group of 5΄-UTRs most efficient in translation. In addition to known ones, several previously unrecognized features that contribute to the translation enhancement were found, such as low proportion of cytidine residues, multiple SD sequences and AG repeats. The latter could be identified as translation enhancer, albeit less efficient than SD sequence in several natural 5΄-UTRs.
Collapse
|
45
|
Rakitina DV, Manolov AI, Kanygina AV, Garushyants SK, Baikova JP, Alexeev DG, Ladygina VG, Kostryukova ES, Larin AK, Semashko TA, Karpova IY, Babenko VV, Ismagilova RK, Malanin SY, Gelfand MS, Ilina EN, Gorodnichev RB, Lisitsyna ES, Aleshkin GI, Scherbakov PL, Khalif IL, Shapina MV, Maev IV, Andreev DN, Govorun VM. Genome analysis of E. coli isolated from Crohn's disease patients. BMC Genomics 2017; 18:544. [PMID: 28724357 PMCID: PMC5517970 DOI: 10.1186/s12864-017-3917-x] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2017] [Accepted: 07/02/2017] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Escherichia coli (E. coli) has been increasingly implicated in the pathogenesis of Crohn's disease (CD). The phylogeny of E. coli isolated from Crohn's disease patients (CDEC) was controversial, and while genotyping results suggested heterogeneity, the sequenced strains of E. coli from CD patients were closely related. RESULTS We performed the shotgun genome sequencing of 28 E. coli isolates from ten CD patients and compared genomes from these isolates with already published genomes of CD strains and other pathogenic and non-pathogenic strains. CDEC was shown to belong to A, B1, B2 and D phylogenetic groups. The plasmid and several operons from the reference CD-associated E. coli strain LF82 were demonstrated to be more often present in CDEC genomes belonging to different phylogenetic groups than in genomes of commensal strains. The operons include carbon-source induced invasion GimA island, prophage I, iron uptake operons I and II, capsular assembly pathogenetic island IV and propanediol and galactitol utilization operons. CONCLUSIONS Our findings suggest that CDEC are phylogenetically diverse. However, some strains isolated from independent sources possess highly similar chromosome or plasmids. Though no CD-specific genes or functional domains were present in all CD-associated strains, some genes and operons are more often found in the genomes of CDEC than in commensal E. coli. They are principally linked to gut colonization and utilization of propanediol and other sugar alcohols.
Collapse
|
46
|
Ulianov SV, Galitsyna AA, Flyamer IM, Golov AK, Khrameeva EE, Imakaev MV, Abdennur NA, Gelfand MS, Gavrilov AA, Razin SV. Activation of the alpha-globin gene expression correlates with dramatic upregulation of nearby non-globin genes and changes in local and large-scale chromatin spatial structure. Epigenetics Chromatin 2017; 10:35. [PMID: 28693562 PMCID: PMC5504709 DOI: 10.1186/s13072-017-0142-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2016] [Accepted: 07/03/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In homeotherms, the alpha-globin gene clusters are located within permanently open genome regions enriched in housekeeping genes. Terminal erythroid differentiation results in dramatic upregulation of alpha-globin genes making their expression comparable to the rRNA transcriptional output. Little is known about the influence of the erythroid-specific alpha-globin gene transcription outburst on adjacent, widely expressed genes and large-scale chromatin organization. Here, we have analyzed the total transcription output, the overall chromatin contact profile, and CTCF binding within the 2.7 Mb segment of chicken chromosome 14 harboring the alpha-globin gene cluster in cultured lymphoid cells and cultured erythroid cells before and after induction of terminal erythroid differentiation. RESULTS We found that, similarly to mammalian genome, the chicken genomes is organized in TADs and compartments. Full activation of the alpha-globin gene transcription in differentiated erythroid cells is correlated with upregulation of several adjacent housekeeping genes and the emergence of abundant intergenic transcription. An extended chromosome region encompassing the alpha-globin cluster becomes significantly decompacted in differentiated erythroid cells, and depleted in CTCF binding and CTCF-anchored chromatin loops, while the sub-TAD harboring alpha-globin gene cluster and the upstream major regulatory element (MRE) becomes highly enriched with chromatin interactions as compared to lymphoid and proliferating erythroid cells. The alpha-globin gene domain and the neighboring loci reside within the A-like chromatin compartment in both lymphoid and erythroid cells and become further segregated from the upstream gene desert upon terminal erythroid differentiation. CONCLUSIONS Our findings demonstrate that the effects of tissue-specific transcription activation are not restricted to the host genomic locus but affect the overall chromatin structure and transcriptional output of the encompassing topologically associating domain.
Collapse
|
47
|
Kublanov IV, Sigalova OM, Gavrilov SN, Lebedinsky AV, Rinke C, Kovaleva O, Chernyh NA, Ivanova N, Daum C, Reddy TBK, Klenk HP, Spring S, Göker M, Reva ON, Miroshnichenko ML, Kyrpides NC, Woyke T, Gelfand MS, Bonch-Osmolovskaya EA. Genomic Analysis of Caldithrix abyssi, the Thermophilic Anaerobic Bacterium of the Novel Bacterial Phylum Calditrichaeota. Front Microbiol 2017; 8:195. [PMID: 28265262 PMCID: PMC5317091 DOI: 10.3389/fmicb.2017.00195] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2016] [Accepted: 01/26/2017] [Indexed: 11/13/2022] Open
Abstract
The genome of Caldithrix abyssi, the first cultivated representative of a phylum-level bacterial lineage, was sequenced within the framework of Genomic Encyclopedia of Bacteria and Archaea (GEBA) project. The genomic analysis revealed mechanisms allowing this anaerobic bacterium to ferment peptides or to implement nitrate reduction with acetate or molecular hydrogen as electron donors. The genome encoded five different [NiFe]- and [FeFe]-hydrogenases, one of which, group 1 [NiFe]-hydrogenase, is presumably involved in lithoheterotrophic growth, three other produce H2 during fermentation, and one is apparently bidirectional. The ability to reduce nitrate is determined by a nitrate reductase of the Nap family, while nitrite reduction to ammonia is presumably catalyzed by an octaheme cytochrome c nitrite reductase εHao. The genome contained genes of respiratory polysulfide/thiosulfate reductase, however, elemental sulfur and thiosulfate were not used as the electron acceptors for anaerobic respiration with acetate or H2, probably due to the lack of the gene of the maturation protein. Nevertheless, elemental sulfur and thiosulfate stimulated growth on fermentable substrates (peptides), being reduced to sulfide, most probably through the action of the cytoplasmic sulfide dehydrogenase and/or NAD(P)-dependent [NiFe]-hydrogenase (sulfhydrogenase) encoded by the genome. Surprisingly, the genome of this anaerobic microorganism encoded all genes for cytochrome c oxidase, however, its maturation machinery seems to be non-operational due to genomic rearrangements of supplementary genes. Despite the fact that sugars were not among the substrates reported when C. abyssi was first described, our genomic analysis revealed multiple genes of glycoside hydrolases, and some of them were predicted to be secreted. This finding aided in bringing out four carbohydrates that supported the growth of C. abyssi: starch, cellobiose, glucomannan and xyloglucan. The genomic analysis demonstrated the ability of C. abyssi to synthesize nucleotides and most amino acids and vitamins. Finally, the genomic sequence allowed us to perform a phylogenomic analysis, based on 38 protein sequences, which confirmed the deep branching of this lineage and justified the proposal of a novel phylum Calditrichaeota.
Collapse
|
48
|
Hahn J, Tsoy OV, Thalmann S, Čuklina J, Gelfand MS, Evguenieva-Hackenberg E. Small Open Reading Frames, Non-Coding RNAs and Repetitive Elements in Bradyrhizobium japonicum USDA 110. PLoS One 2016; 11:e0165429. [PMID: 27788207 PMCID: PMC5082802 DOI: 10.1371/journal.pone.0165429] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2016] [Accepted: 10/11/2016] [Indexed: 11/18/2022] Open
Abstract
Small open reading frames (sORFs) and genes for non-coding RNAs are poorly investigated components of most genomes. Our analysis of 1391 ORFs recently annotated in the soybean symbiont Bradyrhizobium japonicum USDA 110 revealed that 78% of them contain less than 80 codons. Twenty-one of these sORFs are conserved in or outside Alphaproteobacteria and most of them are similar to genes found in transposable elements, in line with their broad distribution. Stabilizing selection was demonstrated for sORFs with proteomic evidence and bll1319_ISGA which is conserved at the nucleotide level in 16 alphaproteobacterial species, 79 species from other taxa and 49 other Proteobacteria. Further we used Northern blot hybridization to validate ten small RNAs (BjsR1 to BjsR10) belonging to new RNA families. We found that BjsR1 and BjsR3 have homologs outside the genus Bradyrhizobium, and BjsR5, BjsR6, BjsR7, and BjsR10 have up to four imperfect copies in Bradyrhizobium genomes. BjsR8, BjsR9, and BjsR10 are present exclusively in nodules, while the other sRNAs are also expressed in liquid cultures. We also found that the level of BjsR4 decreases after exposure to tellurite and iron, and this down-regulation contributes to survival under high iron conditions. Analysis of additional small RNAs overlapping with 3’-UTRs revealed two new repetitive elements named Br-REP1 and Br-REP2. These REP elements may play roles in the genomic plasticity and gene regulation and could be useful for strain identification by PCR-fingerprinting. Furthermore, we studied two potential toxin genes in the symbiotic island and confirmed toxicity of the yhaV homolog bll1687 but not of the newly annotated higB homolog blr0229_ISGA in E. coli. Finally, we revealed transcription interference resulting in an antisense RNA complementary to blr1853, a gene induced in symbiosis. The presented results expand our knowledge on sORFs, non-coding RNAs and repetitive elements in B. japonicum and related bacteria.
Collapse
|
49
|
Korostelev YD, Zharov IA, Mironov AA, Rakhmaininova AB, Gelfand MS. Identification of Position-Specific Correlations between DNA-Binding Domains and Their Binding Sites. Application to the MerR Family of Transcription Factors. PLoS One 2016; 11:e0162681. [PMID: 27690309 PMCID: PMC5045206 DOI: 10.1371/journal.pone.0162681] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2015] [Accepted: 08/26/2016] [Indexed: 11/25/2022] Open
Abstract
The large and increasing volume of genomic data analyzed by comparative methods provides information about transcription factors and their binding sites that, in turn, enables statistical analysis of correlations between factors and sites, uncovering mechanisms and evolution of specific protein-DNA recognition. Here we present an online tool, Prot-DNA-Korr, designed to identify and analyze crucial protein-DNA pairs of positions in a family of transcription factors. Correlations are identified by analysis of mutual information between columns of protein and DNA alignments. The algorithm reduces the effects of common phylogenetic history and of abundance of closely related proteins and binding sites. We apply it to five closely related subfamilies of the MerR family of bacterial transcription factors that regulate heavy metal resistance systems. We validate the approach using known 3D structures of MerR-family proteins in complexes with their cognate DNA binding sites and demonstrate that a significant fraction of correlated positions indeed form specific side-chain-to-base contacts. The joint distribution of amino acids and nucleotides hence may be used to predict changes of specificity for point mutations in transcription factors.
Collapse
|
50
|
Ivanenko VN, Antonenko EA, Gelfand MS, Yager J, Ferrari FD. Changes in segmentation and setation along the anterior/posterior axis of the homonomous trunk limbs of a remipede (Crustacea, Arthropoda). PeerJ 2016; 4:e2305. [PMID: 27602276 PMCID: PMC4991865 DOI: 10.7717/peerj.2305] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2016] [Accepted: 07/08/2016] [Indexed: 11/24/2022] Open
Abstract
This study describes the segmentation and setation at different developmental stages of the homonomous trunk limbs of the remipede Speleonectes tulumensisYager, 1987 collected in anchialine caves of the Yucatan Peninsula. Most homonomous trunk limbs originate ventrolaterally and are composed of two protopodal segments, three exopodal segments and four endopodal segments; contralateral limb pairs are united by a sternal bar. However, the last few posterior limbs originate ventrally, are smaller sized, and have regressively fewer segments, suggesting that limb development passes through several intermediate steps beginning with a limb bud. A terminal stage of development is proposed for specimens on which the posterior somite bears a simple bilobate limb bud, and the adjacent somite bears a limb with a protopod comprised of a coxapod and basipod, and with three exopodal and four endopodal segments. On each trunk limb there are 20 serially homologous groups of setae, and the numbers of setae on different limbs usually varies. These groups of setae are arranged linearly and are identified based on the morphology of the setae and their position on the segments. The number of setae in these groups increases gradually from the anterior homonomous limb to a maximum between limbs 8–12; the number then decreases sharply on the more posterior limbs. Changes in the number of setae, which reach a maximum between trunk limbs 8–12, differ from changes in segmentation which vary only over the last few posterior trunk limbs. Following a vector analysis that identified a spatial pattern for these 20 groups of setae among the different homonomous limbs, the hypothesis was confirmed that the number of setae in any given group and any given limb is correlated with the group, with the position of the somite along the body axis, and with the number of somites present on the specimens. This is the first vector analysis used to analyze a pattern of developmental changes in serially homologs of an arthropod. Development of remipede limbs are compared and contrasted with similar copepod limbs. Architecture, particularly the sternal bar uniting contralateral limb pairs, proposed as homologous, and development of trunk limb segmentation of the remipede is generally similar to that of copepods, but the remipede limb differs in several ways including an additional endopodal segment, the proximal, that appears simultaneously with the protopod during development.
Collapse
|