1
|
Loewenthal G, Wygoda E, Nagar N, Glick L, Mayrose I, Pupko T. The evolutionary dynamics that retain long neutral genomic sequences in face of indel deletion bias: a model and its application to human introns. Open Biol 2022; 12:220223. [PMID: 36514983 PMCID: PMC9748784 DOI: 10.1098/rsob.220223] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Insertions and deletions (indels) of short DNA segments are common evolutionary events. Numerous studies showed that deletions occur more often than insertions in both prokaryotes and eukaryotes. It raises the question why neutral sequences are not eradicated from the genome. We suggest that this is due to a phenomenon we term border-induced selection. Accordingly, a neutral sequence is bordered between conserved regions. Deletions occurring near the borders occasionally protrude to the conserved region and are thereby subject to strong purifying selection. Thus, for short neutral sequences, an insertion bias is expected. Here, we develop a set of increasingly complex models of indel dynamics that incorporate border-induced selection. Furthermore, we show that short conserved sequences within the neutrally evolving sequence help explain: (i) the presence of very long sequences; (ii) the high variance of sequence lengths; and (iii) the possible emergence of multimodality in sequence length distributions. Finally, we fitted our models to the human intron length distribution, as introns are thought to be mostly neutral and bordered by conserved exons. We show that when accounting for the occurrence of short conserved sequences within introns, we reproduce the main features, including the presence of long introns and the multimodality of intron distribution.
Collapse
Affiliation(s)
- Gil Loewenthal
- The Shmunis School of Biomedicine and Cancer Research, Tel Aviv University, Tel Aviv 69978, Israel
| | - Elya Wygoda
- The Shmunis School of Biomedicine and Cancer Research, Tel Aviv University, Tel Aviv 69978, Israel
| | - Natan Nagar
- The Shmunis School of Biomedicine and Cancer Research, Tel Aviv University, Tel Aviv 69978, Israel
| | - Lior Glick
- School of Plant Sciences and Food Security, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Itay Mayrose
- School of Plant Sciences and Food Security, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Tal Pupko
- The Shmunis School of Biomedicine and Cancer Research, Tel Aviv University, Tel Aviv 69978, Israel
| |
Collapse
|
2
|
Loewenthal G, Rapoport D, Avram O, Moshe A, Wygoda E, Itzkovitch A, Israeli O, Azouri D, Cartwright RA, Mayrose I, Pupko T. A probabilistic model for indel evolution: differentiating insertions from deletions. Mol Biol Evol 2021; 38:5769-5781. [PMID: 34469521 PMCID: PMC8662616 DOI: 10.1093/molbev/msab266] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Insertions and deletions (indels) are common molecular evolutionary events. However, probabilistic models for indel evolution are under-developed due to their computational complexity. Here, we introduce several improvements to indel modeling: 1) While previous models for indel evolution assumed that the rates and length distributions of insertions and deletions are equal, here we propose a richer model that explicitly distinguishes between the two; 2) we introduce numerous summary statistics that allow approximate Bayesian computation-based parameter estimation; 3) we develop a method to correct for biases introduced by alignment programs, when inferring indel parameters from empirical data sets; and 4) using a model-selection scheme, we test whether the richer model better fits biological data compared with the simpler model. Our analyses suggest that both our inference scheme and the model-selection procedure achieve high accuracy on simulated data. We further demonstrate that our proposed richer model better fits a large number of empirical data sets and that, for the majority of these data sets, the deletion rate is higher than the insertion rate.
Collapse
Affiliation(s)
- Gil Loewenthal
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Dana Rapoport
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Oren Avram
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Asher Moshe
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Elya Wygoda
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Alon Itzkovitch
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Omer Israeli
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Dana Azouri
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel.,School of Plant Sciences and Food Security, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Reed A Cartwright
- The Biodesign Institute, Arizona State University, Tempe, Arizona, USA.,School of Life Sciences, Arizona State University, Tempe, Arizona, USA
| | - Itay Mayrose
- School of Plant Sciences and Food Security, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Tal Pupko
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| |
Collapse
|
3
|
De Lise F, Strazzulli A, Iacono R, Curci N, Di Fenza M, Maurelli L, Moracci M, Cobucci-Ponzano B. Programmed Deviations of Ribosomes From Standard Decoding in Archaea. Front Microbiol 2021; 12:688061. [PMID: 34149676 PMCID: PMC8211752 DOI: 10.3389/fmicb.2021.688061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Accepted: 05/04/2021] [Indexed: 11/13/2022] Open
Abstract
Genetic code decoding, initially considered to be universal and immutable, is now known to be flexible. In fact, in specific genes, ribosomes deviate from the standard translational rules in a programmed way, a phenomenon globally termed recoding. Translational recoding, which has been found in all domains of life, includes a group of events occurring during gene translation, namely stop codon readthrough, programmed ± 1 frameshifting, and ribosome bypassing. These events regulate protein expression at translational level and their mechanisms are well known and characterized in viruses, bacteria and eukaryotes. In this review we summarize the current state-of-the-art of recoding in the third domain of life. In Archaea, it was demonstrated and extensively studied that translational recoding regulates the decoding of the 21st and the 22nd amino acids selenocysteine and pyrrolysine, respectively, and only one case of programmed -1 frameshifting has been reported so far in Saccharolobus solfataricus P2. However, further putative events of translational recoding have been hypothesized in other archaeal species, but not extensively studied and confirmed yet. Although this phenomenon could have some implication for the physiology and adaptation of life in extreme environments, this field is still underexplored and genes whose expression could be regulated by recoding are still poorly characterized. The study of these recoding episodes in Archaea is urgently needed.
Collapse
Affiliation(s)
- Federica De Lise
- Institute of Biosciences and BioResources - National Research Council of Italy, Naples, Italy
| | - Andrea Strazzulli
- Department of Biology, University of Naples Federico II, Complesso Universitario di Monte S. Angelo, Naples, Italy.,Task Force on Microbiome Studies, University of Naples Federico II, Naples, Italy
| | - Roberta Iacono
- Institute of Biosciences and BioResources - National Research Council of Italy, Naples, Italy.,Department of Biology, University of Naples Federico II, Complesso Universitario di Monte S. Angelo, Naples, Italy
| | - Nicola Curci
- Institute of Biosciences and BioResources - National Research Council of Italy, Naples, Italy.,Department of Biology, University of Naples Federico II, Complesso Universitario di Monte S. Angelo, Naples, Italy
| | - Mauro Di Fenza
- Institute of Biosciences and BioResources - National Research Council of Italy, Naples, Italy
| | - Luisa Maurelli
- Institute of Biosciences and BioResources - National Research Council of Italy, Naples, Italy
| | - Marco Moracci
- Institute of Biosciences and BioResources - National Research Council of Italy, Naples, Italy.,Department of Biology, University of Naples Federico II, Complesso Universitario di Monte S. Angelo, Naples, Italy.,Task Force on Microbiome Studies, University of Naples Federico II, Naples, Italy
| | | |
Collapse
|
4
|
Kırtel O, Versluys M, Van den Ende W, Toksoy Öner E. Fructans of the saline world. Biotechnol Adv 2018; 36:1524-1539. [DOI: 10.1016/j.biotechadv.2018.06.009] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2018] [Revised: 06/08/2018] [Accepted: 06/14/2018] [Indexed: 10/28/2022]
|
5
|
Abstract
A previous study of prokaryotic genomes identified large reservoirs of putative mobile promoters (PMPs), that is, homologous promoter sequences associated with nonhomologous coding sequences. Here we extend this data set to identify the full complement of mobile promoters in sequenced prokaryotic genomes. The expanded search identifies nearly 40,000 PMP sequences, 90% of which occur in noncoding regions of the genome. To gain further insight from this data set, we develop a birth-death-diversification model for mobile genetic elements subject to sequence diversification; applying the model to PMPs we are able to quantify the relative importance of duplication, loss, horizontal gene transfer (HGT), and diversification to the maintenance of the PMP reservoir. The model predicts low rates of HGT relative to the duplication and loss of PMP copies, rapid dynamics of PMP families, and a pool of PMPs that exist as a single copy in a genome at any given time, despite their mobility. We report evidence of these "singletons" at high frequencies in prokaryotic genomes. We also demonstrate that including selection, either for or against PMPs, was not necessary to describe the observed data.
Collapse
|
6
|
Translational recoding in archaea. Extremophiles 2012; 16:793-803. [DOI: 10.1007/s00792-012-0482-8] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2012] [Accepted: 09/09/2012] [Indexed: 12/31/2022]
|
7
|
Abstract
Transcriptional activation or ‘rewiring’ of silent genes is an important, yet poorly understood, phenomenon in prokaryotic genomes. Anecdotal evidence coming from experimental evolution studies in bacterial systems has shown the promptness of adaptation upon appropriate selective pressure. In many cases, a partial or complete promoter is mobilized to silent genes from elsewhere in the genome. We term hereafter such recruited regulatory sequences as Putative Mobile Promoters (PMPs) and we hypothesize they have a large impact on rapid adaptation of novel or cryptic functions. Querying all publicly available prokaryotic genomes (1362) uncovered >4000 families of highly conserved PMPs (50 to 100 long with ≥80% nt identity) in 1043 genomes from 424 different genera. The genomes with the largest number of PMP families are Anabaena variabilis (28 families), Geobacter uraniireducens (27 families) and Cyanothece PCC7424 (25 families). Family size varied from 2 to 93 homologous promoters (in Desulfurivibrio alkaliphilus). Some PMPs are present in particular species, but some are conserved across distant genera. The identified PMPs represent a conservative dataset of very recent or conserved events of mobilization of non-coding DNA and thus they constitute evidence of an extensive reservoir of recyclable regulatory sequences for rapid transcriptional rewiring.
Collapse
Affiliation(s)
- Mariana Matus-Garcia
- Department of Agrotechnology and Food Sciences, Laboratory of Systems and Synthetic Biology, Wageningen University, 6703HB Wageningen, The Netherlands
| | | | | |
Collapse
|
8
|
Choi JY, Eoff RL, Pence MG, Wang J, Martin MV, Kim EJ, Folkmann LM, Guengerich FP. Roles of the four DNA polymerases of the crenarchaeon Sulfolobus solfataricus and accessory proteins in DNA replication. J Biol Chem 2011; 286:31180-93. [PMID: 21784862 DOI: 10.1074/jbc.m111.258038] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
The hyperthermophilic crenarchaeon Sulfolobus solfataricus P2 encodes three B-family DNA polymerase genes, B1 (Dpo1), B2 (Dpo2), and B3 (Dpo3), and one Y-family DNA polymerase gene, Dpo4, which are related to eukaryotic counterparts. Both mRNAs and proteins of all four DNA polymerases were constitutively expressed in all growth phases. Dpo2 and Dpo3 possessed very low DNA polymerase and 3' to 5' exonuclease activities in vitro. Steady-state kinetic efficiencies (k(cat)/K(m)) for correct nucleotide insertion by Dpo2 and Dpo3 were several orders of magnitude less than Dpo1 and Dpo4. Both the accessory proteins proliferating cell nuclear antigen and the clamp loader replication factor C facilitated DNA synthesis with Dpo3, as with Dpo1 and Dpo4, but very weakly with Dpo2. DNA synthesis by Dpo2 and Dpo3 was remarkably decreased by single-stranded binding protein, in contrast to Dpo1 and Dpo4. DNA synthesis in the presence of proliferating cell nuclear antigen, replication factor C, and single-stranded binding protein was most processive with Dpo1, whereas DNA lesion bypass was most effective with Dpo4. Both Dpo2 and Dpo3, but not Dpo1, bypassed hypoxanthine and 8-oxoguanine. Dpo2 and Dpo3 bypassed uracil and cis-syn cyclobutane thymine dimer, respectively. High concentrations of Dpo2 or Dpo3 did not attenuate DNA synthesis by Dpo1 or Dpo4. We conclude that Dpo2 and Dpo3 are much less functional and more thermolabile than Dpo1 and Dpo4 in vitro but have bypass activities across hypoxanthine, 8-oxoguanine, and either uracil or cis-syn cyclobutane thymine dimer, suggesting their catalytically limited roles in translesion DNA synthesis past deaminated, oxidized base lesions and/or UV-induced damage.
Collapse
Affiliation(s)
- Jeong-Yun Choi
- Department of Biochemistry and Center in Molecular Toxicology, Vanderbilt University School of Medicine, Nashville, Tennessee 37232-0146, USA
| | | | | | | | | | | | | | | |
Collapse
|
9
|
Muro EM, Mah N, Moreno-Hagelsieb G, Andrade-Navarro MA. The pseudogenes of Mycobacterium leprae reveal the functional relevance of gene order within operons. Nucleic Acids Res 2010; 39:1732-8. [PMID: 21051341 PMCID: PMC3061063 DOI: 10.1093/nar/gkq1067] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Almost 50 years following the discovery of the prokaryotic operon, the functional relevance of gene order within operons remains unclear. In this work, we take advantage of the eroded genome of Mycobacterium leprae to add evidence supporting the notion that functionally less important genes have a tendency to be located at the end of its operons. M. leprae’s genome includes 1133 pseudogenes and 1614 protein-coding genes and can be compared with the close genome of M. tuberculosis. Assuming M. leprae’s pseudogenes to represent dispensable genes, we have studied the position of these pseudogenes in the operons of M. leprae and of their orthologs in M. tuberculosis. We observed that both tend to be located in the 3′ (downstream) half of the operon (P-values of 0.03 and 0.18, respectively). Analysis of pseudogenes in all available prokaryotic genomes confirms this trend (P-value of 7.1 × 10−7). In a complementary analysis, we found a significant tendency for essential genes to be located at the 5′ (upstream) half of the operon (P-value of 0.006). Our work provides an indication that, in prokarya, functionally less important genes have a tendency to be located at the end of operons, while more relevant genes tend to be located toward operon starts.
Collapse
Affiliation(s)
- Enrique M Muro
- Computational Biology and Data Mining Group, Max Delbrück Center for Molecular Medicine, Robert-Rössle Strasse 10, 13125, Berlin, Germany.
| | | | | | | |
Collapse
|
10
|
Kuo CH, Ochman H. The extinction dynamics of bacterial pseudogenes. PLoS Genet 2010; 6. [PMID: 20700439 PMCID: PMC2916853 DOI: 10.1371/journal.pgen.1001050] [Citation(s) in RCA: 127] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2010] [Accepted: 07/06/2010] [Indexed: 01/02/2023] Open
Abstract
Pseudogenes are usually considered to be completely neutral sequences whose evolution is shaped by random mutations and chance events. It is possible, however, for disrupted genes to generate products that are deleterious due either to the energetic costs of their transcription and translation or to the formation of toxic proteins. We found that after their initial formation, the youngest pseudogenes in Salmonella genomes have a very high likelihood of being removed by deletional processes and are eliminated too rapidly to be governed by a strictly neutral model of stochastic loss. Those few highly degraded pseudogenes that have persisted in Salmonella genomes correspond to genes with low expression levels and low connectivity in gene networks, such that their inactivation and any initial deleterious effects associated with their inactivation are buffered. Although pseudogenes have long been considered the paradigm of neutral evolution, the distribution of pseudogenes among Salmonella strains indicates that removal of many of these apparently functionless regions is attributable to positive selection.
Collapse
Affiliation(s)
- Chih-Horng Kuo
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona, United States of America
| | - Howard Ochman
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona, United States of America
- * E-mail:
| |
Collapse
|
11
|
Yokobori SI, Itoh T, Yoshinari S, Nomura N, Sako Y, Yamagishi A, Oshima T, Kita K, Watanabe YI. Gain and loss of an intron in a protein-coding gene in Archaea: the case of an archaeal RNA pseudouridine synthase gene. BMC Evol Biol 2009; 9:198. [PMID: 19671140 PMCID: PMC2738675 DOI: 10.1186/1471-2148-9-198] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2009] [Accepted: 08/11/2009] [Indexed: 01/13/2023] Open
Abstract
Background We previously found the first examples of splicing of archaeal pre-mRNAs for homologs of the eukaryotic CBF5 protein (also known as dyskerin in humans) in Aeropyrum pernix, Sulfolobus solfataricus, S. tokodaii, and S. acidocaldarirus, and also showed that crenarchaeal species in orders Desulfurococcales and Sulfolobales, except for Hyperthermus butylicus, Pyrodictium occultum, Pyrolobus fumarii, and Ignicoccus islandicus, contain the (putative) cbf5 intron. However, the exact timing of the intron insertion was not determined and verification of the putative secondary loss of the intron in some lineages was not performed. Results In the present study, we determined approximately two-thirds of the entire coding region of crenarchaeal Cbf5 sequences from 43 species. A phylogenetic analysis of our data and information from the available genome sequences suggested that the (putative) cbf5 intron existed in the common ancestor of the orders Desulfurococcales and Sulfolobales and that probably at least two independent lineages in the order Desulfurococcales lost the (putative) intron. Conclusion This finding is the first observation of a lineage-specific loss of a pre-mRNA intron in Archaea. As the insertion or deletion of introns in protein-coding genes in Archaea has not yet been seriously considered, our finding suggests the possible difficulty of accurately and completely predicting protein-coding genes in Archaea.
Collapse
Affiliation(s)
- Shin-ichi Yokobori
- Department of Molecular Biology, School of Life Science, Tokyo University of Pharmacy and Life Science, Horinouchi, Hachioji, Tokyo 192-0392, Japan.
| | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Abstract
Elevated levels of genetic drift are hypothesized to be a dominant factor that influences genome size evolution across all life-forms. However, increased levels of drift appear to be correlated with genome expansion in eukaryotes but with genome contraction in bacteria, suggesting that these two groups of organisms experience vastly different mutational inputs and selective constraints. To determine the contribution of small insertion and deletion events to the differences in genome organization between eukaryotes and prokaryotes, we systematically surveyed 17 taxonomic groups across the three domains of life. Based on over 5,000 indel events in noncoding regions, we found that deletional events outnumbered insertions in all groups examined. The extent of deletional bias, when measured by the total length of insertions to deletions, revealed a marked disparity between eukaryotes and prokaryotes, whereas the ratio was close to one in the three eukaryotic groups examined, deletions outweighed insertions by at least a factor of 10 in most prokaryotes. Moreover, the strength of deletional bias is associated with the proportion of coding regions in prokaryotic genomes. Considering that genetic drift is a stochastic process and does not discriminate the exact nature of mutations, the degree of bias toward deletions provides an explanation to the differential responses of eukaryotes and prokaryotes to elevated levels of drift. Furthermore, deletional bias, rather than natural selection, is the primary mechanism by which the compact gene packing within most prokaryotic genomes is maintained.
Collapse
Affiliation(s)
- Chih-Horng Kuo
- Department of Ecology & Evolutionary Biology, University of Arizona, USA
| | | |
Collapse
|
13
|
Cortez D, Forterre P, Gribaldo S. A hidden reservoir of integrative elements is the major source of recently acquired foreign genes and ORFans in archaeal and bacterial genomes. Genome Biol 2009; 10:R65. [PMID: 19531232 PMCID: PMC2718499 DOI: 10.1186/gb-2009-10-6-r65] [Citation(s) in RCA: 95] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2009] [Revised: 06/04/2009] [Accepted: 06/16/2009] [Indexed: 11/10/2022] Open
Abstract
A large-scale survey of potential recently acquired integrative elements in 119 archaeal and bacterial genomes reveals that many recently acquired genes have originated from integrative elements Background Archaeal and bacterial genomes contain a number of genes of foreign origin that arose from recent horizontal gene transfer, but the role of integrative elements (IEs), such as viruses, plasmids, and transposable elements, in this process has not been extensively quantified. Moreover, it is not known whether IEs play an important role in the origin of ORFans (open reading frames without matches in current sequence databases), whose proportion remains stable despite the growing number of complete sequenced genomes. Results We have performed a large-scale survey of potential recently acquired IEs in 119 archaeal and bacterial genomes. We developed an accurate in silico Markov model-based strategy to identify clusters of genes that show atypical sequence composition (clusters of atypical genes or CAGs) and are thus likely to be recently integrated foreign elements, including IEs. Our method identified a high number of new CAGs. Probabilistic analysis of gene content indicates that 56% of these new CAGs are likely IEs, whereas only 7% likely originated via horizontal gene transfer from distant cellular sources. Thirty-four percent of CAGs remain unassigned, what may reflect a still poor sampling of IEs associated with bacterial and archaeal diversity. Moreover, our study contributes to the issue of the origin of ORFans, because 39% of these are found inside CAGs, many of which likely represent recently acquired IEs. Conclusions Our results strongly indicate that archaeal and bacterial genomes contain an impressive proportion of recently acquired foreign genes (including ORFans) coming from a still largely unexplored reservoir of IEs.
Collapse
Affiliation(s)
- Diego Cortez
- Institut Pasteur, Département de Microbiologie, Unité de Biologie Moléculaire du Gène chez les Extrêmophiles, Paris, France.
| | | | | |
Collapse
|
14
|
van Passel MWJ, Marri PR, Ochman H. The emergence and fate of horizontally acquired genes in Escherichia coli. PLoS Comput Biol 2008; 4:e1000059. [PMID: 18404206 PMCID: PMC2275313 DOI: 10.1371/journal.pcbi.1000059] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2007] [Accepted: 03/14/2008] [Indexed: 11/18/2022] Open
Abstract
Bacterial species, and even strains within species, can vary greatly in their gene contents and metabolic capabilities. We examine the evolution of this diversity by assessing the distribution and ancestry of each gene in 13 sequenced isolates of Escherichia coli and Shigella. We focus on the emergence and demise of two specific classes of genes, ORFans (genes with no homologs in present databases) and HOPs (genes with distant homologs), since these genes, in contrast to most conserved ancestral sequences, are known to be a major source of the novel features in each strain. We find that the rates of gain and loss of these genes vary greatly among strains as well as through time, and that ORFans and HOPs show very different behavior with respect to their emergence and demise. Although HOPs, which mostly represent gene acquisitions from other bacteria, originate more frequently, ORFans are much more likely to persist. This difference suggests that many adaptive traits are conferred by completely novel genes that do not originate in other bacterial genomes. With respect to the demise of these acquired genes, we find that strains of Shigella lose genes, both by disruption events and by complete removal, at accelerated rates.
Collapse
Affiliation(s)
- Mark W J van Passel
- Department of Biochemistry and Molecular Biophysics, University of Arizona, Tucson, Arizona, United States of America.
| | | | | |
Collapse
|
15
|
van Passel MWJ, Ochman H. Selection on the genic location of disruptive elements. Trends Genet 2007; 23:601-4. [PMID: 17996324 DOI: 10.1016/j.tig.2007.08.017] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2007] [Revised: 07/26/2007] [Accepted: 08/01/2007] [Indexed: 11/19/2022]
Abstract
Analyses of nucleotide patterns in coding regions of prokaryotes have revealed that selection acts on DNA and RNA stability and on translational accuracy. Here we examine the positions of mononucleotide repeats within microbial genes and detect a pervasive bias in the locations of these disruptive elements that becomes more pronounced with increases in repeat length. We argue that, because these repeats are mutagenic, this pattern arose to minimize the costs associated with transcribing and translating nonfunctional genes, supporting a view that pseudogenes need not be evolving in a strictly neutral manner.
Collapse
Affiliation(s)
- M W J van Passel
- Department of Biochemistry and Molecular Biophysics, University of Arizona, 1007 East Lowell Street, Tucson, AZ 85721, USA.
| | | |
Collapse
|