151
|
Sonnenschein N, Hütt MT, Stoyan H, Stoyan D. Ranges of control in the transcriptional regulation of Escherichia coli. BMC SYSTEMS BIOLOGY 2009; 3:119. [PMID: 20034377 PMCID: PMC2804738 DOI: 10.1186/1752-0509-3-119] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/12/2009] [Accepted: 12/24/2009] [Indexed: 11/10/2022]
Abstract
BACKGROUND The positioning of genes in the genome is an important evolutionary degree of freedom for organizing gene regulation. Statistical properties of these distributions have been studied particularly in relation to the transcriptional regulatory network. The systematics of gene-gene distances then become important sources of information on the control, which different biological mechanisms exert on gene expression. RESULTS Here we study a set of categories, which has to our knowledge not been analyzed before. We distinguish between genes that do not participate in the transcriptional regulatory network (i.e. that are according to current knowledge not producing transcription factors and do not possess binding sites for transcription factors in their regulatory region), and genes that via transcription factors either are regulated by or regulate other genes. We find that the two types of genes ("isolated" and "regulatory" genes) show a clear statistical repulsion and have different ranges of correlations. In particular we find that isolated genes have a preference for shorter intergenic distances. CONCLUSIONS These findings support previous evidence from gene expression patterns for two distinct logical types of control, namely digital control (i.e. network-based control mediated by dedicated transcription factors) and analog control (i.e. control based on genome structure and mediated by neighborhood on the genome).
Collapse
Affiliation(s)
- Nikolaus Sonnenschein
- School of Engineering and Science, Jacobs University Bremen, Campus Ring 1, 28759 Bremen, Germany.
| | | | | | | |
Collapse
|
152
|
Jackson AP, Thomas GH, Parkhill J, Thomson NR. Evolutionary diversification of an ancient gene family (rhs) through C-terminal displacement. BMC Genomics 2009; 10:584. [PMID: 19968874 PMCID: PMC2935791 DOI: 10.1186/1471-2164-10-584] [Citation(s) in RCA: 82] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2009] [Accepted: 12/07/2009] [Indexed: 12/29/2022] Open
Abstract
Background Rhs genes are prominent features of bacterial genomes that have previously been implicated in genomic rearrangements in E. coli. By comparing rhs repertoires across the Enterobacteriaceae, this study provides a robust explanation of rhs diversification and evolution, and a mechanistic model of how rhs diversity is gained and lost. Results Rhs genes are ubiquitous and comprise six structurally distinct lineages within the Enterobacteriaceae. There is considerable intergenomic variation in rhs repertoire; for instance, in Salmonella enterica, rhs are restricted to mobile elements, while in Escherichia coli one rhs lineage has diversified through transposition as older lineages have been deleted. Overall, comparative genomics reveals frequent, independent gene gains and losses, as well as occasional lateral gene transfer, in different genera. Furthermore, we demonstrate that Rhs 'core' domains and variable C-termini are evolutionarily decoupled, and propose that rhs diversity is driven by homologous recombination with circular intermediates. Existing C-termini are displaced by laterally acquired alternatives, creating long arrays of dissociated 'tips' that characterize the appearance of rhs loci. Conclusion Rhs repertoires are highly dynamic among Enterobacterial genomes, due to repeated gene gains and losses. In contrast, the primary structures of Rhs genes are evolutionarily conserved, indicating that rhs sequence diversity is driven, not by rapid mutation, but by the relatively slow evolution of novel core/tip combinations. Hence, we predict that a large pool of dissociated rhs C-terminal tips exists episomally and these are potentially transmitted across taxonomic boundaries.
Collapse
Affiliation(s)
- Andrew P Jackson
- The Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| | | | | | | |
Collapse
|
153
|
Draft genome sequences of Yersinia pestis isolates from natural foci of endemic plague in China. J Bacteriol 2009; 191:7628-9. [PMID: 19820101 DOI: 10.1128/jb.01227-09] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
To gain insights into the evolutionary origin, emergence, and pathogenicity of the etiologic agent of plague, we have sequenced the genomes of four Yersinia pestis strains isolated from the zoonotic rodent reservoir in foci of endemic plague in China. These resources enable in-depth studies of Y. pestis sequence variations and detailed whole-genome comparisons of very closely related genomes from the supposed site of the origin and the emergence of global pandemics of plague.
Collapse
|
154
|
Ragan MA, Beiko RG. Lateral genetic transfer: open issues. Philos Trans R Soc Lond B Biol Sci 2009; 364:2241-51. [PMID: 19571244 DOI: 10.1098/rstb.2009.0031] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Lateral genetic transfer (LGT) is an important adaptive force in evolution, contributing to metabolic, physiological and ecological innovation in most prokaryotes and some eukaryotes. Genomic sequences and other data have begun to illuminate the processes, mechanisms, quantitative extent and impact of LGT in diverse organisms, populations, taxa and environments; deep questions are being posed, and the provisional answers sometimes challenge existing paradigms. At the same time, there is an enhanced appreciation of the imperfections, biases and blind spots in the data and in analytical approaches. Here we identify and consider significant open questions concerning the role of LGT in genome evolution.
Collapse
Affiliation(s)
- Mark A Ragan
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia.
| | | |
Collapse
|
155
|
Abstract
The analysis of genome rearrangements provides a global view on the evolution of a set of related species. We present a new algorithm called EMRAE (efficient method to recover ancestral events) to reliably predict a wide-range of rearrangement events in the ancestry of a group of species. Using simulated data sets, we show that EMRAE achieves comparable sensitivity but significantly higher specificity when predicting evolutionary events relative to other tools to study genome rearrangements. We apply our approach to the synteny blocks of six mammalian genomes (human, chimpanzee, rhesus macaque, mouse, rat, and dog) and predict 1109 rearrangement events, including 831 inversions, 15 translocations, 237 transpositions, and 26 fusions/fissions. Studying the sequence features at the breakpoints of the primate rearrangement events, we demonstrate that they are not only enriched in segmental duplications (SDs), but that the enrichment of matching pairs of SDs is even stronger within the pairs of breakpoints associated with recovered events. We also show that pairs of L1 repeats are frequently associated with ancestral inversions across all studied lineages. Together, this substantiates the model that regions of high sequence identity have been associated with rearrangement events throughout the mammalian phylogeny.
Collapse
Affiliation(s)
- Hao Zhao
- Computational and Mathematical Biology, Genome Institute of Singapore, Singapore
| | | |
Collapse
|
156
|
Di Rienzi SC, Collingwood D, Raghuraman MK, Brewer BJ. Fragile genomic sites are associated with origins of replication. Genome Biol Evol 2009; 1:350-63. [PMID: 20333204 PMCID: PMC2817429 DOI: 10.1093/gbe/evp034] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/28/2009] [Indexed: 01/03/2023] Open
Abstract
Genome rearrangements are mediators of evolution and disease. Such rearrangements are frequently bounded by transfer RNAs (tRNAs), transposable elements, and other repeated elements, suggesting a functional role for these elements in creating or repairing breakpoints. Though not well explored, there is evidence that origins of replication also colocalize with breakpoints. To investigate a potential correlation between breakpoints and origins, we analyzed evolutionary breakpoints defined between Saccharomyces cerevisiae and Kluyveromyces waltii and S. cerevisiae and a hypothetical ancestor of both yeasts, as well as breakpoints reported in the experimental literature. We find that origins correlate strongly with both evolutionary breakpoints and those described in the literature. Specifically, we find that origins firing earlier in S phase are more strongly correlated with breakpoints than are later-firing origins. Despite origins being located in genomic regions also bearing tRNAs and Ty elements, the correlation we observe between origins and breakpoints appears to be independent of these genomic features. This study lays the groundwork for understanding the mechanisms by which origins of replication may impact genome architecture and disease.
Collapse
|
157
|
Maruyama F, Kobata M, Kurokawa K, Nishida K, Sakurai A, Nakano K, Nomura R, Kawabata S, Ooshima T, Nakai K, Hattori M, Hamada S, Nakagawa I. Comparative genomic analyses of Streptococcus mutans provide insights into chromosomal shuffling and species-specific content. BMC Genomics 2009; 10:358. [PMID: 19656368 PMCID: PMC2907686 DOI: 10.1186/1471-2164-10-358] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2008] [Accepted: 08/05/2009] [Indexed: 11/20/2022] Open
Abstract
Background Streptococcus mutans is the major pathogen of dental caries, and it occasionally causes infective endocarditis. While the pathogenicity of this species is distinct from other human pathogenic streptococci, the species-specific evolution of the genus Streptococcus and its genomic diversity are poorly understood. Results We have sequenced the complete genome of S. mutans serotype c strain NN2025, and compared it with the genome of UA159. The NN2025 genome is composed of 2,013,587 bp, and the two strains show highly conserved core-genome. However, comparison of the two S. mutans strains showed a large genomic inversion across the replication axis producing an X-shaped symmetrical DNA dot plot. This phenomenon was also observed between other streptococcal species, indicating that streptococcal genetic rearrangements across the replication axis play an important role in Streptococcus genetic shuffling. We further confirmed the genomic diversity among 95 clinical isolates using long-PCR analysis. Genomic diversity in S. mutans appears to occur frequently between insertion sequence (IS) elements and transposons, and these diversity regions consist of restriction/modification systems, antimicrobial peptide synthesis systems, and transporters. S. mutans may preferentially reject the phage infection by clustered regularly interspaced short palindromic repeats (CRISPRs). In particular, the CRISPR-2 region, which is highly divergent between strains, in NN2025 has long repeated spacer sequences corresponding to the streptococcal phage genome. Conclusion These observations suggest that S. mutans strains evolve through chromosomal shuffling and that phage infection is not needed for gene acquisition. In contrast, S. pyogenes tolerates phage infection for acquisition of virulence determinants for niche adaptation.
Collapse
Affiliation(s)
- Fumito Maruyama
- Division of Bacteriology, Department of Infectious Diseases Control, International Research Center for Infectious Diseases, The Institute of Medical Science, The University of Tokyo, Tokyo 108-8639, Japan.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
158
|
Miklós I, Darling AE. Efficient sampling of parsimonious inversion histories with application to genome rearrangement in Yersinia. Genome Biol Evol 2009; 1:153-64. [PMID: 20333186 PMCID: PMC2817410 DOI: 10.1093/gbe/evp015] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/14/2009] [Indexed: 11/13/2022] Open
Abstract
Inversions are among the most common mutations acting on the order and orientation of genes in a genome, and polynomial-time algorithms exist to obtain a minimal length series of inversions that transform one genome arrangement to another. However, the minimum length series of inversions (the optimal sorting path) is often not unique as many such optimal sorting paths exist. If we assume that all optimal sorting paths are equally likely, then statistical inference on genome arrangement history must account for all such sorting paths and not just a single estimate. No deterministic polynomial algorithm is known to count the number of optimal sorting paths nor sample from the uniform distribution of optimal sorting paths. Here, we propose a stochastic method that uniformly samples the set of all optimal sorting paths. Our method uses a novel formulation of parallel Markov chain Monte Carlo. In practice, our method can quickly estimate the total number of optimal sorting paths. We introduce a variant of our approach in which short inversions are modeled to be more likely, and we show how the method can be used to estimate the distribution of inversion lengths and breakpoint usage in pathogenic Yersinia pestis. The proposed method has been implemented in a program called "MC4Inversion." We draw comparison of MC4Inversion to the sampler implemented in BADGER and a previously described importance sampling (IS) technique. We find that on high-divergence data sets, MC4Inversion finds more optimal sorting paths per second than BADGER and the IS technique and simultaneously avoids bias inherent in the IS technique.
Collapse
Affiliation(s)
- István Miklós
- Bioinformatics group, Alfréd Rényi Institute of Mathematics, Hungarian Academy of Sciences, Budapest, Hungary.
| | | |
Collapse
|
159
|
Abstract
Neisseria meningitidis usually lives as a commensal bacterium in the upper airways of humans. However, occasionally some strains can also cause life-threatening diseases such as sepsis and bacterial meningitis. Comparative genomics demonstrates that only very subtle genetic differences between carriage and disease strains might be responsible for the observed virulence differences and that N. meningitidis is, evolutionarily, a very recent species. Comparative genome sequencing also revealed a panoply of genetic mechanisms underlying its enormous genomic flexibility which also might affect the virulence of particular strains. From these studies, N. meningitidis emerges as a paradigm for organisms that use genome variability as an adaptation to changing and thus challenging environments.
Collapse
|
160
|
Treangen TJ, Abraham AL, Touchon M, Rocha EPC. Genesis, effects and fates of repeats in prokaryotic genomes. FEMS Microbiol Rev 2009; 33:539-71. [PMID: 19396957 DOI: 10.1111/j.1574-6976.2009.00169.x] [Citation(s) in RCA: 113] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
DNA repeats are causes and consequences of genome plasticity. Repeats are created by intrachromosomal recombination or horizontal transfer. They are targeted by recombination processes leading to amplifications, deletions and rearrangements of genetic material. The identification and analysis of repeats in nearly 700 genomes of bacteria and archaea is facilitated by the existence of sequence data and adequate bioinformatic tools. These have revealed the immense diversity of repeats in genomes, from those created by selfish elements to the ones used for protection against selfish elements, from those arising from transient gene amplifications to the ones leading to stable duplications. Experimental works have shown that some repeats do not carry any adaptive value, while others allow functional diversification and increased expression. All repeats carry some potential to disorganize and destabilize genomes. Because recombination and selection for repeats vary between genomes, the number and types of repeats are also quite diverse and in line with ecological variables, such as host-dependent associations or population sizes, and with genetic variables, such as the recombination machinery. From an evolutionary point of view, repeats represent both opportunities and problems. We describe how repeats are created and how they can be found in genomes. We then focus on the functional and genomic consequences of repeats that dictate their fate.
Collapse
|
161
|
Abstract
SUMMARY Genome evolution underpins all of biology, yet its principles can be difficult to communicate to the non-specialist. To facilitate broader understanding of genome evolution, we have designed an interactive 3D environment that enables visualization of diverse genome evolution processes. The system can intuitively and interactively animate mutation histories involving genome rearrangement, point mutation, recombination, insertion and deletion. Multiple organisms related by a phylogeny can be visualized simultaneously. As methods to infer evolutionary histories of genomes become increasingly complex, visualization of the evolutionary process will not only be useful for communication, but will also serve as an exploratory tool for discovering new patterns of genome evolution. AVAILABILITY The software is licensed under the GNU GPL and available for download from http://seevolution.org.
Collapse
Affiliation(s)
- Andrés Esteban-Marcos
- Institute for Molecular Bioscience, The University of Queensland, St Lucia, QLD, Australia
| | | | | |
Collapse
|
162
|
Tannier E. Yeast Ancestral Genome Reconstructions: The Possibilities of Computational Methods. COMPARATIVE GENOMICS 2009. [DOI: 10.1007/978-3-642-04744-2_1] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
|
163
|
Novichkov PS, Ratnere I, Wolf YI, Koonin EV, Dubchak I. ATGC: a database of orthologous genes from closely related prokaryotic genomes and a research platform for microevolution of prokaryotes. Nucleic Acids Res 2008; 37:D448-54. [PMID: 18845571 PMCID: PMC2686458 DOI: 10.1093/nar/gkn684] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
The database of Alignable Tight Genomic Clusters (ATGCs) consists of closely related genomes of archaea and bacteria, and is a resource for research into prokaryotic microevolution. Construction of a data set with appropriate characteristics is a major hurdle for this type of studies. With the current rate of genome sequencing, it is difficult to follow the progress of the field and to determine which of the available genome sets meet the requirements of a given research project, in particular, with respect to the minimum and maximum levels of similarity between the included genomes. Additionally, extraction of specific content, such as genomic alignments or families of orthologs, from a selected set of genomes is a complicated and time-consuming process. The database addresses these problems by providing an intuitive and efficient web interface to browse precomputed ATGCs, select appropriate ones and access ATGC-derived data such as multiple alignments of orthologous proteins, matrices of pairwise intergenomic distances based on genome-wide analysis of synonymous and nonsynonymous substitution rates and others. The ATGC database will be regularly updated following new releases of the NCBI RefSeq. The database is hosted by the Genomics Division at Lawrence Berkeley National laboratory and is publicly available at http://atgc.lbl.gov
Collapse
Affiliation(s)
- Pavel S Novichkov
- Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA.
| | | | | | | | | |
Collapse
|
164
|
Rocha EPC. Evolutionary patterns in prokaryotic genomes. Curr Opin Microbiol 2008; 11:454-60. [DOI: 10.1016/j.mib.2008.09.007] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2008] [Revised: 09/08/2008] [Accepted: 09/09/2008] [Indexed: 10/21/2022]
|