Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Karlin S, Campbell AM, Mrázek J. Comparative DNA analysis across diverse genomes. Annu Rev Genet 1999;32:185-225. [PMID: 9928479 DOI: 10.1146/annurev.genet.32.1.185] [Citation(s) in RCA: 238] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

For:	Karlin S, Campbell AM, Mrázek J. Comparative DNA analysis across diverse genomes. Annu Rev Genet 1999;32:185-225. [PMID: 9928479 DOI: 10.1146/annurev.genet.32.1.185] [Citation(s) in RCA: 238] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Number

Cited by Other Article(s)

101

Bohlin J, Skjerve E, Ussery DW. Analysis of genomic signatures in prokaryotes using multinomial regression and hierarchical clustering. BMC Genomics 2009;10:487. [PMID: 19845945 PMCID: PMC2770534 DOI: 10.1186/1471-2164-10-487] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2009] [Accepted: 10/21/2009] [Indexed: 11/26/2022] Open

102

Yap VB, Lindsay H, Easteal S, Huttley G. Estimates of the effect of natural selection on protein-coding content. Mol Biol Evol 2009;27:726-34. [PMID: 19815689 PMCID: PMC2822286 DOI: 10.1093/molbev/msp232] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open

103

A web server for interactive and zoomable Chaos Game Representation images. SOURCE CODE FOR BIOLOGY AND MEDICINE 2009;4:6. [PMID: 19761591 PMCID: PMC2753581 DOI: 10.1186/1751-0473-4-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/05/2009] [Accepted: 09/17/2009] [Indexed: 11/10/2022]

104

The genome of Nectria haematococca: contribution of supernumerary chromosomes to gene expansion. PLoS Genet 2009;5:e1000618. [PMID: 19714214 PMCID: PMC2725324 DOI: 10.1371/journal.pgen.1000618] [Citation(s) in RCA: 317] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2009] [Accepted: 07/27/2009] [Indexed: 11/19/2022] Open

Abstract

The ascomycetous fungus Nectria haematococca, (asexual name Fusarium solani), is a member of a group of >50 species known as the "Fusarium solani species complex". Members of this complex have diverse biological properties including the ability to cause disease on >100 genera of plants and opportunistic infections in humans. The current research analyzed the most extensively studied member of this complex, N. haematococca mating population VI (MPVI). Several genes controlling the ability of individual isolates of this species to colonize specific habitats are located on supernumerary chromosomes. Optical mapping revealed that the sequenced isolate has 17 chromosomes ranging from 530 kb to 6.52 Mb and that the physical size of the genome, 54.43 Mb, and the number of predicted genes, 15,707, are among the largest reported for ascomycetes. Two classes of genes have contributed to gene expansion: specific genes that are not found in other fungi including its closest sequenced relative, Fusarium graminearum; and genes that commonly occur as single copies in other fungi but are present as multiple copies in N. haematococca MPVI. Some of these additional genes appear to have resulted from gene duplication events, while others may have been acquired through horizontal gene transfer. The supernumerary nature of three chromosomes, 14, 15, and 17, was confirmed by their absence in pulsed field gel electrophoresis experiments of some isolates and by demonstrating that these isolates lacked chromosome-specific sequences found on the ends of these chromosomes. These supernumerary chromosomes contain more repeat sequences, are enriched in unique and duplicated genes, and have a lower G+C content in comparison to the other chromosomes. Although the origin(s) of the extra genes and the supernumerary chromosomes is not known, the gene expansion and its large genome size are consistent with this species' diverse range of habitats. Furthermore, the presence of unique genes on supernumerary chromosomes might account for individual isolates having different environmental niches.

Collapse

105

Lichtenberg J, Jacox E, Welch JD, Kurz K, Liang X, Yang MQ, Drews F, Ecker K, Lee SS, Elnitski L, Welch LR. Word-based characterization of promoters involved in human DNA repair pathways. BMC Genomics 2009;10 Suppl 1:S18. [PMID: 19594877 PMCID: PMC2709261 DOI: 10.1186/1471-2164-10-s1-s18] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

106

Suzuki H, Saito R, Tomita M. Measure of synonymous codon usage diversity among genes in bacteria. BMC Bioinformatics 2009;10:167. [PMID: 19480720 PMCID: PMC2697163 DOI: 10.1186/1471-2105-10-167] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2008] [Accepted: 06/01/2009] [Indexed: 11/10/2022] Open

107

Tzahor S, Man-Aharonovich D, Kirkup BC, Yogev T, Berman-Frank I, Polz MF, Béjà O, Mandel-Gutfreund Y. A supervised learning approach for taxonomic classification of core-photosystem-II genes and transcripts in the marine environment. BMC Genomics 2009;10:229. [PMID: 19445709 PMCID: PMC2696472 DOI: 10.1186/1471-2164-10-229] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2008] [Accepted: 05/16/2009] [Indexed: 11/10/2022] Open

108

Tzahor S, Man-Aharonovich D, Kirkup BC, Yogev T, Berman-Frank I, Polz MF, Béjà O, Mandel-Gutfreund Y. A supervised learning approach for taxonomic classification of core-photosystem-II genes and transcripts in the marine environment. BMC Genomics 2009. [PMID: 19445709 DOI: 10.1186/1471-2164-10-229.] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

109

Guo WJ, Ling J, Li P. Consensus features of microsatellite distribution: Microsatellite contents are universally correlated with recombination rates and are preferentially depressed by centromeres in multicellular eukaryotic genomes. Genomics 2009;93:323-31. [DOI: 10.1016/j.ygeno.2008.12.009] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2008] [Revised: 12/14/2008] [Accepted: 12/16/2008] [Indexed: 10/21/2022]

110

Ilatovskiy A, Petukhov M. Genome-Wide Search for Local DNA Segments with Anomalous GC-Content. J Comput Biol 2009;16:555-64. [DOI: 10.1089/cmb.2008.0159] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

111

Willner D, Thurber RV, Rohwer F. Metagenomic signatures of 86 microbial and viral metagenomes. Environ Microbiol 2009;11:1752-66. [PMID: 19302541 DOI: 10.1111/j.1462-2920.2009.01901.x] [Citation(s) in RCA: 110] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Abstract

Previous studies have shown that dinucleotide abundances capture the majority of variation in genome signatures and are useful for quantifying lateral gene transfer and building molecular phylogenies. Metagenomes contain a mixture of individual genomes, and might be expected to lack compositional signatures. In many metagenomic data sets the majority of sequences have no significant similarities to known sequences and are effectively excluded from subsequent analyses. To circumvent this limitation, di-, tri- and tetranucleotide abundances of 86 microbial and viral metagenomes consisting of short pyrosequencing reads were analysed to provide a method which includes all sequences that can be used in combination with other analysis to increase our knowledge about microbial and viral communities. Both principal component analysis and hierarchical clustering showed definitive groupings of metagenomes drawn from similar environments. Together these analyses showed that dinucleotide composition, as opposed to tri- and tetranucleotides, defines a metagenomic signature which can explain up to 80% of the variance between biomes, which is comparable to that obtained by functional genomics. Metagenomes with anomalous content were also identified using dinucleotide abundances. Subsequent analyses determined that these metagenomes were contaminated with exogenous DNA, suggesting that this approach is a useful metric for quality control. The predictive strength of the dinucleotide composition also opens the possibility of assigning ecological classifications to unknown fragments. Environmental selection may be responsible for this dinucleotide signature through direct selection of specific compositional signals; however, simulations suggest that the environment may select indirectly by promoting the increased abundance of a few dominant taxa.

Collapse

112

Mrazek J. Phylogenetic Signals in DNA Composition: Limitations and Prospects. Mol Biol Evol 2009;26:1163-9. [DOI: 10.1093/molbev/msp032] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open

113

Takahashi M, Kryukov K, Saitou N. Estimation of bacterial species phylogeny through oligonucleotide frequency distances. Genomics 2009;93:525-33. [PMID: 19442633 DOI: 10.1016/j.ygeno.2009.01.009] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2008] [Revised: 01/30/2009] [Accepted: 01/30/2009] [Indexed: 10/21/2022]

114

Ahmed S, Saito A, Suzuki M, Nemoto N, Nishigaki K. Host-parasite relations of bacteria and phages can be unveiled by oligostickiness, a measure of relaxed sequence similarity. Bioinformatics 2009;25:563-70. [PMID: 19126576 DOI: 10.1093/bioinformatics/btp003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

115

Huttley G. Do genomic datasets resolve the correct relationship among the placental, marsupial and monotreme lineages? AUST J ZOOL 2009. [DOI: 10.1071/zo09049] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

116

Kavanaugh LA, Ohler U. Predicting Non-coding RNA Transcripts. Bioinformatics 2009. [DOI: 10.1007/978-0-387-92738-1_4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open

117

van Passel MWJ, de Graaff LH. Mononucleotide repeats are asymmetrically distributed in fungal genes. BMC Genomics 2008;9:596. [PMID: 19077233 PMCID: PMC2621210 DOI: 10.1186/1471-2164-9-596] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2008] [Accepted: 12/11/2008] [Indexed: 11/10/2022] Open

118

Wang A, Ren L, Abenes G, Hai R. Genome sequence divergences and functional variations in human cytomegalovirus strains. ACTA ACUST UNITED AC 2008;55:23-33. [PMID: 19076227 DOI: 10.1111/j.1574-695x.2008.00489.x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

119

Suzuki H, Sota M, Brown CJ, Top EM. Using Mahalanobis distance to compare genomic signatures between bacterial plasmids and chromosomes. Nucleic Acids Res 2008;36:e147. [PMID: 18953039 PMCID: PMC2602791 DOI: 10.1093/nar/gkn753] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open

120

Stabler RA, Dawson LF, Oyston PCF, Titball RW, Wade J, Hinds J, Witney AA, Wren BW. Development and application of the active surveillance of pathogens microarray to monitor bacterial gene flux. BMC Microbiol 2008;8:177. [PMID: 18844996 PMCID: PMC2607285 DOI: 10.1186/1471-2180-8-177] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2008] [Accepted: 10/09/2008] [Indexed: 11/23/2022] Open

121

Cutler RW, Chantawannakul P. Synonymous codon usage bias dependent on local nucleotide context in the class Deinococci. J Mol Evol 2008;67:301-14. [PMID: 18696025 DOI: 10.1007/s00239-008-9152-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2008] [Accepted: 07/14/2008] [Indexed: 11/25/2022]

122

Biro JC. Does codon bias have an evolutionary origin? Theor Biol Med Model 2008;5:16. [PMID: 18667081 PMCID: PMC2519059 DOI: 10.1186/1742-4682-5-16] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2008] [Accepted: 07/30/2008] [Indexed: 11/10/2022] Open

123

Fang X, Xu H, Zhang C, Chen H, Hu X, Gao X, Gu C, Yue W. Polymorphism in BMP4 gene and its association with growth traits in goats. Mol Biol Rep 2008;36:1339-44. [DOI: 10.1007/s11033-008-9317-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2008] [Accepted: 07/03/2008] [Indexed: 02/06/2023]

124

Genomic mid-range inhomogeneity correlates with an abundance of RNA secondary structures. BMC Genomics 2008;9:284. [PMID: 18549495 PMCID: PMC2442090 DOI: 10.1186/1471-2164-9-284] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2008] [Accepted: 06/12/2008] [Indexed: 11/10/2022] Open

125

Ishoey T, Woyke T, Stepanauskas R, Novotny M, Lasken RS. Genomic sequencing of single microbial cells from environmental samples. Curr Opin Microbiol 2008;11:198-204. [PMID: 18550420 DOI: 10.1016/j.mib.2008.05.006] [Citation(s) in RCA: 106] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2008] [Revised: 04/30/2008] [Accepted: 05/07/2008] [Indexed: 10/22/2022]

Abstract

Recently developed techniques allow genomic DNA sequencing from single microbial cells [Lasken RS: Single-cell genomic sequencing using multiple displacement amplification. Curr Opin Microbiol 2007, 10:510-516]. Here, we focus on research strategies for putting these methods into practice in the laboratory setting. An immediate consequence of single-cell sequencing is that it provides an alternative to culturing organisms as a prerequisite for genomic sequencing. The microgram amounts of DNA required as template are amplified from a single bacterium by a method called multiple displacement amplification (MDA) avoiding the need to grow cells. The ability to sequence DNA from individual cells will likely have an immense impact on microbiology considering the vast numbers of novel organisms, which have been inaccessible unless culture-independent methods could be used. However, special approaches have been necessary to work with amplified DNA. MDA may not recover the entire genome from the single copy present in most bacteria. Also, some sequence rearrangements can occur during the DNA amplification reaction. Over the past two years many research groups have begun to use MDA, and some practical approaches to single-cell sequencing have been developed. We review the consensus that is emerging on optimum methods, reliability of amplified template, and the proper interpretation of 'composite' genomes which result from the necessity of combining data from several single-cell MDA reactions in order to complete the assembly. Preferred laboratory methods are considered on the basis of experience at several large sequencing centers where >70% of genomes are now often recovered from single cells. Methods are reviewed for preparation of bacterial fractions from environmental samples, single-cell isolation, DNA amplification by MDA, and DNA sequencing.

Collapse

126

The mosaic genome of Anaeromyxobacter dehalogenans strain 2CP-C suggests an aerobic common ancestor to the delta-proteobacteria. PLoS One 2008;3:e2103. [PMID: 18461135 PMCID: PMC2330069 DOI: 10.1371/journal.pone.0002103] [Citation(s) in RCA: 109] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2008] [Accepted: 03/19/2008] [Indexed: 11/29/2022] Open

Abstract

Anaeromyxobacter dehalogenans strain 2CP-C is a versaphilic delta-Proteobacterium distributed throughout many diverse soil and sediment environments. 16S rRNA gene phylogenetic analysis groups A. dehalogenans together with the myxobacteria, which have distinguishing characteristics including strictly aerobic metabolism, sporulation, fruiting body formation, and surface motility. Analysis of the 5.01 Mb strain 2CP-C genome substantiated that this organism is a myxobacterium but shares genotypic traits with the anaerobic majority of the delta-Proteobacteria (i.e., the Desulfuromonadales). Reflective of its respiratory versatility, strain 2CP-C possesses 68 genes coding for putative c-type cytochromes, including one gene with 40 heme binding motifs. Consistent with its relatedness to the myxobacteria, surface motility was observed in strain 2CP-C and multiple types of motility genes are present, including 28 genes for gliding, adventurous (A-) motility and 17 genes for type IV pilus-based motility (i.e., social (S-) motility) that all have homologs in Myxococcus xanthus. Although A. dehalogenans shares many metabolic traits with the anaerobic majority of the delta-Proteobacteria, strain 2CP-C grows under microaerophilic conditions and possesses detoxification systems for reactive oxygen species. Accordingly, two gene clusters coding for NADH dehydrogenase subunits and two cytochrome oxidase gene clusters in strain 2CP-C are similar to those in M. xanthus. Remarkably, strain 2CP-C possesses a third NADH dehydrogenase gene cluster and a cytochrome cbb₃ oxidase gene cluster, apparently acquired through ancient horizontal gene transfer from a strictly anaerobic green sulfur bacterium. The mosaic nature of the A. dehalogenans strain 2CP-C genome suggests that the metabolically versatile, anaerobic members of the delta-Proteobacteria may have descended from aerobic ancestors with complex lifestyles.

Collapse

127

Kuo CH, Kissinger JC. Consistent and contrasting properties of lineage-specific genes in the apicomplexan parasites Plasmodium and Theileria. BMC Evol Biol 2008;8:108. [PMID: 18405380 PMCID: PMC2330040 DOI: 10.1186/1471-2148-8-108] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2007] [Accepted: 04/11/2008] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Lineage-specific genes, the genes that are restricted to a limited subset of related organisms, may be important in adaptation. In parasitic organisms, lineage-specific gene products are possible targets for vaccine development or therapeutics when these genes are absent from the host genome.

RESULTS

In this study, we utilized comparative approaches based on a phylogenetic framework to characterize lineage-specific genes in the parasitic protozoan phylum Apicomplexa. Genes from species in two major apicomplexan genera, Plasmodium and Theileria, were categorized into six levels of lineage specificity based on a nine-species phylogeny. In both genera, lineage-specific genes tend to have a higher level of sequence divergence among sister species. In addition, species-specific genes possess a strong codon usage bias compared to other genes in the genome. We found that a large number of genus- or species-specific genes are putative surface antigens that may be involved in host-parasite interactions. Interestingly, the two parasite lineages exhibit several notable differences. In Plasmodium, the (G + C) content at the third codon position increases with lineage specificity while Theileria shows the opposite trend. Surface antigens in Plasmodium are species-specific and mainly located in sub-telomeric regions. In contrast, surface antigens in Theileria are conserved at the genus level and distributed across the entire lengths of chromosomes.

CONCLUSION

Our results provide further support for the model that gene duplication followed by rapid divergence is a major mechanism for generating lineage-specific genes. The result that many lineage-specific genes are putative surface antigens supports the hypothesis that lineage-specific genes could be important in parasite adaptation. The contrasting properties between the lineage-specific genes in two major apicomplexan genera indicate that the mechanisms of generating lineage-specific genes and the subsequent evolutionary fates can differ between related parasite lineages. Future studies that focus on improving functional annotation of parasite genomes and collection of genetic variation data at within- and between-species levels will be important in facilitating our understanding of parasite adaptation and natural selection.

Collapse

128

Larsson P, Hinas A, Ardell DH, Kirsebom LA, Virtanen A, Söderbom F. De novo search for non-coding RNA genes in the AT-rich genome of Dictyostelium discoideum: performance of Markov-dependent genome feature scoring. Genome Res 2008;18:888-99. [PMID: 18347326 DOI: 10.1101/gr.069104.107] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

129

Chantawannakul P, Cutler RW. Convergent host-parasite codon usage between honeybee and bee associated viral genomes. J Invertebr Pathol 2008;98:206-10. [PMID: 18397791 DOI: 10.1016/j.jip.2008.02.016] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2007] [Revised: 02/25/2008] [Accepted: 02/27/2008] [Indexed: 10/22/2022]

130

Arnau V, Gallach M, Marín I. Fast comparison of DNA sequences by oligonucleotide profiling. BMC Res Notes 2008;1:5. [PMID: 18710530 PMCID: PMC2518268 DOI: 10.1186/1756-0500-1-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2008] [Accepted: 02/28/2008] [Indexed: 11/24/2022] Open

131

Evans KJ. Genomic DNA from animals shows contrasting strand bias in large and small subsequences. BMC Genomics 2008;9:43. [PMID: 18221531 PMCID: PMC2267173 DOI: 10.1186/1471-2164-9-43] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2007] [Accepted: 01/25/2008] [Indexed: 01/09/2023] Open

Abstract

Background

For eukaryotes, there is almost no strand bias with regard to base composition, with exceptions for origins of replication and transcription start sites and transcribed regions. This paper revisits the question for subsequences of DNA taken at random from the genome.

Results

For a typical mammal, for example mouse or human, there is a small strand bias throughout the genomic DNA: there is a correlation between (G - C) and (A - T) on the same strand, (that is between the difference in the number of guanine and cytosine bases and the difference in the number of adenine and thymine bases). For small subsequences – up to 1 kb – this correlation is weak but positive; but for large windows – around 50 kb to 2 Mb – the correlation is strong and negative. This effect is largely independent of GC%. Transcribed and untranscribed regions give similar correlations both for small and large subsequences, but there is a difference in these regions for intermediate sized subsequences. An analysis of the human genome showed that position within the isochore structure did not affect these correlations. An analysis of available genomes of different species shows that this contrast between large and small windows is a general feature of mammals and birds. Further down the evolutionary tree, other organisms show a similar but smaller effect. Except for the nematode, all the animals analysed showed at least a small effect.

Conclusion

The correlations on the large scale may be explained by DNA replication. Transcription may be a modifier of these effects but is not the fundamental cause. These results cast light on how DNA mutations affect the genome over evolutionary time. At least for vertebrates, there is a broad relationship between body temperature and the size of the correlation. The genome of mammals and birds has a structure marked by strand bias segments.

Collapse

132

Evans KJ. Strand bias structure in mouse DNA gives a glimpse of how chromatin structure affects gene expression. BMC Genomics 2008;9:16. [PMID: 18194530 PMCID: PMC2266913 DOI: 10.1186/1471-2164-9-16] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2007] [Accepted: 01/14/2008] [Indexed: 12/20/2022] Open

133

Demirev PA, Fenselau C. Mass spectrometry for rapid characterization of microorganisms. ANNUAL REVIEW OF ANALYTICAL CHEMISTRY (PALO ALTO, CALIF.) 2008;1:71-93. [PMID: 20636075 DOI: 10.1146/annurev.anchem.1.031207.112838] [Citation(s) in RCA: 96] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]

134

Demongeot J, Moreira A. A possible circular RNA at the origin of life. J Theor Biol 2007;249:314-24. [PMID: 17825325 DOI: 10.1016/j.jtbi.2007.07.010] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2007] [Revised: 07/04/2007] [Accepted: 07/05/2007] [Indexed: 11/24/2022]

Abstract

The increasing volume of sequenced genomes and the recent techniques for performing in vitro molecular evolution have rekindled the interest for questions on the origin of life. Nevertheless, a gap continues to exist between the research on prebiotic chemistry and molecule generation, on one hand, and the study of molecular fossils preserved in genomes, on the other. Here we attempt to fill this gap by using some assumptions about the prebiotic scenario (including a strong stereochemical basis for the genetic code) to determine the RNA sequences more likely to appear and subsist. A set of minimal RNA rings is exhaustively determined; a subset of them is then selected through stability arguments, and a particular ring ("AL ring") is finally singled out as the most likely winner of this prebiotic game. The rings happen to have several structural and statistical properties of modern genes: a repeated AUG codon appears spontaneously (and is thus made available for becoming a start signal), the form AUG/STOP emerges, and frequency patterns resemble those of present genes. The whole set of rings was also compared to a database of tRNAs, considering the conserved positions (located in the free parts of the molecule, essentially the loops); the ring that most closely matched tRNA sequences-and matched, in fact, the consensus of tRNA at all the aligned positions-was AL, the same ring independently selected before. The unselected emergence of gene-like features through two simple selection steps and the close similarity between the finally selected ring and tRNA (including some remarkable features of the resulting alignment) suggest a possible link between the prebiotic world and the first biological molecules, which is amenable for experimental testing. Even if our scenario is partially wrong, the unlikely coincidences should provide useful hints for other efforts.

Collapse

135

Shekar M, Karunasagar I, Karunasagar I. Abundance, composition and distribution of simple sequence repeats and dinucleotide compositional bias within WSSV genomes. J Genet 2007;86:69-73. [PMID: 17656852 DOI: 10.1007/s12041-007-0010-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

136

Chen C, Chen CW. Quantitative analysis of mutation and selection pressures on base composition skews in bacterial chromosomes. BMC Genomics 2007;8:286. [PMID: 17711583 PMCID: PMC2031905 DOI: 10.1186/1471-2164-8-286] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2007] [Accepted: 08/21/2007] [Indexed: 11/24/2022] Open

Abstract

Background

Most bacterial chromosomes exhibit asymmetry of base composition with respect to leading vs. lagging strands (GC and AT skews). These skews reflect mainly those in protein coding sequences, which are driven by asymmetric mutation pressures during replication and transcription (notably asymmetric cytosine deamination) plus subsequent selection for preferred structures, signals, amino acid or codons. The transcription-associated effects but not the replication-associated effects contribute to the overall skews through the uneven distribution of the coding sequences on the leading and lagging strands.

Results

Analysis of 185 representative bacterial chromosomes showed diverse and characteristic patterns of skews among different clades. The base composition skews in the coding sequences were used to derive quantitatively the effect of replication-driven mutation plus subsequent selection ('replication-associated pressure', RAP), and the effect of transcription-driven mutation plus subsequent selection at translation level ('transcription-associate pressure', TAP). While different clades exhibit distinct patterns of RAP and TAP, RAP is absent or nearly absent in some bacteria, but TAP is present in all. The selection pressure at the translation level is evident in all bacteria based on the analysis of the skews at the three codon positions. Contribution of asymmetric cytosine deamination was found to be weak to TAP in most phyla, and strong to RAP in all the Proteobacteria but weak in most of the Firmicutes. This possibly reflects the differences in their chromosomal replication machineries. A strong negative correlation between TAP and G+C content and between TAP and chromosomal size were also revealed.

Conclusion

The study reveals the diverse mutation and selection forces associated with replication and transcription in various groups of bacteria that shape the distinct patterns of base composition skews in the chromosomes during evolution. Some closely relative species with distinct base composition parameters are uncovered in this study, which also provides opportunities for comparative bioinformatic and genetic investigations to uncover the underlying principles for mutation and selection.

Collapse

137

Bakkali M. Genome dynamics of short oligonucleotides: the example of bacterial DNA uptake enhancing sequences. PLoS One 2007;2:e741. [PMID: 17710141 PMCID: PMC1939737 DOI: 10.1371/journal.pone.0000741] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2007] [Accepted: 06/29/2007] [Indexed: 11/19/2022] Open

Abstract

Among the many bacteria naturally competent for transformation by DNA uptake-a phenomenon with significant clinical and financial implications- Pasteurellaceae and Neisseriaceae species preferentially take up DNA containing specific short sequences. The genomic overrepresentation of these DNA uptake enhancing sequences (DUES) causes preferential uptake of conspecific DNA, but the function(s) behind this overrepresentation and its evolution are still a matter for discovery. Here I analyze DUES genome dynamics and evolution and test the validity of the results to other selectively constrained oligonucleotides. I use statistical methods and computer simulations to examine DUESs accumulation in Haemophilus influenzae and Neisseria gonorrhoeae genomes. I analyze DUESs sequence and nucleotide frequencies, as well as those of all their mismatched forms, and prove the dependence of DUESs genomic overrepresentation on their preferential uptake by quantifying and correlating both characteristics. I then argue that mutation, uptake bias, and weak selection against DUESs in less constrained parts of the genome combined are sufficient enough to cause DUESs accumulation in susceptible parts of the genome with no need for other DUES function. The distribution of overrepresentation values across sequences with different mismatch loads compared to the DUES suggests a gradual yet not linear molecular drive of DNA sequences depending on their similarity to the DUES. Other genomically overrepresented sequences, both pro- and eukaryotic, show similar distribution of frequencies suggesting that the molecular drive reported above applies to other frequent oligonucleotides. Rare oligonucleotides, however, seem to be gradually drawn to genomic underrepresentation, thus, suggesting a molecular drag. To my knowledge this work provides the first clear evidence of the gradual evolution of selectively constrained oligonucleotides, including repeated, palindromic and protein/transcription factor-binding DNAs.

Collapse

138

Butler JE, He Q, Nevin KP, He Z, Zhou J, Lovley DR. Genomic and microarray analysis of aromatics degradation in Geobacter metallireducens and comparison to a Geobacter isolate from a contaminated field site. BMC Genomics 2007;8:180. [PMID: 17578578 PMCID: PMC1924859 DOI: 10.1186/1471-2164-8-180] [Citation(s) in RCA: 77] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2006] [Accepted: 06/19/2007] [Indexed: 12/03/2022] Open

Abstract

Background

Groundwater and subsurface environments contaminated with aromatic compounds can be remediated in situ by Geobacter species that couple oxidation of these compounds to reduction of Fe(III)-oxides. Geobacter metallireducens metabolizes many aromatic compounds, but the enzymes involved are not well known.

Results

The complete G. metallireducens genome contained a 300 kb island predicted to encode enzymes for the degradation of phenol, p-cresol, 4-hydroxybenzaldehyde, 4-hydroxybenzoate, benzyl alcohol, benzaldehyde, and benzoate. Toluene degradation genes were encoded in a separate region. None of these genes was found in closely related species that cannot degrade aromatic compounds. Abundant transposons and phage-like genes in the island suggest mobility, but nucleotide composition and lack of synteny with other species do not suggest a recent transfer. The inferred degradation pathways are similar to those in species that anaerobically oxidize aromatic compounds with nitrate as an electron acceptor. In these pathways the aromatic compounds are converted to benzoyl-CoA and then to 3-hydroxypimelyl-CoA. However, in G. metallireducens there were no genes for the energetically-expensive dearomatizing enzyme. Whole-genome changes in transcript levels were identified in cells oxidizing benzoate. These supported the predicted pathway, identified induced fatty-acid oxidation genes, and identified an apparent shift in the TCA cycle to a putative ATP-yielding succinyl-CoA synthase. Paralogs to several genes in the pathway were also induced, as were several putative molybdo-proteins. Comparison of the aromatics degradation pathway genes to the genome of an isolate from a contaminated field site showed very similar content, and suggested this strain degrades many of the same compounds. This strain also lacked a classical dearomatizing enzyme, but contained two copies of an eight-gene cluster encoding redox proteins that was 30-fold induced during benzoate oxidation.

Conclusion

G. metallireducens appears to convert aromatic compounds to benzoyl-CoA, then to acetyl-CoA via fatty acid oxidation, and then to carbon dioxide via the TCA cycle. The enzyme responsible for dearomatizing the aromatic ring may be novel, and energetic investments at this step may be offset by a change in succinate metabolism. Analysis of a field isolate suggests that the pathways inferred for G. metallireducens may be applicable to modeling in situ bioremediation.

Collapse

139

Wang HF, Hou WR, Niu DK. Strand compositional asymmetries in vertebrate large genes. Mol Biol Rep 2007;35:163-9. [PMID: 17420956 DOI: 10.1007/s11033-007-9066-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2006] [Accepted: 02/26/2007] [Indexed: 10/23/2022]

140

Thakur V, Azad RK, Ramaswamy R. Markov models of genome segmentation. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2007;75:011915. [PMID: 17358192 DOI: 10.1103/physreve.75.011915] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2006] [Revised: 06/19/2006] [Indexed: 05/14/2023]

141

Revisiting the directional mutation pressure theory: The analysis of a particular genomic structure in Leishmania major. Gene 2006;385:28-40. [DOI: 10.1016/j.gene.2006.04.031] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2005] [Accepted: 04/04/2006] [Indexed: 11/20/2022]

142

Janga SC, Lamboy WF, Huerta AM, Moreno-Hagelsieb G. The distinctive signatures of promoter regions and operon junctions across prokaryotes. Nucleic Acids Res 2006;34:3980-7. [PMID: 16914446 PMCID: PMC1557821 DOI: 10.1093/nar/gkl563] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open

143

Dehnert M, Helm WE, Hütt MT. Informational structure of two closely related eukaryotic genomes. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2006;74:021913. [PMID: 17025478 DOI: 10.1103/physreve.74.021913] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2006] [Indexed: 05/12/2023]

144

Chang CH, Hsieh LC, Chen TY, Chen HD, Luo L, Lee HC. Shannon information in complete genomes. PROCEEDINGS. IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE 2006:20-30. [PMID: 16447996 DOI: 10.1109/csb.2004.1332413] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

145

Chen LL. Identification of genomic islands in six plant pathogens. Gene 2006;374:134-41. [PMID: 16581205 DOI: 10.1016/j.gene.2006.01.029] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2005] [Revised: 12/30/2005] [Accepted: 01/24/2006] [Indexed: 10/24/2022]

146

Foerstner KU, von Mering C, Bork P. Comparative analysis of environmental sequences: potential and challenges. Philos Trans R Soc Lond B Biol Sci 2006;361:519-23. [PMID: 16524840 PMCID: PMC1609345 DOI: 10.1098/rstb.2005.1809] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

147

Gao B, Paramanathan R, Gupta RS. Signature proteins that are distinctive characteristics of Actinobacteria and their subgroups. Antonie van Leeuwenhoek 2006;90:69-91. [PMID: 16670965 DOI: 10.1007/s10482-006-9061-2] [Citation(s) in RCA: 83] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2005] [Accepted: 01/20/2006] [Indexed: 10/24/2022]

Abstract

The Actinobacteria constitute one of the main phyla of Bacteria. Presently, no morphological and very few molecular characteristics are known which can distinguish species of this highly diverse group. In this work, we have analyzed the genomes of four actinobacteria (viz. Mycobacterium leprae TN, Leifsonia xyli subsp. xyli str. CTCB07, Bifidobacterium longum NCC2705 and Thermobifida fusca YX) to search for proteins that are unique to Actinobacteria. Our analyses have identified 233 actinobacteria-specific proteins, homologues of which are generally not present in any other bacteria. These proteins can be grouped as follows: (i) 29 proteins uniquely present in most sequenced actinobacterial genomes; (ii) 6 proteins present in almost all actinobacteria except Bifidobacterium longum and another 37 proteins absent in B. longum and few other species; (iii) 11 proteins which are mainly present in Corynebacterium, Mycobacterium and Nocardia (CMN) subgroup as well as Streptomyces, T. fusca and Frankia sp., but they are not found in Bifidobacterium and Micrococcineae; (iv) 8 proteins that are specific for T. fusca and Streptomyces species, plus 2 proteins also present in the Frankia species; (v) 13 proteins that are specific for the Corynebacterineae or the CMN group; (vi) 14 proteins only found in Mycobacterium and Nocardia; (vii) 24 proteins unique to different Mycobacterium species; (viii) 8 proteins specific to the Micrococcineae; (ix) 85 proteins which are distributed sporadically in actinobacterial species. Additionally, many examples of lateral gene transfer from Actinobacteria to Magnetospirillum magnetotacticum have also been identified. The identified proteins provide novel molecular means for defining and circumscribing the Actinobacteria phylum and a number of subgroups within it. The distribution of these proteins also provides useful information regarding interrelationships among the actinobacterial subgroups. Most of these proteins are of unknown function and studies aimed at understanding their cellular functions should reveal common biochemical and physiological characteristics unique to either all actinobacteria or particular subgroups of them. The identified proteins also provide potential targets for development of drugs that are specific for actinobacteria.

Collapse

148

Hou WR, Wang HF, Niu DK. Replication-associated strand asymmetries in vertebrate genomes and implications for replicon size, DNA replication origin, and termination. Biochem Biophys Res Commun 2006;344:1258-62. [PMID: 16650814 DOI: 10.1016/j.bbrc.2006.04.039] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2006] [Accepted: 04/17/2006] [Indexed: 11/16/2022]

149

Mrázek J. Analysis of distribution indicates diverse functions of simple sequence repeats in Mycoplasma genomes. Mol Biol Evol 2006;23:1370-85. [PMID: 16618962 DOI: 10.1093/molbev/msk023] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Abstract

Simple sequence repeats (SSRs) composed of extensive tandem iterations of a single nucleotide or a short oligonucleotide are rare in most bacterial genomes, but they are common among Mycoplasma. Some of these repeats act as contingency loci in association with families of surface antigens. By contraction or expansion during replication, these SSRs increase genetic variance of the population and facilitate avoidance of the immune response of the host. Occurrence and distribution of SSRs are analyzed in complete genomes of 11 Mycoplasma and 3 related Mollicutes in order to gain insights into functional and evolutionary diversity of the SSRs in Mycoplasma. The results revealed an unexpected variety of SSRs with respect to their distribution and composition and suggest that it is unlikely that all SSRs function as contingency loci or recombination hot spots. Various types of SSRs are most abundant in Mycoplasma hyopneumoniae, whereas Mycoplasma penetrans, Mycoplasma mobile, and Mycoplasma synoviae do not contain unusually long SSRs. Mycoplasma hyopneumoniae and Mycoplasma pulmonis feature abundant short adenine and thymine runs periodically spaced at 11 and 12 bp, respectively, which likely affect the supercoiling propensities of the DNA molecule. Physiological roles of long adenine and thymine runs in M. hyopneumoniae appear independent of location upstream or downstream of genes, unlike contingency loci that are typically located in protein-coding regions or upstream regulatory regions. Comparisons among 3 M. hyopneumoniae strains suggest that the adenine and thymine runs are rarely involved in genome rearrangements. The results indicate that the SSRs in the Mycoplasma genomes play diverse roles, including modulating gene expression as contingency loci, facilitating genome rearrangements via recombination, affecting protein structure and possibly protein-protein interactions, and contributing to the organization of the DNA molecule in the cell.

Collapse

150

Bukovska G, Klucar L, Vlcek C, Adamovic J, Turna J, Timko J. Complete nucleotide sequence and genome analysis of bacteriophage BFK20 — A lytic phage of the industrial producer Brevibacterium flavum. Virology 2006;348:57-71. [PMID: 16457869 DOI: 10.1016/j.virol.2005.12.010] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2005] [Revised: 11/14/2005] [Accepted: 12/11/2005] [Indexed: 10/25/2022]