Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Baran RH, Ko H. Detecting horizontally transferred and essential genes based on dinucleotide relative abundance. DNA Res 2008;15:267-76. [PMID: 18799480 PMCID: PMC2575891 DOI: 10.1093/dnares/dsn021] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2007] [Accepted: 08/05/2008] [Indexed: 11/20/2022] Open

For:	Baran RH, Ko H. Detecting horizontally transferred and essential genes based on dinucleotide relative abundance. DNA Res 2008;15:267-76. [PMID: 18799480 PMCID: PMC2575891 DOI: 10.1093/dnares/dsn021] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2007] [Accepted: 08/05/2008] [Indexed: 11/20/2022] Open

Number

Cited by Other Article(s)

de la Fuente R, Díaz-Villanueva W, Arnau V, Moya A. Genomic Signature in Evolutionary Biology: A Review. BIOLOGY 2023;12:biology12020322. [PMID: 36829597 PMCID: PMC9953303 DOI: 10.3390/biology12020322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 02/11/2023] [Accepted: 02/13/2023] [Indexed: 02/19/2023]

Bohlin J, Eldholm V, Pettersson JHO, Brynildsrud O, Snipen L. The nucleotide composition of microbial genomes indicates differential patterns of selection on core and accessory genomes. BMC Genomics 2017;18:151. [PMID: 28187704 PMCID: PMC5303225 DOI: 10.1186/s12864-017-3543-7] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2016] [Accepted: 02/02/2017] [Indexed: 12/02/2022] Open

Abstract

Background

The core genome consists of genes shared by the vast majority of a species and is therefore assumed to have been subjected to substantially stronger purifying selection than the more mobile elements of the genome, also known as the accessory genome. Here we examine intragenic base composition differences in core genomes and corresponding accessory genomes in 36 species, represented by the genomes of 731 bacterial strains, to assess the impact of selective forces on base composition in microbes. We also explore, in turn, how these results compare with findings for whole genome intragenic regions.

Results

We found that GC content in coding regions is significantly higher in core genomes than accessory genomes and whole genomes. Likewise, GC content variation within coding regions was significantly lower in core genomes than in accessory genomes and whole genomes. Relative entropy in coding regions, measured as the difference between observed and expected trinucleotide frequencies estimated from mononucleotide frequencies, was significantly higher in the core genomes than in accessory and whole genomes. Relative entropy was positively associated with coding region GC content within the accessory genomes, but not within the corresponding coding regions of core or whole genomes.

Conclusion

The higher intragenic GC content and relative entropy, as well as the lower GC content variation, observed in the core genomes is most likely associated with selective constraints. It is unclear whether the positive association between GC content and relative entropy in the more mobile accessory genomes constitutes signatures of selection or selective neutral processes.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-017-3543-7) contains supplementary material, which is available to authorized users.

Collapse

Homology-independent metrics for comparative genomics. Comput Struct Biotechnol J 2015;13:352-7. [PMID: 26029354 PMCID: PMC4446528 DOI: 10.1016/j.csbj.2015.04.005] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2015] [Revised: 04/06/2015] [Accepted: 04/18/2015] [Indexed: 11/24/2022] Open

Abstract

A mainstream procedure to analyze the wealth of genomic data available nowadays is the detection of homologous regions shared across genomes, followed by the extraction of biological information from the patterns of conservation and variation observed in such regions. Although of pivotal importance, comparative genomic procedures that rely on homology inference are obviously not applicable if no homologous regions are detectable. This fact excludes a considerable portion of “genomic dark matter” with no significant similarity — and, consequently, no inferred homology to any other known sequence — from several downstream comparative genomic methods. In this review we compile several sequence metrics that do not rely on homology inference and can be used to compare nucleotide sequences and extract biologically meaningful information from them. These metrics comprise several compositional parameters calculated from sequence data alone, such as GC content, dinucleotide odds ratio, and several codon bias metrics. They also share other interesting properties, such as pervasiveness (patterns persist on smaller scales) and phylogenetic signal. We also cite examples where these homology-independent metrics have been successfully applied to support several bioinformatics challenges, such as taxonomic classification of biological sequences without homology inference. They where also used to detect higher-order patterns of interactions in biological systems, ranging from detecting coevolutionary trends between the genomes of viruses and their hosts to characterization of gene pools of entire microbial communities. We argue that, if correctly understood and applied, homology-independent metrics can add important layers of biological information in comparative genomic studies without prior homology inference.

Collapse

Necessary relations for nucleotide frequencies. J Theor Biol 2015;374:179-82. [PMID: 25843217 DOI: 10.1016/j.jtbi.2015.03.025] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2014] [Revised: 02/01/2015] [Accepted: 03/21/2015] [Indexed: 11/21/2022]

[Current status of theoretical studies on essential genes in microbes]. YI CHUAN = HEREDITAS 2012;34:420-30. [PMID: 22522159 DOI: 10.3724/sp.j.1005.2012.00420] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Bosi E, Fani R, Fondi M. The mosaicism of plasmids revealed by atypical genes detection and analysis. BMC Genomics 2011;12:403. [PMID: 21824433 PMCID: PMC3166947 DOI: 10.1186/1471-2164-12-403] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2011] [Accepted: 08/08/2011] [Indexed: 01/05/2023] Open

Abstract

BACKGROUND

From an evolutionary viewpoint, prokaryotic genomes are extremely plastic and dynamic, since large amounts of genetic material are continuously added and/or lost through promiscuous gene exchange. In this picture, plasmids play a key role, since they can be transferred between different cells and, through genetic rearrangement(s), undergo gene(s) load, leading, in turn, to the appearance of important metabolic innovations that might be relevant for cell life. Despite their central position in bacterial evolution, a massive analysis of newly acquired functional blocks [likely the result of horizontal gene transfer (HGT) events] residing on plasmids is still missing.

RESULTS

We have developed a computational, composition-based, pipeline to scan almost 2000 plasmids for genes that differ significantly from their hosting molecule. Plasmids atypical genes (PAGs) were about 6% of the total plasmids ORFs and, on average, each plasmid possessed 4.4 atypical genes. Nevertheless, conjugative plasmids were shown to possess an amount of atypical genes than that found in not mobilizable plasmids, providing strong support for the central role suggested for conjugative plasmids in the context of HGT. Part of the retrieved PAGs are organized into (mainly short) clusters and are involved in important biological processes (detoxification, antibiotic resistance, virulence), revealing the importance of HGT in the spreading of metabolic pathways within the whole microbial community. Lastly, our analysis revealed that PAGs mainly derive from other plasmid (rather than coming from phages and/or chromosomes), suggesting that plasmid-plasmid DNA exchange might be the primary source of metabolic innovations in this class of mobile genetic elements.

CONCLUSIONS

In this work we have performed the first large scale analysis of atypical genes that reside on plasmid molecules to date. Our findings on PAGs function, organization, distribution and spreading reveal the importance of plasmids-mediated HGT within the complex bacterial evolutionary network and in the dissemination of important biological traits.

Collapse

Yu JF, Sun X. Reannotation of protein-coding genes based on an improved graphical representation of DNA sequence. J Comput Chem 2010;31:2126-35. [PMID: 20175214 DOI: 10.1002/jcc.21500] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Bohlin J, Snipen L, Hardy SP, Kristoffersen AB, Lagesen K, Dønsvik T, Skjerve E, Ussery DW. Analysis of intra-genomic GC content homogeneity within prokaryotes. BMC Genomics 2010;11:464. [PMID: 20691090 PMCID: PMC3091660 DOI: 10.1186/1471-2164-11-464] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2010] [Accepted: 08/06/2010] [Indexed: 11/10/2022] Open

Examination of genome homogeneity in prokaryotes using genomic signatures. PLoS One 2009;4:e8113. [PMID: 19956556 PMCID: PMC2781299 DOI: 10.1371/journal.pone.0008113] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2009] [Accepted: 11/05/2009] [Indexed: 01/17/2023] Open

Abstract

BACKGROUND

DNA word frequencies, normalized for genomic AT content, are remarkably stable within prokaryotic genomes and are therefore said to reflect a "genomic signature." The genomic signatures can be used to phylogenetically classify organisms from arbitrary sampled DNA. Genomic signatures can also be used to search for horizontally transferred DNA or DNA regions subjected to special selection forces. Thus, the stability of the genomic signature can be used as a measure of genomic homogeneity. The factors associated with the stability of the genomic signatures are not known, and this motivated us to investigate further. We analyzed the intra-genomic variance of genomic signatures based on AT content normalization (0(th) order Markov model) as well as genomic signatures normalized by smaller DNA words (1(st) and 2(nd) order Markov models) for 636 sequenced prokaryotic genomes. Regression models were fitted, with intra-genomic signature variance as the response variable, to a set of factors representing genomic properties such as genomic AT content, genome size, habitat, phylum, oxygen requirement, optimal growth temperature and oligonucleotide usage variance (OUV, a measure of oligonucleotide usage bias), measured as the variance between genomic tetranucleotide frequencies and Markov chain approximated tetranucleotide frequencies, as predictors.

PRINCIPAL FINDINGS

Regression analysis revealed that OUV was the most important factor (p<0.001) determining intra-genomic homogeneity as measured using genomic signatures. This means that the less random the oligonucleotide usage is in the sense of higher OUV, the more homogeneous the genome is in terms of the genomic signature. The other factors influencing variance in the genomic signature (p<0.001) were genomic AT content, phylum and oxygen requirement.

CONCLUSIONS

Genomic homogeneity in prokaryotes is intimately linked to genomic GC content, oligonucleotide usage bias (OUV) and aerobiosis, while oligonucleotide usage bias (OUV) is associated with genomic GC content, aerobiosis and habitat.

Collapse