1
|
Ding H, Gao J, Yang J, Zhang S, Han S, Yi R, Ye Y, Kan X. Genome evolution of Buchnera aphidicola (Gammaproteobacteria): Insights into strand compositional asymmetry, codon usage bias, and phylogenetic implications. Int J Biol Macromol 2023; 253:126738. [PMID: 37690648 DOI: 10.1016/j.ijbiomac.2023.126738] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Revised: 08/15/2023] [Accepted: 08/25/2023] [Indexed: 09/12/2023]
Abstract
Taxa of Buchnera aphidicola (hereafter "Buchnera") are mutualistic intracellular symbionts of aphids, known for their remarkable biological traits such as genome reduction, strand compositional asymmetry, and symbiont-host coevolution. With the growing availability of genomic data, we performed a comprehensive analysis of 103 genomes of Buchnera strains from 12 host subfamilies, focusing on the genomic characterizations, codon usage patterns, and phylogenetic implications. Our findings revealed consistent features among all genomes, including small genome sizes, low GC contents, and gene losses. We also identified strong strand compositional asymmetries in all strains at the genome level. Further investigation suggested that mutation pressure may have played a crucial role in shaping codon usage of Buchnera. Moreover, the genomic asymmetries were reflected in asymmetric codon usage preferences within chromosomal genes. Notably, the levels of these asymmetries were varied among strains and were significantly influenced by the degrees of genome shrinkages. Lastly, our phylogenetic analyses presented an alternative topology of Aphididae, based on the Buchnera symbionts, providing robust confirmation of the paraphylies of Eriosomatinae, and Macrosiphini. Our objectives are to further understand the strand compositional asymmetry and codon usage bias of Buchnera taxa, and provide new perspectives for phylogenetic studies of Aphididae.
Collapse
Affiliation(s)
- Hengwu Ding
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China; Key Laboratory of Development and Application of Rural Renewable Energy, Biogas Institute of Ministry of Agriculture and Rural Affairs, Chengdu 610041, China
| | - Jinming Gao
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China; The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Jianke Yang
- School of Basic Medical Sciences, Wannan Medical College, Wuhu 241000, China
| | - Sijia Zhang
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China; The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Shiyun Han
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China; The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Ran Yi
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China; The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Yuanxin Ye
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China; The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Xianzhao Kan
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China; The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China.
| |
Collapse
|
2
|
Bose D, Mukhopadhyay S. The hunt for a yet unknown: Common molecular signature in some genetically monomorphic enterobacteria. J Basic Microbiol 2021; 61:524-546. [PMID: 33991346 DOI: 10.1002/jobm.202000630] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Revised: 04/04/2021] [Accepted: 04/22/2021] [Indexed: 11/09/2022]
Abstract
Mark Achtman introduced the term "genetically monomorphic bacteria" (GM bacteria) for some human and plant pathogens. They displayed a great uniformity in terms of their "genetic" properties. This "uniformity" poses a challenge to microbiologists. To address these problems, we used CodonW and IslandViewer 3 as analytical tools and took Escherichia coli, Salmonella, and Shigella strains as a model organisms. We hypothesized that GM bacterium contains a common molecular signature among them. We have found a significant correlation regarding the number of protein-coding genes, predicted highly expressed genes, and the highest length of gene in this regard. On the other hand, the correspondence analysis of pathogenicity-related genes identified by IslandViewer 3 displayed a somewhat unique pattern in GM bacteria. The probable pathogenic genes are clustered into two separate groups, which is a hallmark of some pattern. Similar genes of non-monomorphic pathogenic strain clustered almost similarly, but the clusters are joined together, they are not completely separated. These features, in our considered view, may be considered as codon usages signatures of these bacteria, and E. coli in particular.
Collapse
Affiliation(s)
- Debadin Bose
- Department of Botany, Kabi Nazrul College, Murarai, West Bengal, India
| | - Subhasis Mukhopadhyay
- Distributed Information Centre for Bioinformatics, Department of Biophysics, Molecular Biology and Bioinformatics, University of Calcutta, Calcutta, West Bengal, India
| |
Collapse
|
3
|
Bose D, Mukhopadhyay S. Comparative genomics of a few members of the family Aquificaceae on the basis of their codon usage profile. GENE REPORTS 2019. [DOI: 10.1016/j.genrep.2018.11.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
|
4
|
The Impact of Selection at the Amino Acid Level on the Usage of Synonymous Codons. G3-GENES GENOMES GENETICS 2017; 7:967-981. [PMID: 28122952 PMCID: PMC5345726 DOI: 10.1534/g3.116.038125] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
There are two main forces that affect usage of synonymous codons: directional mutational pressure and selection. The effectiveness of protein translation is usually considered as the main selectional factor. However, biased codon usage can also be a byproduct of a general selection at the amino acid level interacting with nucleotide replacements. To evaluate the validity and strength of such an effect, we superimposed >3.5 billion unrestricted mutational processes on the selection of nonsynonymous substitutions based on the differences in physicochemical properties of the coded amino acids. Using a modified evolutionary optimization algorithm, we determined the conditions in which the effect on the relative codon usage is maximized. We found that the effect is enhanced by mutational processes generating more adenine and thymine than guanine and cytosine, as well as more purines than pyrimidines. Interestingly, this effect is observed only under an unrestricted model of nucleotide substitution, and disappears when the mutational process is time-reversible. Comparison of the simulation results with data for real protein coding sequences indicates that the impact of selection at the amino acid level on synonymous codon usage cannot be neglected. Furthermore, it can considerably interfere, especially in AT-rich genomes, with other selections on codon usage, e.g., translational efficiency. It may also lead to difficulties in the recognition of other effects influencing codon bias, and an overestimation of protein coding sequences whose codon usage is subjected to adaptational selection.
Collapse
|
5
|
Characterization of PAS domains in Frankia and selected Actinobacteria and their possible interaction with other co-domains for environmental adaptation. Symbiosis 2016. [DOI: 10.1007/s13199-016-0413-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
6
|
Banerjee R, Roy A, Ahmad F, Das S, Basak S. Evolutionary patterning of hemagglutinin gene sequence of 2009 H1N1 pandemic. J Biomol Struct Dyn 2016; 29:733-42. [PMID: 22208275 DOI: 10.1080/07391102.2012.10507411] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
The 2009 H1N1 swine flu is the first pandemic in decades. Infectivity of the influenza virus for human host depends largely on its ability to evade antibodies specific for viral protein called hemagglutinin (HA) that mediates attachment to the host. In the present study we analysed large number of HA gene sequences available in Flu Database maintained at NCBI. Our sequence based analysis clearly demonstrates that the amino acid usage pattern may dramatically change during the course of evolution, and there exists a clear link between a particular pattern of amino acid usage of HA genes and its potential to become infectious. Structural studies revealed how binding efficiency between the HA and sialic acid may alter the pandemic potential of infection. Our work highlights the evolutionary significance and biochemical basis of the selective advantage of certain amino acids of HA in 2009 and provides a link between the characteristics changes in HA protein and their potential to pronounce a global menace to public health.
Collapse
Affiliation(s)
- Rachana Banerjee
- Department of Bio-Physics, Molecular Biology and Bioinformatics, University of Calcutta, 92, Kolkata, India
| | | | | | | | | |
Collapse
|
7
|
Banerjee R, Roy A, Das S, Basak S. Similarity of currently circulating H1N1 virus with the 2009 pandemic clone: Viability of an imminent pandemic. INFECTION GENETICS AND EVOLUTION 2015; 32:107-12. [DOI: 10.1016/j.meegid.2015.02.023] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2014] [Revised: 02/22/2015] [Accepted: 02/23/2015] [Indexed: 11/25/2022]
|
8
|
Iriarte A, Baraibar JD, Diana L, Castro-Sowinski S, Romero H, Musto H. Trends in amino acid usage across the class Mollicutes. J Biomol Struct Dyn 2014; 32:65-74. [DOI: 10.1080/07391102.2012.748636] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
9
|
Selva Kumar C, Nair RR, Sivaramakrishnan KG, Ganesh D, Janarthanan S, Arunachalam M, Sivaruban T. Influence of certain forces on evolution of synonymous codon usage bias in certain species of three basal orders of aquatic insects. ACTA ACUST UNITED AC 2012; 23:447-60. [PMID: 22943112 DOI: 10.3109/19401736.2012.710203] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Forces that influence the evolution of synonymous codon usage bias are analyzed in six species of three basal orders of aquatic insects. The rationale behind choosing six species of aquatic insects (three from Ephemeroptera, one from Plecoptera, and two from Odonata) for the present analysis is based on phylogenetic position at the basal clades of the Order Insecta facilitating the understanding of the evolution of codon bias and of factors shaping codon usage patterns in primitive clades of insect lineages and their subtle differences in some of their ecological and environmental requirements in terms of habitat-microhabitat requirements, altitudinal preferences, temperature tolerance ranges, and consequent responses to climate change impacts. The present analysis focuses on open reading frames of the 13 protein-coding genes in the mitochondrial genome of six carefully chosen insect species to get a comprehensive picture of the evolutionary intricacies of codon bias. In all the six species, A and T contents are observed to be significantly higher than G and C, and are used roughly equally. Since transcription hypothesis on codon usage demands A richness and T poorness, it is quite likely that mutation pressure may be the key factor associated with synonymous codon usage (SCU) variations in these species because the mutation hypothesis predicts AT richness and GC poorness in the mitochondrial DNA. Thus, AT-biased mutation pressure seems to be an important factor in framing the SCU variation in all the selected species of aquatic insects, which in turn explains the predominance of A and T ending codons in these species. This study does not find any association between microhabitats and codon usage variations in the mitochondria of selected aquatic insects. However, this study has identified major forces, such as compositional constraints and mutation pressure, which shape patterns of codon usage in mitochondrial genes in the primitive clades of insect lineages.
Collapse
Affiliation(s)
- C Selva Kumar
- Department of Zoology, University of Madras, Chennai 600 025, Tamil Nadu, India
| | | | | | | | | | | | | |
Collapse
|
10
|
Dasgupta A, Banerjee R, Das S, Basak S. Evolutionary perspective on the origin of Haitian cholera outbreak strain. J Biomol Struct Dyn 2012; 30:338-46. [PMID: 22693991 DOI: 10.1080/07391102.2012.680033] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Cholera epidemic has not been reported in Haiti for at least 100 years, although cholera has been present in Latin America since 1991. Surprisingly, the recent cholera epidemic in Haiti (October 2010) recorded more than 250,000 cases and 4000 deaths in the first 6 months and became one of the most explosive and deadly cholera outbreak in recent history. In the present study, we conducted genomic analyses of pathogenicity islands of three Haitian Vibrio cholerae strains and compared them with nine different V. cholerae O1 El Tor genomes. Although CIRS101 is evolutionarily most similar to the Haitian strains, our study also provides some important differences in the genetic organization of pathogenicity islands of Haitian strains with CIRS101. Evolutionary analysis suggests that unusual functional constraints have been imposed on the Haitian strains and we hypothesize that amino acid substitution is more deleterious in Haitian strains than in nonHaitian strains.
Collapse
Affiliation(s)
- Anirban Dasgupta
- Department of Bioinformatics, West Bengal University of Technology, Salt Lake, Kolkata, India
| | | | | | | |
Collapse
|
11
|
Supek F, Škunca N, Repar J, Vlahoviček K, Šmuc T. Translational selection is ubiquitous in prokaryotes. PLoS Genet 2010; 6:e1001004. [PMID: 20585573 PMCID: PMC2891978 DOI: 10.1371/journal.pgen.1001004] [Citation(s) in RCA: 69] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2009] [Accepted: 05/26/2010] [Indexed: 11/29/2022] Open
Abstract
Codon usage bias in prokaryotic genomes is largely a consequence of background substitution patterns in DNA, but highly expressed genes may show a preference towards codons that enable more efficient and/or accurate translation. We introduce a novel approach based on supervised machine learning that detects effects of translational selection on genes, while controlling for local variation in nucleotide substitution patterns represented as sequence composition of intergenic DNA. A cornerstone of our method is a Random Forest classifier that outperformed previous distance measure-based approaches, such as the codon adaptation index, in the task of discerning the (highly expressed) ribosomal protein genes by their codon frequencies. Unlike previous reports, we show evidence that translational selection in prokaryotes is practically universal: in 460 of 461 examined microbial genomes, we find that a subset of genes shows a higher codon usage similarity to the ribosomal proteins than would be expected from the local sequence composition. These genes constitute a substantial part of the genome—between 5% and 33%, depending on genome size—while also exhibiting higher experimentally measured mRNA abundances and tending toward codons that match tRNA anticodons by canonical base pairing. Certain gene functional categories are generally enriched with, or depleted of codon-optimized genes, the trends of enrichment/depletion being conserved between Archaea and Bacteria. Prominent exceptions from these trends might indicate genes with alternative physiological roles; we speculate on specific examples related to detoxication of oxygen radicals and ammonia and to possible misannotations of asparaginyl–tRNA synthetases. Since the presence of codon optimizations on genes is a valid proxy for expression levels in fully sequenced genomes, we provide an example of an “adaptome” by highlighting gene functions with expression levels elevated specifically in thermophilic Bacteria and Archaea. Synonymous codons are not equally common in genomes. The main causes of unequal codon usage are varying nucleotide substitution patterns, as manifested in the wide range of genomic nucleotide compositions. However, since the first E. coli and yeast genes were sequenced, it became evident that there was also a bias towards codons that can be translated to protein faster and more accurately. This bias was stronger in highly expressed genes, and its driving force was termed translational selection. Researchers sought for effects of translational selection in microbial genomes as they became available, employing a flurry of mathematical approaches which sometimes led to contradictory conclusions. We introduce a sensitive and accurate machine learning-based methodology and find that highly expressed genes have a recognizable codon usage pattern in almost every bacterial and archaeal genome analyzed, even after accounting for large differences in background nucleotide composition. We also show that the gene functional category has a great bearing on whether that gene is subject to translational selection. Since presence of codon optimizations can be used as a purely sequence-derived proxy for expression levels, we can delineate “adaptomes” by relating predicted gene activity to organisms' phenotypes, which we demonstrate on genomes of temperature-resistant Bacteria and Archaea.
Collapse
Affiliation(s)
- Fran Supek
- Division of Electronics, Rudjer Boskovic Institute, Zagreb, Croatia
| | - Nives Škunca
- Division of Electronics, Rudjer Boskovic Institute, Zagreb, Croatia
| | - Jelena Repar
- Division of Molecular Biology, Rudjer Boskovic Institute, Zagreb, Croatia
| | - Kristian Vlahoviček
- Division of Biology, Faculty of Science, University of Zagreb, Zagreb, Croatia
- Department of Informatics, University of Oslo, Oslo, Norway
| | - Tomislav Šmuc
- Division of Electronics, Rudjer Boskovic Institute, Zagreb, Croatia
- * E-mail:
| |
Collapse
|
12
|
Synonymous codon usage analysis of thirty two mycobacteriophage genomes. Adv Bioinformatics 2010:316936. [PMID: 20150956 PMCID: PMC2817497 DOI: 10.1155/2009/316936] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2009] [Accepted: 10/27/2009] [Indexed: 11/17/2022] Open
Abstract
Synonymous codon usage of protein coding genes of thirty two completely sequenced mycobacteriophage genomes was studied using multivariate statistical analysis. One of the major factors influencing codon usage is identified to be compositional bias. Codons ending with either C or G are preferred in highly expressed genes among which C ending codons are highly preferred over G ending codons. A strong negative correlation between effective number of codons (Nc) and GC3s content was also observed, showing that the codon usage was effected by gene nucleotide composition. Translational selection is also identified to play a role in shaping the codon usage operative at the level of translational accuracy. High level of heterogeneity is seen among and between the genomes. Length of genes is also identified to influence the codon usage in 11 out of 32 phage genomes. Mycobacteriophage Cooper is identified to be the highly biased genome with better translation efficiency comparing well with the host specific tRNA genes.
Collapse
|
13
|
Guo FB, Yuan JB. Codon usages of genes on chromosome, and surprisingly, genes in plasmid are primarily affected by strand-specific mutational biases in Lawsonia intracellularis. DNA Res 2009; 16:91-104. [PMID: 19221094 PMCID: PMC2671203 DOI: 10.1093/dnares/dsp001] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
In this study, the factors driving genome-wide patterns of codon usages in Lawsonia intracellularis genome are determined. For genes on the chromosome of the bacterium, it is found that the most important source of variation results from strand-specific mutational biases. A lesser trend of variation is attributable to genes that are presumed as horizontally transferred. These putative alien genes are unusually GC richer than the other genes, whereas horizontally transferred genes have been observed to be AT rich in bacteria with medium and relatively low G + C contents. Hydropathy of encoded protein and expression level are also found to influence codon usage. Therefore, codon usage in L. intracellularis chromosome is the result of a complex balance among the different mutational and selectional factors. When analyzing genes in the largest plasmid, for the first time it is found that the strand-specific mutational biases are responsible for the primary variation of codon usages in plasmid. Genes, particularly highly expressed genes of this plasmid, are mainly located on the leading strands and this supposed to be the effects exerted by replicational-transcriptional selection. These facts suggest that this plasmid adopts the similar mechanism of replication as the chromosome in L. intracellularis. Common characters among the 10 bacteria in whose genomes the strand-specific mutational biases are the primary source of variation of codon usage are also investigated. For example, it is found that genes dnaT and fis that are involved in DNA replication initiation and re-initiation pathways are absent in all of the 10 bacteria.
Collapse
Affiliation(s)
- Feng-Biao Guo
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, People's Republic of China.
| | | |
Collapse
|
14
|
Analysis of synonymous codon usage in the UL24 gene of duck enteritis virus. Virus Genes 2008; 38:96-103. [PMID: 18958612 DOI: 10.1007/s11262-008-0295-0] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2008] [Accepted: 10/09/2008] [Indexed: 10/21/2022]
Abstract
The analysis on codon usage bias of UL24 gene of duck enteritis virus (DEV) may improve our understanding of the evolution and pathogenesis of DEV and provide a basis for understanding the relevant mechanism for biased usage of synonymous codons and for selecting appropriate expression systems to improve the expression of target genes. The codon usage bias of UL24 genes of DEV and 27 reference herpesviruses were analyzed. The results showed that codon of UL24 gene of DEV was strong bias toward the synonymous codons with A and T at the third codon position. A high level of diversity in codon usage bias existed, and the effective number of codons used in a gene plot revealed that the genetic heterogeneity in UL24 gene of herpesviruses was constrained by the G + C content. The phylogentic analysis suggested that DEV was evolutionarily closer to Alphaherpesvirinae and that there was no significant deviation in codon usage in different virus strains. There were 20 codons showing distinct usage differences between DEV and Escherichia coli, 23 between DEV and Homo sapiens, but only 16 codons between DEV and yeast. Therefore the yeast expression system may be more suitable for the expression of DEV genes.
Collapse
|
15
|
COMPARATIVE ANALYSIS OF CODON USAGE PATTERNS AND IDENTIFICATION OF PREDICTED HIGHLY EXPRESSED GENES IN FIVE SALMONELLA GENOMES. Indian J Med Microbiol 2008. [DOI: 10.1016/s0255-0857(21)01806-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
16
|
The implication of life style on codon usage patterns and predicted highly expressed genes for three Frankia genomes. Antonie van Leeuwenhoek 2008; 93:335-46. [DOI: 10.1007/s10482-007-9211-1] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/14/2007] [Accepted: 11/12/2007] [Indexed: 11/27/2022]
|
17
|
Sau K, Gupta SK, Sau S, Mandal SC, Ghosh TC. Factors influencing synonymous codon and amino acid usage biases in Mimivirus. Biosystems 2006; 85:107-13. [PMID: 16442213 DOI: 10.1016/j.biosystems.2005.12.004] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2005] [Revised: 12/05/2005] [Accepted: 12/17/2005] [Indexed: 10/25/2022]
Abstract
Synonymous codon and amino acid usage biases have been investigated in 903 Mimivirus protein-coding genes in order to understand the architecture and evolution of Mimivirus genome. As expected for an AT-rich genome, third codon positions of the synonymous codons of Mimivirus carry mostly A or T bases. It was found that codon usage bias in Mimivirus genes is dictated both by mutational pressure and translational selection. Evidences show that four factors such as mean molecular weight (MMW), hydropathy, aromaticity and cysteine content are mostly responsible for the variation of amino acid usage in Mimivirus proteins. Based on our observation, we suggest that genes involved in translation, DNA repair, protein folding, etc., have been laterally transferred to Mimivirus a long ago from living organism and with time these genes acquire the codon usage pattern of other Mimivirus genes under selection pressure.
Collapse
Affiliation(s)
- K Sau
- Department of Biotechnology, Haldia Institute of Technology, Haldia, India
| | | | | | | | | |
Collapse
|
18
|
Banerjee T, Ghosh TC. Gene expression level shapes the amino acid usages in Prochlorococcus marinus MED4. J Biomol Struct Dyn 2006; 23:547-54. [PMID: 16494504 DOI: 10.1080/07391102.2006.10507079] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Prochlorococcus species are the first example of free-living bacteria with reduced genome. Codon and amino acid usages bias of Prochlorococcus marinus MED4 was investigated using all protein coding genes having length greater than or equal to 100 amino acids. Correspondence analysis on relative synonymous codon usage (RSCU) values shows that there is no such influence of translational selection in shaping the codon usage variation among the genes in this organism. However, amino acid usages were markedly different between the highly and lowly expressed genes in this organism and in particular, GC rich amino acids were found to occur significantly higher in highly expressed genes than the lowly expressed genes. Comparative analysis of the homologous genes of Synechococcus sp. WH8102 and Prochlorococcus marinus MED4 shows that amino acids conservation in highly expressed genes is significantly higher than lowly expressed genes. Based on our results we concluded that conservation of GC rich amino acids in the highly expressed genes to its ancestor is the major source of variation in amino acid usages in the organism.
Collapse
Affiliation(s)
- T Banerjee
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata 700 054, India
| | | |
Collapse
|
19
|
Sällström B, Arnaout RA, Davids W, Bjelkmar P, Andersson SGE. Protein evolutionary rates correlate with expression independently of synonymous substitutions in Helicobacter pylori. J Mol Evol 2006; 62:600-14. [PMID: 16586017 DOI: 10.1007/s00239-005-0104-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2005] [Accepted: 12/20/2005] [Indexed: 11/29/2022]
Abstract
In free-living microorganisms, such as Escherichia coli and Saccharomyces cerevisiae, both synonymous and nonsynonymous substitution frequencies correlate with expression levels. Here, we have tested the hypothesis that the correlation between amino acid substitution rates and expression is a by-product of selection for codon bias and translational efficiency in highly expressed genes. To this end, we have examined the correlation between protein evolutionary rates and expression in the human gastric pathogen Helicobacter pylori, where the absence of selection on synonymous sites enables the two types of substitutions to be uncoupled. The results revealed a statistically significant negative correlation between expression levels and nonsynonymous substitutions in both H. pylori and E. coli. We also found that neighboring genes located on the same, but not on opposite strands, evolve at significantly more similar rates than random gene pairs, as expected by co-expression of genes located in the same operon. However, the two species differ in that synonymous substitutions show a strand-specific pattern in E. coli, whereas the weak similarity in synonymous substitutions for neighbors in H. pylori is independent of gene orientation. These results suggest a direct influence of expression levels on nonsynonymous substitution frequencies independent of codon bias and selective constraints on synonymous sites.
Collapse
Affiliation(s)
- Björn Sällström
- Program of Molecular Evolution, Department of Evolution, Genomics and Systematics, Evolutionary Biology Center, Uppsala University, 752 36 Uppsala, Sweden
| | | | | | | | | |
Collapse
|
20
|
Degnan PH, Lazarus AB, Wernegreen JJ. Genome sequence of Blochmannia pennsylvanicus indicates parallel evolutionary trends among bacterial mutualists of insects. Genome Res 2005; 15:1023-33. [PMID: 16077009 PMCID: PMC1182215 DOI: 10.1101/gr.3771305] [Citation(s) in RCA: 155] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The distinct lifestyle of obligately intracellular bacteria can alter fundamental forces that drive and constrain genome change. In this study, sequencing the 792-kb genome of Blochmannia pennsylvanicus, an obligate endosymbiont of Camponotus pennsylvanicus, enabled us to trace evolutionary changes that occurred in the context of a bacterial-ant association. Comparison to the genome of Blochmannia floridanus reveals differential loss of genes involved in cofactor biosynthesis, the composition and structure of the cell wall and membrane, gene regulation, and DNA replication. However, the two Blochmannia species show complete conservation in the order and strand orientation of shared genes. This finding of extreme stasis in genome architecture, also reported previously for the aphid endosymbiont Buchnera, suggests that genome stability characterizes long-term bacterial mutualists of insects and constrains their evolutionary potential. Genome-wide analyses of protein divergences reveal 10- to 50-fold faster amino acid substitution rates in Blochmannia compared to related bacteria. Despite these varying features of genome evolution, a striking correlation in the relative divergences of proteins indicates parallel functional constraints on gene functions across ecologically distinct bacterial groups. Furthermore, the increased rates of amino acid substitution and gene loss in Blochmannia have occurred in a lineage-specific fashion, which may reflect life history differences of their ant hosts.
Collapse
Affiliation(s)
- Patrick H Degnan
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, Massachusetts 02543, USA
| | | | | |
Collapse
|
21
|
Sau K, Sau S, Mandal SC, Ghosh TC. Factors influencing the synonymous codon and amino acid usage bias in AT-rich Pseudomonas aeruginosa phage PhiKZ. Acta Biochim Biophys Sin (Shanghai) 2005; 37:625-33. [PMID: 16143818 PMCID: PMC7109957 DOI: 10.1111/j.1745-7270.2005.00089.x] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
Abstract
To reveal how the AT-rich genome of bacteriophage PhiKZ has been shaped in order to carry out its growth in the GC-rich host Pseudomonas aeruginosa, synonymous codon and amino acid usage bias of PhiKZ was investigated and the data were compared with that of P. aeruginosa. It was found that synonymous codon and amino acid usage of PhiKZ was distinct from that of P. aeruginosa. In contrast to P. aeruginosa, the third codon position of the synonymous codons of PhiKZ carries mostly A or T base; codon usage bias in PhiKZ is dictated mainly by mutational bias and, to a lesser extent, by translational selection. A cluster analysis of the relative synonymous codon usage values of 16 myoviruses including PhiKZ shows that PhiKZ is evolutionary much closer to Escherichia coli phage T4. Further analysis reveals that the three factors of mean molecular weight, aromaticity and cysteine content are mostly responsible for the variation of amino acid usage in PhiKZ proteins, whereas amino acid usage of P. aeruginosa proteins is mainly governed by grand average of hydropathicity, aromaticity and cysteine content. Based on these observations, we suggest that codons of the phage-like PhiKZ have evolved to preferentially incorporate the smaller amino acid residues into their proteins during translation, thereby economizing the cost of its development in GC-rich P. aeruginosa.
Collapse
Affiliation(s)
- K. Sau
- Department of Mathematics, Jadavpur UniversityCalcutta 700 032, India
| | - S. Sau
- Department of Biochemistry, Bose Institute, P1/12-CIT Scheme VII MCalcutta 700 054, India
| | - S. C. Mandal
- Department of Mathematics, Jadavpur UniversityCalcutta 700 032, India
- Corresponding authors: S. C. MANDAL: E-mail,
| | - T. C. Ghosh
- Bioinformatics Centre, Bose Institute, P1/12-CIT Scheme VII MCalcutta 700 054, India
- T. C. GHOSH: Tel, +91-33-2334 6626; Fax, +91-33-2334 3886; E-mail,
| |
Collapse
|
22
|
Schaber J, Rispe C, Wernegreen J, Buness A, Delmotte F, Silva FJ, Moya A. Gene expression levels influence amino acid usage and evolutionary rates in endosymbiotic bacteria. Gene 2005; 352:109-17. [PMID: 15935576 DOI: 10.1016/j.gene.2005.04.003] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2004] [Revised: 01/25/2005] [Accepted: 04/01/2005] [Indexed: 02/07/2023]
Abstract
Most endosymbiotic bacteria have extremely reduced genomes, accelerated evolutionary rates, and strong AT base compositional bias thought to reflect reduced efficacy of selection and increased mutational pressure. Here, we present a comparative study of evolutionary forces shaping five fully sequenced bacterial endosymbionts of insects. The results of this study were three-fold: (i) Stronger conservation of high expression genes at not just nonsynonymous, but also synonymous, sites. (ii) Variation in amino acid usage strongly correlates with GC content and expression level of genes. This pattern is largely explained by greater conservation of high expression genes, leading to their higher GC content. However, we also found indication of selection favoring GC-rich amino acids that contrasts with former studies. (iii) Although the specific nutritional requirements of the insect host are known to affect gene content of endosymbionts, we found no detectable influence on substitution rates, amino acid usage, or codon usage of bacterial genes involved in host nutrition.
Collapse
Affiliation(s)
- Jörg Schaber
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de Valencia, A.C. 22085, 46071 Valencia, Spain.
| | | | | | | | | | | | | |
Collapse
|