1
|
Lamolle G, Iriarte A, Simón D, Musto H. Amino acid usage and protein expression levels in the flatworm Schistosoma mansoni. Mol Biochem Parasitol 2023; 255:111581. [PMID: 37478919 DOI: 10.1016/j.molbiopara.2023.111581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 07/10/2023] [Accepted: 07/17/2023] [Indexed: 07/23/2023]
Abstract
Schistosoma mansoni is a parasitic flatworm that causes a human disease called schistosomiasis, or bilharzia. At the genomic level, S. mansoni is AT-rich, but has some compositional heterogeneity. Indeed, some regions of its genome are GC-rich, mainly in the regions located near the extreme ends of the chromosomes. Recently, we showed that, despite the strong bias towards A/T ending codons, highly expressed genes tend to use GC-rich codons. Here, we address the following question: are highly expressed sequences biased in their amino acid frequencies? Our analyses show that these sequences in S. mansoni, as in species ranging from bacteria to human, are strongly biased in nucleotide composition. Highly expressed genes tend to use GC-rich codons (in the first and second codon positions), which code the energetically cheapest amino acids. Therefore, we conclude that amino acid usage, at least in highly expressed genes, is strongly shaped by natural selection to avoid energetically expensive residues. Whether this is an adaptation to the parasitic way of life of S. mansoni, is unclear since the same pattern occurs in free-living species.
Collapse
Affiliation(s)
- Guillermo Lamolle
- Unidad de Genómica Evolutiva, Facultad de Ciencias, Universidad de la República, Iguá 4225, 11400 Montevideo, Uruguay
| | - Andrés Iriarte
- Unidad de Genómica Evolutiva, Facultad de Ciencias, Universidad de la República, Iguá 4225, 11400 Montevideo, Uruguay; Laboratorio de Biología Computacional, Departamento de Desarrollo Biotecnológico, Instituto de Higiene, Facultad de Medicina, Universidad de la República, Avenida A. Navarro 3051, 11600 Montevideo, Uruguay
| | - Diego Simón
- Unidad de Genómica Evolutiva, Facultad de Ciencias, Universidad de la República, Iguá 4225, 11400 Montevideo, Uruguay; Laboratorio de Virología Molecular, Centro de Investigaciones Nucleares, Universidad de la República, Mataojo 2055, 11400 Montevideo, Uruguay; Laboratorio de Evolución Experimental de Virus, Institut Pasteur de Montevideo, Mataojo 2020, 11400 Montevideo, Uruguay
| | - Héctor Musto
- Unidad de Genómica Evolutiva, Facultad de Ciencias, Universidad de la República, Iguá 4225, 11400 Montevideo, Uruguay.
| |
Collapse
|
2
|
Lamolle G, Iriarte A, Musto H. Codon usage in the flatworm Schistosoma mansoni is shaped by the mutational bias towards A+T and translational selection, which increases GC-ending codons in highly expressed genes. Mol Biochem Parasitol 2021; 247:111445. [PMID: 34942292 DOI: 10.1016/j.molbiopara.2021.111445] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2021] [Revised: 12/14/2021] [Accepted: 12/17/2021] [Indexed: 11/30/2022]
Abstract
Schistosoma mansoni is a trematode flatworm that parasitizes humans and produces a disease called bilharzia. At the genomic level, it is characterized by a low genomic GC content and an "isochore-like" structure, where GC-richest regions, mainly placed at the extremes of the chromosomes, are interspersed with low GC-regions. Furthermore, the GC-richest regions are at the same time the gene-richest, and where the most heavily expressed genes are placed. Taking these features into account, we decided to reanalyze the codon usage of this flatworm. Our results show that a) when all genes are considered together, the strong mutational bias towards A + T leads to a predominance of A/T-ending codons, b) a multivariate analysis discriminates between highly and lowly expressed genes, c) the sequences expressed at highest levels display a significant increase in G/C-ending codons, d) when comparing the molecular distances with a closely related species the synonymous distance in highly expressed genes is significantly lower than in lowly expressed sequences. Therefore, we conclude that despite previous results, which were performed with a small sample of genes, codon usage in S. mansoni is the result of two forces that operate in opposite directions: while mutational bias leads to a predominance of A/T codons, translational selection, working at the level of speed, increment G/C ending triplets.
Collapse
Affiliation(s)
- Guillermo Lamolle
- Unidad de Genómica Evolutiva, Facultad de Ciencias, Universidad de la República, Iguá 4225, 11400 Montevideo, Uruguay
| | - Andrés Iriarte
- Laboratorio de Biología Computacional, Departamento de Desarrollo Biotecnológico, Instituto de Higiene, Facultad de Medicina, Universidad de la República, Avenida A. Navarro 3051, 11600 Montevideo, Uruguay.
| | - Héctor Musto
- Unidad de Genómica Evolutiva, Facultad de Ciencias, Universidad de la República, Iguá 4225, 11400 Montevideo, Uruguay.
| |
Collapse
|
3
|
Analysis of Codon Usage Patterns in Giardia duodenalis Based on Transcriptome Data from GiardiaDB. Genes (Basel) 2021; 12:genes12081169. [PMID: 34440343 PMCID: PMC8393687 DOI: 10.3390/genes12081169] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 07/24/2021] [Accepted: 07/27/2021] [Indexed: 12/03/2022] Open
Abstract
Giardia duodenalis, a flagellated parasitic protozoan, the most common cause of parasite-induced diarrheal diseases worldwide. Codon usage bias (CUB) is an important evolutionary character in most species. However, G. duodenalis CUB remains unclear. Thus, this study analyzes codon usage patterns to assess the restriction factors and obtain useful information in shaping G. duodenalis CUB. The neutrality analysis result indicates that G. duodenalis has a wide GC3 distribution, which significantly correlates with GC12. ENC-plot result—suggesting that most genes were close to the expected curve with only a few strayed away points. This indicates that mutational pressure and natural selection played an important role in the development of CUB. The Parity Rule 2 plot (PR2) result demonstrates that the usage of GC and AT was out of proportion. Interestingly, we identified 26 optimal codons in the G. duodenalis genome, ending with G or C. In addition, GC content, gene expression, and protein size also influence G. duodenalis CUB formation. This study systematically analyzes G. duodenalis codon usage pattern and clarifies the mechanisms of G. duodenalis CUB. These results will be very useful to identify new genes, molecular genetic manipulation, and study of G. duodenalis evolution.
Collapse
|
4
|
Huo X, Liu S, Li Y, Wei H, Gao J, Yan Y, Zhang G, Liu M. Analysis of synonymous codon usage of transcriptome database in Rheum palmatum. PeerJ 2021; 9:e10450. [PMID: 33505783 PMCID: PMC7789865 DOI: 10.7717/peerj.10450] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2020] [Accepted: 11/09/2020] [Indexed: 12/21/2022] Open
Abstract
Background Rheum palmatum is an endangered and important medicinal plant in Asian countries, especially in China. However, there is little knowledge about the codon usage bias for R. palmatum CDSs. In this project, codon usage bias was determined based on the R. palmatum 2,626 predicted CDSs from R. palmatum transcriptome. Methods In this study, all codon usage bias parameters and nucleotide compositions were calculated by Python script, Codon W, DNA Star, CUSP of EMBOSS. Results The average GC and GC3 content are 46.57% and 46.6%, respectively, the results suggested that there exists a little more AT than GC in the R. palmatum genes, and the codon bias of R. palmatum genes preferred to end with A/T. We concluded that the codon bias in R. palmatum was affect by nucleotide composition, mutation pressure, natural selection, gene expression levels, and the mutation pressure is the prominent factor. In addition, we figured out 28 optimal codons and most of them ended with A or U. The project here can offer important information for further studies on enhancing the gene expression using codon optimization in heterogeneous expression system, predicting the genetic and evolutionary mechanisms in R. palmatum.
Collapse
Affiliation(s)
- Xiaowei Huo
- College of Pharmaceutical Science, Institute of Life Science and Green Development, Hebei University, Baoding, China
| | - Sisi Liu
- Hunan Academy of Forestry, Changsha, China
| | - Yimin Li
- College of Pharmacy and Shaanxi Provincial Key Laboratory for Chinese Medicine Basis & New Drugs Research, Shaanxi University of Chinese Medicine, Xi'an, China
| | - Hao Wei
- College of Pharmacy and Shaanxi Provincial Key Laboratory for Chinese Medicine Basis & New Drugs Research, Shaanxi University of Chinese Medicine, Xi'an, China
| | - Jing Gao
- College of Pharmacy and Shaanxi Provincial Key Laboratory for Chinese Medicine Basis & New Drugs Research, Shaanxi University of Chinese Medicine, Xi'an, China
| | - Yonggang Yan
- College of Pharmacy and Shaanxi Provincial Key Laboratory for Chinese Medicine Basis & New Drugs Research, Shaanxi University of Chinese Medicine, Xi'an, China
| | - Gang Zhang
- College of Pharmacy and Shaanxi Provincial Key Laboratory for Chinese Medicine Basis & New Drugs Research, Shaanxi University of Chinese Medicine, Xi'an, China
| | - Mengmeng Liu
- College of Traditional Chinese Medicine, Hebei University, Baoding, China
| |
Collapse
|
5
|
Uddin A, Mazumder TH, Barbhuiya PA, Chakraborty S. Similarities and dissimilarities of codon usage in mitochondrial ATP genes among fishes, aves, and mammals. IUBMB Life 2020; 72:899-914. [DOI: 10.1002/iub.2231] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2019] [Accepted: 01/05/2020] [Indexed: 11/06/2022]
Affiliation(s)
- Arif Uddin
- Department of ZoologyMoinul Hoque Choudhury Memorial Science College Hailakandi Assam India
| | | | | | | |
Collapse
|
6
|
Lee DH, Chu KB, Kang HJ, Lee SH, Chopra M, Choi HJ, Moon EK, Inn KS, Quan FS. Protection induced by malaria virus-like particles containing codon-optimized AMA-1 of Plasmodium berghei. Malar J 2019; 18:394. [PMID: 31796032 PMCID: PMC6888966 DOI: 10.1186/s12936-019-3017-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2019] [Accepted: 11/20/2019] [Indexed: 12/27/2022] Open
Abstract
Background Despite the extensive endeavours, developing an effective malaria vaccine remains as a great challenge. Apical membrane antigen 1 (AMA-1) located on the merozoite surface of parasites belonging to the genus Plasmodium is involved in red blood cell invasion. Methods Influenza virus-like particle (VLP) vaccines containing codon-optimized or native (non-codon optimized) AMA-1 from Plasmodium berghei were generated. VLP-induced protective immunity was evaluated in a mouse model. Results Mice immunized with VLP vaccine containing the codon-optimized AMA-1 elicited higher levels of P. berghei-specific IgG and IgG2a antibody responses compared to VLPs containing non-codon optimized AMA-1 before and after challenge infection. Codon-optimized AMA-1 VLP vaccination induced higher levels of CD4+ T cells, CD8+ T cells, B cells, and germinal centre cell responses compared to non-codon optimized AMA-1 VLPs. Importantly, the codon-optimized AMA-1 VLP vaccination showed lower body weight loss, longer survival and a significant decrease in parasitaemia compared to non-codon optimized VLP vaccination. Conclusion Overall, VLP vaccine expressing codon-optimized AMA-1 induced better protective efficacy than VLPs expressing the non-codon optimized AMA-1. Current findings highlight the importance of codon-optimization for vaccine use and its potential involvement in future malaria vaccine design strategies.
Collapse
Affiliation(s)
- Dong-Hun Lee
- Department of Biomedical Science, Graduate School, Kyung Hee University, Seoul, Korea
| | - Ki-Back Chu
- Department of Biomedical Science, Graduate School, Kyung Hee University, Seoul, Korea
| | - Hae-Ji Kang
- Department of Biomedical Science, Graduate School, Kyung Hee University, Seoul, Korea
| | - Su-Hwa Lee
- Department of Biomedical Science, Graduate School, Kyung Hee University, Seoul, Korea
| | - Manika Chopra
- Department of Chemical and Materials Engineering, University of Alberta, Edmonton, AB, T6G 1H9, Canada
| | - Hyo-Jick Choi
- Department of Chemical and Materials Engineering, University of Alberta, Edmonton, AB, T6G 1H9, Canada
| | - Eun-Kyung Moon
- Department of Medical Zoology, Kyung Hee University School of Medicine, Seoul, Republic of Korea
| | - Kyung-Soo Inn
- Department of Pharmaceutical Science, College of Pharmacy, Kyung Hee University, Seoul, 02447, Republic of Korea
| | - Fu-Shi Quan
- Department of Medical Zoology, Kyung Hee University School of Medicine, Seoul, Republic of Korea. .,Medical Research Center for Bioreaction to Reactive Oxygen Species and Biomedical Science Institute, School of Medicine, Graduate School, Kyung Hee University, Seoul, Republic of Korea.
| |
Collapse
|
7
|
Barbhuiya RI, Uddin A, Chakraborty S. Compositional properties and codon usage pattern of mitochondrial ATP gene in different classes of Arthropoda. Genetica 2019; 147:231-248. [PMID: 31152294 DOI: 10.1007/s10709-019-00067-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2018] [Accepted: 05/22/2019] [Indexed: 12/17/2022]
Abstract
Codon usage bias (CUB) is defined as the usage of synonymous codons unequally for an amino acid in a gene transcript. It is influenced by both mutation pressure and natural selection and is a species-specific property. In our current study, we used bioinformatic methods to investigate the coding sequences of mitochondrial adenosine triphosphate gene (MT-ATP) in different classes of arthropoda to know the codon usage pattern of the gene as no work was described earlier. The analysis of compositional properties suggested that the gene is AT rich. The effective number of codons revealed the CUB of both ATP6 and ATP8 gene was moderate. Heat map showed that the codons ending with AT were negatively associated with GC3 while the codons ending with GC were positively associated with GC3 in all the classes of arthropoda. Correspondence study revealed that the pattern of codon usage of ATP6 and ATP8 genes differed across classes. Neutrality plot suggested the codon usage bias of these two genes in phylum arthropoda was influenced by both mutation pressure and natural selection.
Collapse
Affiliation(s)
| | - Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Science College, Algapur, Hailakandi, Assam, 788150, India
| | - Supriyo Chakraborty
- Department of Biotechnology, Assam University, Silchar, Assam, 788011, India.
| |
Collapse
|
8
|
Barbhuiya PA, Uddin A, Chakraborty S. Genome‐wide comparison of codon usage dynamics in mitochondrial genes across different species of amphibian genus
Bombina. JOURNAL OF EXPERIMENTAL ZOOLOGY PART B-MOLECULAR AND DEVELOPMENTAL EVOLUTION 2019; 332:99-112. [DOI: 10.1002/jez.b.22852] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/10/2018] [Revised: 03/10/2019] [Accepted: 03/20/2019] [Indexed: 01/16/2023]
Affiliation(s)
| | - Arif Uddin
- Department of ZoologyMoinul Hoque Choudhury Memorial Science CollegeHailakandi Assam India
| | | |
Collapse
|
9
|
Uddin A, Mazumder TH, Chakraborty S. Understanding molecular biology of codon usage in mitochondrial complex IV genes of electron transport system: Relevance to mitochondrial diseases. J Cell Physiol 2018; 234:6397-6413. [DOI: 10.1002/jcp.27375] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2018] [Accepted: 08/17/2018] [Indexed: 12/17/2022]
Affiliation(s)
- Arif Uddin
- Department of Zoology Moinul Hoque Choudhury Memorial Science College Hailakandi Assam India
| | | | | |
Collapse
|
10
|
Sinha I, Woodrow CJ. Forces acting on codon bias in malaria parasites. Sci Rep 2018; 8:15984. [PMID: 30374097 PMCID: PMC6206010 DOI: 10.1038/s41598-018-34404-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2017] [Accepted: 10/16/2018] [Indexed: 11/09/2022] Open
Abstract
Malaria parasite genomes have a range of codon biases, with Plasmodium falciparum one of the most AT-biased genomes known. We examined the make up of synonymous coding sites and stop codons in the core genomes of representative malaria parasites, showing first that local DNA context influences codon bias similarly across P. falciparum, P. vivax and P. berghei, with suppression of CpG dinucleotides and enhancement of CpC dinucleotides, both within and aross codons. Intense asexual phase gene expression in P. falciparum and P. berghei is associated with increased A3:G3 bias but reduced T3:C3 bias at 2-fold sites, consistent with adaptation of codons to tRNA pools and avoidance of wobble tRNA interactions that potentially slow down translation. In highly expressed genes, the A3:G3 ratio can exceed 30-fold while the T3:C3 ratio can be less than 1, according to the encoded amino acid and subsequent base. Lysine codons (AAA/G) show distinctive behaviour with substantially reduced A3:G3 bias in highly expressed genes, perhaps because of selection against frameshifting when the AAA codon is followed by another adenine. Intense expression is also associated with a strong bias towards TAA stop codons (found in 94% and 89% of highly expressed P. falciparum and P. berghei genes respectively) and a proportional rise in the TAAA stop ‘tetranucleotide’. The presence of these expression-linked effects in the relatively AT-rich malaria parasite species adds weight to the suggestion that AT-richness in the Plasmodium genus might be a fitness adaptation. Potential explanations for the relative lack of codon bias in P. vivax include the distinct features of its lifecycle and its effective population size over evolutionary time.
Collapse
Affiliation(s)
- I Sinha
- Mahidol-Oxford Tropical Medicine Research Unit (MORU), Mahidol University, Bangkok, Thailand.,Centre for Tropical Medicine and Global Health, University of Oxford, Oxford, UK
| | - C J Woodrow
- Mahidol-Oxford Tropical Medicine Research Unit (MORU), Mahidol University, Bangkok, Thailand. .,Centre for Tropical Medicine and Global Health, University of Oxford, Oxford, UK.
| |
Collapse
|
11
|
Gajbhiye S, Patra P, Yadav MK. New insights into the factors affecting synonymous codon usage in human infecting Plasmodium species. Acta Trop 2017; 176:29-33. [PMID: 28751162 DOI: 10.1016/j.actatropica.2017.07.025] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2017] [Revised: 07/18/2017] [Accepted: 07/21/2017] [Indexed: 02/07/2023]
Abstract
Codon usage bias is due to the non-random usage of synonymous codons for coding amino acids. The synonymous sites are under weak selection, and codon usage bias is maintained by the equilibrium in mutational bias, genetic drift and selection pressure. The differential codon usage choices are also relevant to human infecting Plasmodium species. Recently, P. knowlesi switches its natural host, long-tailed macaques, and starts infecting humans. This review focuses on the comparative analysis of codon usage choices among human infecting P. falciparum and P. vivax along with P. knowlesi species taking their coding sequence data. The variation in GC content, amino acid frequencies, effective number of codons and other factors plays a crucial role in determining synonymous codon choices. Within species codon choices are more similar for P. vivax and P. knowlesi in comparison with P. falciparum species. This study suggests that synonymous codon choice modulates the gene expression level, mRNA stability, ribosome speed, protein folding, translation efficiency and its accuracy in Plasmodium species, and provides a valuable information regarding the codon usage pattern to facilitate gene cloning as well as expression and transfection studies for malaria causing species.
Collapse
|
12
|
Nath Choudhury M, Uddin A, Chakraborty S. Codon usage bias and its influencing factors for Y-linked genes in human. Comput Biol Chem 2017; 69:77-86. [DOI: 10.1016/j.compbiolchem.2017.05.005] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2016] [Revised: 05/04/2017] [Accepted: 05/20/2017] [Indexed: 11/30/2022]
|
13
|
Uddin A, Choudhury MN, Chakraborty S. Factors influencing codon usage of mitochondrial ND1 gene in pisces, aves and mammals. Mitochondrion 2017; 37:17-26. [PMID: 28668667 DOI: 10.1016/j.mito.2017.06.004] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2016] [Revised: 05/19/2017] [Accepted: 06/26/2017] [Indexed: 01/05/2023]
Abstract
Animal mitochondrial genome harbours 13 protein coding genes which regulate the process of respiration. The mitochondrial NADH dehydrogenase 1 (MT-ND1) gene, one of the 13 protein-coding genes, encodes the NADH dehydrogenase 1 enzyme of the respiratory chain. Analysis of codon usage bias (CUB) acquires importance for better understanding of the molecular biology, new gene discovery, design of transgenes and gene evolution. The MT-ND1 gene seems to be a good candidate for analyzing codon usage pattern, since no work has yet been reported. Moreover, it is still not clear which factors significantly influence the codon usage pattern. In the present study, comparative analysis of codon usage pattern, expression level and influencing factors for MT-ND1 gene from 100 different species each of pisces, aves and mammals were used for CUB analysis. Our result suggests that the gene is AT rich in pisces, aves and mammals and most of the nucleotides significantly differ among them as revealed from t-test. CUB was not remarkable as reflected by high value of effective number of codons and it also significantly differs among pisces, aves and mammals. Although we found that CUB is mainly influenced by natural selection and mutation pressure for MT-ND1 gene as suggested by correlation and correspondence analysis but neutrality plot further revealed that natural selection played a major role and mutation pressure played a minor role in codon usage pattern. Additionally, t-test analysis showed that the MT-ND1 gene has a wide significant discrepancy in codon choices in pisces, aves and mammals. This study has contributed to boost our understanding about the mechanism of distribution of the codons and the factors that may influence the evolution of the MT-ND1 gene.
Collapse
Affiliation(s)
- Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial Science College, Algapur, Hailakandi 788150, Assam, India.
| | | | - Supriyo Chakraborty
- Department of Biotechnology, Assam University, Silchar 788011, Assam, India.
| |
Collapse
|
14
|
Frequent GU wobble pairings reduce translation efficiency in Plasmodium falciparum. Sci Rep 2017; 7:723. [PMID: 28389662 PMCID: PMC5429705 DOI: 10.1038/s41598-017-00801-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2016] [Accepted: 03/13/2017] [Indexed: 11/08/2022] Open
Abstract
Plasmodium falciparum genome has 81% A+T content. This nucleotide bias leads to extreme codon usage bias and culminates in frequent insertion of asparagine homorepeats in the proteome. Using recodonized GFP sequences, we show that codons decoded via G:U wobble pairing are suboptimal codons that are negatively associated to protein translation efficiency. Despite this, one third of all codons in the genome are GU wobble codons, suggesting that codon usage in P. falciparum has not been driven to maximize translation efficiency, but may have evolved as translational regulatory mechanism. Particularly, asparagine homorepeats are generally encoded by locally clustered GU wobble AAT codons, we demonstrated that this GU wobble-rich codon context is the determining factor that causes reduction of protein level. Moreover, insertion of clustered AAT codons also causes destabilization of the transcripts. Interestingly, more frequent asparagine homorepeats insertion is seen in single-exon genes, suggesting transcripts of these genes may have been programmed for rapid mRNA decay to compensate for the inefficiency of mRNA surveillance regulation on intronless genes. To our knowledge, this is the first study that addresses P. falciparum codon usage in vitro and provides new insights on translational regulation and genome evolution of this parasite.
Collapse
|
15
|
Zhao Y, Zheng H, Xu A, Yan D, Jiang Z, Qi Q, Sun J. Analysis of codon usage bias of envelope glycoprotein genes in nuclear polyhedrosis virus (NPV) and its relation to evolution. BMC Genomics 2016; 17:677. [PMID: 27558469 PMCID: PMC4997668 DOI: 10.1186/s12864-016-3021-7] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2016] [Accepted: 08/16/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Analysis of codon usage bias is an extremely versatile method using in furthering understanding of the genetic and evolutionary paths of species. Codon usage bias of envelope glycoprotein genes in nuclear polyhedrosis virus (NPV) has remained largely unexplored at present. Hence, the codon usage bias of NPV envelope glycoprotein was analyzed here to reveal the genetic and evolutionary relationships between different viral species in baculovirus genus. RESULTS A total of 9236 codons from 18 different species of NPV of the baculovirus genera were used to perform this analysis. Glycoprotein of NPV exhibits weaker codon usage bias. Neutrality plot analysis and correlation analysis of effective number of codons (ENC) values indicate that natural selection is the main factor influencing codon usage bias, and that the impact of mutation pressure is relatively smaller. Another cluster analysis shows that the kinship or evolutionary relationships of these viral species can be divided into two broad categories despite all of these 18 species are from the same baculovirus genus. CONCLUSIONS There are many elements that can affect codon bias, such as the composition of amino acids, mutation pressure, natural selection, gene expression level, and etc. In the meantime, cluster analysis also illustrates that codon usage bias of virus envelope glycoprotein can serve as an effective means of evolutionary classification in baculovirus genus.
Collapse
Affiliation(s)
- Yongchao Zhao
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Hao Zheng
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Anying Xu
- Sericultural Research Institute, Chinese Academy of Agricultural Sciences, Zhenjiang Jiangsu, 212018, People's Republic of China
| | - Donghua Yan
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Zijian Jiang
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Qi Qi
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, People's Republic of China
| | - Jingchen Sun
- Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Center, Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, People's Republic of China.
| |
Collapse
|
16
|
Uddin A, Chakraborty S. Codon usage trend in mitochondrial CYB gene. Gene 2016; 586:105-14. [DOI: 10.1016/j.gene.2016.04.005] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2015] [Revised: 03/11/2016] [Accepted: 04/02/2016] [Indexed: 11/25/2022]
|
17
|
Expression levels and codon usage patterns in nuclear genes of the filarial nematode Wucheraria bancrofti and the blood fluke Schistosoma haematobium. J Helminthol 2016; 91:72-79. [PMID: 27048929 DOI: 10.1017/s0022149x16000092] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Synonymous codons are used with different frequencies, a phenomenon known as codon bias, which exists in many genomes and is mainly resolute by mutation and selection. To elucidate the genetic characteristics and evolutionary relationship of Wucheraria bancrofti and Schistosoma haematobium we examined the pattern of synonymous codon usage in nuclear genes of both the species. The mean overall GC contents of W. bancrofti and S. haematobium were 43.41 and 36.37%, respectively, which suggests that genes in both the species were AT rich. The value of the High Effective Number of Codons in both species suggests that codon usage bias was weak. Both species had a wide range of P3 distribution in the neutrality plot, with a significant correlation between P12 and P3. The codons were closer to the axes in correspondence analysis, suggesting that mutation pressure influenced the codon usage pattern in these species. We have identified the more frequently used codons in these species, most codons ending with an A or T. The nucleotides A/T and C/G were not proportionally used at the third position of codons, which reveals that natural selection might influence the codon usage patterns. The regression equation of P12 on P3 suggests that natural selection might have played a major role, while mutational pressure played a minor role in codon usage pattern in both species. These results form the basis of exploring the evolutionary mechanisms and the heterologous expression of medically important proteins of W. bancrofti and S. haematobium.
Collapse
|
18
|
Luo Y, Huang C, Ye J, Fang W, Gu W, Chen Z, Li H, Wang X, Jin D. Genome Sequence and Analysis of Peptoclostridium difficile Strain ZJCDC-S82. Evol Bioinform Online 2016; 12:41-9. [PMID: 26823648 PMCID: PMC4727486 DOI: 10.4137/ebo.s32476] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2015] [Revised: 11/26/2015] [Accepted: 12/01/2015] [Indexed: 12/18/2022] Open
Abstract
Peptoclostridium difficile (Clostridium difficile) is the major pathogen associated with infectious diarrhea in humans. Concomitant with the increased incidence of C. difficile infection worldwide, there is an increasing concern regarding this infection type. This study reports a draft assembly and detailed sequence analysis of C. difficile strain ZJCDC-S82. The de novo assembled genome was 4.19 Mb in size, which includes 4,013 protein-coding genes, 41 rRNA genes, and 84 tRNA genes. Along with the nuclear genome, we also assembled sequencing information for a single plasmid consisting of 11,930 nucleotides. Comparative genomic analysis of C. difficile ZJCDC-S82 and two other previously published strains, such as M120 and CD630, showed extensive similarity. Phylogenetic analysis revealed that genetic diversity among C. difficile strains was not influenced by geographic location. Evolutionary analysis suggested that four genes encoding surface proteins exhibited positive selection in C. difficile ZJCDC-S82. Codon usage analysis indicated that C. difficile ZJCDC-S82 had high codon usage bias toward A/U-ended codons. Furthermore, codon usage patterns in C. difficile ZJCDC-S82 were predominantly affected by mutation pressure. Our results provide detailed information pertaining to the C. difficile genome associated with a strain from mainland China. This analysis will facilitate the understanding of genomic diversity and evolution of C. difficile strains in this region.
Collapse
Affiliation(s)
- Yun Luo
- Department of Microbiology, Zhejiang Provincial Center for Disease Control and Prevention, Hangzhou, Zhejiang, China
| | - Chen Huang
- Department of Microbiology, Zhejiang Provincial Center for Disease Control and Prevention, Hangzhou, Zhejiang, China
| | - Julian Ye
- Department of Microbiology, Zhejiang Provincial Center for Disease Control and Prevention, Hangzhou, Zhejiang, China
| | - Weijia Fang
- Department of Medical Oncology, The First Affiliated Hospital, Zhejiang University, Hangzhou, Zhejiang, China
| | - Wanjun Gu
- Research Center for Learning Science, Southeast University, Nanjing, Jiangsu, China
| | - Zhiping Chen
- Department of Microbiology, Zhejiang Provincial Center for Disease Control and Prevention, Hangzhou, Zhejiang, China
| | - Hui Li
- Shanghai Huirui Biotechnology Co., Ltd., Shanghai, China
| | - XianJun Wang
- Department of Laboratory Medicine, Hangzhou First People's Hospital, Hangzhou, Zhejiang, China
| | - Dazhi Jin
- Department of Microbiology, Zhejiang Provincial Center for Disease Control and Prevention, Hangzhou, Zhejiang, China
| |
Collapse
|
19
|
Uddin A, Mazumder TH, Choudhury MN, Chakraborty S. Codon bias and gene expression of mitochondrial ND2 gene in chordates. Bioinformation 2015; 11:407-12. [PMID: 26420922 PMCID: PMC4574124 DOI: 10.6026/97320630011407] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2015] [Accepted: 08/24/2015] [Indexed: 11/23/2022] Open
Abstract
Background: Mitochondrial ND gene, which encodes NADH dehydrogenase, is the first enzyme of the mitochondrial electron
transport chain. Leigh syndrome, a neurodegenerative disease caused by mutation in the ND2 gene (T4681C), is associated with
bilateral symmetric lesions in basal ganglia and subcortical brain regions. Therefore, it is of interest to analyze mitochondrial DNA
to glean information for evolutionary relationship. This study highlights on the analysis of compositional dynamics and selection
pressure in shaping the codon usage patterns in the coding sequence of MT-ND2 gene across pisces, aves and mammals by using
bioinformatics tools like effective number of codons (ENC), codon adaptation index (CAI), relative synonymous codon usage
(RSCU) etc. Results: We observed a low codon usage bias as reflected by high ENC values in MT-ND2 gene among pisces, aves
and mammals. The most frequently used codons were ending with A/C at the 3rd position of codon and the gene was AT rich in
all the three classes. The codons TCA, CTA, CGA and TGA were over represented in all three classes. The F1 correspondence
showed significant positive correlation with G, T3 and CAI while the F2 axis showed significant negative correlation with A and T
but significant positive correlation with G, C, G3, C3, ENC, GC, GC1, GC2 and GC3. Conclusions: The codon usage bias in MTND2
gene is not associated with expression level. Mutation pressure and natural selection affect the codon usage pattern in MT-ND
2 gene.
Collapse
Affiliation(s)
- Arif Uddin
- Department of Biotechnology, Assam University, Silchar-788011, Assam, India
| | | | | | | |
Collapse
|
20
|
Jia X, Liu S, Zheng H, Li B, Qi Q, Wei L, Zhao T, He J, Sun J. Non-uniqueness of factors constraint on the codon usage in Bombyx mori. BMC Genomics 2015; 16:356. [PMID: 25943559 PMCID: PMC4422305 DOI: 10.1186/s12864-015-1596-z] [Citation(s) in RCA: 69] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2015] [Accepted: 04/28/2015] [Indexed: 11/10/2022] Open
Abstract
Background The analysis of codon usage is a good way to understand the genetic and evolutionary characteristics of an organism. However, there are only a few reports related with the codon usage of the domesticated silkworm, Bombyx mori (B. mori). Hence, the codon usage of B. mori was analyzed here to reveal the constraint factors and it could be helpful to improve the bioreactor based on B. mori. Results A total of 1,097 annotated mRNA sequences from B. mori were analyzed, revealing there is only a weak codon bias. It also shows that the gene expression level is related to the GC content, and the amino acids with higher general average hydropathicity (GRAVY) and aromaticity (Aromo). And the genes on the primary axis are strongly positively correlated with the GC content, and GC3s. Meanwhile, the effective number of codons (ENc) is strongly correlated with codon adaptation index (CAI), gene length, and Aromo values. However, the ENc values are correlated with the second axis, which indicates that the codon usage in B. mori is affected by not only mutation pressure and natural selection, but also nucleotide composition and the gene expression level. It is also associated with Aromo values, and gene length. Additionally, B. mori has a greater relative discrepancy in codon preferences with Drosophila melanogaster (D. melanogaster) or Saccharomyces cerevisiae (S. cerevisiae) than with Arabidopsis thaliana (A. thaliana), Escherichia coli (E. coli), or Caenorhabditis elegans (C. elegans). Conclusions The codon usage bias in B. mori is relatively weak, and many influence factors are found here, such as nucleotide composition, mutation pressure, natural selection, and expression level. Additionally, it is also associated with Aromo values, and gene length. Among them, natural selection might play a major role. Moreover, the “optimal codons” of B. mori are all encoded by G and C, which provides useful information for enhancing the gene expression in B. mori through codon optimization. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1596-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Xian Jia
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, and Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Centre, College of Animal Science, South China Agricultural University, Guangzhou, 510642, People's Republic of China.
| | - Shuyu Liu
- Guangzhou East Campus Lab Centre, Sun Yat-sen University, Guangzhou, 510006, People's Republic of China.
| | - Hao Zheng
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, and Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Centre, College of Animal Science, South China Agricultural University, Guangzhou, 510642, People's Republic of China.
| | - Bo Li
- Innovative Drug Research Centre, Chongqing University, Chongqing, 401331, People's Republic of China.
| | - Qi Qi
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, and Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Centre, College of Animal Science, South China Agricultural University, Guangzhou, 510642, People's Republic of China.
| | - Lei Wei
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, and Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Centre, College of Animal Science, South China Agricultural University, Guangzhou, 510642, People's Republic of China.
| | - Taiyi Zhao
- Wageningen University, Wageningen, 6708 PG, Netherlands.
| | - Jian He
- Guangzhou East Campus Lab Centre, Sun Yat-sen University, Guangzhou, 510006, People's Republic of China.
| | - Jingchen Sun
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, and Subtropical Sericulture and Mulberry Resources Protection and Safety Engineering Research Centre, College of Animal Science, South China Agricultural University, Guangzhou, 510642, People's Republic of China.
| |
Collapse
|
21
|
Ponce de Leon M, de Miranda AB, Alvarez-Valin F, Carels N. The Purine Bias of Coding Sequences is Determined by Physicochemical Constraints on Proteins. Bioinform Biol Insights 2014; 8:93-108. [PMID: 24899802 PMCID: PMC4039185 DOI: 10.4137/bbi.s13161] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2013] [Revised: 11/24/2013] [Accepted: 11/24/2013] [Indexed: 01/02/2023] Open
Abstract
For this report, we analyzed protein secondary structures in relation to the statistics of three nucleotide codon positions. The purpose of this investigation was to find which properties of the ribosome, tRNA or protein level, could explain the purine bias (Rrr) as it is observed in coding DNA. We found that the Rrr pattern is the consequence of a regularity (the codon structure) resulting from physicochemical constraints on proteins and thermodynamic constraints on ribosomal machinery. The physicochemical constraints on proteins mainly come from the hydropathy and molecular weight (MW) of secondary structures as well as the energy cost of amino acid synthesis. These constraints appear through a network of statistical correlations, such as (i) the cost of amino acid synthesis, which is in favor of a higher level of guanine in the first codon position, (ii) the constructive contribution of hydropathy alternation in proteins, (iii) the spatial organization of secondary structure in proteins according to solvent accessibility, (iv) the spatial organization of secondary structure according to amino acid hydropathy, (v) the statistical correlation of MW with protein secondary structures and their overall hydropathy, (vi) the statistical correlation of thymine in the second codon position with hydropathy and the energy cost of amino acid synthesis, and (vii) the statistical correlation of adenine in the second codon position with amino acid complexity and the MW of secondary protein structures. Amino acid physicochemical properties and functional constraints on proteins constitute a code that is translated into a purine bias within the coding DNA via tRNAs. In that sense, the Rrr pattern within coding DNA is the effect of information transfer on nucleotide composition from protein to DNA by selection according to the codon positions. Thus, coding DNA structure and ribosomal machinery co-evolved to minimize the energy cost of protein coding given the functional constraints on proteins.
Collapse
Affiliation(s)
- Miguel Ponce de Leon
- Sección Biomatemática, Facultad de Ciencias, Universidad de la República, Iguá, Montevideo, Uruguay
| | - Antonio Basilio de Miranda
- Fundação Oswaldo Cruz (FIOCRUZ), Instituto Oswaldo Cruz (IOC), Laboratório de Genômica Funcional e Bioinformática, Rio de Janeiro, RJ, Brazil
| | - Fernando Alvarez-Valin
- Sección Biomatemática, Facultad de Ciencias, Universidad de la República, Iguá, Montevideo, Uruguay
| | - Nicolas Carels
- Fundação Oswaldo Cruz (FIOCRUZ), Instituto Oswaldo Cruz (IOC), Laboratório de Genômica Funcional e Bioinformática, Rio de Janeiro, RJ, Brazil
| |
Collapse
|
22
|
Ross LS, Gamo FJ, Lafuente-Monasterio MJ, Singh OMP, Rowland P, Wiegand RC, Wirth DF. In vitro resistance selections for Plasmodium falciparum dihydroorotate dehydrogenase inhibitors give mutants with multiple point mutations in the drug-binding site and altered growth. J Biol Chem 2014; 289:17980-95. [PMID: 24782313 PMCID: PMC4140291 DOI: 10.1074/jbc.m114.558353] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Malaria is a preventable and treatable disease; yet half of the world's population lives at risk of infection, and an estimated 660,000 people die of malaria-related causes every year. Rising drug resistance threatens to make malaria untreatable, necessitating both the discovery of new antimalarial agents and the development of strategies to identify and suppress the emergence and spread of drug resistance. We focused on in-development dihydroorotate dehydrogenase (DHODH) inhibitors. Characterizing resistance pathways for antimalarial agents not yet in clinical use will increase our understanding of the potential for resistance. We identified resistance mechanisms of Plasmodium falciparum (Pf) DHODH inhibitors via in vitro resistance selections. We found 11 point mutations in the PfDHODH target. Target gene amplification and unknown mechanisms also contributed to resistance, albeit to a lesser extent. These mutant parasites were often hypersensitive to other PfDHODH inhibitors, which immediately suggested a novel combination therapy approach to preventing resistance. Indeed, a combination of wild-type and mutant-type selective inhibitors led to resistance far less often than either drug alone. The effects of point mutations in PfDHODH were corroborated with purified recombinant wild-type and mutant-type PfDHODH proteins, which showed the same trends in drug response as the cognate cell lines. Comparative growth assays demonstrated that two mutant parasites grew less robustly than their wild-type parent, and the purified protein of those mutants showed a decrease in catalytic efficiency, thereby suggesting a reason for the diminished growth rate. Co-crystallography of PfDHODH with three inhibitors suggested that hydrophobic interactions are important for drug binding and selectivity.
Collapse
Affiliation(s)
- Leila S Ross
- From the Department of Immunology and Infectious Diseases, Harvard School of Public Health, Boston, Massachusetts 02115
| | - Francisco Javier Gamo
- the Tres Cantos Medicines Development Campus, GlaxoSmithKline, 28760 Tres Cantos, Madrid, Spain
| | | | - Onkar M P Singh
- the Computational and Structural Chemistry, GlaxoSmithKline, Stevenage, Hertfordshire SG1 2NY, United Kingdom, and
| | - Paul Rowland
- the Computational and Structural Chemistry, GlaxoSmithKline, Stevenage, Hertfordshire SG1 2NY, United Kingdom, and
| | - Roger C Wiegand
- the Infectious Disease Initiative, Broad Institute, Cambridge, Massachusetts 02142
| | - Dyann F Wirth
- From the Department of Immunology and Infectious Diseases, Harvard School of Public Health, Boston, Massachusetts 02115, the Infectious Disease Initiative, Broad Institute, Cambridge, Massachusetts 02142
| |
Collapse
|
23
|
Raiford DW, Heizer EM, Miller RV, Doom TE, Raymer ML, Krane DE. Metabolic and translational efficiency in microbial organisms. J Mol Evol 2012; 74:206-16. [PMID: 22538926 DOI: 10.1007/s00239-012-9500-9] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2011] [Accepted: 04/05/2012] [Indexed: 11/25/2022]
Abstract
Metabolic efficiency, as a selective force shaping proteomes, has been shown to exist in Escherichia coli and Bacillus subtilis and in a small number of organisms with photoautotrophic and thermophilic lifestyles. Earlier attempts at larger-scale analyses have utilized proxies (such as molecular weight) for biosynthetic cost, and did not consider lifestyle or auxotrophy. This study extends the analysis to all currently sequenced microbial organisms that are amenable to these analyses while utilizing lifestyle specific amino acid biosynthesis pathways (where possible) to determine protein production costs and compensating for auxotrophy. The tendency for highly expressed proteins (with adherence to codon usage bias as a proxy for expressivity) to utilize less biosynthetically expensive amino acids is taken as evidence of cost selection. A comprehensive analysis of sequenced genomes to identify those that exhibit strong translational efficiency bias (389 out of 1,700 sequenced organisms) is also presented.
Collapse
Affiliation(s)
- Douglas W Raiford
- Department of Computer Science, University of Montana, Missoula, MT, USA.
| | | | | | | | | | | |
Collapse
|
24
|
Explaining complex codon usage patterns with selection for translational efficiency, mutation bias, and genetic drift. Proc Natl Acad Sci U S A 2011; 108:10231-6. [PMID: 21646514 DOI: 10.1073/pnas.1016719108] [Citation(s) in RCA: 104] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The genetic code is redundant with most amino acids using multiple codons. In many organisms, codon usage is biased toward particular codons. Understanding the adaptive and nonadaptive forces driving the evolution of codon usage bias (CUB) has been an area of intense focus and debate in the fields of molecular and evolutionary biology. However, their relative importance in shaping genomic patterns of CUB remains unsolved. Using a nested model of protein translation and population genetics, we show that observed gene level variation of CUB in Saccharomyces cerevisiae can be explained almost entirely by selection for efficient ribosomal usage, genetic drift, and biased mutation. The correlation between observed codon counts within individual genes and our model predictions is 0.96. Although a variety of factors shape patterns of CUB at the level of individual sites within genes, our results suggest that selection for efficient ribosome usage is a central force in shaping codon usage at the genomic scale. In addition, our model allows direct estimation of codon-specific mutation rates and elongation times and can be readily applied to any organism with high-throughput expression datasets. More generally, we have developed a natural framework for integrating models of molecular processes to population genetics models to quantitatively estimate parameters underlying fundamental biological processes, such a protein translation.
Collapse
|
25
|
Selected codon usage bias in members of the class Mollicutes. Gene 2010; 473:110-8. [PMID: 21147204 DOI: 10.1016/j.gene.2010.11.010] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2010] [Revised: 11/20/2010] [Accepted: 11/22/2010] [Indexed: 11/24/2022]
Abstract
Mollicutes are parasitic microorganisms mainly characterized by small cell sizes, reduced genomes and great A and T mutational bias. We analyzed the codon usage patterns of the completely sequenced genomes of bacteria that belong to this class. We found that for many organisms not only mutational bias but also selection has a major effect on codon usage. Through a comparative perspective and based on three widely used criteria we were able to classify Mollicutes according to the effect of selection on codon usage. We found conserved optimal codons in many species and study the tRNA gene pool in each genome. Previous results are reinforced by the fact that, when selection is operative, the putative optimal codons found match the respective cognate tRNA. Finally, we trace selection effect backwards to the common ancestor of the class and estimate the phylogenetic inertia associated with this character. We discuss the possible scenarios that explain the observed evolutionary patterns.
Collapse
|
26
|
Imamura H, Persampieri JH, Chuang JH. Sequences conserved by selection across mouse and human malaria species. BMC Genomics 2007; 8:372. [PMID: 17937810 PMCID: PMC2174483 DOI: 10.1186/1471-2164-8-372] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2007] [Accepted: 10/15/2007] [Indexed: 11/22/2022] Open
Abstract
Background Little is known, either experimentally or computationally, about the genomic sequence features that regulate malaria genes. A sequence conservation analysis of the malaria species P. falciparum, P. berghei, P. yoelii, and P. chabaudi could significantly advance knowledge of malaria gene regulation. Results We computationally identify intergenic sequences conserved beyond neutral expectations, using a conservation algorithm that accounts for the strong compositional biases in malaria genomes. We first quantify the composition-specific divergence at silent positions in coding sequence. Using this as a background, we examine gene 5' regions, identifying 610 blocks conserved far beyond neutral expectations across the three mouse malariae, and 81 blocks conserved as strongly across all four species (p < 10-6). Detailed analysis of these blocks indicates that only a minor fraction are likely to be previously unknown coding sequences. Analogous noncoding conserved blocks have been shown to regulate adjacent genes in other phylogenies, making the predicted blocks excellent candidates for novel regulatory functions. We also find three potential transcription factor binding motifs which exhibit strong conservation and overrepresentation among the rodent malariae. Conclusion A broader finding of our analysis is that less malaria intergenic sequence has been conserved by selection than in yeast or vertebrate genomes. This supports the hypothesis that transcriptional regulation is simpler in malaria than other eukaryotic species. We have built a public database containing all sequence alignments and functional predictions, and we expect this to be a valuable resource to the malaria research community.
Collapse
Affiliation(s)
- Hideo Imamura
- Boston College - Department of Biology, 140 Commonwealth Avenue, Chestnut Hill, MA 02467, USA.
| | | | | |
Collapse
|
27
|
Swire J. Selection on synthesis cost affects interprotein amino acid usage in all three domains of life. J Mol Evol 2007; 64:558-71. [PMID: 17476453 DOI: 10.1007/s00239-006-0206-8] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2006] [Accepted: 01/02/2007] [Indexed: 11/27/2022]
Abstract
Most investigations of the forces shaping protein evolution have focused on protein function. However, cells are typically 50%-75% protein by dry weight, with protein expression levels distributed over five orders of magnitude. Cells may, therefore, be under considerable selection pressure to incorporate amino acids that are cheap to synthesize into proteins that are highly expressed. Such selection pressure has been demonstrated to alter amino acid usage in a few organisms, but whether "cost selection" is a general phenomenon remains unknown. One reason for this is that reliable protein expression level data is not available for most organisms. Accordingly, I have developed a new method for detecting cost selection. This method depends solely on interprotein gradients in amino acid usage. Applying it to an analysis of 43 whole genomes from all three domains of life, I show that selection on the synthesis cost of amino acids is a pervasive force in shaping the composition of proteins. Moreover, some amino acids have different price tags for different organisms--the cost of amino acids is changed for organisms living in hydrothermal vents compared with those living at the sea surface or for organisms that have difficulty acquiring elements such as nitrogen compared with those that do not--so I also investigated whether differences between organisms in amino acid usage might reflect differences in synthesis or acquisition costs. The results suggest that organisms evolve to alter amino acid usage in response to environmental conditions.
Collapse
Affiliation(s)
- Jonathan Swire
- Centre for Bioinformatics, Division of Molecular Biosciences, Faculty of Life Sciences, Imperial College London, London, SW7 2AZ, UK.
| |
Collapse
|
28
|
Das S, Paul S, Bag SK, Dutta C. Analysis of Nanoarchaeum equitans genome and proteome composition: indications for hyperthermophilic and parasitic adaptation. BMC Genomics 2006; 7:186. [PMID: 16869956 PMCID: PMC1574309 DOI: 10.1186/1471-2164-7-186] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2006] [Accepted: 07/25/2006] [Indexed: 11/24/2022] Open
Abstract
Background Nanoarchaeum equitans, the only known hyperthermophilic archaeon exhibiting parasitic life style, has raised some new questions about the evolution of the Archaea and provided a model of choice to study the genome landmarks correlated with thermo-parasitic adaptation. In this context, we have analyzed the genome and proteome composition of N. equitans and compared the same with those of other mesophiles, hyperthermophiles and obligatory host-associated organisms. Results Analysis of nucleotide, codon and amino acid usage patterns in N. equitans indicates the presence of distinct selective constraints, probably due to its adaptation to a thermo-parasitic life-style. Among the conspicuous characteristics featuring its hyperthermophilic adaptation are overrepresentation of purine bases in protein coding sequences, higher GC-content in tRNA/rRNA sequences, distinct synonymous codon usage, enhanced usage of aromatic and positively charged residues, and decreased frequencies of polar uncharged residues, as compared to those in mesophilic organisms. Positively charged amino acid residues are relatively abundant in the encoded gene-products of N. equitans and other hyperthermophiles, which is reflected in their isoelectric point distribution. Pairwise comparison of 105 orthologous protein sequences shows a strong bias towards replacement of uncharged polar residues of mesophilic proteins by Lys/Arg, Tyr and some hydrophobic residues in their Nanoarchaeal orthologs. The traits potentially attributable to the symbiotic/parasitic life-style of the organism include the presence of apparently weak translational selection in synonymous codon usage and a marked heterogeneity in membrane-associated proteins, which may be important for N. equitans to interact with the host and hence, may help the organism to adapt to the strictly host-associated life style. Despite being strictly host-dependent, N. equitans follows cost minimization hypothesis. Conclusion The present study reveals that the genome and proteome composition of N. equitans are marked with the signatures of dual adaptation – one to high temperature and the other to obligatory parasitism. While the analysis of nucleotide/amino acid preferences in N. equitans offers an insight into the molecular strategies taken by the archaeon for thermo-parasitic adaptation, the comparative study of the compositional characteristics of mesophiles, hyperthermophiles and obligatory host-associated organisms demonstrates the generality of such strategies in the microbial world.
Collapse
Affiliation(s)
- Sabyasachi Das
- Bioinformatics Centre, Indian Institute of Chemical Biology, Kolkata–700032, India
| | - Sandip Paul
- Bioinformatics Centre, Indian Institute of Chemical Biology, Kolkata–700032, India
| | - Sumit K Bag
- Bioinformatics Centre, Indian Institute of Chemical Biology, Kolkata–700032, India
| | - Chitra Dutta
- Bioinformatics Centre, Indian Institute of Chemical Biology, Kolkata–700032, India
- Human Genetics & Genomics Division, Indian Institute of Chemical Biology, Kolkata–700032, India
| |
Collapse
|
29
|
Patron NJ, Rogers MB, Keeling PJ. Comparative rates of evolution in endosymbiotic nuclear genomes. BMC Evol Biol 2006; 6:46. [PMID: 16772046 PMCID: PMC1523203 DOI: 10.1186/1471-2148-6-46] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2006] [Accepted: 06/14/2006] [Indexed: 11/20/2022] Open
Abstract
BACKGROUND The nucleomorphs associated with secondary plastids of cryptomonads and chlorarachniophytes are the sole examples of organelles with eukaryotic nuclear genomes. Although not as widespread as their prokaryotic equivalents in mitochondria and plastids, nucleomorph genomes share similarities in terms of reduction and compaction. They also differ in several aspects, not least in that they encode proteins that target to the plastid, and so function in a different compartment from that in which they are encoded. RESULTS Here, we test whether the phylogenetically distinct nucleomorph genomes of the cryptomonad, Guillardia theta, and the chlorarachniophyte, Bigelowiella natans, have experienced similar evolutionary pressures during their transformation to reduced organelles. We compared the evolutionary rates of genes from nuclear, nucleomorph, and plastid genomes, all of which encode proteins that function in the same cellular compartment, the plastid, and are thus subject to similar selection pressures. Furthermore, we investigated the divergence of nucleomorphs within cryptomonads by comparing G. theta and Rhodomonas salina. CONCLUSION Chlorarachniophyte nucleomorph genes have accumulated errors at a faster rate than other genomes within the same cell, regardless of the compartment where the gene product functions. In contrast, most nucleomorph genes in cryptomonads have evolved faster than genes in other genomes on average, but genes for plastid-targeted proteins are not overly divergent, and it appears that cryptomonad nucleomorphs are not presently evolving rapidly and have therefore stabilized. Overall, these analyses suggest that the forces at work in the two lineages are different, despite the similarities between the structures of their genomes.
Collapse
Affiliation(s)
- Nicola J Patron
- Canadian Institute for Advanced Research, Department of Botany, University of British Columbia, Vancouver, V6T 1Z4, Canada
| | - Matthew B Rogers
- Canadian Institute for Advanced Research, Department of Botany, University of British Columbia, Vancouver, V6T 1Z4, Canada
| | - Patrick J Keeling
- Canadian Institute for Advanced Research, Department of Botany, University of British Columbia, Vancouver, V6T 1Z4, Canada
| |
Collapse
|
30
|
Das S, Paul S, Dutta C. Evolutionary constraints on codon and amino acid usage in two strains of human pathogenic actinobacteria Tropheryma whipplei. J Mol Evol 2006; 62:645-58. [PMID: 16557339 DOI: 10.1007/s00239-005-0164-6] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2005] [Accepted: 12/20/2005] [Indexed: 12/13/2022]
Abstract
The factors governing codon and amino acid usages in the predicted protein-coding sequences of Tropheryma whipplei TW08/27 and Twist genomes have been analyzed. Multivariate analysis identifies the replicational-transcriptional selection coupled with DNA strand-specific asymmetric mutational bias as a major driving force behind the significant interstrand variations in synonymous codon usage patterns in T. whipplei genes, while a residual intrastrand synonymous codon bias is imparted by a selection force operating at the level of translation. The strand-specific mutational pressure has little influence on the amino acid usage, for which the mean hydropathy level and aromaticity are the major sources of variation, both having nearly equal impact. In spite of the intracellular lifestyle, the amino acid usage in highly expressed gene products of T. whipplei follows the cost-minimization hypothesis. The products of the highly expressed genes of these relatively A + T-rich actinobacteria prefer to use the residues encoded by GC-rich codons, probably due to greater conservation of a GC-rich ancestral state in the highly expressed genes, as suggested by the lower values of the rate of nonsynonymous divergences between orthologous sequences of highly expressed genes from the two strains of T. whipplei. Both the genomes under study are characterized by the presence of two distinct groups of membrane-associated genes, products of which exhibit significant differences in primary and potential secondary structures as well as in the propensity of protein disorder.
Collapse
Affiliation(s)
- Sabyasachi Das
- Bioinformatics Centre, Indian Institute of Chemical Biology, 4 Raja S. C. Mullick Road, Kolkata 700 032, India
| | | | | |
Collapse
|
31
|
Chanda I, Pan A, Dutta C. Proteome composition in Plasmodium falciparum: higher usage of GC-rich nonsynonymous codons in highly expressed genes. J Mol Evol 2005; 61:513-23. [PMID: 16044241 DOI: 10.1007/s00239-005-0023-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2005] [Accepted: 04/19/2005] [Indexed: 10/25/2022]
Abstract
The parasite Plasmodium falciparum, responsible for the most deadly form of human malaria, is one of the extremely AT-rich genomes sequenced so far and known to possess many atypical characteristics. Using multivariate statistical approaches, the present study analyzes the amino acid usage pattern in 5038 annotated protein-coding sequences in P. falciparum clone 3D7. The amino acid composition of individual proteins, though dominated by the directional mutational pressure, exhibits wide variation across the proteome. The Asn content, expression level, mean molecular weight, hydropathy, and aromaticity are found to be the major sources of variation in amino acid usage. At all stages of development, frequencies of residues encoded by GC-rich codons such as Gly, Ala, Arg, and Pro increase significantly in the products of the highly expressed genes. Investigation of nucleotide substitution patterns in P. falciparum and other Plasmodium species reveals that the nonsynonymous sites of highly expressed genes are more conserved than those of the lowly expressed ones, though for synonymous sites, the reverse is true. The highly expressed genes are, therefore, expected to be closer to their putative ancestral state in amino acid composition, and a plausible reason for their sequences being GC-rich at nonsynonymous codon positions could be that their ancestral state was less AT-biased. Negative correlation of the expression level of proteins with respective molecular weights supports the notion that P. falciparum, in spite of its intracellular parasitic lifestyle, follows the principle of cost minimization.
Collapse
Affiliation(s)
- Ipsita Chanda
- Human Genetics & Genomics Group, Indian Institute of Chemical Biology, Kolkata 700032, India
| | | | | |
Collapse
|
32
|
Jana S, Deb JK. Strategies for efficient production of heterologous proteins in Escherichia coli. Appl Microbiol Biotechnol 2005; 67:289-98. [PMID: 15635462 DOI: 10.1007/s00253-004-1814-0] [Citation(s) in RCA: 257] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2004] [Revised: 10/22/2004] [Accepted: 10/23/2004] [Indexed: 11/24/2022]
Abstract
In recent years, the number of recombinant proteins used for therapeutic applications has increased dramatically. Production of these proteins has a remarkable demand in the market. Escherichia coli offers a means for the rapid and economical production of recombinant proteins. These advantages, coupled with a wealth of biochemical and genetic knowledge, have enabled the production of such economically therapeutic proteins such as insulin and bovine growth hormone. These demands have driven the development of a variety of strategies for achieving high-level expression of protein, particularly involving several aspects such as expression vectors design, gene dosage, promoter strength (transcriptional regulation), mRNA stability, translation initiation and termination (translational regulation), host design considerations, codon usage, and fermentation factors available for manipulating the expression conditions, which are the major challenges is obtaining the high yield of protein at low cost.
Collapse
Affiliation(s)
- S Jana
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology-Delhi, Hauz Khas, New Delhi 110016, India
| | | |
Collapse
|
33
|
Abstract
Volatility of a codon is defined as the probability that a random point mutation in the codon generates a nonsynonymous change. It has been proposed that higher-than-expected mean codon volatility of a gene indicates that positive selection for nonsynonymous changes has acted on the gene in the recent past. I show that strong frequency-dependent selection (minority advantage) in large populations can increase codon volatility slightly, whereas directional positive selection has no effect on volatility. Factors unrelated to positive selection, such as expression-related or GC-content-related codon usage bias, also affect volatility. These and other considerations suggest that codon volatility has only limited utility for detecting positive selection at the DNA sequence level.
Collapse
Affiliation(s)
- Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan 48109, USA.
| |
Collapse
|