1
|
Zabolotskii AI, Kozlovskiy SV, Katrukha AG. The Influence of the Nucleotide Composition of Genes and Gene Regulatory Elements on the Efficiency of Protein Expression in Escherichia coli. BIOCHEMISTRY (MOSCOW) 2023; 88:S176-S191. [PMID: 37069120 DOI: 10.1134/s0006297923140109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/22/2023]
Abstract
Recombinant proteins expressed in Escherichia coli are widely used in biochemical research and industrial processes. At the same time, achieving higher protein expression levels and correct protein folding still remains the key problem, since optimization of nutrient media, growth conditions, and methods for induction of protein synthesis do not always lead to the desired result. Often, low protein expression is determined by the sequences of the expressed genes and their regulatory regions. The genetic code is degenerated; 18 out of 20 amino acids are encoded by more than one codon. Choosing between synonymous codons in the coding sequence can significantly affect the level of protein expression and protein folding due to the influence of the gene nucleotide composition on the probability of formation of secondary mRNA structures that affect the ribosome binding at the translation initiation phase, as well as the ribosome movement along the mRNA during elongation, which, in turn, influences the mRNA degradation and the folding of the nascent protein. The nucleotide composition of the mRNA untranslated regions, in particular the promoter and Shine-Dalgarno sequences, also affects the efficiency of mRNA transcription, translation, and degradation. In this review, we describe the genetic principles that determine the efficiency of protein production in Escherichia coli.
Collapse
Affiliation(s)
- Artur I Zabolotskii
- Faculty of Biology, Lomonosov Moscow State University, Moscow, 119991, Russia.
| | | | - Alexey G Katrukha
- Faculty of Biology, Lomonosov Moscow State University, Moscow, 119991, Russia
| |
Collapse
|
2
|
Miller JB, Meurs TE, Hodgman MW, Song B, Miller KN, Ebbert MTW, Kauwe JSK, Ridge PG. The Ramp Atlas: facilitating tissue and cell-specific ramp sequence analyses through an intuitive web interface. NAR Genom Bioinform 2022; 4:lqac039. [PMID: 35664804 PMCID: PMC9155233 DOI: 10.1093/nargab/lqac039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Revised: 03/01/2022] [Accepted: 05/24/2022] [Indexed: 11/14/2022] Open
Abstract
Ramp sequences occur when the average translational efficiency of codons near the 5′ end of highly expressed genes is significantly lower than the rest of the gene sequence, which counterintuitively increases translational efficiency by decreasing downstream ribosomal collisions. Here, we show that the relative codon adaptiveness within different tissues changes the existence of a ramp sequence without altering the underlying genetic code. We present the first comprehensive analysis of tissue and cell type-specific ramp sequences and report 3108 genes with ramp sequences that change between tissues and cell types, which corresponds with increased gene expression within those tissues and cells. The Ramp Atlas (https://ramps.byu.edu/) allows researchers to query precomputed ramp sequences in 18 388 genes across 62 tissues and 66 cell types and calculate tissue-specific ramp sequences from user-uploaded FASTA files through an intuitive web interface. We used The Ramp Atlas to identify seven SARS-CoV-2 genes and seven human SARS-CoV-2 entry factor genes with tissue-specific ramp sequences that may help explain viral proliferation within those tissues. We anticipate that The Ramp Atlas will facilitate personalized and creative tissue-specific ramp sequence analyses for both human and viral genes that will increase our ability to utilize this often-overlooked regulatory region.
Collapse
Affiliation(s)
- Justin B Miller
- Sanders-Brown Center on Aging, University of Kentucky, Lexington, KY 40504, USA
| | - Taylor E Meurs
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Matthew W Hodgman
- Sanders-Brown Center on Aging, University of Kentucky, Lexington, KY 40504, USA
| | - Benjamin Song
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Kyle N Miller
- Department of Computer Science, Utah Valley University, Orem, UT 84058, USA
| | - Mark T W Ebbert
- Sanders-Brown Center on Aging, University of Kentucky, Lexington, KY 40504, USA
| | - John S K Kauwe
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Perry G Ridge
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| |
Collapse
|
3
|
Zeng Z, Aptekmann AA, Bromberg Y. Decoding the effects of synonymous variants. Nucleic Acids Res 2021; 49:12673-12691. [PMID: 34850938 PMCID: PMC8682775 DOI: 10.1093/nar/gkab1159] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Revised: 11/02/2021] [Accepted: 11/08/2021] [Indexed: 12/12/2022] Open
Abstract
Synonymous single nucleotide variants (sSNVs) are common in the human genome but are often overlooked. However, sSNVs can have significant biological impact and may lead to disease. Existing computational methods for evaluating the effect of sSNVs suffer from the lack of gold-standard training/evaluation data and exhibit over-reliance on sequence conservation signals. We developed synVep (synonymous Variant effect predictor), a machine learning-based method that overcomes both of these limitations. Our training data was a combination of variants reported by gnomAD (observed) and those unreported, but possible in the human genome (generated). We used positive-unlabeled learning to purify the generated variant set of any likely unobservable variants. We then trained two sequential extreme gradient boosting models to identify subsets of the remaining variants putatively enriched and depleted in effect. Our method attained 90% precision/recall on a previously unseen set of variants. Furthermore, although synVep does not explicitly use conservation, its scores correlated with evolutionary distances between orthologs in cross-species variation analysis. synVep was also able to differentiate pathogenic vs. benign variants, as well as splice-site disrupting variants (SDV) vs. non-SDVs. Thus, synVep provides an important improvement in annotation of sSNVs, allowing users to focus on variants that most likely harbor effects.
Collapse
Affiliation(s)
- Zishuo Zeng
- Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, NJ 08873, USA
| | - Ariel A Aptekmann
- Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, NJ 08873, USA
| | - Yana Bromberg
- Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, NJ 08873, USA
- Department of Genetics, Rutgers University, Piscataway, NJ 08854, USA
| |
Collapse
|
4
|
Wang P, Mao Y, Su Y, Wang J. Comparative analysis of transcriptomic data shows the effects of multiple evolutionary selection processes on codon usage in Marsupenaeus japonicus and Marsupenaeus pulchricaudatus. BMC Genomics 2021; 22:781. [PMID: 34717552 PMCID: PMC8557549 DOI: 10.1186/s12864-021-08106-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2020] [Accepted: 10/19/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Kuruma shrimp, a major commercial shrimp species in the world, has two cryptic or sibling species, Marsupenaeus japonicus and Marsupenaeus pulchricaudatus. Codon usage analysis would contribute to our understanding of the genetic and evolutionary characteristics of the two Marsupenaeus species. In this study, we analyzed codon usage and related indices using coding sequences (CDSs) from RNA-seq data. RESULTS Using CodonW 1.4.2 software, we performed the codon bias analysis of transcriptomes obtained from hepatopancreas tissues, which indicated weak codon bias. Almost all parameters had similar correlations for both species. The gene expression level (FPKM) was negatively correlated with A/T3s. We determined 12 and 14 optimal codons for M. japonicus and M. pulchricaudatus, respectively, and all optimal codons have a C/G-ending. The two Marsupenaeus species had different usage frequencies of codon pairs, which contributed to further analysis of transcriptional differences between them. Orthologous genes that underwent positive selection (ω > 1) had a higher correlation coefficient than that of experienced purifying selection (ω < 1). Parity Rule 2 (PR2) and effective number of codons (ENc) plot analysis showed that the codon usage patterns of both species were influenced by both mutations and selection. Moreover, the average observed ENc value was lower than the expected value for both species, suggesting that factors other than GC may play roles in these phenomena. The results of multispecies clustering based on codon preference were consistent with traditional classification. CONCLUSIONS This study provides a relatively comprehensive understanding of the correlations among codon usage bias, gene expression, and selection pressures of CDSs for M. japonicus and M. pulchricaudatus. The genetic evolution was driven by mutations and selection pressure. Moreover, the results point out new insights into the specificities and evolutionary characteristics of the two Marsupenaeus species.
Collapse
Affiliation(s)
- Panpan Wang
- Jiangsu Key Laboratory of Marine Bioresources and Environment/ Jiangsu Key Laboratory of Marine Biotechnology, Jiangsu Ocean University, Lianyungang, 222005, China
- Co-Innovation Center of Jiangsu Marine Bio-Industry Technology, Jiangsu Ocean University, Lianyungang, 222005, China
- The Jiangsu Provincial Infrastructure for Conservation and Utilization of Agricultural Germplasm, Nanjing, 210014, China
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, 361102, Fujian, China
| | - Yong Mao
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, 361102, Fujian, China.
- Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, Xiamen University, Xiamen, 361102, China.
| | - Yongquan Su
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, 361102, Fujian, China
| | - Jun Wang
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, 361102, Fujian, China
| |
Collapse
|
5
|
Mehrshad M, Lopez-Fernandez M, Sundh J, Bell E, Simone D, Buck M, Bernier-Latmani R, Bertilsson S, Dopson M. Energy efficiency and biological interactions define the core microbiome of deep oligotrophic groundwater. Nat Commun 2021; 12:4253. [PMID: 34253732 PMCID: PMC8275790 DOI: 10.1038/s41467-021-24549-z] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2020] [Accepted: 06/23/2021] [Indexed: 02/06/2023] Open
Abstract
While oligotrophic deep groundwaters host active microbes attuned to the low-end of the bioenergetics spectrum, the ecological constraints on microbial niches in these ecosystems and their consequences for microbiome convergence are unknown. Here, we provide a genome-resolved, integrated omics analysis comparing archaeal and bacterial communities in disconnected fracture fluids of the Fennoscandian Shield in Europe. Leveraging a dataset that combines metagenomes, single cell genomes, and metatranscriptomes, we show that groundwaters flowing in similar lithologies offer fixed niches that are occupied by a common core microbiome. Functional expression analysis highlights that these deep groundwater ecosystems foster diverse, yet cooperative communities adapted to this setting. We suggest that these communities stimulate cooperation by expression of functions related to ecological traits, such as aggregate or biofilm formation, while alleviating the burden on microorganisms producing compounds or functions that provide a collective benefit by facilitating reciprocal promiscuous metabolic partnerships with other members of the community. We hypothesize that an episodic lifestyle enabled by reversible bacteriostatic functions ensures the subsistence of the oligotrophic deep groundwater microbiome.
Collapse
Affiliation(s)
- Maliheh Mehrshad
- grid.8993.b0000 0004 1936 9457Department of Ecology and Genetics, Limnology and Science for Life Laboratory, Uppsala University, Uppsala, Sweden ,grid.6341.00000 0000 8578 2742Department of Aquatic Sciences and Assessment, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Margarita Lopez-Fernandez
- grid.8148.50000 0001 2174 3522Centre for Ecology and Evolution in Microbial Model Systems (EEMiS), Linnaeus University, Kalmar, Sweden ,grid.4489.10000000121678994Present Address: Department of Microbiology, University of Granada, Granada, Spain
| | - John Sundh
- grid.10548.380000 0004 1936 9377Dept of Biochemistry and Biophysics, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Stockholm University, Solna, Sweden
| | - Emma Bell
- grid.5333.60000000121839049Environmental Microbiology Laboratory, Environmental Engineering Institute, School of Architecture, Civil and Environmental Engineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland ,grid.22072.350000 0004 1936 7697Present Address: Department of Biological Sciences, University of Calgary, Calgary, Alberta Canada
| | - Domenico Simone
- grid.8148.50000 0001 2174 3522Centre for Ecology and Evolution in Microbial Model Systems (EEMiS), Linnaeus University, Kalmar, Sweden ,grid.6341.00000 0000 8578 2742SLU Bioinformatics Infrastructure, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Moritz Buck
- grid.6341.00000 0000 8578 2742Department of Aquatic Sciences and Assessment, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Rizlan Bernier-Latmani
- grid.5333.60000000121839049Environmental Microbiology Laboratory, Environmental Engineering Institute, School of Architecture, Civil and Environmental Engineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Stefan Bertilsson
- grid.8993.b0000 0004 1936 9457Department of Ecology and Genetics, Limnology and Science for Life Laboratory, Uppsala University, Uppsala, Sweden ,grid.6341.00000 0000 8578 2742Department of Aquatic Sciences and Assessment, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Mark Dopson
- grid.8148.50000 0001 2174 3522Centre for Ecology and Evolution in Microbial Model Systems (EEMiS), Linnaeus University, Kalmar, Sweden
| |
Collapse
|
6
|
Chu D, Wei L. Context-dependent and -independent selection on synonymous mutations revealed by 1,135 genomes of Arabidopsis thaliana. BMC Ecol Evol 2021; 21:68. [PMID: 33910528 PMCID: PMC8079846 DOI: 10.1186/s12862-021-01792-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2020] [Accepted: 04/11/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Synonymous mutations do not alter the amino acids and therefore are regarded as neutral for a long time. However, they do change the tRNA adaptation index (tAI) of a particular codon (independent of its context), affecting the tRNA availability during translation. They could also change the isoaccepting relationship with its neighboring synonymous codons in particular context, which again affects the local translation process. Evidence of selection pressure on synonymous mutations has emerged. RESULTS The proposed selection patterns on synonymous mutations are never formally and systematically tested in plant species. We fully take advantage of the SNP data from 1,135 A. thaliana lines, and found that the synonymous mutations that increase tAI or the isoaccepting mutations in isoaccepting codon context tend to have higher derived allele frequencies (DAF) compared to other synonymous mutations of the opposite effects. CONCLUSIONS Synonymous mutations are not strictly neutral. The synonymous mutations that increase tAI or the isoaccepting mutations in isoaccepting codon context are likely to be positively selected. We propose the concept of context-dependent and -independent selection on synonymous mutations. These concepts broaden our knowledge of the functional consequences of synonymous mutations, and should be appealing to phytologists and evolutionary biologists.
Collapse
Affiliation(s)
- Duan Chu
- College of Life Sciences, Beijing Normal University, No. 19 Xinjiekouwai Street, Haidian, Beijing, China
| | - Lai Wei
- College of Life Sciences, Beijing Normal University, No. 19 Xinjiekouwai Street, Haidian, Beijing, China.
| |
Collapse
|
7
|
Samatova E, Daberger J, Liutkute M, Rodnina MV. Translational Control by Ribosome Pausing in Bacteria: How a Non-uniform Pace of Translation Affects Protein Production and Folding. Front Microbiol 2021; 11:619430. [PMID: 33505387 PMCID: PMC7829197 DOI: 10.3389/fmicb.2020.619430] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Accepted: 12/11/2020] [Indexed: 11/23/2022] Open
Abstract
Protein homeostasis of bacterial cells is maintained by coordinated processes of protein production, folding, and degradation. Translational efficiency of a given mRNA depends on how often the ribosomes initiate synthesis of a new polypeptide and how quickly they read the coding sequence to produce a full-length protein. The pace of ribosomes along the mRNA is not uniform: periods of rapid synthesis are separated by pauses. Here, we summarize recent evidence on how ribosome pausing affects translational efficiency and protein folding. We discuss the factors that slow down translation elongation and affect the quality of the newly synthesized protein. Ribosome pausing emerges as important factor contributing to the regulatory programs that ensure the quality of the proteome and integrate the cellular and environmental cues into regulatory circuits of the cell.
Collapse
Affiliation(s)
- Ekaterina Samatova
- Department of Physical Biochemistry, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany
| | - Jan Daberger
- Department of Physical Biochemistry, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany
| | - Marija Liutkute
- Department of Physical Biochemistry, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany
| | - Marina V Rodnina
- Department of Physical Biochemistry, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany
| |
Collapse
|
8
|
Hodgman MW, Miller JB, Meurs TE, Kauwe JSK. CUBAP: an interactive web portal for analyzing codon usage biases across populations. Nucleic Acids Res 2020; 48:11030-11039. [PMID: 33045750 PMCID: PMC7641757 DOI: 10.1093/nar/gkaa863] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Revised: 08/18/2020] [Accepted: 09/22/2020] [Indexed: 12/19/2022] Open
Abstract
Synonymous codon usage significantly impacts translational and transcriptional efficiency, gene expression, the secondary structure of both mRNA and proteins, and has been implicated in various diseases. However, population-specific differences in codon usage biases remain largely unexplored. Here, we present a web server, https://cubap.byu.edu, to facilitate analyses of codon usage biases across populations (CUBAP). Using the 1000 Genomes Project, we calculated and visually depict population-specific differences in codon frequencies, codon aversion, identical codon pairing, co-tRNA codon pairing, ramp sequences, and nucleotide composition in 17,634 genes. We found that codon pairing significantly differs between populations in 35.8% of genes, allowing us to successfully predict the place of origin for African and East Asian individuals with 98.8% and 100% accuracy, respectively. We also used CUBAP to identify a significant bias toward decreased CTG pairing in the immunity related GTPase M (IRGM) gene in East Asian and African populations, which may contribute to the decreased association of rs10065172 with Crohn's disease in those populations. CUBAP facilitates in-depth gene-specific and codon-specific visualization that will aid in analyzing candidate genes identified in genome-wide association studies, identifying functional implications of synonymous variants, predicting population-specific impacts of synonymous variants and categorizing genetic biases unique to certain populations.
Collapse
Affiliation(s)
- Matthew W Hodgman
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Justin B Miller
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Taylor E Meurs
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - John S K Kauwe
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| |
Collapse
|
9
|
Miller JB, McKinnon LM, Whiting MF, Kauwe JSK, Ridge PG. Codon Pairs are Phylogenetically Conserved: A comprehensive analysis of codon pairing conservation across the Tree of Life. PLoS One 2020; 15:e0232260. [PMID: 32401752 PMCID: PMC7219770 DOI: 10.1371/journal.pone.0232260] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Accepted: 04/10/2020] [Indexed: 11/27/2022] Open
Abstract
Identical codon pairing and co-tRNA codon pairing increase translational efficiency within genes when two codons that encode the same amino acid are translated by the same tRNA before it diffuses from the ribosome. We examine the phylogenetic signal in both identical and co-tRNA codon pairing across 23 428 species using alignment-free and parsimony methods. We determined that conserved codon pairing typically has a smaller window size than the length of a ribosome, and codon pairing tracks phylogenies across various taxonomic groups. We report a comprehensive analysis of codon pairing, including the extent to which each codon pairs. Our parsimony method generally recovers phylogenies that are more congruent with the established phylogenies than our alignment-free method. However, four of the ten taxonomic groups did not have sufficient orthologous codon pairings and were therefore analyzed using only the alignment-free methods. Since the recovered phylogenies using only codon pairing largely match phylogenies from the Open Tree of Life and the NCBI taxonomy, and are comparable to trees recovered by other algorithms, we propose that codon pairing biases are phylogenetically conserved and should be considered in conjunction with other phylogenomic techniques.
Collapse
Affiliation(s)
- Justin B. Miller
- Department of Biology, Brigham Young University, Provo, UT, United States of America
| | - Lauren M. McKinnon
- Department of Biology, Brigham Young University, Provo, UT, United States of America
| | - Michael F. Whiting
- Department of Biology, Brigham Young University, Provo, UT, United States of America
- M.L. Bean Museum, Brigham Young University, Provo, UT, United States of America
| | - John S. K. Kauwe
- Department of Biology, Brigham Young University, Provo, UT, United States of America
| | - Perry G. Ridge
- Department of Biology, Brigham Young University, Provo, UT, United States of America
| |
Collapse
|
10
|
Kulmala A, Huovinen T, Lamminmäki U. Improvement of Fab expression by screening combinatorial synonymous signal sequence libraries. Microb Cell Fact 2019; 18:157. [PMID: 31526395 PMCID: PMC6745802 DOI: 10.1186/s12934-019-1210-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Accepted: 09/06/2019] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND Antibody fragments can be expressed in Escherichia coli, where they are commonly directed to the periplasm via Sec pathway to enable disulphide bridge formations and correct folding. In order to transport antibody fragments to the periplasmic space via Sec pathway, they are equipped with N-terminal signal sequence. Periplasmic expression has many benefits but it's also subjected to many hurdles like inefficient translocation across the inner membrane and insufficient capacity of the translocation system. One solution to overcome these hurdles is a modulation of codon usage of signal sequence which has proved to be an efficient way of tuning the translocation process. Modulation of codon usage of signal sequences has been successfully employed also in improving the expression levels of antibody fragments, but unfortunately the effect of codon usage on the expression has not been thoroughly analyzed. RESULTS In the present study we established three synonymous PelB signal sequence libraries by modulating codon usage of light chain and heavy chain PelB signal sequences of a Fab fragment. Each region (n-region, hydrophobic region and c-region) of the PelB signal sequence in the both chains of the Fab fragment in a bicistronic expression vector was mutated separately. We then screened for clones with improved expression profile. The best source for improved clones was the n-region library but in general, improved clones were obtained from all of the three libraries. After screening, we analyzed the effects of codon usage and mRNA secondary structures of chosen clones on the expression levels of the Fab fragment. When it comes to codon usage based factors, it was discovered that especially codon usage of fifth leucine position of the light chain PelB affects the expression levels of Fab fragment. In addition, we observed that mRNA secondary structures in the translation initiation regions of the light and heavy chain have an effect on expression levels as well. CONCLUSIONS In conclusion, the established synonymous signal sequence libraries are good sources for discovering Fab fragments with improved expression profile and obtaining new codon usage related information.
Collapse
Affiliation(s)
- Antti Kulmala
- Department of Biochemistry/Biotechnology, University of Turku, Kiinamyllynkatu 10, 20520, Turku, Finland
| | - Tuomas Huovinen
- Department of Biochemistry/Biotechnology, University of Turku, Kiinamyllynkatu 10, 20520, Turku, Finland
| | - Urpo Lamminmäki
- Department of Biochemistry/Biotechnology, University of Turku, Kiinamyllynkatu 10, 20520, Turku, Finland.
| |
Collapse
|
11
|
Saito Y, Kitagawa W, Kumagai T, Tajima N, Nishimiya Y, Tamano K, Yasutake Y, Tamura T, Kameda T. Developing a codon optimization method for improved expression of recombinant proteins in actinobacteria. Sci Rep 2019; 9:8338. [PMID: 31171855 PMCID: PMC6554278 DOI: 10.1038/s41598-019-44500-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2018] [Accepted: 05/03/2019] [Indexed: 02/05/2023] Open
Abstract
Codon optimization by synonymous substitution is widely used for recombinant protein expression. Recent studies have investigated sequence features for codon optimization based on large-scale expression analyses. However, these studies have been limited to common host organisms such as Escherichia coli. Here, we develop a codon optimization method for Rhodococcus erythropolis, a gram-positive GC-rich actinobacterium attracting attention as an alternative host organism. We evaluate the recombinant protein expression of 204 genes in R. erythropolis with the same plasmid vector. The statistical analysis of these expression data reveals that the mRNA folding energy at 5’ regions as well as the codon frequency are important sequence features for codon optimization. Intriguingly, other sequence features such as the codon repetition rate show a different tendency from the previous study on E. coli. We optimize the coding sequences of 12 genes regarding these sequence features, and confirm that 9 of them (75%) achieve increased expression levels compared with wild-type sequences. Especially, for 5 genes whose expression levels for wild-type sequences are small or not detectable, all of them are improved by optimized sequences. These results demonstrate the effectiveness of our codon optimization method in R. erythropolis, and possibly in other actinobacteria.
Collapse
Affiliation(s)
- Yutaka Saito
- Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-4-7 Aomi, Koto-ku, Tokyo, 135-0064, Japan.,Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), 3-4-1 Okubo, Shinjuku-ku, Tokyo, 169-8555, Japan
| | - Wataru Kitagawa
- Bioproduction Research Institute, National Institute of Advanced Industrial Science and Technology (AIST), 2-17-2-1 Tsukisamu-Higashi, Toyohira-ku, Sapporo, 062-8517, Japan.,Graduate School of Agriculture, Hokkaido University, Kita 9-Nishi 9, Kita-ku, Sapporo, 060-8589, Japan
| | | | - Naoyuki Tajima
- Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-4-7 Aomi, Koto-ku, Tokyo, 135-0064, Japan
| | - Yoshiyuki Nishimiya
- Bioproduction Research Institute, National Institute of Advanced Industrial Science and Technology (AIST), 2-17-2-1 Tsukisamu-Higashi, Toyohira-ku, Sapporo, 062-8517, Japan
| | - Koichi Tamano
- Bioproduction Research Institute, National Institute of Advanced Industrial Science and Technology (AIST), 2-17-2-1 Tsukisamu-Higashi, Toyohira-ku, Sapporo, 062-8517, Japan
| | - Yoshiaki Yasutake
- Bioproduction Research Institute, National Institute of Advanced Industrial Science and Technology (AIST), 2-17-2-1 Tsukisamu-Higashi, Toyohira-ku, Sapporo, 062-8517, Japan
| | - Tomohiro Tamura
- Bioproduction Research Institute, National Institute of Advanced Industrial Science and Technology (AIST), 2-17-2-1 Tsukisamu-Higashi, Toyohira-ku, Sapporo, 062-8517, Japan. .,Graduate School of Agriculture, Hokkaido University, Kita 9-Nishi 9, Kita-ku, Sapporo, 060-8589, Japan.
| | - Tomoshi Kameda
- Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-4-7 Aomi, Koto-ku, Tokyo, 135-0064, Japan.
| |
Collapse
|
12
|
Wu Y, Wu P, Wang B, Shao ZQ. Genome-Wide Analysis Reveals Ancestral Lack of Seventeen Different tRNAs and Clade-Specific Loss of tRNA-CNNs in Archaea. Front Microbiol 2018; 9:1245. [PMID: 29930548 PMCID: PMC6000648 DOI: 10.3389/fmicb.2018.01245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2017] [Accepted: 05/23/2018] [Indexed: 12/02/2022] Open
Abstract
Transfer RNA (tRNA) is a category of RNAs that specifically decode messenger RNAs (mRNAs) into proteins by recognizing a set of 61 codons commonly adopted by different life domains. The composition and abundance of tRNAs play critical roles in shaping codon usage and pairing bias, which subsequently modulate mRNA translation efficiency and accuracy. Over the past few decades, effort has been concentrated on evaluating the specificity and redundancy of different tRNA families. However, the mechanism and processes underlying tRNA evolution have only rarely been investigated. In this study, by surveying tRNA genes in 167 completely sequenced genomes, we systematically investigated the composition and evolution of tRNAs in Archaea from a phylogenetic perspective. Our data revealed that archaeal genomes are compact in both tRNA types and copy number. Generally, no more than 44 different types of tRNA are present in archaeal genomes to decode the 61 canonical codons, and most of them have only one gene copy per genome. Among them, tRNA-Met was significantly overrepresented, with an average of three copies per genome. In contrast, the tRNA-UAU and 16 tRNAs with A-starting anticodons (tRNA-ANNs) were rarely detected in all archaeal genomes. The conspicuous absence of these tRNAs across the archaeal phylogeny suggests they might have not been evolved in the common ancestor of Archaea, rather than have lost independently from different clades. Furthermore, widespread absence of tRNA-CNNs in the Methanococcales and Methanobacteriales genomes indicates convergent loss of these tRNAs in the two clades. This clade-specific tRNA loss may be attributing to the reductive evolution of their genomes. Our data suggest that the current tRNA profiles in Archaea are contributed not only by the ancestral tRNA composition, but also by differential maintenance and loss of redundant tRNAs.
Collapse
Affiliation(s)
- Yue Wu
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China.,Institute of Bioinformatics, University of Georgia, Athens, GA, United States
| | - Ping Wu
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Bin Wang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Zhu-Qing Shao
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| |
Collapse
|
13
|
Lück A, Klimmasch L, Großmann P, Germerodt S, Kaleta C. Computational Investigation of Environment-Noise Interaction in Single-Cell Organisms: The Merit of Expression Stochasticity Depends on the Quality of Environmental Fluctuations. Sci Rep 2018; 8:333. [PMID: 29321537 PMCID: PMC5762857 DOI: 10.1038/s41598-017-17441-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2017] [Accepted: 11/27/2017] [Indexed: 11/23/2022] Open
Abstract
Organisms need to adapt to changing environments and they do so by using a broad spectrum of strategies. These strategies include finding the right balance between expressing genes before or when they are needed, and adjusting the degree of noise inherent in gene expression. We investigated the interplay between different nutritional environments and the inhabiting organisms’ metabolic and genetic adaptations by applying an evolutionary algorithm to an agent-based model of a concise bacterial metabolism. Our results show that constant environments and rapidly fluctuating environments produce similar adaptations in the organisms, making the predictability of the environment a major factor in determining optimal adaptation. We show that exploitation of expression noise occurs only in some types of fluctuating environment and is strongly dependent on the quality and availability of nutrients: stochasticity is generally detrimental in fluctuating environments and beneficial only at equal periods of nutrient availability and above a threshold environmental richness. Moreover, depending on the availability and nutritional value of nutrients, nutrient-dependent and stochastic expression are both strategies used to deal with environmental changes. Overall, we comprehensively characterize the interplay between the quality and periodicity of an environment and the resulting optimal deterministic and stochastic regulation strategies of nutrient-catabolizing pathways.
Collapse
Affiliation(s)
- Anja Lück
- Department of Bioinformatics, Friedrich Schiller University, Jena, 07743, Germany
| | - Lukas Klimmasch
- Research Group Theoretical Systems Biology, Friedrich Schiller University, Jena, 07743, Germany
| | - Peter Großmann
- Department of Bioinformatics, Friedrich Schiller University, Jena, 07743, Germany
| | - Sebastian Germerodt
- Department of Bioinformatics, Friedrich Schiller University, Jena, 07743, Germany
| | - Christoph Kaleta
- Research Group Medical Systems Biology, Institute for Experimental Medicine, Christian-Albrechts-University, Kiel, 24105, Germany.
| |
Collapse
|
14
|
The Impact of Selection at the Amino Acid Level on the Usage of Synonymous Codons. G3-GENES GENOMES GENETICS 2017; 7:967-981. [PMID: 28122952 PMCID: PMC5345726 DOI: 10.1534/g3.116.038125] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
There are two main forces that affect usage of synonymous codons: directional mutational pressure and selection. The effectiveness of protein translation is usually considered as the main selectional factor. However, biased codon usage can also be a byproduct of a general selection at the amino acid level interacting with nucleotide replacements. To evaluate the validity and strength of such an effect, we superimposed >3.5 billion unrestricted mutational processes on the selection of nonsynonymous substitutions based on the differences in physicochemical properties of the coded amino acids. Using a modified evolutionary optimization algorithm, we determined the conditions in which the effect on the relative codon usage is maximized. We found that the effect is enhanced by mutational processes generating more adenine and thymine than guanine and cytosine, as well as more purines than pyrimidines. Interestingly, this effect is observed only under an unrestricted model of nucleotide substitution, and disappears when the mutational process is time-reversible. Comparison of the simulation results with data for real protein coding sequences indicates that the impact of selection at the amino acid level on synonymous codon usage cannot be neglected. Furthermore, it can considerably interfere, especially in AT-rich genomes, with other selections on codon usage, e.g., translational efficiency. It may also lead to difficulties in the recognition of other effects influencing codon bias, and an overestimation of protein coding sequences whose codon usage is subjected to adaptational selection.
Collapse
|
15
|
Quax TEF, Claassens NJ, Söll D, van der Oost J. Codon Bias as a Means to Fine-Tune Gene Expression. Mol Cell 2015; 59:149-61. [PMID: 26186290 DOI: 10.1016/j.molcel.2015.05.035] [Citation(s) in RCA: 405] [Impact Index Per Article: 45.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
The redundancy of the genetic code implies that most amino acids are encoded by multiple synonymous codons. In all domains of life, a biased frequency of synonymous codons is observed at the genome level, in functionally related genes (e.g., in operons), and within single genes. Other codon bias variants include biased codon pairs and codon co-occurrence. Although translation initiation is the key step in protein synthesis, it is generally accepted that codon bias contributes to translation efficiency by tuning the elongation rate of the process. Moreover, codon bias plays an important role in controlling a multitude of cellular processes, ranging from differential protein production to protein folding. Here we review currently known types of codon bias and how they may influence translation. We discuss how understanding the principles of codon bias and translation can contribute to improved protein production and developments in synthetic biology.
Collapse
Affiliation(s)
- Tessa E F Quax
- Laboratory of Microbiology, Wageningen University, Dreijenplein 10, 6703 HB Wageningen, the Netherlands; Institut für Biologie II, Albert Ludwig Universität Freiburg, Schänzlestrasse 1, 79104 Freiburg, Germany
| | - Nico J Claassens
- Laboratory of Microbiology, Wageningen University, Dreijenplein 10, 6703 HB Wageningen, the Netherlands
| | - Dieter Söll
- Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Avenue, New Haven, CT 06520-8114, USA
| | - John van der Oost
- Laboratory of Microbiology, Wageningen University, Dreijenplein 10, 6703 HB Wageningen, the Netherlands.
| |
Collapse
|
16
|
Decoding mechanisms by which silent codon changes influence protein biogenesis and function. Int J Biochem Cell Biol 2015; 64:58-74. [PMID: 25817479 DOI: 10.1016/j.biocel.2015.03.011] [Citation(s) in RCA: 90] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2014] [Revised: 03/02/2015] [Accepted: 03/14/2015] [Indexed: 02/07/2023]
Abstract
SCOPE Synonymous codon usage has been a focus of investigation since the discovery of the genetic code and its redundancy. The occurrences of synonymous codons vary between species and within genes of the same genome, known as codon usage bias. Today, bioinformatics and experimental data allow us to compose a global view of the mechanisms by which the redundancy of the genetic code contributes to the complexity of biological systems from affecting survival in prokaryotes, to fine tuning the structure and function of proteins in higher eukaryotes. Studies analyzing the consequences of synonymous codon changes in different organisms have revealed that they impact nucleic acid stability, protein levels, structure and function without altering amino acid sequence. As such, synonymous mutations inevitably contribute to the pathogenesis of complex human diseases. Yet, fundamental questions remain unresolved regarding the impact of silent mutations in human disorders. In the present review we describe developments in this area concentrating on mechanisms by which synonymous mutations may affect protein function and human health. PURPOSE This synopsis illustrates the significance of synonymous mutations in disease pathogenesis. We review the different steps of gene expression affected by silent mutations, and assess the benefits and possible harmful effects of codon optimization applied in the development of therapeutic biologics. PHYSIOLOGICAL AND MEDICAL RELEVANCE Understanding mechanisms by which synonymous mutations contribute to complex diseases such as cancer, neurodegeneration and genetic disorders, including the limitations of codon-optimized biologics, provides insight concerning interpretation of silent variants and future molecular therapies.
Collapse
|
17
|
Oßwald C, Zipf G, Schmidt G, Maier J, Bernauer HS, Müller R, Wenzel SC. Modular construction of a functional artificial epothilone polyketide pathway. ACS Synth Biol 2014; 3:759-72. [PMID: 23654254 DOI: 10.1021/sb300080t] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Natural products of microbial origin continue to be an important source of pharmaceuticals and agrochemicals exhibiting potent activities and often novel modes of action. Due to their inherent structural complexity chemical synthesis is often hardly possible, leaving fermentation as the only viable production route. In addition, the pharmaceutical properties of natural products often need to be optimized for application by sophisticated medicinal chemistry and/or biosynthetic engineering. The latter requires a detailed understanding of the biosynthetic process and genetic tools to modify the producing organism that are often unavailable. Consequently, heterologous expression of complex natural product pathways has been in the focus of development over recent years. However, piecing together existing DNA cloned from natural sources and achieving efficient expression in heterologous circuits represent several limitations that can be addressed by synthetic biology. In this work we have redesigned and reassembled the 56 kb epothilone biosynthetic gene cluster from Sorangium cellulosum for expression in the high GC host Myxococcus xanthus. The codon composition was adapted to a modified codon table for M. xanthus, and unique restriction sites were simultaneously introduced and others eliminated from the sequence in order to permit pathway assembly and future interchangeability of modular building blocks from the epothilone megasynthetase. The functionality of the artificial pathway was demonstrated by successful heterologous epothilone production in M. xanthus at significant yields that have to be improved in upcoming work. Our study sets the stage for future engineering of epothilone biosynthesis and production optimization using a highly flexible assembly strategy.
Collapse
Affiliation(s)
- Corina Oßwald
- Department
of Microbial Natural Products, Helmholtz Institute for Pharmaceutical
Research Saarland, Helmholtz Centre for Infection Research and Pharmaceutical
Biotechnology, Saarland University, Saarbrücken,
Germany
| | | | | | - Josef Maier
- IStLS, Information Services to Life Science, Oberndorf a.N., Germany
| | | | - Rolf Müller
- Department
of Microbial Natural Products, Helmholtz Institute for Pharmaceutical
Research Saarland, Helmholtz Centre for Infection Research and Pharmaceutical
Biotechnology, Saarland University, Saarbrücken,
Germany
| | - Silke C. Wenzel
- Department
of Microbial Natural Products, Helmholtz Institute for Pharmaceutical
Research Saarland, Helmholtz Centre for Infection Research and Pharmaceutical
Biotechnology, Saarland University, Saarbrücken,
Germany
| |
Collapse
|
18
|
Wald N, Margalit H. Auxiliary tRNAs: large-scale analysis of tRNA genes reveals patterns of tRNA repertoire dynamics. Nucleic Acids Res 2014; 42:6552-66. [PMID: 24782525 PMCID: PMC4041420 DOI: 10.1093/nar/gku245] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Decoding of all codons can be achieved by a subset of tRNAs. In bacteria, certain tRNA species are mandatory, while others are auxiliary and are variably used. It is currently unknown how this variability has evolved and whether it provides an adaptive advantage. Here we shed light on the subset of auxiliary tRNAs, using genomic data from 319 bacteria. By reconstructing the evolution of tRNAs we show that the auxiliary tRNAs are highly dynamic, being frequently gained and lost along the phylogenetic tree, with a clear dominance of loss events for most auxiliary tRNA species. We reveal distinct co-gain and co-loss patterns for subsets of the auxiliary tRNAs, suggesting that they are subjected to the same selection forces. Controlling for phylogenetic dependencies, we find that the usage of these tRNA species is positively correlated with GC content and may derive directly from nucleotide bias or from preference of Watson-Crick codon-anticodon interactions. Our results highlight the highly dynamic nature of these tRNAs and their complicated balance with codon usage.
Collapse
Affiliation(s)
- Naama Wald
- Department of Microbiology and Molecular Genetics, IMRIC, Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem 91120, Israel
| | - Hanah Margalit
- Department of Microbiology and Molecular Genetics, IMRIC, Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem 91120, Israel
| |
Collapse
|
19
|
Godinic-Mikulcic V, Jaric J, Greber BJ, Franke V, Hodnik V, Anderluh G, Ban N, Weygand-Durasevic I. Archaeal aminoacyl-tRNA synthetases interact with the ribosome to recycle tRNAs. Nucleic Acids Res 2014; 42:5191-201. [PMID: 24569352 PMCID: PMC4005694 DOI: 10.1093/nar/gku164] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open
Abstract
Aminoacyl-tRNA synthetases (aaRS) are essential enzymes catalyzing the formation of aminoacyl-tRNAs, the immediate precursors for encoded peptides in ribosomal protein synthesis. Previous studies have suggested a link between tRNA aminoacylation and high-molecular-weight cellular complexes such as the cytoskeleton or ribosomes. However, the structural basis of these interactions and potential mechanistic implications are not well understood. To biochemically characterize these interactions we have used a system of two interacting archaeal aaRSs: an atypical methanogenic-type seryl-tRNA synthetase and an archaeal ArgRS. More specifically, we have shown by thermophoresis and surface plasmon resonance that these two aaRSs bind to the large ribosomal subunit with micromolar affinities. We have identified the L7/L12 stalk and the proteins located near the stalk base as the main sites for aaRS binding. Finally, we have performed a bioinformatics analysis of synonymous codons in the Methanothermobacter thermautotrophicus genome that supports a mechanism in which the deacylated tRNAs may be recharged by aaRSs bound to the ribosome and reused at the next occurrence of a codon encoding the same amino acid. These results suggest a mechanism of tRNA recycling in which aaRSs associate with the L7/L12 stalk region to recapture the tRNAs released from the preceding ribosome in polysomes.
Collapse
Affiliation(s)
- Vlatka Godinic-Mikulcic
- Department of Chemistry, Faculty of Science, University of Zagreb, Horvatovac 102A, HR-10000 Zagreb, Croatia, Institute of Molecular Biology and Biophysics, ETH Zurich, Otto-Stern-Weg 5, 8093 Zurich, Switzerland, Department of Molecular Biology, Faculty of Science, University of Zagreb, Horvatovac 102A, HR-10000 Zagreb, Croatia, Department of Biology, Biotechnical Faculty, University of Ljubljana, Večna pot 111, 1000 Ljubljana, Slovenia and Laboratory for Molecular Biology and Nanobiotechnology, National Institute of Chemistry, Hajdrihova 19, 1000 Ljubljana, Slovenia
| | | | | | | | | | | | | | | |
Collapse
|
20
|
Feng C, Xu CJ, Wang Y, Liu WL, Yin XR, Li X, Chen M, Chen KS. Codon usage patterns in Chinese bayberry (Myrica rubra) based on RNA-Seq data. BMC Genomics 2013; 14:732. [PMID: 24160180 PMCID: PMC4008310 DOI: 10.1186/1471-2164-14-732] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2013] [Accepted: 10/21/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Codon usage analysis has been a classical topic for decades and has significances for studies of evolution, mRNA translation, and new gene discovery, etc. While the codon usage varies among different members of the plant kingdom, indicating the necessity for species-specific study, this work has mostly been limited to model organisms. Recently, the development of deep sequencing, especial RNA-Seq, has made it possible to carry out studies in non-model species. RESULT RNA-Seq data of Chinese bayberry was analyzed to investigate the bias of codon usage and codon pairs. High frequency codons (AGG, GCU, AAG and GAU), as well as low frequency ones (NCG and NUA codons) were identified, and 397 high frequency codon pairs were observed. Meanwhile, 26 preferred and 141 avoided neighboring codon pairs were also identified, which showed more significant bias than the same pairs with one or more intervening codons. Codon patterns were also analyzed at the plant kingdom, organism and gene levels. Changes during plant evolution were evident using RSCU (relative synonymous codon usage), which was even more significant than GC3s (GC content of 3rd synonymous codons). Nine GO categories were differentially and independently influenced by CAI (codon adaptation index) or GC3s, especially in 'Molecular function' category. Within a gene, the average CAI increased from 0.720 to 0.785 in the first 50 codons, and then more slowly thereafter. Furthermore, the preferred as well as avoided codons at the position just following the start codon AUG were identified and discussed in relation to the key positions in Kozak sequences. CONCLUSION A comprehensive codon usage Table and number of high-frequency codon pairs were established. Bias in codon usage as well as in neighboring codon pairs was observed, and the significance of this in avoiding DNA mutation, increasing protein production and regulating protein synthesis rate was proposed. Codon usage patterns at three levels were revealed and the significance in plant evolution analysis, gene function classification, and protein translation start site predication were discussed. This work promotes the study of codon biology, and provides some reference for analysis and comprehensive application of RNA-Seq data from other non-model species.
Collapse
Affiliation(s)
- Chao Feng
- Laboratory of Fruit Quality Biology / The State Agriculture Ministry Laboratory of Horticultural Plant Growth, Development and Quality Improvement, Zhejiang University, Hangzhou, 310058, China
| | - Chang-jie Xu
- Laboratory of Fruit Quality Biology / The State Agriculture Ministry Laboratory of Horticultural Plant Growth, Development and Quality Improvement, Zhejiang University, Hangzhou, 310058, China
| | - Yue Wang
- Department of Bioinformatics / The State Key Laboratory of Plant Physiology and Biochemistry, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Wen-li Liu
- Department of Mathematics, Zhejiang University, Hangzhou, 310027, China
| | - Xue-ren Yin
- Laboratory of Fruit Quality Biology / The State Agriculture Ministry Laboratory of Horticultural Plant Growth, Development and Quality Improvement, Zhejiang University, Hangzhou, 310058, China
| | - Xian Li
- Laboratory of Fruit Quality Biology / The State Agriculture Ministry Laboratory of Horticultural Plant Growth, Development and Quality Improvement, Zhejiang University, Hangzhou, 310058, China
| | - Ming Chen
- Department of Bioinformatics / The State Key Laboratory of Plant Physiology and Biochemistry, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Kun-song Chen
- Laboratory of Fruit Quality Biology / The State Agriculture Ministry Laboratory of Horticultural Plant Growth, Development and Quality Improvement, Zhejiang University, Hangzhou, 310058, China
| |
Collapse
|
21
|
Quax TEF, Wolf YI, Koehorst JJ, Wurtzel O, van der Oost R, Ran W, Blombach F, Makarova KS, Brouns SJJ, Forster AC, Wagner EGH, Sorek R, Koonin EV, van der Oost J. Differential translation tunes uneven production of operon-encoded proteins. Cell Rep 2013; 4:938-44. [PMID: 24012761 DOI: 10.1016/j.celrep.2013.07.049] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2012] [Revised: 07/15/2013] [Accepted: 07/31/2013] [Indexed: 12/27/2022] Open
Abstract
Clustering of functionally related genes in operons allows for coregulated gene expression in prokaryotes. This is advantageous when equal amounts of gene products are required. Production of protein complexes with an uneven stoichiometry, however, requires tuning mechanisms to generate subunits in appropriate relative quantities. Using comparative genomic analysis, we show that differential translation is a key determinant of modulated expression of genes clustered in operons and that codon bias generally is the best in silico indicator of unequal protein production. Variable ribosome density profiles of polycistronic transcripts correlate strongly with differential translation patterns. In addition, we provide experimental evidence that de novo initiation of translation can occur at intercistronic sites, allowing for differential translation of any gene irrespective of its position on a polycistronic messenger. Thus, modulation of translation efficiency appears to be a universal mode of control in bacteria and archaea that allows for differential production of operon-encoded proteins.
Collapse
Affiliation(s)
- Tessa E F Quax
- Laboratory of Microbiology, Wageningen University, Dreijenplein 10, 6703 HB Wageningen, The Netherlands.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Zhang YM, Shao ZQ, Yang LT, Sun XQ, Mao YF, Chen JQ, Wang B. Non-random arrangement of synonymous codons in archaea coding sequences. Genomics 2013; 101:362-7. [PMID: 23603537 DOI: 10.1016/j.ygeno.2013.04.008] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2012] [Revised: 04/08/2013] [Accepted: 04/09/2013] [Indexed: 01/28/2023]
Abstract
Non-random arrangement of synonymous codons in coding sequences has been recently reported in eukaryotic and bacterial genomes, but the case in archaeal genomes is largely undetermined. Here, we systematically investigated 122 archaeal genomes for their synonymous codon co-occurrence patterns. We found that in most archaeal coding sequences, the order of synonymous codons is not arranged randomly, but rather some successive codon pairs appear significantly more often than expected. Importantly, such codon pairing bias (CPB) pattern in archaea does not seem to completely follow the co-tRNA codon pairing (CCP) rule previously reported for eukaryotes, but largely obeys an identical codon pairing (ICP) rule. Further, synonymous codon permutation test demonstrated that in many archaeal genomes, random mutation alone is unable to cause the observed high level of ICP bias, which strongly indicates that selection force has been involved to shape synonymous codon orders, potentially meeting a global requirement to optimize translation rate.
Collapse
Affiliation(s)
- Yan-Mei Zhang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, Jiangsu Province, 210093, China.
| | | | | | | | | | | | | |
Collapse
|
23
|
Shao ZQ, Zhang YM, Pan XZ, Wang B, Chen JQ. Insight into the evolution of the histidine triad protein (HTP) family in Streptococcus. PLoS One 2013; 8:e60116. [PMID: 23527301 PMCID: PMC3603884 DOI: 10.1371/journal.pone.0060116] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2012] [Accepted: 02/21/2013] [Indexed: 12/19/2022] Open
Abstract
The Histidine Triad Proteins (HTPs), also known as Pht proteins in Streptococcus pneumoniae, constitute a family of surface-exposed proteins that exist in many pathogenic streptococcal species. Although many studies have revealed the importance of HTPs in streptococcal physiology and pathogenicity, little is known about their origin and evolution. In this study, after identifying all htp homologs from 105 streptococcal genomes representing 38 different species/subspecies, we analyzed their domain structures, positions in genome, and most importantly, their evolutionary histories. By further projecting this information onto the streptococcal phylogeny, we made several major findings. First, htp genes originated earlier than the Streptococcus genus and gene-loss events have occurred among three streptococcal groups, resulting in the absence of the htp gene in the Bovis, Mutans and Salivarius groups. Second, the copy number of htp genes in other groups of Streptococcus is variable, ranging from one to four functional copies. Third, both phylogenetic evidence and domain structure analyses support the division of two htp subfamilies, designated as htp I and htp II. Although present mainly in the pyogenic group and in Streptococcus suis, htp II members are distinct from htp I due to the presence of an additional leucine-rich-repeat domain at the C-terminus. Finally, htp genes exhibit a faster nucleotide substitution rate than do housekeeping genes. Specifically, the regions outside the HTP domains are under strong positive selection. This distinct evolutionary pattern likely helped Streptococcus to easily escape from recognition by host immunity.
Collapse
Affiliation(s)
- Zhu-Qing Shao
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, Jiangsu Province, China
| | - Yan-Mei Zhang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, Jiangsu Province, China
| | - Xiu-Zhen Pan
- Department of Epidemiology, Research Institute for Medicine of Nanjing Command, Nanjing, China
| | - Bin Wang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, Jiangsu Province, China
- * E-mail: (BW); (JQC)
| | - Jian-Qun Chen
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, Jiangsu Province, China
- * E-mail: (BW); (JQC)
| |
Collapse
|