1
|
İncir İ, Kaplan Ö. Escherichia coli as a versatile cell factory: Advances and challenges in recombinant protein production. Protein Expr Purif 2024; 219:106463. [PMID: 38479588 DOI: 10.1016/j.pep.2024.106463] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Revised: 02/25/2024] [Accepted: 03/11/2024] [Indexed: 05/08/2024]
Abstract
E. coli plays a substantial role in recombinant protein production. Its importance increased with the discovery of recombinant DNA technology and the subsequent production of the first recombinant insulin in E. coli. E. coli is a widely used and cost-effective host to produce recombinant proteins. It is also noteworthy that a significant portion of the approved therapeutic proteins have been produced in this organism. Despite these advantages, it has some disadvantages, such as toxicity and lack of eukaryotic post-translational modifications that can lead to the production of misfolded, insoluble, or dysfunctional proteins. This study focused on the challenges and engineering approaches for improved expression and solubility in recombinant protein production in E. coli. In this context, solution strategies such as strain and vector selection, codon usage, mRNA stability, expression conditions, translocation to the periplasmic region and addition of fusion tags in E. coli were discussed.
Collapse
Affiliation(s)
- İbrahim İncir
- Karamanoğlu Mehmetbey University, Kazım Karabekir Vocational School, Department of Medical Services and Techniques, Environmental Health Program Karaman, Turkey.
| | - Özlem Kaplan
- Alanya Alaaddin Keykubat University, Rafet Kayış Faculty of Engineering, Department of Genetics and Bioengineering, Antalya, Turkey.
| |
Collapse
|
2
|
Moss MJ, Chamness LM, Clark PL. The Effects of Codon Usage on Protein Structure and Folding. Annu Rev Biophys 2024; 53:87-108. [PMID: 38134335 PMCID: PMC11227313 DOI: 10.1146/annurev-biophys-030722-020555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2023]
Abstract
The rate of protein synthesis is slower than many folding reactions and varies depending on the synonymous codons encoding the protein sequence. Synonymous codon substitutions thus have the potential to regulate cotranslational protein folding mechanisms, and a growing number of proteins have been identified with folding mechanisms sensitive to codon usage. Typically, these proteins have complex folding pathways and kinetically stable native structures. Kinetically stable proteins may fold only once over their lifetime, and thus, codon-mediated regulation of the pioneer round of protein folding can have a lasting impact. Supporting an important role for codon usage in folding, conserved patterns of codon usage appear in homologous gene families, hinting at selection. Despite these exciting developments, there remains few experimental methods capable of quantifying translation elongation rates and cotranslational folding mechanisms in the cell, which challenges the development of a predictive understanding of how biology uses codons to regulate protein folding.
Collapse
Affiliation(s)
- McKenze J Moss
- Department of Chemistry & Biochemistry, University of Notre Dame, Notre Dame, Indiana, USA; , ,
| | - Laura M Chamness
- Department of Chemistry & Biochemistry, University of Notre Dame, Notre Dame, Indiana, USA; , ,
| | - Patricia L Clark
- Department of Chemistry & Biochemistry, University of Notre Dame, Notre Dame, Indiana, USA; , ,
| |
Collapse
|
3
|
Davyt M, Bharti N, Ignatova Z. Effect of mRNA/tRNA mutations on translation speed: Implications for human diseases. J Biol Chem 2023; 299:105089. [PMID: 37495112 PMCID: PMC10470029 DOI: 10.1016/j.jbc.2023.105089] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2023] [Revised: 07/18/2023] [Accepted: 07/20/2023] [Indexed: 07/28/2023] Open
Abstract
Recent discoveries establish tRNAs as central regulators of mRNA translation dynamics, and therefore cotranslational folding and function of the encoded protein. The tRNA pool, whose composition and abundance change in a cell- and tissue-dependent manner, is the main factor which determines mRNA translation velocity. In this review, we discuss a group of pathogenic mutations, in the coding sequences of either protein-coding genes or in tRNA genes, that alter mRNA translation dynamics. We also summarize advances in tRNA biology that have uncovered how variations in tRNA levels on account of genetic mutations affect protein folding and function, and thereby contribute to phenotypic diversity in clinical manifestations.
Collapse
Affiliation(s)
- Marcos Davyt
- Institute of Biochemistry and Molecular Biology, University of Hamburg, Hamburg, Germany
| | - Nikhil Bharti
- Institute of Biochemistry and Molecular Biology, University of Hamburg, Hamburg, Germany
| | - Zoya Ignatova
- Institute of Biochemistry and Molecular Biology, University of Hamburg, Hamburg, Germany.
| |
Collapse
|
4
|
Gurjar P, Karuvantevida N, Rzhepakovsky IV, Khan AA, Khandia R. A Synthetic Biology Approach for Vaccine Candidate Design against Delta Strain of SARS-CoV-2 Revealed Disruption of Favored Codon Pair as a Better Strategy over Using Rare Codons. Vaccines (Basel) 2023; 11:487. [PMID: 36851364 PMCID: PMC9967482 DOI: 10.3390/vaccines11020487] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2022] [Revised: 02/13/2023] [Accepted: 02/17/2023] [Indexed: 02/22/2023] Open
Abstract
The SARS-CoV-2 delta variant (B.1.617.2) appeared for the first time in December 2020 and later spread worldwide. Currently available vaccines are not so efficacious in curbing the viral pathogenesis of the delta strain of COVID; therefore, the development of a safe and effective vaccine is required. In the present study, we envisaged molecular patterns in the structural genes' spike, nucleoprotein, membrane, and envelope of the SARS-CoV-2 delta variant. The study was based on determining compositional features, dinucleotide odds ratio, synonymous codon usage, positive and negative codon contexts, rare codons, and insight into relatedness between the human host isoacceptor tRNA and preferred codons from the structural genes. We found specific patterns, including a significant abundance of T nucleotide over all other three nucleotides. The underrepresentation of GpA, GpG, CpC, and CpG dinucleotides and the overrepresentation of TpT, ApA, CpT, and TpG were observed. A preference towards ACT- (Thr), AAT- (Asn), TTT- (Phe), and TTG- (Leu) initiated codons and aversion towards CGG (Arg), CCG (Pro), and CAC (His) was present in the structural genes of the delta strain. The interaction between the host tRNA pool and preferred codons of the envisaged structural genes revealed that the virus preferred the codons for those suboptimal numbers of isoacceptor tRNA were present. We see this as a strategy adapted by the virus to keep the translation rate low to facilitate the correct folding of viral proteins. The information generated in the study helps design the attenuated vaccine candidate against the SARS-CoV-2 delta variant using a synthetic biology approach. Three strategies were tested: changing TpT to TpA, introducing rare codons, and disrupting favored codons. It found that disrupting favored codons is a better approach to reducing virus fitness and attenuating SARS-CoV-2 delta strain using structural genes.
Collapse
Affiliation(s)
- Pankaj Gurjar
- Department of Science and Engineering, Novel Global Community Educational Foundation, Hebersham, NSW 2770, Australia
| | - Noushad Karuvantevida
- College of Medicine, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai P.O. Box 505055, United Arab Emirates
| | | | - Azmat Ali Khan
- Pharmaceutical Biotechnology Laboratory, Department of Pharmaceutical Chemistry, College of Pharmacy, King Saud University, Riyadh 11451, Saudi Arabia
| | - Rekha Khandia
- Department of Biochemistry and Genetics, Barkatullah Universty, Bhopal 462026, India
| |
Collapse
|
5
|
Fages‐Lartaud M, Hundvin K, Hohmann‐Marriott MF. Mechanisms governing codon usage bias and the implications for protein expression in the chloroplast of Chlamydomonas reinhardtii. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 112:919-945. [PMID: 36071273 PMCID: PMC9828097 DOI: 10.1111/tpj.15970] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Revised: 08/29/2022] [Accepted: 09/01/2022] [Indexed: 05/30/2023]
Abstract
Chloroplasts possess a considerably reduced genome that is decoded via an almost minimal set of tRNAs. These features make an excellent platform for gaining insights into fundamental mechanisms that govern protein expression. Here, we present a comprehensive and revised perspective of the mechanisms that drive codon selection in the chloroplast of Chlamydomonas reinhardtii and the functional consequences for protein expression. In order to extract this information, we applied several codon usage descriptors to genes with different expression levels. We show that highly expressed genes strongly favor translationally optimal codons, while genes with lower functional importance are rather affected by directional mutational bias. We demonstrate that codon optimality can be deduced from codon-anticodon pairing affinity and, for a small number of amino acids (leucine, arginine, serine, and isoleucine), tRNA concentrations. Finally, we review, analyze, and expand on the impact of codon usage on protein yield, secondary structures of mRNA, translation initiation and termination, and amino acid composition of proteins, as well as cotranslational protein folding. The comprehensive analysis of codon choice provides crucial insights into heterologous gene expression in the chloroplast of C. reinhardtii, which may also be applicable to other chloroplast-containing organisms and bacteria.
Collapse
Affiliation(s)
- Maxime Fages‐Lartaud
- Department of BiotechnologyNorwegian University of Science and TechnologyTrondheimN‐7491Norway
| | - Kristoffer Hundvin
- Department of BiotechnologyNorwegian University of Science and TechnologyTrondheimN‐7491Norway
| | | |
Collapse
|
6
|
Abstract
In the past 20 years, the mRNA vaccine technology has evolved from the first proof of concept to the first licensed vaccine against emerging pandemics such as SARS-CoV-2. Two mRNA vaccines targeting SARS-CoV-2 have received emergency use authorization by US FDA, conditional marketing authorization by EMA, as well as multiple additional national regulatory authorities. The simple composition of an mRNA encoding the antigen formulated in a lipid nanoparticle enables a fast adaptation to new emerging pathogens. This can speed up vaccine development in pandemics from antigen and sequence selection to clinical trial to only a few months. mRNA vaccines are well tolerated and efficacious in animal models for multiple pathogens and will further contribute to the development of vaccines for other unaddressed diseases. Here, we give an overview of the mRNA vaccine design and factors for further optimization of this new promising technology and discuss current knowledge on the mode of action of mRNA vaccines interacting with the innate and adaptive immune system.
Collapse
|
7
|
Wang S, Cheng Y, Liu S, Xu Y, Gao Y, Wang C, Wang Z, Feng T, Lu G, Song J, Xia P, Hao L. A synonymous mutation in IGF-1 impacts the transcription and translation process of gene expression. MOLECULAR THERAPY. NUCLEIC ACIDS 2021; 26:1446-1465. [PMID: 34938600 PMCID: PMC8655398 DOI: 10.1016/j.omtn.2021.08.007] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Accepted: 08/10/2021] [Indexed: 11/18/2022]
Abstract
Insulin-like growth factor 1 (IGF-1) is considered to be a crucial gene in the animal development of bone and body size. In this study, a unique synonymous mutation (c.258 A > G) of the IGF-1 gene was modified with an adenine base editor to observe the growth and developmental situation of mutant mice. Significant expression differences and molecular mechanisms among vectors with different alanine synonymous codons were explored. Although modification of a single synonymous codon rarely interferes with animal phenotypes, we observed that the expression and secretion of IGF-1 were different between 8-week-old homozygous (Ho) and wild-type (WT) mice. In addition, the IGF-1 with optimal codon combinations showed a higher expression content than other codon combination modes at both transcription and translation levels and performed proliferation promotion. The gene stability and translation initiation efficiency also changed significantly. Our findings illustrated that the synonymous mutation altered the IGF-1 gene expression in individual mice and suggested that the synonymous mutation affected the IGF-1 expression and biological function through the transcription and translation processes.
Collapse
Affiliation(s)
- S.Y. Wang
- College of Animal Science, Jilin University, Changchun 130062, China
- Beijing Advanced Innovation Centre for Biomedical Engineering, School of Biological Science and Medical Engineering, Beihang University, Beijing, China
| | - Y.Y. Cheng
- Ministry of Health Key Laboratory of Radiobiology, College of Public Health, Jilin University, Changchun, Jilin 130021, China
| | - S.C. Liu
- College of Animal Science, Jilin University, Changchun 130062, China
| | - Y.X. Xu
- College of Animal Science, Jilin University, Changchun 130062, China
| | - Y. Gao
- College of Animal Science, Jilin University, Changchun 130062, China
| | - C.L. Wang
- College of Animal Science, Jilin University, Changchun 130062, China
| | - Z.G. Wang
- College of Animal Science, Jilin University, Changchun 130062, China
| | - T.Q. Feng
- College of Animal Science, Jilin University, Changchun 130062, China
| | - G.H. Lu
- College of Animal Science, Jilin University, Changchun 130062, China
| | - J. Song
- College of Animal Science, Jilin University, Changchun 130062, China
| | - P.J. Xia
- College of Animal Science, Jilin University, Changchun 130062, China
| | - L.L. Hao
- College of Animal Science, Jilin University, Changchun 130062, China
- Corresponding author: Linlin Hao, College of Animal Science, Jilin University, Changchun 130062, China.
| |
Collapse
|
8
|
Abstract
Codon usage bias is the preferential or non-random use of synonymous codons, a ubiquitous phenomenon observed in bacteria, plants and animals. Different species have consistent and characteristic codon biases. Codon bias varies not only with species, family or group within kingdom, but also between the genes within an organism. Codon usage bias has evolved through mutation, natural selection, and genetic drift in various organisms. Genome composition, GC content, expression level and length of genes, position and context of codons in the genes, recombination rates, mRNA folding, and tRNA abundance and interactions are some factors influencing codon bias. The factors shaping codon bias may also be involved in evolution of the universal genetic code. Codon-usage bias is critical factor determining gene expression and cellular function by influencing diverse processes such as RNA processing, protein translation and protein folding. Codon usage bias reflects the origin, mutation patterns and evolution of the species or genes. Investigations of codon bias patterns in genomes can reveal phylogenetic relationships between organisms, horizontal gene transfers, molecular evolution of genes and identify selective forces that drive their evolution. Most important application of codon bias analysis is in the design of transgenes, to increase gene expression levels through codon optimization, for development of transgenic crops. The review gives an overview of deviations of genetic code, factors influencing codon usage or bias, codon usage bias of nuclear and organellar genes, computational methods to determine codon usage and the significance as well as applications of codon usage analysis in biological research, with emphasis on plants.
Collapse
Affiliation(s)
| | - Varatharajalu Udayasuriyan
- Department of Biotechnology, Centre for Plant Molecular Biology and Biotechnology, Tamil Nadu Agricultural University, Coimbatore, 641003, India
| | - Vijaipal Bhadana
- ICAR-Indian Institute of Agricultural Biotechnology, Ranchi, Jharkhand, 834010, India
| |
Collapse
|
9
|
Liu Y, Yang Q, Zhao F. Synonymous but Not Silent: The Codon Usage Code for Gene Expression and Protein Folding. Annu Rev Biochem 2021; 90:375-401. [PMID: 33441035 DOI: 10.1146/annurev-biochem-071320-112701] [Citation(s) in RCA: 104] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Codon usage bias, the preference for certain synonymous codons, is found in all genomes. Although synonymous mutations were previously thought to be silent, a large body of evidence has demonstrated that codon usage can play major roles in determining gene expression levels and protein structures. Codon usage influences translation elongation speed and regulates translation efficiency and accuracy. Adaptation of codon usage to tRNA expression determines the proteome landscape. In addition, codon usage biases result in nonuniform ribosome decoding rates on mRNAs, which in turn influence the cotranslational protein folding process that is critical for protein function in diverse biological processes. Conserved genome-wide correlations have also been found between codon usage and protein structures. Furthermore, codon usage is a major determinant of mRNA levels through translation-dependent effects on mRNA decay and translation-independent effects on transcriptional and posttranscriptional processes. Here, we discuss the multifaceted roles and mechanisms of codon usage in different gene regulatory processes.
Collapse
Affiliation(s)
- Yi Liu
- Department of Physiology, University of Texas Southwestern Medical Center, Dallas, Texas 75390-9040, USA;
| | - Qian Yang
- Department of Physiology, University of Texas Southwestern Medical Center, Dallas, Texas 75390-9040, USA;
| | - Fangzhou Zhao
- Department of Physiology, University of Texas Southwestern Medical Center, Dallas, Texas 75390-9040, USA;
| |
Collapse
|
10
|
Fields FR, Suresh N, Hiller M, Freed SD, Haldar K, Lee SW. Algorithmic assessment of missense mutation severity in the Von-Hippel Lindau protein. PLoS One 2020; 15:e0234100. [PMID: 33151962 PMCID: PMC7644048 DOI: 10.1371/journal.pone.0234100] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2020] [Accepted: 10/10/2020] [Indexed: 11/19/2022] Open
Abstract
Von Hippel-Lindau disease (VHL) is an autosomal dominant rare disease that causes the formation of angiogenic tumors. When functional, pVHL acts as an E3 ubiquitin ligase that negatively regulates hypoxia inducible factor (HIF). Genetic mutations that perturb the structure of pVHL result in dysregulation of HIF, causing a wide array of tumor pathologies including retinal angioma, pheochromocytoma, central nervous system hemangioblastoma, and clear cell renal carcinoma. These VHL-related cancers occur throughout the lifetime of the patient, requiring frequent intervention procedures, such as surgery, to remove the tumors. Although VHL is classified as a rare disease (1 in 39,000 to 1 in 91,000 affected) there is a large heterogeneity in genetic mutations listed for observed pathologies. Understanding how these specific mutations correlate with the myriad of observed pathologies for VHL could provide clinicians insight into the potential severity and onset of disease. Using a select set of 285 ClinVar mutations in VHL, we developed a multiparametric scoring algorithm to evaluate the overall clinical severity of missense mutations in pVHL. The mutations were assessed according to eight weighted parameters as a comprehensive evaluation of protein misfolding and malfunction. Higher mutation scores were strongly associated with pathogenicity. Our approach establishes a novel in silico method by which VHL-specific mutations can be assessed for their severity and effect on the biophysical functions of the VHL protein.
Collapse
Affiliation(s)
- Francisco R. Fields
- Boler-Parseghian Center for Rare and Neglected Diseases, University of Notre Dame, Notre Dame, Indiana, United States of America
- Department of Biological Sciences, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Niraja Suresh
- Boler-Parseghian Center for Rare and Neglected Diseases, University of Notre Dame, Notre Dame, Indiana, United States of America
- Department of Biological Sciences, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Morgan Hiller
- Boler-Parseghian Center for Rare and Neglected Diseases, University of Notre Dame, Notre Dame, Indiana, United States of America
- Department of Biological Sciences, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Stefan D. Freed
- Boler-Parseghian Center for Rare and Neglected Diseases, University of Notre Dame, Notre Dame, Indiana, United States of America
- Department of Biological Sciences, University of Notre Dame, Notre Dame, Indiana, United States of America
- Chemistry-Biology-Biochemistry Interfaces, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Kasturi Haldar
- Boler-Parseghian Center for Rare and Neglected Diseases, University of Notre Dame, Notre Dame, Indiana, United States of America
- Department of Biological Sciences, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Shaun W. Lee
- Boler-Parseghian Center for Rare and Neglected Diseases, University of Notre Dame, Notre Dame, Indiana, United States of America
- Department of Biological Sciences, University of Notre Dame, Notre Dame, Indiana, United States of America
- Chemistry-Biology-Biochemistry Interfaces, University of Notre Dame, Notre Dame, Indiana, United States of America
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, Indiana, United States of America
| |
Collapse
|
11
|
Dewi KS, Fuad AM. Improving the Expression of Human Granulocyte Colony Stimulating Factor in Escherichia coli by Reducing the GC-content and Increasing mRNA Folding Free Energy at 5'-Terminal End. Adv Pharm Bull 2020; 10:610-616. [PMID: 33072539 PMCID: PMC7539321 DOI: 10.34172/apb.2020.073] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Revised: 12/24/2019] [Accepted: 01/27/2020] [Indexed: 01/16/2023] Open
Abstract
Purpose: Strategy for improving the production of biopharmaceutical protein continues to develop due to increasing market demand. Human granulocyte colony stimulating factor (hG-CSF) is one of biopharmaceutical proteins that has many applications, and easily produced in Escherichia coli expression system. Previous studies reported that codon usage, rare codon, mRNA folding and GC-content at 5’-terminal end were crucial for protein production in E. coli. In the present study, the effect of reducing the GC-content and increasing the mRNA folding free energy at the 5’-terminal end on the expression level of hG-CSF proteins was investigated. Methods: Synonymous codon substitutions were performed to generate mutant variants of open reading frame (ORF) with lower GC-content at 5’-terminal ends. Oligoanalyzer tool was used to calculate the GC content of eight codons sequence after ATG. Whereas, mRNA folding free energy was predicted using KineFold and RNAfold tools. The template DNA was amplified using three variant forward primers and one same reverse primer. Those DNA fragments were individually cloned into pJexpress414 expression vector and were confirmed using restriction and DNA sequencing analyses. The confirmed constructs were transformed into E. coli NiCo21(DE3) host cells and the recombinant protein was expressed using IPTG-induction. Total protein obtained were characterized using SDS-PAGE, Western blot and ImageJ software analyses. Results: The result showed that the mutant variant with lower GC-content and higher mRNA folding free energy near the translation initiation region (TIR) could produce a higher amount of hG-CSF proteins compared to the original gene sequence. Conclusion: This study emphasized the important role of the nucleotide composition immediately downstream the start codon to achieve high-yield protein product on heterologous expression in E. coli.
Collapse
Affiliation(s)
- Kartika Sari Dewi
- Research Center for Biotechnology, Indonesian Institute of Sciences, Cibinong, Bogor, Indonesia, 16911
| | - Asrul Muhamad Fuad
- Research Center for Biotechnology, Indonesian Institute of Sciences, Cibinong, Bogor, Indonesia, 16911
| |
Collapse
|
12
|
Nonoptimal Codon Usage Is Critical for Protein Structure and Function of the Master General Amino Acid Control Regulator CPC-1. mBio 2020; 11:mBio.02605-20. [PMID: 33051373 PMCID: PMC7554675 DOI: 10.1128/mbio.02605-20] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Under amino acid starvation conditions, eukaryotic organisms activate a general amino acid control response. In Neurospora crassa, Cross Pathway Control Protein 1 (CPC-1), the ortholog of the Saccharomyces cerevisiae bZIP transcription factor GCN4, functions as the master regulator of the general amino acid control response. Codon usage biases are a universal feature of eukaryotic genomes and are critical for regulation of gene expression. Although codon usage has also been implicated in the regulation of protein structure and function, genetic evidence supporting this conclusion is very limited. Here, we show that Neurospora cpc-1 has a nonoptimal NNU-rich codon usage profile that contrasts with the strong NNC codon preference in the genome. Although substitution of the cpc-1 NNU codons with synonymous NNC codons elevated CPC-1 expression in Neurospora, it altered the CPC-1 degradation rate and abolished its amino acid starvation-induced protein stabilization. The codon-manipulated CPC-1 protein also exhibited different sensitivity to limited protease digestion. Furthermore, CPC-1 functions in rescuing the cell growth of the cpc-1 deletion mutant and activation of the expression of its target genes were impaired by the synonymous codon changes. Together, these results reveal the critical role of codon usage in regulation of CPC-1 expression and function and establish a genetic example of the importance of codon usage in protein folding.IMPORTANCE The general amino acid control response is critical for adaptation of organisms to amino acid starvation conditions. The preference to use certain synonymous codons is a universal feature of all genomes. Synonymous codon changes were previously thought to be silent mutations. In this study, we showed that the Neurospora cpc-1 gene has an unusual codon usage profile compared to other genes in the genome. We found that codon optimization of the cpc-1 gene without changing its amino acid sequence resulted in elevated CPC-1 expression, an altered protein degradation rate, and impaired protein functions due to changes in protein structure. Together, these results reveal the critical role of synonymous codon usage in regulation of CPC-1 expression and function and establish a genetic example of the importance of codon usage in protein structure.
Collapse
|
13
|
Liu Y. A code within the genetic code: codon usage regulates co-translational protein folding. Cell Commun Signal 2020; 18:145. [PMID: 32907610 PMCID: PMC7488015 DOI: 10.1186/s12964-020-00642-6] [Citation(s) in RCA: 131] [Impact Index Per Article: 26.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2020] [Accepted: 08/10/2020] [Indexed: 01/05/2023] Open
Abstract
The genetic code is degenerate, and most amino acids are encoded by two to six synonymous codons. Codon usage bias, the preference for certain synonymous codons, is a universal feature of all genomes examined. Synonymous codon mutations were previously thought to be silent; however, a growing body evidence now shows that codon usage regulates protein structure and gene expression through effects on co-translational protein folding, translation efficiency and accuracy, mRNA stability, and transcription. Codon usage regulates the speed of translation elongation, resulting in non-uniform ribosome decoding rates on mRNAs during translation that is adapted to co-translational protein folding process. Biochemical and genetic evidence demonstrate that codon usage plays an important role in regulating protein folding and function in both prokaryotic and eukaryotic organisms. Certain protein structural types are more sensitive than others to the effects of codon usage on protein folding, and predicted intrinsically disordered domains are more prone to misfolding caused by codon usage changes than other domain types. Bioinformatic analyses revealed that gene codon usage correlates with different protein structures in diverse organisms, indicating the existence of a codon usage code for co-translational protein folding. This review focuses on recent literature on the role and mechanism of codon usage in regulating translation kinetics and co-translational protein folding. Video abstract
![]()
Collapse
Affiliation(s)
- Yi Liu
- Department of Physiology, ND13.214A, UT Southwestern Medical Center, 5323 Harry Hines Blvd., Dallas, TX, 75390-9040, USA.
| |
Collapse
|
14
|
|
15
|
Komar AA. [Synonymous Codon Usage-a Guide for Co-Translational Protein Folding in the Cell]. Mol Biol (Mosk) 2019; 53:883-898. [PMID: 31876270 PMCID: PMC8462064 DOI: 10.1134/s0026898419060090] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2019] [Accepted: 05/14/2019] [Indexed: 06/10/2023]
Abstract
In the cell, protein folding begins during protein synthesis/translation and thus is a co-translational process. Co-translational protein folding is tightly linked to translation elongation, which is not a uniform process. While there are many reasons for translation non-uniformity, it is generally believed that non-uniform synonymous codon usage is one of the key factors modulating translation elongation rates. Frequent/optimal codons as a rule are translated more rapidly than infrequently used ones and vice versa. Over 30 years ago, it was hypothesized that changes in synonymous codon usage affecting translation elongation rates could impinge on co-translation protein folding and that many synonymous codons are strategically placed within mRNA to ensure a particular translation kinetics facilitating productive step-by-step co-translational folding of proteins. It was suggested that this particular translation kinetics (and, specifically, translation pause sites) may define the window of opportunity for the protein parts to fold locally, particularly at the critical points where folding is far from equilibrium. It was thus hypothesized that synonymous codons may provide a secondary code for protein folding in the cell. Although, mostly accepted now, this hypothesis appeared to be difficult to prove and many convincing results were obtained only relatively recently. Here, I review the progress in the field and explain, why this simple idea appeared to be so challenging to prove.
Collapse
Affiliation(s)
- A A Komar
- Center for Gene Regulation in Health and Disease and Department of Biological, Geological and Environmental Sciences, Cleveland State University, Cleveland, Ohio, 44115 USA
- Department of Biochemistry and Center for RNA Science and Therapeutics, Case Western Reserve University, Cleveland, Ohio, 44106 USA
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, 44195 USA
- DAPCEL, Inc., Cleveland, Ohio, 44106 USA
| |
Collapse
|
16
|
Konczal J, Bower J, Gray CH. Re-introducing non-optimal synonymous codons into codon-optimized constructs enhances soluble recovery of recombinant proteins from Escherichia coli. PLoS One 2019; 14:e0215892. [PMID: 31013332 PMCID: PMC6478350 DOI: 10.1371/journal.pone.0215892] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Accepted: 04/10/2019] [Indexed: 12/14/2022] Open
Abstract
Gene synthesis services have largely superseded traditional PCR methods for the generation of cDNAs destined for bacterial expression vectors. This, in turn, has increased the application of codon-optimized cDNAs where codons rarely used by Escherchia coli are replaced with common synonymous codons to accelerate translation of the target. A markedly accelerated rate of expression often results in a significant uplift in the levels of target protein but a substantial proportion of the enhanced yield can partition to the insoluble fraction rendering a significant portion of the gains unavailable for native purification. We have assessed several expression attenuation strategies for their utility in the manipulation of the soluble fraction towards higher levels of soluble target recovery from codon optimized systems. Using a set of human small GTPases as a case study, we compare the degeneration of the T7 promoter sequence, the use of alternative translational start codons and the manipulation of synonymous codon usage. Degeneration of both the T7 promoter and the translational start codon merely depressed overall expression and did not increase the percentage of product recovered in native purification of the soluble fraction. However, the selective introduction of rare non-optimal codons back into the codon-optimized sequence resulted in significantly elevated recovery of soluble targets. We propose that slowing the rate of extension during translation using a small number of rare codons allows more time for the co-translational folding of the nascent polypeptide. This increases the proportion of the target recovered in the soluble fraction by immobilized metal affinity chromatography (IMAC). Thus, a "de-optimization" of codon-optimized cDNAs, to attenuate or pause the translation process, may prove a useful strategy for improved recombinant protein production.
Collapse
Affiliation(s)
- Jennifer Konczal
- Drug Discovery Program, CRUK Beatson Institute, Glasgow, United Kingdom
| | - Justin Bower
- Drug Discovery Program, CRUK Beatson Institute, Glasgow, United Kingdom
| | | |
Collapse
|
17
|
Natural tuning of restriction endonuclease synthesis by cluster of rare arginine codons. Sci Rep 2019; 9:5808. [PMID: 30967604 PMCID: PMC6456624 DOI: 10.1038/s41598-019-42311-w] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2018] [Accepted: 03/28/2019] [Indexed: 01/21/2023] Open
Abstract
Restriction–modification (R-M) systems are highly widespread among bacteria and archaea, and they appear to play a pivotal role in modulating horizontal gene transfer, as well as in protecting the host organism against viruses and other invasive DNA particles. Type II R-M systems specify two independent enzymes: a restriction endonuclease (REase) and protective DNA methyltransferase (MTase). If the cell is to survive, the counteracting activities as toxin and antitoxin, must be finely balanced in vivo. The molecular basis of this regulatory process remains unclear and current searches for regulatory elements in R-M modules are focused mainly at the transcription step. In this report, we show new aspects of REase control that are linked to translation. We used the EcoVIII R-M system as a model. Both, the REase and MTase genes for this R-M system contain an unusually high number of rare arginine codons (AGA and AGG) when compared to the rest of the E. coli K-12 genome. Clusters of these codons near the N-terminus of the REase greatly affect the translational efficiency. Changing these to higher frequency codons for E. coli (CGC) improves the REase synthesis, making the R-M system more potent to defend its host against bacteriophages. However, this improved efficiency in synthesis reduces host fitness due to increased autorestriction. We hypothesize that expression of the endonuclease gene can be modulated depending on the host genetic context and we propose a novel post-transcriptional mode of R–M system regulation that alleviates the potential lethal action of the restriction enzyme.
Collapse
|
18
|
Bhattacharyya S, Jacobs WM, Adkar BV, Yan J, Zhang W, Shakhnovich EI. Accessibility of the Shine-Dalgarno Sequence Dictates N-Terminal Codon Bias in E. coli. Mol Cell 2018; 70:894-905.e5. [PMID: 29883608 PMCID: PMC6311106 DOI: 10.1016/j.molcel.2018.05.008] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2017] [Revised: 02/14/2018] [Accepted: 05/03/2018] [Indexed: 10/14/2022]
Abstract
Despite considerable efforts, no physical mechanism has been shown to explain N-terminal codon bias in prokaryotic genomes. Using a systematic study of synonymous substitutions in two endogenous E. coli genes, we show that interactions between the coding region and the upstream Shine-Dalgarno (SD) sequence modulate the efficiency of translation initiation, affecting both intracellular mRNA and protein levels due to the inherent coupling of transcription and translation in E. coli. We further demonstrate that far-downstream mutations can also modulate mRNA levels by occluding the SD sequence through the formation of non-equilibrium secondary structures. By contrast, a non-endogenous RNA polymerase that decouples transcription and translation largely alleviates the effects of synonymous substitutions on mRNA levels. Finally, a complementary statistical analysis of the E. coli genome specifically implicates avoidance of intra-molecular base pairing with the SD sequence. Our results provide general physical insights into the coding-level features that optimize protein expression in prokaryotes.
Collapse
Affiliation(s)
- Sanchari Bhattacharyya
- Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA, USA
| | - William M Jacobs
- Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA, USA
| | - Bharat V Adkar
- Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA, USA
| | - Jin Yan
- Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA, USA; College of Chemical Engineering, Sichuan University, Chengdu 610065, Sichuan, China
| | - Wenli Zhang
- Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA, USA; State Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China
| | - Eugene I Shakhnovich
- Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA, USA.
| |
Collapse
|
19
|
Im EH, Choi SS. Synonymous Codon Usage Controls Various Molecular Aspects. Genomics Inform 2017; 15:123-127. [PMID: 29307137 PMCID: PMC5769864 DOI: 10.5808/gi.2017.15.4.123] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Accepted: 09/25/2017] [Indexed: 12/19/2022] Open
Abstract
Synonymous sites are generally considered to be functionally neutral. However, there are recent contradictory findings suggesting that synonymous alleles might have functional roles in various molecular aspects. For instance, a recent study demonstrated that synonymous single nucleotide polymorphisms have a similar effect size as nonsynonymous single nucleotide polymorphisms in human disease association studies. Researchers have recognized synonymous codon usage bias (SCUB) in the genomes of almost all species and have investigated whether SCUB is due to random nucleotide compositional bias or to natural selection of any functional exposure generated by synonymous mutations. One of the most prominent observations on the non-neutrality of synonymous codons is the correlation between SCUB and levels of gene expression, such that highly expressed genes tend to have a higher preference toward so-called optimal codons than lowly expressed genes. In relation, it is known that amounts of cognate tRNAs that bind to optimal codons are significantly higher than the amounts of cognate tRNAs that bind to non-optimal codons in genomes. In the present paper, we review various functions that synonymous codons might have other than regulating expression levels.
Collapse
Affiliation(s)
- Eu-Hyun Im
- Division of Biomedical Convergence, College of Biomedical Science, and Institute of Bioscience & Biotechnology, Kangwon National University, Chuncheon 24341, Korea
| | - Sun Shim Choi
- Division of Biomedical Convergence, College of Biomedical Science, and Institute of Bioscience & Biotechnology, Kangwon National University, Chuncheon 24341, Korea
| |
Collapse
|
20
|
Rodriguez A, Wright G, Emrich S, Clark PL. %MinMax: A versatile tool for calculating and comparing synonymous codon usage and its impact on protein folding. Protein Sci 2017; 27:356-362. [PMID: 29090506 DOI: 10.1002/pro.3336] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2017] [Revised: 10/27/2017] [Accepted: 10/30/2017] [Indexed: 11/09/2022]
Abstract
Most amino acids can be encoded by more than one synonymous codon, but these are rarely used with equal frequency. In many coding sequences the usage patterns of rare versus common synonymous codons is nonrandom and under selection. Moreover, synonymous substitutions that alter these patterns can have a substantial impact on the folding efficiency of the encoded protein. This has ignited broad interest in exploring synonymous codon usage patterns. For many protein chemists, biophysicists and structural biologists, the primary motivation for codon analysis is identifying and preserving usage patterns most likely to impact high-yield production of functional proteins. Here we describe the core functions and new features of %MinMax, a codon usage calculator freely available as a web-based portal and downloadable script (http://www.codons.org). %MinMax evaluates the relative usage frequencies of the synonymous codons used to encode a protein sequence of interest and compares these results to a rigorous null model. Crucially, for analyzing codon usage in common host organisms %MinMax requires only the coding sequence as input; with a user-input codon frequency table, %MinMax can be used to evaluate synonymous codon usage patterns for any coding sequence from any fully sequenced genome. %MinMax makes no assumptions regarding the impact of transfer ribonucleic acid concentrations or other molecular-level interactions on translation rates, yet its output is sufficient to predict the effects of synonymous codon substitutions on cotranslational folding mechanisms. A simple calculation included within %MinMax can be used to harmonize codon usage frequencies for heterologous gene expression.
Collapse
Affiliation(s)
- Anabel Rodriguez
- Department of Chemistry & Biochemistry, University of Notre Dame, Notre Dame, Indiana, 46556
| | - Gabriel Wright
- Department of Computer Science & Engineering, University of Notre Dame, Notre Dame, Indiana, 46556
| | - Scott Emrich
- Department of Computer Science & Engineering, University of Notre Dame, Notre Dame, Indiana, 46556
| | - Patricia L Clark
- Department of Chemistry & Biochemistry, University of Notre Dame, Notre Dame, Indiana, 46556.,Department of Chemical & Biomolecular Engineering, University of Notre Dame, Notre Dame, Indiana, 46556
| |
Collapse
|
21
|
Abstract
Recent experiments and simulations have demonstrated that proteins can fold on the ribosome. However, the extent and generality of fitness effects resulting from cotranslational folding remain open questions. Here we report a genome-wide analysis that uncovers evidence of evolutionary selection for cotranslational folding. We describe a robust statistical approach to identify loci within genes that are both significantly enriched in slowly translated codons and evolutionarily conserved. Surprisingly, we find that domain boundaries can explain only a small fraction of these conserved loci. Instead, we propose that regions enriched in slowly translated codons are associated with cotranslational folding intermediates, which may be smaller than a single domain. We show that the intermediates predicted by a native-centric model of cotranslational folding account for the majority of these loci across more than 500 Escherichia coli proteins. By making a direct connection to protein folding, this analysis provides strong evidence that many synonymous substitutions have been selected to optimize translation rates at specific locations within genes. More generally, our results indicate that kinetics, and not just thermodynamics, can significantly alter the efficiency of self-assembly in a biological context.
Collapse
|
22
|
Villada JC, Brustolini OJB, Batista da Silveira W. Integrated analysis of individual codon contribution to protein biosynthesis reveals a new approach to improving the basis of rational gene design. DNA Res 2017; 24:419-434. [PMID: 28449100 PMCID: PMC5737324 DOI: 10.1093/dnares/dsx014] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2016] [Revised: 03/22/2017] [Accepted: 03/23/2017] [Indexed: 01/21/2023] Open
Abstract
Gene codon optimization may be impaired by the misinterpretation of frequency and optimality of codons. Although recent studies have revealed the effects of codon usage bias (CUB) on protein biosynthesis, an integrated perspective of the biological role of individual codons remains unknown. Unlike other previous studies, we show, through an integrated framework that attributes of codons such as frequency, optimality and positional dependency should be combined to unveil individual codon contribution for protein biosynthesis. We designed a codon quantification method for assessing CUB as a function of position within genes with a novel constraint: the relativity of position-dependent codon usage shaped by coding sequence length. Thus, we propose a new way of identifying the enrichment, depletion and non-uniform positional distribution of codons in different regions of yeast genes. We clustered codons that shared attributes of frequency and optimality. The cluster of non-optimal codons with rare occurrence displayed two remarkable characteristics: higher codon decoding time than frequent-non-optimal cluster and enrichment at the 5'-end region, where optimal codons with the highest frequency are depleted. Interestingly, frequent codons with non-optimal adaptation to tRNAs are uniformly distributed in the Saccharomyces cerevisiae genes, suggesting their determinant role as a speed regulator in protein elongation.
Collapse
Affiliation(s)
- Juan C. Villada
- Department of Microbiology, Universidade Federal de Viçosa, Viçosa 36570-900, Brazil
| | | | | |
Collapse
|
23
|
Liu H, Rahman SU, Mao Y, Xu X, Tao S. Codon usage bias in 5' terminal coding sequences reveals distinct enrichment of gene functions. Genomics 2017; 109:506-513. [PMID: 28778539 DOI: 10.1016/j.ygeno.2017.07.008] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2017] [Revised: 06/24/2017] [Accepted: 07/30/2017] [Indexed: 11/16/2022]
Abstract
Codon bias at the 5' terminal of coding sequence (CDS) is known to be distinct from the rest of the CDS. A number of events occur in this short region to regulate early translation elongation and co-translational translocation. In the genes encoding secretory proteins, there is a special signal sequence which has a higher occurrence of rare codons. In this study, we analyzed codon bias of secretory genes in several eukaryotes. The results showed that secretory genes in the species except mammals had a higher occurrence of rare codons in the 5' terminal of CDS, and the bias was greater than the same region of non-secretory genes. GO analysis revealed that secretory genes containing rare codon clusters in different regions were responsible for various roles in gene functions. Moreover, codon bias in the region encoding the hydrophobic region of protein is similar in secretory and non-secretory genes, indicating that codon bias in secretory genes was partly influenced by amino acid bias. Rare codon clusters are found more frequently in specific regions, and continuous rare codons are not favoured probably because they will increase the probability of ribosome collision and drop-off. Based on ribosome profiling data, there is no significant difference in the average translation efficiencies between rare and optimal codons. Higher ribosomal density in the 5' terminal may result from ribosome pausing which could be involved in different translation events. These findings collectively provided rich information on codon bias in secretory genes, which may shed light on the co-effect of codon bias, mRNA structure and tRNA abundance in translational regulations.
Collapse
Affiliation(s)
- Huiling Liu
- College of Life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling, Shaanxi, China
| | - Siddiq Ur Rahman
- College of Life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling, Shaanxi, China
| | - Yuanhui Mao
- College of Life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling, Shaanxi, China
| | - Xiaodong Xu
- College of Life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling, Shaanxi, China.
| | - Shiheng Tao
- College of Life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling, Shaanxi, China.
| |
Collapse
|
24
|
Chaney JL, Steele A, Carmichael R, Rodriguez A, Specht AT, Ngo K, Li J, Emrich S, Clark PL. Widespread position-specific conservation of synonymous rare codons within coding sequences. PLoS Comput Biol 2017; 13:e1005531. [PMID: 28475588 PMCID: PMC5438181 DOI: 10.1371/journal.pcbi.1005531] [Citation(s) in RCA: 67] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2016] [Revised: 05/19/2017] [Accepted: 04/21/2017] [Indexed: 02/01/2023] Open
Abstract
Synonymous rare codons are considered to be sub-optimal for gene expression because they are translated more slowly than common codons. Yet surprisingly, many protein coding sequences include large clusters of synonymous rare codons. Rare codons at the 5’ terminus of coding sequences have been shown to increase translational efficiency. Although a general functional role for synonymous rare codons farther within coding sequences has not yet been established, several recent reports have identified rare-to-common synonymous codon substitutions that impair folding of the encoded protein. Here we test the hypothesis that although the usage frequencies of synonymous codons change from organism to organism, codon rarity will be conserved at specific positions in a set of homologous coding sequences, for example to tune translation rate without altering a protein sequence. Such conservation of rarity–rather than specific codon identity–could coordinate co-translational folding of the encoded protein. We demonstrate that many rare codon cluster positions are indeed conserved within homologous coding sequences across diverse eukaryotic, bacterial, and archaeal species, suggesting they result from positive selection and have a functional role. Most conserved rare codon clusters occur within rather than between conserved protein domains, challenging the view that their primary function is to facilitate co-translational folding after synthesis of an autonomous structural unit. Instead, many conserved rare codon clusters separate smaller protein structural motifs within structural domains. These smaller motifs typically fold faster than an entire domain, on a time scale more consistent with translation rate modulation by synonymous codon usage. While proteins with conserved rare codon clusters are structurally and functionally diverse, they are enriched in functions associated with organism growth and development, suggesting an important role for synonymous codon usage in organism physiology. The identification of conserved rare codon clusters advances our understanding of distinct, functional roles for otherwise synonymous codons and enables experimental testing of the impact of synonymous codon usage on the production of functional proteins. Proteins are long linear polymers that must fold into complex three-dimensional shapes in order to carry out their cellular functions. Every protein is synthesized by the ribosome, which decodes each trinucleotide codon in an mRNA coding sequence in order to select the amino acid residue that will occupy each position in the protein sequence. Most amino acids can be encoded by more than one codon, but these synonymous codons are not used with equal frequency. Rare codons are associated with generally slower rates for protein synthesis, and for this reason have traditionally been considered mildly deleterious for efficient protein production. However, because synonymous codon substitutions do not change the sequence of the encoded protein, the majority view is that they merely reflect genomic ‘background noise’. To the contrary, here we show that the positions of many synonymous rare codons are conserved in mRNA sequences that encode structurally similar proteins from a diverse range of organisms. These results suggest that rare codons have a functional role related to the production of functional proteins, potentially to regulate the rate of protein synthesis and the earliest steps of protein folding, while synthesis is still underway.
Collapse
Affiliation(s)
- Julie L. Chaney
- Department of Chemistry & Biochemistry, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Aaron Steele
- Department of Computer Science & Engineering, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Rory Carmichael
- Department of Computer Science & Engineering, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Anabel Rodriguez
- Department of Chemistry & Biochemistry, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Alicia T. Specht
- Department of Applied and Computational Mathematics & Statistics, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Kim Ngo
- Department of Chemistry & Biochemistry, University of Notre Dame, Notre Dame, Indiana, United States of America
- Department of Computer Science & Engineering, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Jun Li
- Department of Applied and Computational Mathematics & Statistics, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Scott Emrich
- Department of Computer Science & Engineering, University of Notre Dame, Notre Dame, Indiana, United States of America
- * E-mail: (PLC); (SE)
| | - Patricia L. Clark
- Department of Chemistry & Biochemistry, University of Notre Dame, Notre Dame, Indiana, United States of America
- Department of Chemical & Biomolecular Engineering, University of Notre Dame, Notre Dame, Indiana, United States of America
- * E-mail: (PLC); (SE)
| |
Collapse
|
25
|
The Impact of Selection at the Amino Acid Level on the Usage of Synonymous Codons. G3-GENES GENOMES GENETICS 2017; 7:967-981. [PMID: 28122952 PMCID: PMC5345726 DOI: 10.1534/g3.116.038125] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
There are two main forces that affect usage of synonymous codons: directional mutational pressure and selection. The effectiveness of protein translation is usually considered as the main selectional factor. However, biased codon usage can also be a byproduct of a general selection at the amino acid level interacting with nucleotide replacements. To evaluate the validity and strength of such an effect, we superimposed >3.5 billion unrestricted mutational processes on the selection of nonsynonymous substitutions based on the differences in physicochemical properties of the coded amino acids. Using a modified evolutionary optimization algorithm, we determined the conditions in which the effect on the relative codon usage is maximized. We found that the effect is enhanced by mutational processes generating more adenine and thymine than guanine and cytosine, as well as more purines than pyrimidines. Interestingly, this effect is observed only under an unrestricted model of nucleotide substitution, and disappears when the mutational process is time-reversible. Comparison of the simulation results with data for real protein coding sequences indicates that the impact of selection at the amino acid level on synonymous codon usage cannot be neglected. Furthermore, it can considerably interfere, especially in AT-rich genomes, with other selections on codon usage, e.g., translational efficiency. It may also lead to difficulties in the recognition of other effects influencing codon bias, and an overestimation of protein coding sequences whose codon usage is subjected to adaptational selection.
Collapse
|
26
|
Mortazavi M, Zarenezhad M, Gholamzadeh S, Alavian SM, Ghorbani M, Dehghani R, Malekpour A, Meshkibaf M, Fakhrzad A. Bioinformatic Identification of Rare Codon Clusters (RCCs) in HBV Genome and Evaluation of RCCs in Proteins Structure of Hepatitis B Virus. HEPATITIS MONTHLY 2016; 16:e39909. [PMID: 27882067 PMCID: PMC5116127 DOI: 10.5812/hepatmon.39909] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/11/2016] [Revised: 08/10/2016] [Accepted: 09/24/2016] [Indexed: 12/11/2022]
Abstract
BACKGROUND Hepatitis B virus (HBV) as an infectious disease that has nine genotypes (A - I) and a 'putative' genotype J. OBJECTIVES The aim of this study was to identify the rare codon clusters (RCC) in the HBV genome and to evaluate these RCCs in the HBV proteins structure. METHODS For detection of protein family accession numbers (Pfam) in HBV proteins, the UniProt database and Pfam search tool were used. Protein family accession numbers is a comprehensive and accurate collection of protein domains and families. It contains annotation of each family in the form of textual descriptions, links to other resources and literature references. Genome projects have used Pfam extensively for large-scale functional annotation of genomic data; Pfam database is a large collection of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs). The Pfam search tools are databases that identify Pfam of proteins. These Pfam IDs were analyzed in Sherlocc program and the location of RCCs in HBV genome and proteins were detected and reported as translated EMBL nucleotide sequence data library (TrEMBL) entries. The TrEMBL is a computer-annotated supplement of SWISS-PROT that contains all the translations of European molecular biology laboratory (EMBL) nucleotide sequence entries not yet integrated in SWISS-PROT. Furthermore, the structures of TrEMBL entries proteins were studied in the PDB database and 3D structures of the HBV proteins and locations of RCCs were visualized and studied using Swiss PDB Viewer software®. RESULTS The Pfam search tool found nine protein families in three frames. Results of Pfams studies in the Sherlocc program showed that this program has not identified RCCs in the external core antigen (PF08290) and truncated HBeAg gene (PF08290) of HBV. By contrast, the RCCs were identified in gene of hepatitis core antigen (PF00906 and the residues 224 - 234 and 251 - 255), large envelope protein S (PF00695 and the residues 53-56 and 70 - 84), X protein (PF00739 and the residues 10 - 24, 29 - 83, 95 - 99. 122 - 129, 139 - 143), DNA polymerase (viral) N-terminal domain (PF00242 and the residues 59 - 62, 214 - 217, 407 - 413) and protein P (Pf00336 and the residues 225 - 228). In HBV genome, seven RCCs were identified in the gene area of hepatitis core antigen, large envelope protein S and DNA polymerase, while protein structures of TrEMBL entries sequences found in Sherlocc program outputs were not complete. CONCLUSIONS Based on the location of detected RCCs in the structure of HBV proteins, it was found that these RCCs may have a critical role in correct folding of HBV proteins and can be considered as drug targets. The results of this study provide new and deep perspectives about structure of HBV proteins for further researches and designing new drugs for treatment of HBV.
Collapse
Affiliation(s)
- Mojtaba Mortazavi
- Department of Biotechnology, Institute of Science and High Technology and Environmental Sciences, Graduate University of Advanced Technology, Kerman, IR Iran
| | - Mohammad Zarenezhad
- Gastroenterohepatology Research Center, Shiraz University of Medical Sciences, Shiraz, IR Iran
- Legal Medicine Research Center, Legal Medicine Organization of Iran, Tehran, IR Iran
| | - Saeid Gholamzadeh
- Legal Medicine Research Center, Legal Medicine Organization of Iran, Tehran, IR Iran
| | - Seyed Moayed Alavian
- Baqiyatallah Research Center for Gastroenterology and Liver Diseases, Middle East Liver Disease Center, Baqiyatallah University of Medical Sciences, Tehran, IR Iran
| | - Mohammad Ghorbani
- Department of Pathology, School of Medicine, Fasa University of Medical Sciences, Fasa, IR Iran
| | - Reza Dehghani
- Pharmacology Department, School of Medicine, Shiraz University of Medical Sciences, Shiraz, IR Iran
| | - Abdorrasoul Malekpour
- Legal Medicine Research Center, Legal Medicine Organization of Iran, Tehran, IR Iran
| | - Mohammadhasan Meshkibaf
- Department of Biochemistry, School of Medicine, Fasa University of Medical Sciences, Fasa, IR Iran
| | - Ali Fakhrzad
- Gastroenterohepatology Research Center, Shiraz University of Medical Sciences, Shiraz, IR Iran
| |
Collapse
|
27
|
Komar AA. The Yin and Yang of codon usage. Hum Mol Genet 2016; 25:R77-R85. [PMID: 27354349 DOI: 10.1093/hmg/ddw207] [Citation(s) in RCA: 106] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2016] [Accepted: 06/24/2016] [Indexed: 01/07/2023] Open
Abstract
The genetic code is degenerate. With the exception of two amino acids (Met and Trp), all other amino acid residues are each encoded by multiple, so-called synonymous codons. Synonymous codons were initially presumed to have entirely equivalent functions, however, the finding that synonymous codons are not present at equal frequencies in genes/genomes suggested that codon choice might have functional implications beyond amino acid coding. The pattern of non-uniform codon use (known as codon usage bias) varies between organisms and represents a unique feature of an organism. Organism-specific codon choice is related to organism-specific differences in populations of cognate tRNAs. This implies that, in a given organism, frequently used codons will be translated more rapidly than infrequently used ones and vice versa A theory of codon-tRNA co-evolution (necessary to balance accurate and efficient protein production) was put forward to explain the existence of codon usage bias. This model suggests that selection favours preferred (frequent) over un-preferred (rare) codons in order to sustain efficient protein production in cells and that a given un-preferred codon will have the same effect on an organism's fitness regardless of its position within an mRNA's open reading frame. However, many recent studies refute this prediction. Un-preferred codons have been found to have important functional roles and their effects appeared to be position-dependent. Synonymous codon usage affects the efficiency/stringency of mRNA decoding, mRNA biogenesis/stability, and protein secretion and folding. This review summarizes recent developments in the field that have identified novel functions of synonymous codons and their usage.
Collapse
Affiliation(s)
- Anton A Komar
- Center for Gene Regulation in Health and Disease and Department of Biological, Geological and Environmental Sciences, Cleveland State University, Cleveland, Ohio, OH, USA Department of Biochemistry and Center for RNA Molecular Biology, Case Western Reserve University, Cleveland, Ohio, USA Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, OH, USA
| |
Collapse
|
28
|
Jacobson GN, Clark PL. Quality over quantity: optimizing co-translational protein folding with non-'optimal' synonymous codons. Curr Opin Struct Biol 2016; 38:102-10. [PMID: 27318814 PMCID: PMC5010456 DOI: 10.1016/j.sbi.2016.06.002] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2016] [Revised: 05/31/2016] [Accepted: 06/01/2016] [Indexed: 12/28/2022]
Abstract
Protein folding occurs on a time scale similar to peptide bond formation by the ribosome, which has long sparked speculation that altering translation rate could alter the folding mechanism or even the final folded structure of a protein in vivo. Recent results have provided strong support for this model: synonymous substitutions to codons with different usage frequency, which are often translated at different rates, have been shown to significantly alter the co-translational folding mechanism of some proteins, leading to altered cell function. Here we review recent progress towards understanding the connections between synonymous codon usage, translation rate and co-translational protein folding mechanisms.
Collapse
Affiliation(s)
- Giselle N Jacobson
- Department of Chemistry & Biochemistry, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Patricia L Clark
- Department of Chemistry & Biochemistry, University of Notre Dame, Notre Dame, IN 46556, USA; Department of Chemical & Biomolecular Engineering, University of Notre Dame, Notre Dame, IN 46556, USA.
| |
Collapse
|
29
|
Critical reflections on synthetic gene design for recombinant protein expression. Curr Opin Struct Biol 2016; 38:155-62. [DOI: 10.1016/j.sbi.2016.07.004] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2016] [Revised: 06/29/2016] [Accepted: 07/06/2016] [Indexed: 11/17/2022]
|
30
|
The Art of Gene Redesign and Recombinant Protein Production: Approaches and Perspectives. TOPICS IN MEDICINAL CHEMISTRY 2016. [DOI: 10.1007/7355_2016_2] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
|
31
|
Ugrinov KG, Freed SD, Thomas CL, Lee SW. A multiparametric computational algorithm for comprehensive assessment of genetic mutations in mucopolysaccharidosis type IIIA (Sanfilippo syndrome). PLoS One 2015; 10:e0121511. [PMID: 25807448 PMCID: PMC4373678 DOI: 10.1371/journal.pone.0121511] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2014] [Accepted: 02/12/2015] [Indexed: 12/22/2022] Open
Abstract
Mucopolysaccharidosis type IIIA (MPS-IIIA, Sanfilippo syndrome) is a Lysosomal Storage Disease caused by cellular deficiency of N-sulfoglucosamine sulfohydrolase (SGSH). Given the large heterogeneity of genetic mutations responsible for the disease, a comprehensive understanding of the mechanisms by which these mutations affect enzyme function is needed to guide effective therapies. We developed a multiparametric computational algorithm to assess how patient genetic mutations in SGSH affect overall enzyme biogenesis, stability, and function. 107 patient mutations for the SGSH gene were obtained from the Human Gene Mutation Database representing all of the clinical mutations documented for Sanfilippo syndrome. We assessed each mutation individually using ten distinct parameters to give a comprehensive predictive score of the stability and misfolding capacity of the SGSH enzyme resulting from each of these mutations. The predictive score generated by our multiparametric algorithm yielded a standardized quantitative assessment of the severity of a given SGSH genetic mutation toward overall enzyme activity. Application of our algorithm has identified SGSH mutations in which enzymatic malfunction of the gene product is specifically due to impairments in protein folding. These scores provide an assessment of the degree to which a particular mutation could be treated using approaches such as chaperone therapies. Our multiparametric protein biogenesis algorithm advances a key understanding in the overall biochemical mechanism underlying Sanfilippo syndrome. Importantly, the design of our multiparametric algorithm can be tailored to many other diseases of genetic heterogeneity for which protein misfolding phenotypes may constitute a major component of disease manifestation.
Collapse
Affiliation(s)
- Krastyu G Ugrinov
- Department of Biological Sciences, University of Notre Dame, Notre Dame, Indiana, 46556, United States of America; Center for Rare and Neglected Diseases, University of Notre Dame, Notre Dame, Indiana, 46556, United States of America
| | - Stefan D Freed
- Department of Biological Sciences, University of Notre Dame, Notre Dame, Indiana, 46556, United States of America; Center for Rare and Neglected Diseases, University of Notre Dame, Notre Dame, Indiana, 46556, United States of America
| | - Clayton L Thomas
- Department of Biological Sciences, University of Notre Dame, Notre Dame, Indiana, 46556, United States of America; Center for Rare and Neglected Diseases, University of Notre Dame, Notre Dame, Indiana, 46556, United States of America
| | - Shaun W Lee
- Department of Biological Sciences, University of Notre Dame, Notre Dame, Indiana, 46556, United States of America; Center for Rare and Neglected Diseases, University of Notre Dame, Notre Dame, Indiana, 46556, United States of America
| |
Collapse
|
32
|
Abstract
Owing to the degeneracy of the genetic code, a protein sequence can be encoded by many different synonymous mRNA coding sequences. Synonymous codon usage was once thought to be functionally neutral, but evidence now indicates it is shaped by evolutionary selection and affects other aspects of protein biogenesis beyond specifying the amino acid sequence of the protein. Synonymous rare codons, once thought to have only negative impacts on the speed and accuracy of translation, are now known to play an important role in diverse functions, including regulation of cotranslational folding, covalent modifications, secretion, and expression level. Mutations altering synonymous codon usage are linked to human diseases. However, much remains unknown about the molecular mechanisms connecting synonymous codon usage to efficient protein biogenesis and proper cell physiology. Here we review recent literature on the functional effects of codon usage, including bioinformatics approaches aimed at identifying general roles for synonymous codon usage.
Collapse
|
33
|
Nissley DA, O'Brien EP. Timing is everything: unifying codon translation rates and nascent proteome behavior. J Am Chem Soc 2014; 136:17892-8. [PMID: 25486504 DOI: 10.1021/ja510082j] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Experiments have demonstrated that changing the rate at which the ribosome translates a codon position in an mRNA molecule's open reading frame can alter the behavior of the newly synthesized protein. That is, codon translation rates can govern nascent proteome behavior. We emphasize that this phenomenon is a manifestation of the nonequilibrium nature of cotranslational processes, and as such, there exist theoretical tools that offer a potential means to quantitatively predict the influence of codon translation rates on the broad spectrum of nascent protein behaviors including cotranslational folding, aggregation, and translocation. We provide a review of the experimental evidence for the impact that codon translation rates can have, followed by a discussion of theoretical methods that can describe this phenomenon. The development and application of these tools are likely to provide fundamental insights into protein maturation and homeostasis, codon usage bias in organisms, the origins of translation related diseases, and new rational design methods for biotechnology and biopharmaceutical applications.
Collapse
Affiliation(s)
- Daniel A Nissley
- Department of Chemistry, Pennsylvania State University , University Park, Pennsylvania 16802, United States
| | | |
Collapse
|
34
|
Tuller T, Zur H. Multiple roles of the coding sequence 5' end in gene expression regulation. Nucleic Acids Res 2014; 43:13-28. [PMID: 25505165 PMCID: PMC4288200 DOI: 10.1093/nar/gku1313] [Citation(s) in RCA: 148] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
The codon composition of the coding sequence's (ORF) 5′ end first few dozen codons is known to be distinct to that of the rest of the ORF. Various explanations for the unusual codon distribution in this region have been proposed in recent years, and include, among others, novel regulatory mechanisms of translation initiation and elongation. However, due to the fact that many overlapping regulatory signals are suggested to be associated with this relatively short region, its research is challenging. Here, we review the currently known signals that appear in this region, the theories related to the way they regulate translation and affect the organismal fitness, and the debates they provoke.
Collapse
Affiliation(s)
- Tamir Tuller
- Department of Biomedical Engineering, the Engineering Faculty, Tel Aviv University, Tel Aviv, Israel The Sagol School of Neuroscience, Tel Aviv University, Tel Aviv 69978, Israel
| | - Hadas Zur
- Department of Biomedical Engineering, the Engineering Faculty, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
35
|
Speed controls in translating secretory proteins in eukaryotes--an evolutionary perspective. PLoS Comput Biol 2014; 10:e1003294. [PMID: 24391480 PMCID: PMC3879104 DOI: 10.1371/journal.pcbi.1003294] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2013] [Accepted: 09/04/2013] [Indexed: 11/19/2022] Open
Abstract
Protein translation is the most expensive operation in dividing cells from bacteria to humans. Therefore, managing the speed and allocation of resources is subject to tight control. From bacteria to humans, clusters of relatively rare tRNA codons at the N'-terminal of mRNAs have been implicated in attenuating the process of ribosome allocation, and consequently the translation rate in a broad range of organisms. The current interpretation of "slow" tRNA codons does not distinguish between protein translations mediated by free- or endoplasmic reticulum (ER)-bound ribosomes. We demonstrate that proteins translated by free- or ER-bound ribosomes exhibit different overall properties in terms of their translation efficiency and speed in yeast, fly, plant, worm, bovine and human. We note that only secreted or membranous proteins with a Signal peptide (SP) are specified by segments of "slow" tRNA at the N'-terminal, followed by abundant codons that are considered "fast." Such profiles apply to 3100 proteins of the human proteome that are composed of secreted and signal peptide (SP)-assisted membranous proteins. Remarkably, the bulks of the proteins (12,000), or membranous proteins lacking SP (3400), do not have such a pattern. Alternation of "fast" and "slow" codons was found also in proteins that translocate to mitochondria through transit peptides (TP). The differential clusters of tRNA adapted codons is not restricted to the N'-terminal of transcripts. Specifically, Glycosylphosphatidylinositol (GPI)-anchored proteins are unified by clusters of low adapted tRNAs codons at the C'-termini. Furthermore, selection of amino acids types and specific codons was shown as the driving force which establishes the translation demands for the secretory proteome. We postulate that "hard-coded" signals within the secretory proteome assist the steps of protein maturation and folding. Specifically, "speed control" signals for delaying the translation of a nascent protein fulfill the co- and post-translational stages such as membrane translocation, proteins processing and folding.
Collapse
|
36
|
Mao Y, Li Q, Wang W, Liang P, Tao S. Number variation of high stability regions is correlated with gene functions. Genome Biol Evol 2013; 5:484-93. [PMID: 23407773 PMCID: PMC3622296 DOI: 10.1093/gbe/evt020] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Various regulatory elements in messenger RNAs (mRNAs) carrying the secondary structure play important roles in a wide range of expression processes. Numerous recent works have focused on the discovery of these functional elements that contain the conserved mRNA structures. However, to date, regions with high structural stability have been largely overlooked. In this study, we defined high stability regions (HSRs) in the coding sequences (CDSs) in bacteria based on the normalized folding free energy. We found that CDSs had high number of HSRs, and these HSRs showed high structural context robustness compared with random sequences, indicating a direct selective constraint imposed on HSRs. A reduced ribosome speed was detected near the start position of HSR, implying a possibility that HSR acted as obstacle to drive translational pausing that coordinated protein synthesis. Interestingly, we found that genes with high HSR density were enriched in the processes of translation, protein folding, and cell division. In addition, essential genes exhibited higher HSR density than nonessential genes. Overall, our study presented the previously unappreciated correlation between the number variation of HSRs and cellular processes.
Collapse
Affiliation(s)
- Yuanhui Mao
- College of life Sciences and State Key Laboratory of Crop Stress Biology in Arid Areas, Northwest A&F University, Yangling, Shaanxi, China
| | | | | | | | | |
Collapse
|
37
|
Mao Y, Wang W, Cheng N, Li Q, Tao S. Universally increased mRNA stability downstream of the translation initiation site in eukaryotes and prokaryotes. Gene 2013; 517:230-5. [PMID: 23313297 DOI: 10.1016/j.gene.2012.12.062] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2012] [Accepted: 12/03/2012] [Indexed: 11/26/2022]
Abstract
Local secondary structures in coding sequences have important functions across various translational processes. To date, however, the local structures and their functions in the early stage of translation elongation remain poorly understood. Here, we surveyed the structural stability in the first 180 nucleotides of the coding sequence of 27 species using computational method. We found that the structural stability in the 30-80 nucleotide interval was significantly higher than that in other regions in eukaryotes and most prokaryotes. No significant correlation between local translation efficiency and structural stability was observed, suggesting that this structural region has undergone selection pressure directly to maintain high stability. Furthermore, ribosome was blocked by this region, providing an opportunity for co-translational regulation. Remarkably, in eukaryotes, we found that mRNAs with higher structural stability in the 30-80 nucleotide interval tended to encode the secreted proteins. Overall, our results revealed a previously unappreciated correlation between structural stability and protein localization.
Collapse
Affiliation(s)
- Yuanhui Mao
- State Key Laboratory of Crop Stress Biology in Arid Areas and College of Life Sciences, Northwest A&F University, Yangling, Shaanxi 712100, China
| | | | | | | | | |
Collapse
|
38
|
In vivo translation rates can substantially delay the cotranslational folding of the Escherichia coli cytosolic proteome. Proc Natl Acad Sci U S A 2012; 110:E132-40. [PMID: 23256155 DOI: 10.1073/pnas.1213624110] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
A question of fundamental importance concerning protein folding in vivo is whether the kinetics of translation or the thermodynamics of the ribosome nascent chain (RNC) complex is the major determinant of cotranslational folding behavior. This is because translation rates can reduce the probability of cotranslational folding below that associated with arrested ribosomes, whose behavior is determined by the equilibrium thermodynamics of the RNC complex. Here, we combine a chemical kinetic equation with genomic and proteomic data to predict domain folding probabilities as a function of nascent chain length for Escherichia coli cytosolic proteins synthesized on both arrested and continuously translating ribosomes. Our results indicate that, at in vivo translation rates, about one-third of the Escherichia coli cytosolic proteins exhibit cotranslational folding, with at least one domain in each of these proteins folding into its stable native structure before the full-length protein is released from the ribosome. The majority of these cotranslational folding domains are influenced by translation kinetics which reduces their probability of cotranslational folding and consequently increases the nascent chain length at which they fold into their native structures. For about 20% of all cytosolic proteins this delay in folding can exceed the length of the completely synthesized protein, causing one or more of their domains to switch from co- to posttranslational folding solely as a result of the in vivo translation rates. These kinetic effects arise from the difference in time scales of folding and amino-acid addition, and they represent a source of metastability in Escherichia coli's proteome.
Collapse
|
39
|
Chartier M, Gaudreault F, Najmanovich R. Large-scale analysis of conserved rare codon clusters suggests an involvement in co-translational molecular recognition events. ACTA ACUST UNITED AC 2012; 28:1438-45. [PMID: 22467916 DOI: 10.1093/bioinformatics/bts149] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
MOTIVATION An increasing amount of evidence from experimental and computational analysis suggests that rare codon clusters are functionally important for protein activity. Most of the studies on rare codon clusters were performed on a limited number of proteins or protein families. In the present study, we present the Sherlocc program and how it can be used for large scale protein family analysis of evolutionarily conserved rare codon clusters and their relation to protein function and structure. This large-scale analysis was performed using the whole Pfam database covering over 70% of the known protein sequence universe. Our program Sherlocc, detects statistically relevant conserved rare codon clusters and produces a user-friendly HTML output. RESULTS Statistically significant rare codon clusters were detected in a multitude of Pfam protein families. The most statistically significant rare codon clusters were predominantly identified in N-terminal Pfam families. Many of the longest rare codon clusters are found in membrane-related proteins which are required to interact with other proteins as part of their function, for example in targeting or insertion. We identified some cases where rare codon clusters can play a regulating role in the folding of catalytically important domains. Our results support the existence of a widespread functional role for rare codon clusters across species. Finally, we developed an online filter-based search interface that provides access to Sherlocc results for all Pfam families. AVAILABILITY The Sherlocc program and search interface are open access and are available at http://bcb.med.usherbrooke.ca
Collapse
Affiliation(s)
- Matthieu Chartier
- Department of Biochemistry, Faculty of Medicine and Health Sciences, Université de Sherbrooke, 12e Avenue Nord, Sherbrooke, Québec, Canada
| | | | | |
Collapse
|
40
|
Angov E. Codon usage: nature's roadmap to expression and folding of proteins. Biotechnol J 2011; 6:650-9. [PMID: 21567958 PMCID: PMC3166658 DOI: 10.1002/biot.201000332] [Citation(s) in RCA: 157] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2011] [Revised: 04/11/2011] [Accepted: 04/13/2011] [Indexed: 02/06/2023]
Abstract
Biomedical and biotechnological research relies on processes leading to the successful expression and production of key biological products. High-quality proteins are required for many purposes, including protein structural and functional studies. Protein expression is the culmination of multistep processes involving regulation at the level of transcription, mRNA turnover, protein translation, and post-translational modifications leading to the formation of a stable product. Although significant strides have been achieved over the past decade, advances toward integrating genomic and proteomic information are essential, and until such time, many target genes and their products may not be fully realized. Thus, the focus of this review is to provide some experimental support and a brief overview of how codon usage bias has evolved relative to regulating gene expression levels.
Collapse
Affiliation(s)
- Evelina Angov
- Division of Malaria Vaccine Development, Walter Reed Army Institute of Research, Silver Spring, MD 20910, USA.
| |
Collapse
|
41
|
Gershenson A, Gierasch LM. Protein folding in the cell: challenges and progress. Curr Opin Struct Biol 2011; 21:32-41. [PMID: 21112769 PMCID: PMC3072030 DOI: 10.1016/j.sbi.2010.11.001] [Citation(s) in RCA: 133] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2010] [Revised: 11/01/2010] [Accepted: 11/02/2010] [Indexed: 01/02/2023]
Abstract
It is hard to imagine a more extreme contrast than that between the dilute solutions used for in vitro studies of protein folding and the crowded, compartmentalized, sticky, spatially inhomogeneous interior of a cell. This review highlights recent research exploring protein folding in the cell with a focus on issues that are generally not relevant to in vitro studies of protein folding, such as macromolecular crowding, hindered diffusion, cotranslational folding, molecular chaperones, and evolutionary pressures. The technical obstacles that must be overcome to characterize protein folding in the cell are driving methodological advances, and we draw attention to several examples, such as fluorescence imaging of folding in cells and genetic screens for in-cell stability.
Collapse
Affiliation(s)
- Anne Gershenson
- Department of Biochemistry & Molecular Biology, University of Massachusetts, Amherst, MA 01003, USA.
| | | |
Collapse
|
42
|
Navon S, Pilpel Y. The role of codon selection in regulation of translation efficiency deduced from synthetic libraries. Genome Biol 2011; 12:R12. [PMID: 21284851 PMCID: PMC3188794 DOI: 10.1186/gb-2011-12-2-r12] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2010] [Revised: 11/18/2010] [Accepted: 02/01/2011] [Indexed: 12/03/2022] Open
Abstract
Background Translation efficiency is affected by a diversity of parameters, including secondary structure of the transcript and its codon usage. Here we examine the effects of codon usage on translation efficiency by re-analysis of previously constructed synthetic expression libraries in Escherichia coli. Results We define the region in a gene that takes the longest time to translate as the bottleneck. We found that localization of the bottleneck at the beginning of a transcript promoted a high level of expression, especially if the computed dwell time of the ribosome within this region was sufficiently long. The location and translation time of the bottleneck were not correlated with the cost of expression, approximated by the fitness of the host cell, yet utilization of specific codons was. Particularly, enhanced usage of the codons UCA and CAU was correlated with increased cost of production, potentially due to sequestration of their corresponding rare tRNAs. Conclusions The distribution of codons along the genes appears to affect translation efficiency, consistent with analysis of natural genes. This study demonstrates how synthetic biology complements bioinformatics by providing a set-up for well controlled experiments in biology.
Collapse
Affiliation(s)
- Sivan Navon
- Department of Molecular Genetics, Weizmann Institute of Science, PO Box 26, Rehovot, 76100, Israel
| | | |
Collapse
|