1
|
Cao G, Li Z, Wang Z, Yang Y, Li J, Qi H. High-throughput optimization of peptide-linker for fusing function protein with GFP. Protein Expr Purif 2025; 231:106718. [PMID: 40239936 DOI: 10.1016/j.pep.2025.106718] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2025] [Revised: 04/06/2025] [Accepted: 04/13/2025] [Indexed: 04/18/2025]
Abstract
Fusion proteins are pivotal in bioengineering, with applications in purification, delivery, and imaging. However, the development of specialized peptide linkers tailored for target fusion proteins remains an unmet challenge. In this study, we demonstrate the optimization of fusing a functional protein with green fluorescent protein (GFP) through the screening of peptide linker sequences. Using seamless cloning methodology, a nanobody protein was fused to the N-terminus of GFP via a randomized 18-amino acid peptide linker library. Initial screening of fusion protein clones was conducted on solid plates to identify those expressing robust GFP fluorescence. A total of 153 clones with unique linker sequences were identified using Sanger sequencing. A wide range of normalized fluorescence signals was observed, revealing significant variability in linker performance. Among the screened linkers, one exhibited high fluorescence activity, outperforming commonly used flexible and rigid linkers. This finding underscores the necessity of optimize linker sequences for specific fusion proteins. Furthermore, the results demonstrated that the screened linker is compatible with diverse N-terminal proteins while maintaining GFP functionality. Additionally, to investigate the effect of linker on the function of target protein, we determined the reverse transcription efficiency of the murine leukemia virus reverse transcriptase (MLV-RT) in the fusion proteins by a two-step RT-qPCR method. In conclusion, this study presents an efficient optimization of peptide linkers, offering a novel methodology for the engineering and application of specialized linkers for fusion proteins.
Collapse
Affiliation(s)
- Gaili Cao
- School of Chemical Engineering and Technology, Tianjin University, Tianjin, China; State Key Laboratory of Synthetic Biology, Tianjin University, Tianjin, China
| | - Zhong Li
- School of Chemical Engineering and Technology, Tianjin University, Tianjin, China; State Key Laboratory of Synthetic Biology, Tianjin University, Tianjin, China
| | - Zhaoguan Wang
- School of Chemical Engineering and Technology, Tianjin University, Tianjin, China; State Key Laboratory of Synthetic Biology, Tianjin University, Tianjin, China
| | - Youhui Yang
- School of Chemical Engineering and Technology, Tianjin University, Tianjin, China; State Key Laboratory of Synthetic Biology, Tianjin University, Tianjin, China
| | - Jiawei Li
- School of Chemical Engineering and Technology, Tianjin University, Tianjin, China; State Key Laboratory of Synthetic Biology, Tianjin University, Tianjin, China
| | - Hao Qi
- School of Chemical Engineering and Technology, Tianjin University, Tianjin, China; State Key Laboratory of Synthetic Biology, Tianjin University, Tianjin, China.
| |
Collapse
|
2
|
Gan T, Liu Y, Qiao Y, Dong Y, Feng J, Chen X, Zhu L. Translation regulation in Bacillus subtilis and its applications in heterologous protein expression: A review. Int J Biol Macromol 2025; 311:143653. [PMID: 40311986 DOI: 10.1016/j.ijbiomac.2025.143653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2025] [Revised: 04/11/2025] [Accepted: 04/28/2025] [Indexed: 05/03/2025]
Abstract
Bacillus subtilis is widely used for industrial enzyme production due to its food safety and good capability of protein synthesis and secretion. However, the production of heterologous proteins is often inefficient, partly due to poor compatibility and versatility of genetic elements in B. subtilis. Recent study shows that transcription and translation is uncoupled in B. subtilis, which is quite different from general knowledge about the transcription-translation coupling mechanism in bacteria. The uncoupling mechanism in B. subtilis shows that the transcription rate is much faster than translation rate. Therefore, the translation regulation will play an important role in highly-effective synthesis of heterologous protein. To better understanding the different regulation strategies at the translation level in B. subtilis, this review will summarize the translation process in B. subtilis cell and its regulatory mechanisms as well as the differences in comparison to other bacteria. Besides, the genetic engineering strategies for engineering the translation regulatory elements are also summarized.
Collapse
Affiliation(s)
- Tian Gan
- Institute of Fermentation Engineering, Zhejiang University of Technology, Hangzhou 310014, China; College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou 310014, China
| | - Yidi Liu
- Institute of Fermentation Engineering, Zhejiang University of Technology, Hangzhou 310014, China; College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou 310014, China
| | - Ying Qiao
- Institute of Fermentation Engineering, Zhejiang University of Technology, Hangzhou 310014, China; College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou 310014, China
| | - Yangyang Dong
- Institute of Fermentation Engineering, Zhejiang University of Technology, Hangzhou 310014, China; College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou 310014, China
| | - Jiacheng Feng
- Institute of Fermentation Engineering, Zhejiang University of Technology, Hangzhou 310014, China; College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou 310014, China
| | - Xiaolong Chen
- Institute of Fermentation Engineering, Zhejiang University of Technology, Hangzhou 310014, China; College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou 310014, China.
| | - Linjiang Zhu
- Institute of Fermentation Engineering, Zhejiang University of Technology, Hangzhou 310014, China; College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou 310014, China.
| |
Collapse
|
3
|
Zhang J, Qian W. Functional synonymous mutations and their evolutionary consequences. Nat Rev Genet 2025:10.1038/s41576-025-00850-1. [PMID: 40394196 DOI: 10.1038/s41576-025-00850-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/23/2025] [Indexed: 05/22/2025]
Abstract
Synonymous mutations are coding mutations that do not alter protein sequences. Commonly thought to have little to no functional consequence, synonymous mutations have been widely used in evolutionary analyses that require neutral markers, including those foundational for the neutral theory. However, recent studies suggest that synonymous mutations can influence nearly every step in the expression of genetic information and may often be strongly non-neutral. We review the extent and mechanisms of these phenotypic and fitness effects and discuss the implications of the functionality and non-neutrality of synonymous mutations for various analyses and conclusions pertinent to genetics, evolution, conservation and disease.
Collapse
Affiliation(s)
- Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA.
| | - Wenfeng Qian
- State Key Laboratory of Seed Innovation, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China.
- University of Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
4
|
Kim D, Sung D, Lee JW. Expanding the genetic code: Strategies for noncanonical amino acid incorporation in biopolymer. BIORESOURCE TECHNOLOGY 2025; 432:132691. [PMID: 40381810 DOI: 10.1016/j.biortech.2025.132691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/19/2025] [Revised: 05/07/2025] [Accepted: 05/15/2025] [Indexed: 05/20/2025]
Abstract
Codon expansion has become a powerful tool for overcoming the limitations of the standard genetic code system, which restricts the building block of proteins to canonical amino acids. The incorporation of non-canonical amino acids (ncAAs) into proteins presents a significant opportunity to expand their functional diversity. The precise incorporation of ncAAs in vivo requires an orthogonal tRNA/aminoacyl-tRNA synthetase pair and a blank codon to assign them. Studies have focused on the biosynthesis of proteins with novel chemical properties alongside biotechnological advancements in codon expansion research. The three principal strategies for codon expansion are: stop codon utilization, quadruplet codon generation, and sense codon compression. Although using stop codons as blank codons remains an effective approach, the need for additional blank codons has expanded research into quadruplet codons and sense-codon compression. This review presents an overview of each strategy by integrating recent advances in research. We discuss the advantages and limitations of these strategies, as well as the challenges encountered. Subsequently, we propose potential approaches to enhance the efficiency and fidelity of ncAA incorporation. The insights presented in this review provide perspectives for future research and facilitate the advancement of codon expansion and its applications in biotechnology.
Collapse
Affiliation(s)
- Donghyeon Kim
- Department of Chemical Engineering, Pohang University of Science and Technology (POSTECH), Pohang 37673, Republic of Korea
| | - Doeon Sung
- Department of Chemical Engineering, Pohang University of Science and Technology (POSTECH), Pohang 37673, Republic of Korea
| | - Jeong Wook Lee
- Department of Chemical Engineering, Pohang University of Science and Technology (POSTECH), Pohang 37673, Republic of Korea; Division of Interdisciplinary Bioscience and Bioengineering (I-Bio), Pohang University of Science and Technology (POSTECH), Pohang 37673, Republic of Korea.
| |
Collapse
|
5
|
Miller ST, Henzler-Wildman KA, Raman S. Energetic and structural control of polyspecificity in a multidrug transporter. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.04.09.647630. [PMID: 40291682 PMCID: PMC12027068 DOI: 10.1101/2025.04.09.647630] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/30/2025]
Abstract
Multidrug efflux pumps are dynamic molecular machines that drive antibiotic resistance by harnessing ion gradients to export chemically diverse substrates. Despite their clinical importance, the molecular principles underlying multidrug promiscuity and energy efficiency remain poorly understood. Using multiparametric deep mutational scanning across eight substrates and two energy conditions, we deconvolute the contributions of substrate recognition, energetic coupling, and protein stability, providing an integrated, high-resolution view of multidrug transport. We find that substrate specificity arises from a distributed network of residues extending beyond the binding site, with mutations that reshape binding, coupling, conformational flexibility, and membrane interactions. Further, we apply a pH-based selection scheme to measure the effect of mutation on pH-dependent transport efficiency. By integrating these data, we reveal a fundamental relationship between efficiency and promiscuity: highly efficient variants exhibit broad substrate profiles, while inefficient variants are narrower. These findings establish a direct link between energy coupling and polyspecificity, uncovering the biochemical logic underlying multidrug transport.
Collapse
|
6
|
Demissie EA, Park SY, Moon JH, Lee DY. Comparative Analysis of Codon Optimization Tools: Advancing toward a Multi-Criteria Framework for Synthetic Gene Design. J Microbiol Biotechnol 2025; 35:e2411066. [PMID: 40223268 PMCID: PMC12010093 DOI: 10.4014/jmb.2411.11066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2024] [Revised: 02/13/2025] [Accepted: 02/24/2025] [Indexed: 04/15/2025]
Abstract
Codon optimization is an essential technique in synthetic biology and biopharmaceutical production, enhancing recombinant protein expression by fine-tuning genetic sequences to match the translational machinery and codon usage preferences of specific host organisms. This study presents a comprehensive comparative analysis of widely used codon optimization tools, focusing on their capacity to reflect host-specific codon biases, design principles, and parameters. Industrially relevant target proteins were evaluated in Escherichia coli, Saccharomyces cerevisiae, and CHO cells, uncovering significant variability in sequence design and clustering patterns across tools. Tools such as JCat, OPTIMIZER, ATGme, and GeneOptimizer demonstrated strong alignment with genome-wide and highly expressed gene-level codon usage, achieving high codon adaptation index (CAI) values and efficient codon-pair utilization. Conversely, tools like TISIGNER and IDT employed different optimization strategies that frequently produced divergent results. Other key parameters, including GC content, mRNA secondary structure stability (ΔG), and codon-pair bias (CPB), were analyzed to elucidate their influence on translational efficiency. While increased GC content enhanced mRNA stability in E. coli, A/T-rich codons in S. cerevisiae minimized secondary structure formation, and moderate GC content in CHO cells balanced mRNA stability and translation efficiency. Our findings highlight the limitations of single-metric approaches and advocate for a multi-criteria framework that integrates CAI, GC content, mRNA folding energy, and codon-pair considerations. This integrative strategy enables the design of tailored genetic sequences that meet host-specific requirements, advancing synthetic gene design for biotechnological innovation and precision biopharmaceutical applications.
Collapse
Affiliation(s)
- Eden A. Demissie
- School of Chemical Engineering, Sungkyunkwan University, Suwon 16419, Republic of Korea
| | - Seo-Young Park
- School of Chemical Engineering, Sungkyunkwan University, Suwon 16419, Republic of Korea
- School of Medicine, Kyungpook National University, Daegu 41944, Republic of Korea
| | - Je Hun Moon
- School of Chemical Engineering, Sungkyunkwan University, Suwon 16419, Republic of Korea
| | - Dong-Yup Lee
- School of Chemical Engineering, Sungkyunkwan University, Suwon 16419, Republic of Korea
| |
Collapse
|
7
|
Herynek Š, Svoboda J, Huličiak M, Peleg Y, Škultétyová Ľ, Mikulecký P, Schneider B. Increasing recombinant protein production in E. coli via FACS-based selection of N-terminal coding DNA libraries. FEBS J 2025; 292:1070-1085. [PMID: 39726159 PMCID: PMC11880969 DOI: 10.1111/febs.17376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2024] [Revised: 08/28/2024] [Accepted: 11/25/2024] [Indexed: 12/28/2024]
Abstract
Here, we present a previously undescribed approach to modify N-terminal sequences of recombinant proteins to increase their production yield in Escherichia coli. Prior research has demonstrated that the nucleotides immediately following the start codon can significantly influence protein expression. However, the impact of these sequences is construct-specific and is not universally applicable to all proteins. Most of the previous research has been limited to selecting from a few rationally designed sequences. In contrast, we used a directed evolution-based methodology, screening large numbers of diversified sequences derived from DNA libraries coding for the N-termini of investigated proteins. To facilitate the identification of cells with increased expression of the target construct, we cloned a GFP gene at the C-terminus of the expressed genes and used fluorescent activated cell sorting (FACS) to separate cells based on their fluorescence. By following this systematic workflow, we successfully elevated the yield of soluble recombinant proteins of multiple constructs up to over 30-fold.
Collapse
Affiliation(s)
- Štěpán Herynek
- Institute of Biotechnology, Czech Academy of Sciences, BIOCEVPragueCzech Republic
| | - Jakub Svoboda
- Institute of Biotechnology, Czech Academy of Sciences, BIOCEVPragueCzech Republic
| | - Maroš Huličiak
- Institute of Biotechnology, Czech Academy of Sciences, BIOCEVPragueCzech Republic
| | - Yoav Peleg
- Structural Proteomics Unit (SPU), Department of Life Sciences Core Facilities (LSCF)Weizmann Institute of ScienceRehovotIsrael
| | - Ľubica Škultétyová
- Institute of Biotechnology, Czech Academy of Sciences, BIOCEVPragueCzech Republic
| | - Pavel Mikulecký
- Institute of Biotechnology, Czech Academy of Sciences, BIOCEVPragueCzech Republic
| | - Bohdan Schneider
- Institute of Biotechnology, Czech Academy of Sciences, BIOCEVPragueCzech Republic
| |
Collapse
|
8
|
Wong DPH, Wong KH, Park S, Boël G, Hunt JF, Aalberts DP. OPT: Codon optimize gene sequences for E. coli protein overexpression. J Mol Biol 2025:168965. [PMID: 40133777 DOI: 10.1016/j.jmb.2025.168965] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2024] [Revised: 01/23/2025] [Accepted: 01/23/2025] [Indexed: 03/27/2025]
Abstract
The ability to overexpress proteins is valuable for biotechnology, but not all sequences are compatible with high yield. We previously analyzed the sequence features and mRNA folding stability of a large data set of 6,384 distinct gene constructs, and developed a model for protein yield. Our OPT.williams.edu server (1) predicts the probability an input sequence will produce protein at a high level when overexpressed in E. coli, and (2) returns optimized synonymous sequences designed to boost protein expression. Here we also present experimental evidence of the high yields of our OPT constructs for eight commercially produced proteins.
Collapse
Affiliation(s)
- Daniel P H Wong
- Physics Department, Williams College, Williamstown, MA 01267, USA
| | - Kam-Ho Wong
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA
| | - Sunjae Park
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA
| | - Grégory Boël
- Expression Génétique Microbienne, CNRS, Universite Paris Cite, Institut de Biologie Physio-Chimique, F-75005 Paris, France.
| | - John F Hunt
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA.
| | | |
Collapse
|
9
|
Sidi T, Bahiri-Elitzur S, Tuller T, Kolodny R. Predicting gene sequences with AI to study codon usage patterns. Proc Natl Acad Sci U S A 2025; 122:e2410003121. [PMID: 39739812 PMCID: PMC11725940 DOI: 10.1073/pnas.2410003121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2024] [Accepted: 11/27/2024] [Indexed: 01/02/2025] Open
Abstract
Selective pressure acts on the codon use, optimizing multiple, overlapping signals that are only partially understood. We trained AI models to predict codons given their amino acid sequence in the eukaryotes Saccharomyces cerevisiae and Schizosaccharomyces pombe and the bacteria Escherichia coli and Bacillus subtilis to study the extent to which we can learn patterns in naturally occurring codons to improve predictions. We trained our models on a subset of the proteins and evaluated their predictions on large, separate sets of proteins of varying lengths and expression levels. Our models significantly outperformed naïve frequency-based approaches, demonstrating that there are learnable dependencies in evolutionary-selected codon usage. The prediction accuracy advantage of our models is greater for highly expressed genes and is greater in bacteria than eukaryotes, supporting the hypothesis that there is a monotonic relationship between selective pressure for complex codon patterns and effective population size. In S. cerevisiae and bacteria, our models were more accurate for longer proteins, suggesting that the learned patterns may be related to cotranslational folding. Gene functionality and conservation were also important determinants that affect the performance of our models. Finally, we showed that using information encoded in homologous proteins has only a minor effect on prediction accuracy, perhaps due to complex codon-usage codes in genes undergoing rapid evolution. Our study employing contemporary AI methods offers a unique perspective and a deep-learning-based prediction tool for evolutionary-selected codons. We hope that these can be useful to optimize codon usage in endogenous and heterologous proteins.
Collapse
Affiliation(s)
- Tomer Sidi
- Department of Computer Science, University of Haifa, Haifa3303221, Israel
| | - Shir Bahiri-Elitzur
- Department of Biomedical Engineering, Tel-Aviv University, Tel Aviv6139001, Israel
| | - Tamir Tuller
- Department of Biomedical Engineering, Tel-Aviv University, Tel Aviv6139001, Israel
- The Sagol School of Neuroscience, Tel-Aviv University, Tel Aviv6139001, Israel
| | - Rachel Kolodny
- Department of Computer Science, University of Haifa, Haifa3303221, Israel
| |
Collapse
|
10
|
Buric F, Viknander S, Fu X, Lemke O, Carmona OG, Zrimec J, Szyrwiel L, Mülleder M, Ralser M, Zelezniak A. Amino acid sequence encodes protein abundance shaped by protein stability at reduced synthesis cost. Protein Sci 2025; 34:e5239. [PMID: 39665261 PMCID: PMC11635393 DOI: 10.1002/pro.5239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 10/11/2024] [Accepted: 11/14/2024] [Indexed: 12/13/2024]
Abstract
Understanding what drives protein abundance is essential to biology, medicine, and biotechnology. Driven by evolutionary selection, an amino acid sequence is tailored to meet the required abundance of a proteome, underscoring the intricate relationship between sequence and functional demand. Yet, the specific role of amino acid sequences in determining proteome abundance remains elusive. Here we show that the amino acid sequence alone encodes over half of protein abundance variation across all domains of life, ranging from bacteria to mouse and human. With an attempt to go beyond predictions, we trained a manageable-size Transformer model to interpret latent factors predictive of protein abundances. Intuitively, the model's attention focused on the protein's structural features linked to stability and metabolic costs related to protein synthesis. To probe these relationships, we introduce MGEM (Mutation Guided by an Embedded Manifold), a methodology for guiding protein abundance through sequence modifications. We find that mutations which increase predicted abundance have significantly altered protein polarity and hydrophobicity, underscoring a connection between protein structural features and abundance. Through molecular dynamics simulations we revealed that abundance-enhancing mutations possibly contribute to protein thermostability by increasing rigidity, which occurs at a lower synthesis cost.
Collapse
Affiliation(s)
- Filip Buric
- Department of Biology and Biological EngineeringChalmers University of TechnologyGothenburgSweden
| | - Sandra Viknander
- Department of Biology and Biological EngineeringChalmers University of TechnologyGothenburgSweden
| | - Xiaozhi Fu
- Department of Biology and Biological EngineeringChalmers University of TechnologyGothenburgSweden
| | - Oliver Lemke
- Department of BiochemistryCharité – Universitätsmedizin BerlinBerlinGermany
| | - Oriol Gracia Carmona
- Randall Centre for Cell & Molecular BiophysicsKing's College LondonLondonUK
- Institute of Structural and Molecular BiologyUniversity College LondonLondonUK
| | - Jan Zrimec
- Department of Biology and Biological EngineeringChalmers University of TechnologyGothenburgSweden
- Department of Biotechnology and Systems BiologyNational Institute of BiologyLjubljanaSlovenia
| | - Lukasz Szyrwiel
- Department of BiochemistryCharité – Universitätsmedizin BerlinBerlinGermany
| | - Michael Mülleder
- Core Facility High Throughput Mass SpectrometryCharité – Universitätsmedizin BerlinBerlinGermany
| | - Markus Ralser
- Department of BiochemistryCharité – Universitätsmedizin BerlinBerlinGermany
| | - Aleksej Zelezniak
- Department of Biology and Biological EngineeringChalmers University of TechnologyGothenburgSweden
- Randall Centre for Cell & Molecular BiophysicsKing's College LondonLondonUK
- Institute of Biotechnology, Life Sciences CentreVilnius UniversityVilniusLithuania
| |
Collapse
|
11
|
Ojima-Kato T. Advances in recombinant protein production in microorganisms and functional peptide tags. Biosci Biotechnol Biochem 2024; 89:1-10. [PMID: 39479788 DOI: 10.1093/bbb/zbae147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2024] [Accepted: 10/13/2024] [Indexed: 12/24/2024]
Abstract
Recombinant protein production in prokaryotic and eukaryotic cells is a fundamental technology for both research and industry. Achieving efficient protein synthesis is key to accelerating the discovery, characterization, and practical application of proteins. This review focuses on recent advances in recombinant protein production and strategies for more efficient protein production, especially using Escherichia coli and Saccharomyces cerevisiae. Additionally, this review summarizes the development of various functional peptide tags that can be employed for protein production, modification, and purification, including translation-enhancing peptide tags developed by our research group.
Collapse
Affiliation(s)
- Teruyo Ojima-Kato
- Laboratory of Molecular Biotechnology, Graduate School of Bioagricultural Sciences, Nagoya University, Nagoya, Japan
| |
Collapse
|
12
|
Wen X, Lin J, Yang C, Li Y, Cheng H, Liu Y, Zhang Y, Ma H, Mao Y, Liao X, Wang M. Automated characterization and analysis of expression compatibility between regulatory sequences and metabolic genes in Escherichia coli. Synth Syst Biotechnol 2024; 9:647-657. [PMID: 38817827 PMCID: PMC11137365 DOI: 10.1016/j.synbio.2024.05.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2024] [Revised: 05/11/2024] [Accepted: 05/16/2024] [Indexed: 06/01/2024] Open
Abstract
Utilizing standardized artificial regulatory sequences to fine-tuning the expression of multiple metabolic pathways/genes is a key strategy in the creation of efficient microbial cell factories. However, when regulatory sequence expression strengths are characterized using only a few reporter genes, they may not be applicable across diverse genes. This introduces great uncertainty into the precise regulation of multiple genes at multiple expression levels. To address this, our study adopted a fluorescent protein fusion strategy for a more accurate assessment of target protein expression levels. We combined 41 commonly-used metabolic genes with 15 regulatory sequences, yielding an expression dataset encompassing 520 unique combinations. This dataset highlighted substantial variation in protein expression level under identical regulatory sequences, with relative expression levels ranging from 2.8 to 176-fold. It also demonstrated that improving the strength of regulatory sequences does not necessarily lead to significant improvements in the expression levels of target proteins. Utilizing this dataset, we have developed various machine learning models and discovered that the integration of promoter regions, ribosome binding sites, and coding sequences significantly improves the accuracy of predicting protein expression levels, with a Spearman correlation coefficient of 0.72, where the promoter sequence exerts a predominant influence. Our study aims not only to provide a detailed guide for fine-tuning gene expression in the metabolic engineering of Escherichia coli but also to deepen our understanding of the compatibility issues between regulatory sequences and target genes.
Collapse
Affiliation(s)
- Xiao Wen
- School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230026, China
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- Key Laboratory of Engineering Biology for Low-Carbon Manufacturing, Tianjin, 300308, China
| | - Jiawei Lin
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- School of Biological Engineering, Tianjin University of Science and Technology, Tianjin, 300457, China
| | - Chunhe Yang
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- School of Biological Engineering, Tianjin University of Science and Technology, Tianjin, 300457, China
| | - Ying Li
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- School of Biological Engineering, Tianjin University of Science and Technology, Tianjin, 300457, China
| | - Haijiao Cheng
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- Key Laboratory of Engineering Biology for Low-Carbon Manufacturing, Tianjin, 300308, China
| | - Ye Liu
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- Key Laboratory of Engineering Biology for Low-Carbon Manufacturing, Tianjin, 300308, China
| | - Yue Zhang
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- Key Laboratory of Engineering Biology for Low-Carbon Manufacturing, Tianjin, 300308, China
| | - Hongwu Ma
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- Key Laboratory of Engineering Biology for Low-Carbon Manufacturing, Tianjin, 300308, China
| | - Yufeng Mao
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- Key Laboratory of Engineering Biology for Low-Carbon Manufacturing, Tianjin, 300308, China
| | - Xiaoping Liao
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- Key Laboratory of Engineering Biology for Low-Carbon Manufacturing, Tianjin, 300308, China
| | - Meng Wang
- School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230026, China
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China
- Key Laboratory of Engineering Biology for Low-Carbon Manufacturing, Tianjin, 300308, China
| |
Collapse
|
13
|
Costello A, Peterson AA, Chen PH, Bagirzadeh R, Lanster DL, Badran AH. Genetic Code Expansion History and Modern Innovations. Chem Rev 2024; 124:11962-12005. [PMID: 39466033 DOI: 10.1021/acs.chemrev.4c00275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/29/2024]
Abstract
The genetic code is the foundation for all life. With few exceptions, the translation of nucleic acid messages into proteins follows conserved rules, which are defined by codons that specify each of the 20 proteinogenic amino acids. For decades, leading research groups have developed a catalogue of innovative approaches to extend nature's amino acid repertoire to include one or more noncanonical building blocks in a single protein. In this review, we summarize advances in the history of in vitro and in vivo genetic code expansion, and highlight recent innovations that increase the scope of biochemically accessible monomers and codons. We further summarize state-of-the-art knowledge in engineered cellular translation, as well as alterations to regulatory mechanisms that improve overall genetic code expansion. Finally, we distill existing limitations of these technologies into must-have improvements for the next generation of technologies, and speculate on future strategies that may be capable of overcoming current gaps in knowledge.
Collapse
Affiliation(s)
- Alan Costello
- Department of Chemistry The Scripps Research Institute; La Jolla, California 92037, United States
- Department of Integrative Structural and Computational Biology The Scripps Research Institute; La Jolla, California 92037, United States
| | - Alexander A Peterson
- Department of Chemistry The Scripps Research Institute; La Jolla, California 92037, United States
- Department of Integrative Structural and Computational Biology The Scripps Research Institute; La Jolla, California 92037, United States
| | - Pei-Hsin Chen
- Department of Chemistry The Scripps Research Institute; La Jolla, California 92037, United States
- Department of Integrative Structural and Computational Biology The Scripps Research Institute; La Jolla, California 92037, United States
- Doctoral Program in Chemical and Biological Sciences The Scripps Research Institute; La Jolla, California 92037, United States
| | - Rustam Bagirzadeh
- Department of Chemistry The Scripps Research Institute; La Jolla, California 92037, United States
- Department of Integrative Structural and Computational Biology The Scripps Research Institute; La Jolla, California 92037, United States
| | - David L Lanster
- Department of Chemistry The Scripps Research Institute; La Jolla, California 92037, United States
- Department of Integrative Structural and Computational Biology The Scripps Research Institute; La Jolla, California 92037, United States
- Doctoral Program in Chemical and Biological Sciences The Scripps Research Institute; La Jolla, California 92037, United States
| | - Ahmed H Badran
- Department of Chemistry The Scripps Research Institute; La Jolla, California 92037, United States
- Department of Integrative Structural and Computational Biology The Scripps Research Institute; La Jolla, California 92037, United States
| |
Collapse
|
14
|
He L, Wang W, Wang X, Zhang D, Zhang Y, Zhao Y, Zhao L, Li X, Cheng J, Xu D, Ma Z, Yang X, Huang Z, Cai Y, Liu X, Chen Z, Weng X, Lin C, Gong P, Zhang X. Identification of the FGB gene polymorphism and analysis of its association with fat deposition traits in Hu sheep. Anim Biotechnol 2024; 35:2344207. [PMID: 38669223 DOI: 10.1080/10495398.2024.2344207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/28/2024]
Abstract
As a crucial economic trait, fat deposition is directly related to carcass quality and feed efficiency in sheep. The purpose of this study was to investigate the polymorphisms of the FGB gene related to fat deposition and detect the expression features of the FGB gene in different adipose tissues of sheep by using Sanger sequencing, MassARRAY® SNP technique, and quantitative real-time PCR (qRT-PCR). Results showed that in the intron region of the FGB gene, a SNP g. 3378953 A > T has been identified, and significant association was found between perirenal fat weight, perirenal fat relative weight, mesenteric fat weight, and mesenteric fat relative weight (P < 0.05). Moreover, qRT-PCR analysis showed that FGB was expressed in all three adipose tissues, and FGB gene expression level in the AA genotype was significantly lower than that in the AT or TT genotypes (P < 0.05). Therefore, the FGB gene can be used as a candidate gene to reduce fat deposition in Hu sheep breeding, and the selection of the AA genotype in Hu sheep in production practice is more conducive to improving production efficiency.
Collapse
Affiliation(s)
- Lijuan He
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Weimin Wang
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, Gansu, China
| | - Xiaojuan Wang
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Deyin Zhang
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, Gansu, China
| | - Yukun Zhang
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, Gansu, China
| | - Yuan Zhao
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, Gansu, China
| | - Liming Zhao
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, Gansu, China
| | - Xiaolong Li
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, Gansu, China
| | - Jiangbo Cheng
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, Gansu, China
| | - Dan Xu
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Zongwu Ma
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Xiaobin Yang
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Zhiqiang Huang
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Youxin Cai
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Xiaoqiang Liu
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Zhanyu Chen
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Xiuxiu Weng
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, Gansu, China
| | - Changchun Lin
- Institute of Animal Husbandry Quality Standards, Xinjiang Academy of Animal Science, Urumqi, China
| | - Ping Gong
- Institute of Animal Husbandry Quality Standards, Xinjiang Academy of Animal Science, Urumqi, China
| | - Xiaoxue Zhang
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| |
Collapse
|
15
|
Ma Z, Wang W, Zhang D, Wang X, Li S, Zhao L, Zhang Y, Zhao Y, Li X, Lin C, Wang J, Cheng J, Xu D, Yang X, Huang Y, Cui P, Liu J, Zeng X, Zhai R, Huang Z, Weng X, Zhang X. Polymorphism in IGFALS gene and its association with scrotal circumference in Hu lambs. Anim Biotechnol 2024; 35:2295928. [PMID: 38174897 DOI: 10.1080/10495398.2023.2295928] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
Scrotal circumference is an important reproductive index of breeding rams, which has a high genetic correlation with ejaculation volume and semen quality. In this study, the scrotal circumference of 1353 male Hu sheep at different stages of development was measured and descriptive statistical analysis was performed. The results showed that the coefficient of variation of scrotal circumference at each stage was greater than 10%, and its heritability were moderately to high, ranging from 0.318 to 0.719. We used PCR amplification and Sanger sequencing to scan the polymorphisms of the IGFALS gene, and performed association analysis with the circumference of the scrotum at different stages. We identified a synonymous mutation g.918 G > C in exon 1 of the IGFALS gene, and this mutation was significantly associated with scrotal circumference at 100, 120, 140, 160 and 180 days (p < 0.05). Therefore, IGFALS gene polymorphism can be used as a molecular marker affecting scrotal circumference of Hu sheep, which can provide a reference for future molecular marker-assisted selection of scrotal circumference in sheep.
Collapse
Affiliation(s)
- Zongwu Ma
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Weimin Wang
- The State Key Laboratory of Grassland Agro-Ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, Gansu, China
| | - Deyin Zhang
- The State Key Laboratory of Grassland Agro-Ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, Gansu, China
| | - Xinji Wang
- Minqin County Animal Husbandry and Veterinary Workstation, Minqin, Gansu, China
| | - Shirong Li
- Minqin County Animal Husbandry and Veterinary Workstation, Minqin, Gansu, China
| | - Liming Zhao
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Yukun Zhang
- The State Key Laboratory of Grassland Agro-Ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, Gansu, China
| | - Yuan Zhao
- The State Key Laboratory of Grassland Agro-Ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, Gansu, China
| | - Xiaolong Li
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Changchun Lin
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Jianghui Wang
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Jiangbo Cheng
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Dan Xu
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Xiaobin Yang
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Yongliang Huang
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Panpan Cui
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Jia Liu
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Xiwen Zeng
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Rui Zhai
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Zhiqiang Huang
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Xiuxiu Weng
- The State Key Laboratory of Grassland Agro-Ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, Gansu, China
| | - Xiaoxue Zhang
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, China
| |
Collapse
|
16
|
Yang F, Sun X, Lu J, Zhang L, Du G, Li J, Xu R, Kang Z. Immobilized high-performance heparin lyase III for efficient preparation of low molecular weight heparin. Int J Biol Macromol 2024; 280:135833. [PMID: 39306163 DOI: 10.1016/j.ijbiomac.2024.135833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2024] [Revised: 09/09/2024] [Accepted: 09/18/2024] [Indexed: 09/30/2024]
Abstract
Heparin lyase III has garnered widespread attention due to its high specificity and minimal loss of anticoagulant activity during the preparation of low molecular weight heparin (LMWH), a crucial anticoagulant drug in clinical practice. However, low expression levels and complex preparation processes limit its practical application. To address these challenges, high-performance Bacteroides thetaiotaomicron heparin lyase III (Bhep III) variants were engineered and immobilized for LMWH preparation. First, we enhanced enzyme expression by adding a solubility-enhancing tag and optimizing the N-terminal coding sequence, which resulted in a Bhep III activity level of 2.9 × 103 U/L with 8-fold increase. After evolution guided the design of rational mutations, the variant Bhep III K85A/Q95F/S471T generated higher activity (5.4 × 104 U/L in 5-L fermenter), which is, to our knowledge, the highest reported to date in the literature, being 1.7-fold that of the wild type and demonstrating 2-fold increase of the thermal stability. By screening and optimizing the C-terminal self-assembling tag, we successfully immobilized Bhep III, further increasing its thermal stability by 12-fold, and allowing for the multi-batch preparation of LMWH with simple centrifugation. The immobilized heparin lyase III demonstrated sufficient reusability in enzymatic reactions, facilitating efficient industrial-scale production of LMWH.
Collapse
Affiliation(s)
- Fengling Yang
- State Key Laboratory of Food Science and Resources, Jiangnan University, Wuxi, China; The Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, China; The Science Center for Future Foods, Jiangnan University, Wuxi 214122, China; Jiaxing Institute of Future Food, Jiaxing 314050, China
| | - Xiaoyuan Sun
- The Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, China; The Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| | - Jie Lu
- The Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, China; The Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| | - Linpei Zhang
- The Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, China
| | - Guocheng Du
- The Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, China; The Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| | - Jianghua Li
- The Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, China; The Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| | - Ruirui Xu
- The Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, China; The Science Center for Future Foods, Jiangnan University, Wuxi 214122, China.
| | - Zhen Kang
- State Key Laboratory of Food Science and Resources, Jiangnan University, Wuxi, China; The Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, China; The Science Center for Future Foods, Jiangnan University, Wuxi 214122, China.
| |
Collapse
|
17
|
Jiang R, Yuan S, Zhou Y, Wei Y, Li F, Wang M, Chen B, Yu H. Strategies to overcome the challenges of low or no expression of heterologous proteins in Escherichia coli. Biotechnol Adv 2024; 75:108417. [PMID: 39038691 DOI: 10.1016/j.biotechadv.2024.108417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Revised: 07/18/2024] [Accepted: 07/19/2024] [Indexed: 07/24/2024]
Abstract
Protein expression is a critical process in diverse biological systems. For Escherichia coli, a widely employed microbial host in industrial catalysis and healthcare, researchers often face significant challenges in constructing recombinant expression systems. To maximize the potential of E. coli expression systems, it is essential to address problems regarding the low or absent production of certain target proteins. This article presents viable solutions to the main factors posing challenges to heterologous protein expression in E. coli, which includes protein toxicity, the intrinsic influence of gene sequences, and mRNA structure. These strategies include specialized approaches for managing toxic protein expression, addressing issues related to mRNA structure and codon bias, advanced codon optimization methodologies that consider multiple factors, and emerging optimization techniques facilitated by big data and machine learning.
Collapse
Affiliation(s)
- Ruizhao Jiang
- Department of Chemical Engineering, Tsinghua University, Beijing 100084, China; Key Laboratory of Industrial Biocatalysis (Tsinghua University), the Ministry of Education, Beijing 100084, China
| | - Shuting Yuan
- Department of Chemical Engineering, Tsinghua University, Beijing 100084, China; Key Laboratory of Industrial Biocatalysis (Tsinghua University), the Ministry of Education, Beijing 100084, China
| | - Yilong Zhou
- Tanwei College, Tsinghua University, Beijing 100084, China
| | - Yuwen Wei
- Department of Chemical Engineering, Tsinghua University, Beijing 100084, China; Key Laboratory of Industrial Biocatalysis (Tsinghua University), the Ministry of Education, Beijing 100084, China
| | - Fulong Li
- Beijing Evolyzer Co.,Ltd., 100176, China
| | | | - Bo Chen
- Beijing Evolyzer Co.,Ltd., 100176, China
| | - Huimin Yu
- Department of Chemical Engineering, Tsinghua University, Beijing 100084, China; Key Laboratory of Industrial Biocatalysis (Tsinghua University), the Ministry of Education, Beijing 100084, China; Center for Synthetic and Systems Biology, Tsinghua University, Beijing 100084, China.
| |
Collapse
|
18
|
Cao X, Zhang Y, Ding Y, Wan Y. Identification of RNA structures and their roles in RNA functions. Nat Rev Mol Cell Biol 2024; 25:784-801. [PMID: 38926530 DOI: 10.1038/s41580-024-00748-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/28/2024] [Indexed: 06/28/2024]
Abstract
The development of high-throughput RNA structure profiling methods in the past decade has greatly facilitated our ability to map and characterize different aspects of RNA structures transcriptome-wide in cell populations, single cells and single molecules. The resulting high-resolution data have provided insights into the static and dynamic nature of RNA structures, revealing their complexity as they perform their respective functions in the cell. In this Review, we discuss recent technical advances in the determination of RNA structures, and the roles of RNA structures in RNA biogenesis and functions, including in transcription, processing, translation, degradation, localization and RNA structure-dependent condensates. We also discuss the current understanding of how RNA structures could guide drug design for treating genetic diseases and battling pathogenic viruses, and highlight existing challenges and future directions in RNA structure research.
Collapse
Affiliation(s)
- Xinang Cao
- Stem Cell and Regenerative Biology, Genome Institute of Singapore, Singapore, Singapore
| | - Yueying Zhang
- Department of Cell and Developmental Biology, John Innes Centre, Norwich, UK
| | - Yiliang Ding
- Department of Cell and Developmental Biology, John Innes Centre, Norwich, UK.
| | - Yue Wan
- Stem Cell and Regenerative Biology, Genome Institute of Singapore, Singapore, Singapore.
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore.
| |
Collapse
|
19
|
Yan Z, Chu W, Sheng Y, Tang K, Wang S, Liu Y, Wong WF. Integrating Deep Learning and Synthetic Biology: A Co-Design Approach for Enhancing Gene Expression via N-Terminal Coding Sequences. ACS Synth Biol 2024; 13:2960-2968. [PMID: 39229974 DOI: 10.1021/acssynbio.4c00371] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/05/2024]
Abstract
N-terminal coding sequence (NCS) influences gene expression by impacting the translation initiation rate. The NCS optimization problem is to find an NCS that maximizes gene expression. The problem is important in genetic engineering. However, current methods for NCS optimization such as rational design and statistics-guided approaches are labor-intensive yield only relatively small improvements. This paper introduces a deep learning/synthetic biology codesigned few-shot training workflow for NCS optimization. Our method utilizes k-nearest encoding followed by word2vec to encode the NCS, then performs feature extraction using attention mechanisms, before constructing a time-series network for predicting gene expression intensity, and finally a direct search algorithm identifies the optimal NCS with limited training data. We took green fluorescent protein (GFP) expressed by Bacillus subtilis as a reporting protein of NCSs, and employed the fluorescence enhancement factor as the metric of NCS optimization. Within just six iterative experiments, our model generated an NCS (MLD62) that increased average GFP expression by 5.41-fold, outperforming the state-of-the-art NCS designs. Extending our findings beyond GFP, we showed that our engineered NCS (MLD62) can effectively boost the production of N-acetylneuraminic acid by enhancing the expression of the crucial rate-limiting GNA1 gene, demonstrating its practical utility. We have open-sourced our NCS expression database and experimental procedures for public use.
Collapse
Affiliation(s)
- Zhanglu Yan
- School of Computing, National University of Singapore, Singapore 117417, Singapore
| | - Weiran Chu
- Science Center for Future Foods, Jiangnan University, Wuxi 214122, PR China
| | - Yuhua Sheng
- Science Center for Future Foods, Jiangnan University, Wuxi 214122, PR China
| | - Kaiwen Tang
- School of Computing, National University of Singapore, Singapore 117417, Singapore
| | - Shida Wang
- Department of Mathematics, National University of Singapore, Singapore 119077, Singapore
| | - Yanfeng Liu
- Science Center for Future Foods, Jiangnan University, Wuxi 214122, PR China
| | - Weng-Fai Wong
- School of Computing, National University of Singapore, Singapore 117417, Singapore
| |
Collapse
|
20
|
Sun H, Vargas-Blanco D, Zhou Y, Masiello C, Kelly J, Moy J, Korkin D, Shell S. Diverse intrinsic properties shape transcript stability and stabilization in Mycolicibacterium smegmatis. NAR Genom Bioinform 2024; 6:lqae147. [PMID: 39498432 PMCID: PMC11532794 DOI: 10.1093/nargab/lqae147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2024] [Revised: 08/23/2024] [Accepted: 10/17/2024] [Indexed: 11/07/2024] Open
Abstract
Mycobacteria regulate transcript degradation to facilitate adaptation to environmental stress. However, the mechanisms underlying this regulation are unknown. Here we sought to gain understanding of the mechanisms controlling mRNA stability by investigating the transcript properties associated with variance in transcript stability and stress-induced transcript stabilization. We measured mRNA half-lives transcriptome-wide in Mycolicibacterium smegmatis in log phase growth and hypoxia-induced growth arrest. The transcriptome was globally stabilized in response to hypoxia, but transcripts of essential genes were generally stabilized more than those of non-essential genes. We then developed machine learning models that enabled us to identify the non-linear collective effect of a compendium of transcript properties on transcript stability and stabilization. We identified properties that were more predictive of half-life in log phase as well as properties that were more predictive in hypoxia, and many of these varied between leadered and leaderless transcripts. In summary, we found that transcript properties are differentially associated with transcript stability depending on both the transcript type and the growth condition. Our results reveal the complex interplay between transcript features and microenvironment that shapes transcript stability in mycobacteria.
Collapse
Affiliation(s)
- Huaming Sun
- Program in Bioinformatics and Computational Biology, Worcester Polytechnic Institute, Worcester, MA 01609, USA
| | - Diego A Vargas-Blanco
- Department of Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, MA 01609, USA
| | - Ying Zhou
- Department of Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, MA 01609, USA
| | - Catherine S Masiello
- Department of Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, MA 01609, USA
| | - Jessica M Kelly
- Program in Bioinformatics and Computational Biology, Worcester Polytechnic Institute, Worcester, MA 01609, USA
- Department of Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, MA 01609, USA
| | - Justin K Moy
- Program in Bioinformatics and Computational Biology, Worcester Polytechnic Institute, Worcester, MA 01609, USA
| | - Dmitry Korkin
- Program in Bioinformatics and Computational Biology, Worcester Polytechnic Institute, Worcester, MA 01609, USA
| | - Scarlet S Shell
- Program in Bioinformatics and Computational Biology, Worcester Polytechnic Institute, Worcester, MA 01609, USA
- Department of Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, MA 01609, USA
| |
Collapse
|
21
|
Gilliot PA, Gorochowski TE. Transfer learning for cross-context prediction of protein expression from 5'UTR sequence. Nucleic Acids Res 2024; 52:e58. [PMID: 38864396 PMCID: PMC11260469 DOI: 10.1093/nar/gkae491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2023] [Revised: 04/28/2024] [Accepted: 05/28/2024] [Indexed: 06/13/2024] Open
Abstract
Model-guided DNA sequence design can accelerate the reprogramming of living cells. It allows us to engineer more complex biological systems by removing the need to physically assemble and test each potential design. While mechanistic models of gene expression have seen some success in supporting this goal, data-centric, deep learning-based approaches often provide more accurate predictions. This accuracy, however, comes at a cost - a lack of generalization across genetic and experimental contexts that has limited their wider use outside the context in which they were trained. Here, we address this issue by demonstrating how a simple transfer learning procedure can effectively tune a pre-trained deep learning model to predict protein translation rate from 5' untranslated region (5'UTR) sequence for diverse contexts in Escherichia coli using a small number of new measurements. This allows for important model features learnt from expensive massively parallel reporter assays to be easily transferred to new settings. By releasing our trained deep learning model and complementary calibration procedure, this study acts as a starting point for continually refined model-based sequence design that builds on previous knowledge and future experimental efforts.
Collapse
Affiliation(s)
- Pierre-Aurélien Gilliot
- School of Biological Sciences, University of Bristol, 24 Tyndall Avenue, Bristol BS8 1TQ, UK
| | - Thomas E Gorochowski
- School of Biological Sciences, University of Bristol, 24 Tyndall Avenue, Bristol BS8 1TQ, UK
- BrisEngBio, School of Chemistry, University of Bristol, Cantock’s Close, Bristol BS8 1TS, UK
| |
Collapse
|
22
|
Wolf ME, Eltis LD. Preparation of reductases for multicomponent oxygenases. Methods Enzymol 2024; 703:65-85. [PMID: 39261004 DOI: 10.1016/bs.mie.2024.05.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/13/2024]
Abstract
Oxygenases catalyze crucial reactions throughout all domains of life, cleaving molecular oxygen (O2) and inserting one or two of its atoms into organic substrates. Many oxygenases, including those in the cytochrome P450 (P450) and Rieske oxygenase enzyme families, function as multicomponent systems, which require one or more redox partners to transfer electrons to the catalytic center. As the identity of the reductase can change the reactivity of the oxygenase, characterization of the latter with its cognate redox partners is critical. However, the isolation of the native redox partner or partners is often challenging. Here, we report the preparation and characterization of PbdB, the native reductase partner of PbdA, a bacterial P450 enzyme that catalyzes the O-demethylation of para-methoxylated benzoates. Through production in a rhodoccocal host, codon optimization, and anaerobic purification, this procedure overcomes conventional challenges in redox partner production and allows for robust oxygenase characterization with its native redox partner. Key lessons learned here, including the value of production in a related host and rare codon effects are applicable to a broad range of Fe-dependent oxygenases and their components.
Collapse
Affiliation(s)
- Megan E Wolf
- Microbiology and Immunology, The University of British Columbia, Vancouver, BC, Canada
| | - Lindsay D Eltis
- Microbiology and Immunology, The University of British Columbia, Vancouver, BC, Canada.
| |
Collapse
|
23
|
Radrizzani S, Kudla G, Izsvák Z, Hurst LD. Selection on synonymous sites: the unwanted transcript hypothesis. Nat Rev Genet 2024; 25:431-448. [PMID: 38297070 DOI: 10.1038/s41576-023-00686-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/04/2023] [Indexed: 02/02/2024]
Abstract
Although translational selection to favour codons that match the most abundant tRNAs is not readily observed in humans, there is nonetheless selection in humans on synonymous mutations. We hypothesize that much of this synonymous site selection can be explained in terms of protection against unwanted RNAs - spurious transcripts, mis-spliced forms or RNAs derived from transposable elements or viruses. We propose not only that selection on synonymous sites functions to reduce the rate of creation of unwanted transcripts (for example, through selection on exonic splice enhancers and cryptic splice sites) but also that high-GC content (but low-CpG content), together with intron presence and position, is both particular to functional native mRNAs and used to recognize transcripts as native. In support of this hypothesis, transcription, nuclear export, liquid phase condensation and RNA degradation have all recently been shown to promote GC-rich transcripts and suppress AU/CpG-rich ones. With such 'traps' being set against AU/CpG-rich transcripts, the codon usage of native genes has, in turn, evolved to avoid such suppression. That parallel filters against AU/CpG-rich transcripts also affect the endosomal import of RNAs further supports the unwanted transcript hypothesis of synonymous site selection and explains the similar design rules that have enabled the successful use of transgenes and RNA vaccines.
Collapse
Affiliation(s)
- Sofia Radrizzani
- Milner Centre for Evolution, Department of Life Sciences, University of Bath, Bath, UK
- Milner Therapeutics Institute, Jeffrey Cheah Biomedical Centre, University of Cambridge, Cambridge, UK
| | - Grzegorz Kudla
- MRC Human Genetics Unit, Institute for Genetics and Cancer, The University of Edinburgh, Edinburgh, UK
| | - Zsuzsanna Izsvák
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Society, Berlin, Germany
| | - Laurence D Hurst
- Milner Centre for Evolution, Department of Life Sciences, University of Bath, Bath, UK.
| |
Collapse
|
24
|
Fuse-Murakami T, Matsumoto R, Kanamori T. N-Terminal Amino Acid Affects the Translation Efficiency at Lower Temperatures in a Reconstituted Protein Synthesis System. Int J Mol Sci 2024; 25:5264. [PMID: 38791303 PMCID: PMC11120837 DOI: 10.3390/ijms25105264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Revised: 05/07/2024] [Accepted: 05/10/2024] [Indexed: 05/26/2024] Open
Abstract
The Escherichia coli (E. coli)-based protein synthesis using recombinant elements (PURE) system is a cell-free protein synthesis system reconstituted from purified factors essential for E. coli translation. The PURE system is widely used for basic and synthetic biology applications. One of the major challenges associated with the PURE system is that the protein yield of the system varies depending on the protein. Studies have reported that the efficiency of translation is significantly affected by nucleotide and amino acid sequences, especially in the N-terminal region. Here, we investigated the inherent effect of various N-terminal sequences on protein synthesis using the PURE system. We found that a single amino acid substitution in the N-terminal region significantly altered translation efficiency in the PURE system, especially at low temperatures. This result gives us useful suggestions for the expression of the protein of interest in vitro and in vivo.
Collapse
Affiliation(s)
| | | | - Takashi Kanamori
- GeneFrontier Corporation, 273-1 Kashiwa, Kashiwa-shi 277-0005, Chiba, Japan
| |
Collapse
|
25
|
Zabolotskii AI, Riabkova NS. A new look at the fluorescent protein-based approach for identifying optimal coding sequence for recombinant protein expression in E. coli. Biotechnol J 2024; 19:e2300343. [PMID: 38622786 DOI: 10.1002/biot.202300343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 03/06/2024] [Accepted: 03/20/2024] [Indexed: 04/17/2024]
Abstract
Due to the degeneracy of the genetic code, most amino acids are encoded by several codons. The choice among synonymous codons at the N-terminus of genes has a profound effect on protein expression in Escherichia coli. This is often explained by the different contributions of synonymous codons to mRNA secondary structure formation. Strong secondary structures at the 5'-end of mRNA interfere with ribosome binding and affect the process of translation initiation. In silico optimization of the gene 5'-end can significantly increase the level of protein expression; however, this method is not always effective due to the uncertainty of the exact mechanism by which synonymous substitutions affect expression; thus, it may produce nonoptimal variants as well as miss some of the best producers. In this paper, an alternative approach is proposed based on screening a partially randomized library of expression constructs comprising hundreds of selected synonymous variants. The effect of such substitutions was evaluated using the gene of interest fused to the reporter gene of the fluorescent protein with subsequent screening for the most promising candidates according to the reporter's signal intensity. The power of the approach is demonstrated by a significant increase in the prokaryotic expression of three proteins: canine cystatin C, human BCL2-associated athanogene 3 and human cardiac troponin I. This simple approach was suggested which may provide an efficient, easy, and inexpensive optimization method for poorly expressed proteins in bacteria.
Collapse
|
26
|
Köppl C, Buchinger W, Striedner G, Cserjan-Puschmann M. Modifications of the 5' region of the CASPON TM tag's mRNA further enhance soluble recombinant protein production in Escherichia coli. Microb Cell Fact 2024; 23:86. [PMID: 38509572 PMCID: PMC10953258 DOI: 10.1186/s12934-024-02350-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Accepted: 02/27/2024] [Indexed: 03/22/2024] Open
Abstract
BACKGROUND Escherichia coli is one of the most commonly used host organisms for the production of biopharmaceuticals, as it allows for cost-efficient and fast recombinant protein expression. However, challenging proteins are often produced with low titres or as inclusion bodies, and the manufacturing process needs to be developed individually for each protein. Recently, we developed the CASPONTM technology, a generic fusion tag-based platform process for high-titer soluble expression including a standardized downstream processing and highly specific enzymatic cleavage of the fusion tag. To assess potential strategies for further improvement of the N-terminally fused CASPONTM tag, we modified the 5'UTR and 5' region of the tag-coding mRNA to optimize the ribosome-mRNA interactions. RESULTS In the present work, we found that by modifying the 5'UTR sequence of a pET30acer plasmid-based system, expression of the fusion protein CASPONTM-tumour necrosis factor α was altered in laboratory-scale carbon-limited fed-batch cultivations, but no significant increase in expression titre was achieved. Translation efficiency was highest for a construct carrying an expression enhancer element and additionally possessing a very favourable interaction energy between ribosome and mRNA (∆Gtotal). However, a construct with comparatively low transcriptional efficiency, which lacked the expression enhancer sequence and carried the most favourable ∆Gtotal tested, led to the highest recombinant protein formation alongside the reference pET30a construct. Furthermore, we found, that by introducing synonymous mutations within the nucleotide sequence of the T7AC element of the CASPONTM tag, utilizing a combination of rare and non-rare codons, the free folding energy of the nucleotides at the 5' end (-4 to + 37) of the transcript encoding the CASPONTM tag increased by 6 kcal/mol. Surprisingly, this new T7ACrare variant led to improved recombinant protein titres by 1.3-fold up to 5.3-fold, shown with three industry-relevant proteins in lab-scale carbon limited fed-batch fermentations under industrially relevant conditions. CONCLUSIONS This study reveals some of the complex interdependencies between the ribosome and mRNA that govern recombinant protein expression. By modifying the 5'UTR to obtain an optimized interaction energy between the mRNA and the ribosome (ΔGtotal), transcript levels were changed, highlighting the different translation efficiencies of individual transcripts. It was shown that the highest recombinant titre was not obtained by the construct with the most efficient translation but by a construct with a generally high transcript amount coupled with a favourable ΔGtotal. Furthermore, an unexpectedly high potential to enhance expression by introducing silent mutations including multiple rare codons into the 5'end of the CAPONTM tag's mRNA was identified. Although the titres of the fusion proteins were dramatically increased, no formation of inclusion bodies or negative impact on cell growth was observed. We hypothesize that the drastic increase in titre is most likely caused by better ribosomal binding site accessibility. Our study, which demonstrates the influence of changes in ribosome-mRNA interactions on protein expression under industrially relevant production conditions, opens the door to the applicability of the new T7ACrare tag in biopharmaceutical industry using the CASPONTM platform process.
Collapse
Affiliation(s)
- Christoph Köppl
- Austrian Centre of Industrial Biotechnology, Muthgasse 18, Vienna, 1190, Austria
- Department of Biotechnology, Institute of Bioprocess Science and Engineering, University of Natural Resources and Life Sciences, Muthgasse 18, Vienna, 1190, Austria
| | - Wolfgang Buchinger
- Biopharma Austria, Development Operations, Boehringer Ingelheim Regional Center Vienna GmbH & Co KG, Dr.-Boehringer-Gasse 5-11, Vienna, A-1121, Austria
| | - Gerald Striedner
- Austrian Centre of Industrial Biotechnology, Muthgasse 18, Vienna, 1190, Austria
- Department of Biotechnology, Institute of Bioprocess Science and Engineering, University of Natural Resources and Life Sciences, Muthgasse 18, Vienna, 1190, Austria
| | - Monika Cserjan-Puschmann
- Austrian Centre of Industrial Biotechnology, Muthgasse 18, Vienna, 1190, Austria.
- Department of Biotechnology, Institute of Bioprocess Science and Engineering, University of Natural Resources and Life Sciences, Muthgasse 18, Vienna, 1190, Austria.
| |
Collapse
|
27
|
Eskandari A, Nezhad NG, Leow TC, Rahman MBA, Oslan SN. Essential factors, advanced strategies, challenges, and approaches involved for efficient expression of recombinant proteins in Escherichia coli. Arch Microbiol 2024; 206:152. [PMID: 38472371 DOI: 10.1007/s00203-024-03871-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2023] [Revised: 12/31/2023] [Accepted: 01/25/2024] [Indexed: 03/14/2024]
Abstract
Producing recombinant proteins is a major accomplishment of biotechnology in the past century. Heterologous hosts, either eukaryotic or prokaryotic, are used for the production of these proteins. The utilization of microbial host systems continues to dominate as the most efficient and affordable method for biotherapeutics and food industry productions. Hence, it is crucial to analyze the limitations and advantages of microbial hosts to enhance the efficient production of recombinant proteins on a large scale. E. coli is widely used as a host for the production of recombinant proteins. Researchers have identified certain obstacles with this host, and given the growing demand for recombinant protein production, there is an immediate requirement to enhance this host. The following review discusses the elements contributing to the manifestation of recombinant protein. Subsequently, it sheds light on innovative approaches aimed at improving the expression of recombinant protein. Lastly, it delves into the obstacles and optimization methods associated with translation, mentioning both cis-optimization and trans-optimization, producing soluble recombinant protein, and engineering the metal ion transportation. In this context, a comprehensive description of the distinct features will be provided, and this knowledge could potentially enhance the expression of recombinant proteins in E. coli.
Collapse
Affiliation(s)
- Azadeh Eskandari
- Enzyme and Microbial Technology Research Centre, Universiti Putra Malaysia, 43400 UPM, Serdang, Selangor, Malaysia
- Department of Biochemistry, FacultyofBiotechnologyand BiomolecularSciences, Universiti Putra Malaysia, 43400 UPM, Serdang, Selangor, Malaysia
| | - Nima Ghahremani Nezhad
- Enzyme and Microbial Technology Research Centre, Universiti Putra Malaysia, 43400 UPM, Serdang, Selangor, Malaysia
- Department of Cell and Molecular Biology, Faculty of Biotechnology and Biomolecular Sciences, Universiti Putra Malaysia, 43400 UPM, Serdang, Selangor, Malaysia
| | - Thean Chor Leow
- Enzyme and Microbial Technology Research Centre, Universiti Putra Malaysia, 43400 UPM, Serdang, Selangor, Malaysia
- Department of Cell and Molecular Biology, Faculty of Biotechnology and Biomolecular Sciences, Universiti Putra Malaysia, 43400 UPM, Serdang, Selangor, Malaysia
- Enzyme Technology and X-Ray Crystallography Laboratory, VacBio 5, Institute of Bioscience, Universiti Putra Malaysia, 43400 UPM, Serdang, Selangor, Malaysia
| | | | - Siti Nurbaya Oslan
- Enzyme and Microbial Technology Research Centre, Universiti Putra Malaysia, 43400 UPM, Serdang, Selangor, Malaysia.
- Department of Biochemistry, FacultyofBiotechnologyand BiomolecularSciences, Universiti Putra Malaysia, 43400 UPM, Serdang, Selangor, Malaysia.
- Enzyme Technology and X-Ray Crystallography Laboratory, VacBio 5, Institute of Bioscience, Universiti Putra Malaysia, 43400 UPM, Serdang, Selangor, Malaysia.
| |
Collapse
|
28
|
Ma Z, Wang W, Zhang D, Zhang Y, Zhao Y, Li X, Zhao L, Cheng J, Xu D, Yang X, Liu J, He L, Chen Z, Gong P, Zhang X. Polymorphisms of PLIN1 and MOGAT1 genes and their association with feed efficiency in Hu sheep. Gene 2024; 897:148072. [PMID: 38081333 DOI: 10.1016/j.gene.2023.148072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Revised: 11/22/2023] [Accepted: 12/08/2023] [Indexed: 12/21/2023]
Abstract
Feed cost accounts for a high proportion of sheep production, and improving sheep's utilization of feed will reduce production costs and improve economic benefits. The purpose of this study was to investigate the expression characteristics of PLIN1 and MOGAT1 genes and the relationship between their polymorphisms and feed efficiency traits in Hu sheep, and to find molecular Genetic marker that can be used in breeding. The expression levels of PLIN1 and MOGAT1 genes in various tissues were determined using quantitative real-time PCR (qRT-PCR). The results showed that PLIN1 and MOGAT1 genes were widely expressed in heart, liver, spleen, lungs, kidneys, rumen, duodenum, muscle, lymph, and tail fat. The PLIN1 gene had the highest expression level in in the tail fat compared to the other nine tissues. The expression levels of MOGAT1 gene in liver, tail fat, lung and heart was significantly higher than in kidney, muscle and lymph. The expression level of MOGAT1 was lowest in muscle compared to the other tissues (heart, liver, spleen, lung, rumen and tail fat). We recorded the body weight (BW80 and BW180) and feed intake (FI) information of 985 male Hu sheep at 80 and 180 days of age, and calculated the daily average feed intake (ADFI), average daily gain (ADG), and feed conversion rate (FCR) from 80 to 180 days of age. Two intronic mutations, g.18517910 A > G and g.224856118 G > C, were identified in PLIN1 and MOGAT1 genes by PCR amplification and Sanger sequencing. MassARRAY ® SNP detection technology was used to genotype the DNA of 985 Hu sheep and analyze its association with feed efficiency traits. The results showed that the SNP g.18517910 A > G was significantly associated with BW80, BW180, FI, ADFI and FCR (P < 0.05), while SNP g.2248561118 G > C was significantly associated with FCR (P < 0.05). Meanwhile, significant differences were also observed in different combinations of genotypes (P < 0.05). Therefore, these two polymorphic loci can serve as candidate molecular markers for improving feed utilization efficiency in Hu sheep.
Collapse
Affiliation(s)
- Zongwu Ma
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu 730070, China
| | - Weimin Wang
- The State Key Laboratory of Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou Gansu 730020, China
| | - Deyin Zhang
- The State Key Laboratory of Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou Gansu 730020, China
| | - Yukun Zhang
- The State Key Laboratory of Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou Gansu 730020, China
| | - Yuan Zhao
- The State Key Laboratory of Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou Gansu 730020, China
| | - Xiaolong Li
- The State Key Laboratory of Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou Gansu 730020, China
| | - Liming Zhao
- The State Key Laboratory of Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou Gansu 730020, China
| | - Jiangbo Cheng
- The State Key Laboratory of Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou Gansu 730020, China
| | - Dan Xu
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu 730070, China
| | - Xiaobin Yang
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu 730070, China
| | - Jia Liu
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu 730070, China
| | - Lijuan He
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu 730070, China
| | - Zhanyu Chen
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu 730070, China
| | - Ping Gong
- Institute of Animal Husbandry Quality Standards, Xinjiang Academy of Animal Science, Urumqi, 830057, China.
| | - Xiaoxue Zhang
- College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu 730070, China.
| |
Collapse
|
29
|
Love AM, Nair NU. Specific codons control cellular resources and fitness. SCIENCE ADVANCES 2024; 10:eadk3485. [PMID: 38381824 PMCID: PMC10881034 DOI: 10.1126/sciadv.adk3485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 01/18/2024] [Indexed: 02/23/2024]
Abstract
As cellular engineering progresses from simply overexpressing proteins to imparting complex phenotypes through multigene expression, judicious appropriation of cellular resources is essential. Since codon use is degenerate and biased, codons may control cellular resources at a translational level. We investigate how partitioning transfer RNA (tRNA) resources by incorporating dissimilar codon usage can drastically alter interdependence of expression level and burden on the host. By isolating the effect of individual codons' use during translation elongation while eliminating confounding factors, we show that codon choice can trans-regulate fitness of the host and expression of other heterologous or native genes. We correlate specific codon usage patterns with host fitness and derive a coding scheme for multigene expression called the Codon Health Index (CHI, χ). This empirically derived coding scheme (χ) enables the design of multigene expression systems that avoid catastrophic cellular burden and is robust across several proteins and conditions.
Collapse
Affiliation(s)
- Aaron M. Love
- Manus Bio, Waltham, MA 02453, USA
- Department of Chemical and Biological Engineering, Tufts University, Medford, MA 02155, USA
| | - Nikhil U. Nair
- Department of Chemical and Biological Engineering, Tufts University, Medford, MA 02155, USA
| |
Collapse
|
30
|
Lai HY, Cooper TF. Interaction with a phage gene underlie costs of a β-lactamase. mBio 2024; 15:e0277623. [PMID: 38194254 PMCID: PMC10865808 DOI: 10.1128/mbio.02776-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Accepted: 12/05/2023] [Indexed: 01/10/2024] Open
Abstract
The fitness cost of an antibiotic resistance gene (ARG) can differ across host strains, creating refuges that allow the maintenance of an ARG in the absence of direct selection for its resistance phenotype. Despite the importance of such ARG-host interactions for predicting ARG dynamics, the basis of ARG fitness costs and their variability between hosts are not well understood. We determined the genetic basis of a host-dependent cost of a β-lactamase, blaTEM-116*, that conferred a significant cost in one Escherichia coli strain but was close to neutral in 11 other Escherichia spp. strains. Selection of a blaTEM-116*-encoding plasmid in the strain in which it initially had a high cost resulted in rapid and parallel compensation for that cost through mutations in a P1-like phage gene, relAP1. When the wild-type relAP1 gene was added to a strain in which it was not present and in which blaTEM-116* was neutral, it caused the ARG to become costly. Thus, relAP1 is both necessary and sufficient to explain blaTEM-116* costs in at least some host backgrounds. To our knowledge, these findings represent the first demonstrated case of the cost of an ARG being influenced by a genetic interaction with a phage gene. The interaction between a phage gene and a plasmid-borne ARG highlights the complexity of selective forces determining the maintenance and spread of ARGs and, by extension, encoding phage and plasmids in natural bacterial communities.IMPORTANCEAntibiotic resistance genes (ARGs) play a major role in the increasing problem of antibiotic resistance in clinically relevant bacteria. Selection of these genes occurs in the presence of antibiotics, but their eventual success also depends on the sometimes substantial costs they impose on host bacteria in antibiotic-free environments. We evolved an ARG that confers resistance to penicillin-type antibiotics in one host in which it did confer a cost and in one host in which it did not. We found that costs were rapidly and consistently reduced through parallel genetic changes in a gene encoded by a phage that was infecting the costly host. The unmutated version of this gene was sufficient to cause the ARG to confer a cost in a host in which it was originally neutral, demonstrating an antagonism between the two genetic elements and underlining the range and complexity of pressures determining ARG dynamics in natural populations.
Collapse
Affiliation(s)
- Huei-Yi Lai
- School of Natural Sciences, Massey University, Auckland, New Zealand
| | - Tim F. Cooper
- School of Natural Sciences, Massey University, Auckland, New Zealand
| |
Collapse
|
31
|
Yang DD, Rusch LM, Widney KA, Morgenthaler AB, Copley SD. Synonymous edits in the Escherichia coli genome have substantial and condition-dependent effects on fitness. Proc Natl Acad Sci U S A 2024; 121:e2316834121. [PMID: 38252823 PMCID: PMC10835057 DOI: 10.1073/pnas.2316834121] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Accepted: 12/19/2023] [Indexed: 01/24/2024] Open
Abstract
CRISPR-Cas-based genome editing is widely used in bacteria at scales ranging from construction of individual mutants to massively parallel libraries. This procedure relies on guide RNA-directed cleavage of the genome followed by repair with a template that introduces a desired mutation along with synonymous "immunizing" mutations to prevent re-cleavage of the genome after editing. Because the immunizing mutations do not change the protein sequence, they are often assumed to be neutral. However, synonymous mutations can change mRNA structures in ways that alter levels of the encoded proteins. We have tested the assumption that immunizing mutations are neutral by constructing a library of over 50,000 edits that consist of only synonymous mutations in Escherichia coli. Thousands of edits had substantial effects on fitness during growth of E. coli on acetate, a poor carbon source that is toxic at high concentrations. The percentage of high-impact edits varied considerably between genes and at different positions within genes. We reconstructed clones with high-impact edits and found that 69% indeed had significant effects on growth in acetate. Interestingly, fewer edits affected fitness during growth in glucose, a preferred carbon source, suggesting that changes in protein expression caused by synonymous mutations may be most important when an organism encounters challenging conditions. Finally, we showed that synonymous edits can have widespread effects; a synonymous edit at the 5' end of ptsI altered expression of hundreds of genes. Our results suggest that the synonymous immunizing edits introduced during CRISPR-Cas-based genome editing should not be assumed to be innocuous.
Collapse
Affiliation(s)
- Dong-Dong Yang
- Department of Molecular, Cellular and Developmental Biology and the Cooperative Institute for Research in Environmental Sciences, University of Colorado Boulder, Boulder, CO80309
| | - Leo M. Rusch
- Department of Molecular, Cellular and Developmental Biology and the Cooperative Institute for Research in Environmental Sciences, University of Colorado Boulder, Boulder, CO80309
| | - Karl A. Widney
- Department of Molecular, Cellular and Developmental Biology and the Cooperative Institute for Research in Environmental Sciences, University of Colorado Boulder, Boulder, CO80309
| | - Andrew B. Morgenthaler
- Department of Molecular, Cellular and Developmental Biology and the Cooperative Institute for Research in Environmental Sciences, University of Colorado Boulder, Boulder, CO80309
- Amyris, Inc., Emeryville, CA94608
| | - Shelley D. Copley
- Department of Molecular, Cellular and Developmental Biology and the Cooperative Institute for Research in Environmental Sciences, University of Colorado Boulder, Boulder, CO80309
| |
Collapse
|
32
|
Rao X, Li D, Su Z, Nomura CT, Chen S, Wang Q. A smart RBS library and its prediction model for robust and accurate fine-tuning of gene expression in Bacillus species. Metab Eng 2024; 81:1-9. [PMID: 37951459 DOI: 10.1016/j.ymben.2023.11.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 10/17/2023] [Accepted: 11/05/2023] [Indexed: 11/14/2023]
Abstract
Bacillus species, such as Bacillus subtilis and Bacillus licheniformis, are important industrial bacteria. However, there is a lack of standardized and predictable genetic tools for convenient and reproducible assembly of genetic modules in Bacillus species to realize their full potential. In this study, we constructed a Ribosome Binding Site (RBS) library in B. licheniformis, which provides incremental regulation of expression levels over a 104-fold range. Additionally, we developed a model to quantify the resulting translation rates. We successfully demonstrated the robust expression of various target genes using the RBS library and showed that the model accurately predicts the translation rates of arbitrary coding genes. Importantly, we also extended the use of the RBS library and prediction model to B. subtilis, B. thuringiensis, and B. amyloliquefacie. The versatility of the RBS library and its prediction model enables quantification of biological behavior, facilitating reliable forward engineering of gene expression.
Collapse
Affiliation(s)
- Xiaolan Rao
- State Key Laboratory of Biocatalysis and Enzyme Engineering, Environmental Microbial Technology Center of Hubei Province, Hubei University, Wuhan 430062, PR China
| | - Dian Li
- State Key Laboratory of Biocatalysis and Enzyme Engineering, Environmental Microbial Technology Center of Hubei Province, Hubei University, Wuhan 430062, PR China
| | - Zhaowei Su
- State Key Laboratory of Biocatalysis and Enzyme Engineering, Environmental Microbial Technology Center of Hubei Province, Hubei University, Wuhan 430062, PR China
| | | | - Shouwen Chen
- State Key Laboratory of Biocatalysis and Enzyme Engineering, Environmental Microbial Technology Center of Hubei Province, Hubei University, Wuhan 430062, PR China.
| | - Qin Wang
- State Key Laboratory of Biocatalysis and Enzyme Engineering, Environmental Microbial Technology Center of Hubei Province, Hubei University, Wuhan 430062, PR China.
| |
Collapse
|
33
|
Bryant OJ, Lastovka F, Powell J, Chung BYW. The distinct translational landscapes of gram-negative Salmonella and gram-positive Listeria. Nat Commun 2023; 14:8167. [PMID: 38071303 PMCID: PMC10710512 DOI: 10.1038/s41467-023-43759-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2023] [Accepted: 11/17/2023] [Indexed: 12/18/2023] Open
Abstract
Translational control in pathogenic bacteria is fundamental to gene expression and affects virulence and other infection phenotypes. We used an enhanced ribosome profiling protocol coupled with parallel transcriptomics to capture accurately the global translatome of two evolutionarily distant pathogenic bacteria-the Gram-negative bacterium Salmonella and the Gram-positive bacterium Listeria. We find that the two bacteria use different mechanisms to translationally regulate protein synthesis. In Salmonella, in addition to the expected correlation between translational efficiency and cis-regulatory features such as Shine-Dalgarno (SD) strength and RNA secondary structure around the initiation codon, our data reveal an effect of the 2nd and 3rd codons, where the presence of tandem lysine codons (AAA-AAA) enhances translation in both Salmonella and E. coli. Strikingly, none of these features are seen in efficiently translated Listeria transcripts. Instead, approximately 20% of efficiently translated Listeria genes exhibit 70 S footprints seven nt upstream of the authentic start codon, suggesting that these genes may be subject to a novel translational initiation mechanism. Our results show that SD strength is not a direct hallmark of translational efficiency in all bacteria. Instead, Listeria has evolved additional mechanisms to control gene expression level that are distinct from those utilised by Salmonella and E. coli.
Collapse
Affiliation(s)
- Owain J Bryant
- Department of Pathology, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QP, UK
- Centre for Structural Biology, National Cancer Institute, 21702, Frederick, MD, USA
| | - Filip Lastovka
- Department of Pathology, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QP, UK
| | - Jessica Powell
- Department of Pathology, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QP, UK
| | - Betty Y-W Chung
- Department of Pathology, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QP, UK.
| |
Collapse
|
34
|
Bajaj P, Bhasin M, Varadarajan R. Molecular bases for strong phenotypic effects of single synonymous codon substitutions in the E. coli ccdB toxin gene. BMC Genomics 2023; 24:732. [PMID: 38049728 PMCID: PMC10694988 DOI: 10.1186/s12864-023-09817-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2023] [Accepted: 11/18/2023] [Indexed: 12/06/2023] Open
Abstract
BACKGROUND Single synonymous codon mutations typically have only minor or no effects on gene function. Here, we estimate the effects on cell growth of ~ 200 single synonymous codon mutations in an operonic context by mutating almost all positions of ccdB, the 101-residue long cytotoxin of the ccdAB Toxin-Antitoxin (TA) operon to most degenerate codons. Phenotypes were assayed by transforming the mutant library into CcdB sensitive and resistant E. coli strains, isolating plasmid pools, and subjecting them to deep sequencing. Since autoregulation is a hallmark of TA operons, phenotypes obtained for ccdB synonymous mutants after transformation in a RelE toxin reporter strain followed by deep sequencing provided information on the amount of CcdAB complex formed. RESULTS Synonymous mutations in the N-terminal region involved in translation initiation showed the strongest non-neutral phenotypic effects. We observe an interplay of numerous factors, namely, location of the codon, codon usage, t-RNA abundance, formation of anti-Shine Dalgarno sequences, predicted transcript secondary structure, and evolutionary conservation in determining phenotypic effects of ccdB synonymous mutations. Incorporation of an N-terminal, hyperactive synonymous mutation, in the background of the single synonymous codon mutant library sufficiently increased translation initiation, such that mutational effects on either folding or termination of translation became more apparent. Introduction of putative pause sites not only affects the translational rate, but might also alter the folding kinetics of the protein in vivo. CONCLUSION In summary, the study provides novel insights into diverse mechanisms by which synonymous mutations modulate gene function. This information is useful in optimizing heterologous gene expression in E. coli and understanding the molecular bases for alteration in gene expression that arise due to synonymous mutations.
Collapse
Affiliation(s)
- Priyanka Bajaj
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560012, India
- Present address: Department of Bioengineering and Therapeutic Sciences, University of CA - San Francisco, San Francisco, CA, 94158, USA
| | - Munmun Bhasin
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560012, India
| | - Raghavan Varadarajan
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560012, India.
| |
Collapse
|
35
|
Foo JL, Kitano S, Susanto AV, Jin Z, Lin Y, Luo Z, Huang L, Liang Z, Mitchell LA, Yang K, Wong A, Cai Y, Cai J, Stracquadanio G, Bader JS, Boeke JD, Dai J, Chang MW. Establishing chromosomal design-build-test-learn through a synthetic chromosome and its combinatorial reconfiguration. CELL GENOMICS 2023; 3:100435. [PMID: 38020970 PMCID: PMC10667554 DOI: 10.1016/j.xgen.2023.100435] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Revised: 08/19/2023] [Accepted: 10/06/2023] [Indexed: 12/01/2023]
Abstract
Chromosome-level design-build-test-learn cycles (chrDBTLs) allow systematic combinatorial reconfiguration of chromosomes with ease. Here, we established chrDBTL with a redesigned synthetic Saccharomyces cerevisiae chromosome XV, synXV. We designed and built synXV to harbor strategically inserted features, modified elements, and synonymously recoded genes throughout the chromosome. Based on the recoded chromosome, we developed a method to enable chrDBTL: CRISPR-Cas9-mediated mitotic recombination with endoreduplication (CRIMiRE). CRIMiRE allowed the creation of customized wild-type/synthetic combinations, accelerating genotype-phenotype mapping and synthetic chromosome redesign. We also leveraged synXV as a "build-to-learn" model organism for translation studies by ribosome profiling. We conducted a locus-to-locus comparison of ribosome occupancy between synXV and the wild-type chromosome, providing insight into the effects of codon changes and redesigned features on translation dynamics in vivo. Overall, we established synXV as a versatile reconfigurable system that advances chrDBTL for understanding biological mechanisms and engineering strains.
Collapse
Affiliation(s)
- Jee Loon Foo
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore 117456, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117456, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117597, Singapore
- Wilmar-NUS Corporate Laboratory (WIL@NUS), National University of Singapore, Singapore 117599, Singapore
| | - Shohei Kitano
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore 117456, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117456, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117597, Singapore
- Wilmar-NUS Corporate Laboratory (WIL@NUS), National University of Singapore, Singapore 117599, Singapore
| | - Adelia Vicanatalita Susanto
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore 117456, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117456, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117597, Singapore
- Wilmar-NUS Corporate Laboratory (WIL@NUS), National University of Singapore, Singapore 117599, Singapore
| | - Zhu Jin
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore 117456, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117456, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117597, Singapore
- Wilmar-NUS Corporate Laboratory (WIL@NUS), National University of Singapore, Singapore 117599, Singapore
| | - Yicong Lin
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Zhouqing Luo
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| | - Linsen Huang
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Zhenzhen Liang
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Leslie A. Mitchell
- Institute for Systems Genetics, NYU Langone Health, New York, NY 10016, USA
| | - Kun Yang
- Department of Biomedical Engineering, NYU Tandon School of Engineering, Brooklyn, NY 11201, USA
| | - Adison Wong
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore 117456, Singapore
- Singapore Institute of Technology, 10 Dover Drive, Singapore 138683, Singapore
| | - Yizhi Cai
- Manchester Institute of Biotechnology, University of Manchester, 131 Princess Street, Manchester M1 7DN, UK
| | - Jitong Cai
- High-Throughput Biological Center and Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21205, USA
| | - Giovanni Stracquadanio
- High-Throughput Biological Center and Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21205, USA
- School of Biological Sciences, The University of Edinburgh, Edinburgh EH9 3BF, UK
| | - Joel S. Bader
- High-Throughput Biological Center and Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21205, USA
| | - Jef D. Boeke
- Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, NY 10016, USA
- Institute for Systems Genetics, NYU Langone Health, New York, NY 10016, USA
- Department of Biomedical Engineering, NYU Tandon School of Engineering, Brooklyn, NY 11201, USA
| | - Junbiao Dai
- CAS Key Laboratory of Quantitative Engineering Biology, Guangdong Provincial Key Laboratory of Synthetic Genomics and Shenzhen Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Matthew Wook Chang
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore 117456, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117456, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117597, Singapore
- Wilmar-NUS Corporate Laboratory (WIL@NUS), National University of Singapore, Singapore 117599, Singapore
| |
Collapse
|
36
|
Lewin LE, Daniels KG, Hurst LD. Genes for highly abundant proteins in Escherichia coli avoid 5' codons that promote ribosomal initiation. PLoS Comput Biol 2023; 19:e1011581. [PMID: 37878567 PMCID: PMC10599525 DOI: 10.1371/journal.pcbi.1011581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 10/09/2023] [Indexed: 10/27/2023] Open
Abstract
In many species highly expressed genes (HEGs) over-employ the synonymous codons that match the more abundant iso-acceptor tRNAs. Bacterial transgene codon randomization experiments report, however, that enrichment with such "translationally optimal" codons has little to no effect on the resultant protein level. By contrast, consistent with the view that ribosomal initiation is rate limiting, synonymous codon usage following the 5' ATG greatly influences protein levels, at least in part by modifying RNA stability. For the design of bacterial transgenes, for simple codon based in silico inference of protein levels and for understanding selection on synonymous mutations, it would be valuable to computationally determine initiation optimality (IO) scores for codons for any given species. One attractive approach is to characterize the 5' codon enrichment of HEGs compared with the most lowly expressed genes, just as translational optimality scores of codons have been similarly defined employing the full gene body. Here we determine the viability of this approach employing a unique opportunity: for Escherichia coli there is both the most extensive protein abundance data for native genes and a unique large-scale transgene codon randomization experiment enabling objective definition of the 5' codons that cause, rather than just correlate with, high protein abundance (that we equate with initiation optimality, broadly defined). Surprisingly, the 5' ends of native genes that specify highly abundant proteins avoid such initiation optimal codons. We find that this is probably owing to conflicting selection pressures particular to native HEGs, including selection favouring low initiation rates, this potentially enabling high efficiency of ribosomal usage and low noise. While the classical HEG enrichment approach does not work, rendering simple prediction of native protein abundance from 5' codon content futile, we report evidence that initiation optimality scores derived from the transgene experiment may hold relevance for in silico transgene design for a broad spectrum of bacteria.
Collapse
Affiliation(s)
- Loveday E. Lewin
- The Milner Centre for Evolution, Department of Life Sciences, University of Bath, Bath, United Kingdom
| | - Kate G. Daniels
- The Milner Centre for Evolution, Department of Life Sciences, University of Bath, Bath, United Kingdom
| | - Laurence D. Hurst
- The Milner Centre for Evolution, Department of Life Sciences, University of Bath, Bath, United Kingdom
| |
Collapse
|
37
|
Umemoto S, Kondo T, Fujino T, Hayashi G, Murakami H. Large-scale analysis of mRNA sequences localized near the start and amber codons and their impact on the diversity of mRNA display libraries. Nucleic Acids Res 2023; 51:7465-7479. [PMID: 37395404 PMCID: PMC10415131 DOI: 10.1093/nar/gkad555] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Revised: 06/14/2023] [Accepted: 06/18/2023] [Indexed: 07/04/2023] Open
Abstract
Extremely diverse libraries are essential for effectively selecting functional peptides or proteins, and mRNA display technology is a powerful tool for generating such libraries with over 1012-1013 diversity. Particularly, the protein-puromycin linker (PuL)/mRNA complex formation yield is determining for preparing the libraries. However, how mRNA sequences affect the complex formation yield remains unclear. To study the effects of N-terminal and C-terminal coding sequences on the complex formation yield, puromycin-attached mRNAs containing three random codons after the start codon (32768 sequences) or seven random bases next to the amber codon (6480 sequences) were translated. Enrichment scores were calculated by dividing the appearance rate of every sequence in protein-PuL/mRNA complexes by that in total mRNAs. The wide range of enrichment scores (0.09-2.10 for N-terminal and 0.30-4.23 for C-terminal coding sequences) indicated that the N-terminal and C-terminal coding sequences strongly affected the complex formation yield. Using C-terminal GGC-CGA-UAG-U sequences, which resulted in the highest enrichment scores, we constructed highly diverse libraries of monobodies and macrocyclic peptides. The present study provides insights into how mRNA sequences affect the protein/mRNA complex formation yield and will accelerate the identification of functional peptides and proteins involved in various biological processes and having therapeutic applications.
Collapse
Affiliation(s)
- Shun Umemoto
- Department of Biomolecular Engineering, Graduate School of Engineering, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8603, Japan
| | - Taishi Kondo
- Department of Biomolecular Engineering, Graduate School of Engineering, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8603, Japan
| | - Tomoshige Fujino
- Department of Biomolecular Engineering, Graduate School of Engineering, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8603, Japan
| | - Gosuke Hayashi
- Department of Biomolecular Engineering, Graduate School of Engineering, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8603, Japan
- Japan Science and Technology Agency (JST), PRESTO, 4-1-8 Honcho, Kawaguchi, Saitama 332-0012, Japan
| | - Hiroshi Murakami
- Department of Biomolecular Engineering, Graduate School of Engineering, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8603, Japan
- Institute of Nano-Life-Systems, Institutes of Innovation for Future Society, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8603, Japan
| |
Collapse
|
38
|
Anwar AM, Khodary SM, Ahmed EA, Osama A, Ezzeldin S, Tanios A, Mahgoub S, Magdeldin S. gtAI: an improved species-specific tRNA adaptation index using the genetic algorithm. Front Mol Biosci 2023; 10:1218518. [PMID: 37469707 PMCID: PMC10352787 DOI: 10.3389/fmolb.2023.1218518] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2023] [Accepted: 06/22/2023] [Indexed: 07/21/2023] Open
Abstract
The tRNA adaptation index (tAI) is a translation efficiency metric that considers weighted values (S ij values) for codon-tRNA wobble interaction efficiencies. The initial implementation of the tAI had significant flaws. For instance, generated S ij weights were optimized based on gene expression in Saccharomyces cerevisiae, which is expected to vary among different species. Consequently, a species-specific approach (stAI) was developed to overcome those limitations. However, the stAI method employed a hill climbing algorithm to optimize the S ij weights, which is not ideal for obtaining the best set of S ij weights because it could struggle to find the global maximum given a complex search space, even after using different starting positions. In addition, it did not perform well in computing the tAI of fungal genomes in comparison with the original implementation. We developed a novel approach named genetic tAI (gtAI) implemented as a Python package (https://github.com/AliYoussef96/gtAI), which employs a genetic algorithm to obtain the best set of S ij weights and follows a new codon usage-based workflow that better computes the tAI of genomes from the three domains of life. The gtAI has significantly improved the correlation with the codon adaptation index (CAI) and the prediction of protein abundance (empirical data) compared to the stAI.
Collapse
Affiliation(s)
- Ali Mostafa Anwar
- Proteomics and Metabolomics Research Program, Basic Research Department, Children’s Cancer Hospital 57357 (CCHE-57357), Cairo, Egypt
| | - Saif M. Khodary
- Department of Genetics, Faculty of Agriculture, Cairo University, Giza, Egypt
| | - Eman Ali Ahmed
- Proteomics and Metabolomics Research Program, Basic Research Department, Children’s Cancer Hospital 57357 (CCHE-57357), Cairo, Egypt
- Department of Pharmacology, Faculty of Veterinary Medicine, Suez Canal University, Ismailia, Egypt
| | - Aya Osama
- Proteomics and Metabolomics Research Program, Basic Research Department, Children’s Cancer Hospital 57357 (CCHE-57357), Cairo, Egypt
| | - Shahd Ezzeldin
- Proteomics and Metabolomics Research Program, Basic Research Department, Children’s Cancer Hospital 57357 (CCHE-57357), Cairo, Egypt
| | - Anthony Tanios
- Proteomics and Metabolomics Research Program, Basic Research Department, Children’s Cancer Hospital 57357 (CCHE-57357), Cairo, Egypt
| | - Sebaey Mahgoub
- Proteomics and Metabolomics Research Program, Basic Research Department, Children’s Cancer Hospital 57357 (CCHE-57357), Cairo, Egypt
| | - Sameh Magdeldin
- Proteomics and Metabolomics Research Program, Basic Research Department, Children’s Cancer Hospital 57357 (CCHE-57357), Cairo, Egypt
- Department of Physiology, Faculty of Veterinary Medicine, Suez Canal University, Ismailia, Egypt
| |
Collapse
|
39
|
Li J, Li P, Liu Q, Li J, Qi H. Translation initiation consistency between in vivo and in vitro bacterial protein expression systems. Front Bioeng Biotechnol 2023; 11:1201580. [PMID: 37304134 PMCID: PMC10248181 DOI: 10.3389/fbioe.2023.1201580] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Accepted: 05/17/2023] [Indexed: 06/13/2023] Open
Abstract
Strict on-demand control of protein synthesis is a crucial aspect of synthetic biology. The 5'-terminal untranslated region (5'-UTR) is an essential bacterial genetic element that can be designed for the regulation of translation initiation. However, there is insufficient systematical data on the consistency of 5'-UTR function among various bacterial cells and in vitro protein synthesis systems, which is crucial for the standardization and modularization of genetic elements in synthetic biology. Here, more than 400 expression cassettes comprising the GFP gene under the regulation of various 5'-UTRs were systematically characterized to evaluate the protein translation consistency in the two popular Escherichia coli strains of JM109 and BL21, as well as an in vitro protein expression system based on cell lysate. In contrast to the very strong correlation between the two cellular systems, the consistency between in vivo and in vitro protein translation was lost, whereby both in vivo and in vitro translation evidently deviated from the estimation of the standard statistical thermodynamic model. Finally, we found that the absence of nucleotide C and complex secondary structure in the 5'-UTR significantly improve the efficiency of protein translation, both in vitro and in vivo.
Collapse
Affiliation(s)
- Jiaojiao Li
- School of Chemical Engineering and Technology, Tianjin University, Tianjin, China
- Frontier Science Center for Synthetic Biology (Ministry of Education), Tianjin University, Tianjin, China
| | - Peixian Li
- School of Chemical Engineering and Technology, Tianjin University, Tianjin, China
- Frontier Science Center for Synthetic Biology (Ministry of Education), Tianjin University, Tianjin, China
| | - Qian Liu
- School of Chemical Engineering and Technology, Tianjin University, Tianjin, China
- Frontier Science Center for Synthetic Biology (Ministry of Education), Tianjin University, Tianjin, China
| | - Jinjin Li
- School of Chemical Engineering and Technology, Tianjin University, Tianjin, China
- Frontier Science Center for Synthetic Biology (Ministry of Education), Tianjin University, Tianjin, China
| | - Hao Qi
- School of Chemical Engineering and Technology, Tianjin University, Tianjin, China
- Frontier Science Center for Synthetic Biology (Ministry of Education), Tianjin University, Tianjin, China
- Zhejiang Shaoxing Research Institute of Tianjin University, Shaoxing, China
| |
Collapse
|
40
|
Nagao A, Nakanishi Y, Yamaguchi Y, Mishina Y, Karoji M, Toya T, Fujita T, Iwasaki S, Miyauchi K, Sakaguchi Y, Suzuki T. Quality control of protein synthesis in the early elongation stage. Nat Commun 2023; 14:2704. [PMID: 37198183 PMCID: PMC10192219 DOI: 10.1038/s41467-023-38077-5] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Accepted: 04/14/2023] [Indexed: 05/19/2023] Open
Abstract
In the early stage of bacterial translation, peptidyl-tRNAs frequently dissociate from the ribosome (pep-tRNA drop-off) and are recycled by peptidyl-tRNA hydrolase. Here, we establish a highly sensitive method for profiling of pep-tRNAs using mass spectrometry, and successfully detect a large number of nascent peptides from pep-tRNAs accumulated in Escherichia coli pthts strain. Based on molecular mass analysis, we found about 20% of the peptides bear single amino-acid substitutions of the N-terminal sequences of E. coli ORFs. Detailed analysis of individual pep-tRNAs and reporter assay revealed that most of the substitutions take place at the C-terminal drop-off site and that the miscoded pep-tRNAs rarely participate in the next round of elongation but dissociate from the ribosome. These findings suggest that pep-tRNA drop-off is an active mechanism by which the ribosome rejects miscoded pep-tRNAs in the early elongation, thereby contributing to quality control of protein synthesis after peptide bond formation.
Collapse
Affiliation(s)
- Asuteka Nagao
- Department of Chemistry and Biotechnology, Graduate School of Engineering, University of Tokyo, Bunkyo-ku, Tokyo, 113-8656, Japan.
| | - Yui Nakanishi
- Department of Chemistry and Biotechnology, Graduate School of Engineering, University of Tokyo, Bunkyo-ku, Tokyo, 113-8656, Japan
| | - Yutaro Yamaguchi
- Department of Chemistry and Biotechnology, Graduate School of Engineering, University of Tokyo, Bunkyo-ku, Tokyo, 113-8656, Japan
| | - Yoshifumi Mishina
- Department of Chemistry and Biotechnology, Graduate School of Engineering, University of Tokyo, Bunkyo-ku, Tokyo, 113-8656, Japan
| | - Minami Karoji
- Department of Chemistry and Biotechnology, Graduate School of Engineering, University of Tokyo, Bunkyo-ku, Tokyo, 113-8656, Japan
| | - Takafumi Toya
- Department of Chemistry and Biotechnology, Graduate School of Engineering, University of Tokyo, Bunkyo-ku, Tokyo, 113-8656, Japan
| | - Tomoya Fujita
- RNA Systems Biochemistry Laboratory, RIKEN Cluster for Pioneering Research, Wako, Saitama, 351-0198, Japan
| | - Shintaro Iwasaki
- RNA Systems Biochemistry Laboratory, RIKEN Cluster for Pioneering Research, Wako, Saitama, 351-0198, Japan
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba, 277-8561, Japan
| | - Kenjyo Miyauchi
- Department of Chemistry and Biotechnology, Graduate School of Engineering, University of Tokyo, Bunkyo-ku, Tokyo, 113-8656, Japan
| | - Yuriko Sakaguchi
- Department of Chemistry and Biotechnology, Graduate School of Engineering, University of Tokyo, Bunkyo-ku, Tokyo, 113-8656, Japan
| | - Tsutomu Suzuki
- Department of Chemistry and Biotechnology, Graduate School of Engineering, University of Tokyo, Bunkyo-ku, Tokyo, 113-8656, Japan.
| |
Collapse
|
41
|
Ojima-Kato T, Nishikawa Y, Furukawa Y, Kojima T, Nakano H. Nascent MSKIK Peptide Cancels Ribosomal Stalling by Arrest Peptides in Escherichia coli. J Biol Chem 2023; 299:104676. [PMID: 37028767 DOI: 10.1016/j.jbc.2023.104676] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2022] [Revised: 03/22/2023] [Accepted: 03/24/2023] [Indexed: 04/09/2023] Open
Abstract
The insertion of the DNA sequence encoding SKIK peptide adjacent to the M start codon of a difficult-to-express protein enhances protein production in Escherichia coli. In this report, we reveal that the increased production of the SKIK-tagged protein is not due to codon usage of the SKIK sequence. Furthermore, we found that insertion of SKIK or MSKIK just before the SecM arrest peptide (FSTPVWISQAQGIRAGP), which causes ribosomal stalling on mRNA, greatly increased the production of the protein containing the SecM arrest peptide in the E. coli reconstituted cell-free protein synthesis system (PURE system). A similar translation enhancement phenomenon by MSKIK was observed for the CmlA leader peptide, a ribosome arrest peptide, whose arrest is induced by chloramphenicol. These results strongly suggest that the nascent MSKIK peptide prevents or releases ribosomal stalling immediately following its generation during the translation process, resulting in an increase of protein production.
Collapse
Affiliation(s)
- Teruyo Ojima-Kato
- Graduate School of Bioagricultural Sciences, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8601, Japan.
| | - Yuma Nishikawa
- Graduate School of Bioagricultural Sciences, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8601, Japan
| | - Yuki Furukawa
- Graduate School of Bioagricultural Sciences, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8601, Japan
| | - Takaaki Kojima
- Graduate School of Bioagricultural Sciences, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8601, Japan
| | - Hideo Nakano
- Graduate School of Bioagricultural Sciences, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8601, Japan
| |
Collapse
|
42
|
Khandia R, Pandey MK, Rzhepakovsky IV, Khan AA, Alexiou A. Synonymous Codon Variant Analysis for Autophagic Genes Dysregulated in Neurodegeneration. Mol Neurobiol 2023; 60:2252-2267. [PMID: 36637744 DOI: 10.1007/s12035-022-03081-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Accepted: 09/27/2022] [Indexed: 01/14/2023]
Abstract
Neurodegenerative disorders are often a culmination of the accumulation of abnormally folded proteins and defective organelles. Autophagy is a process of removing these defective proteins, organelles, and harmful substances from the body, and it works to maintain homeostasis. If autophagic removal of defective proteins has interfered, it affects neuronal health. Some of the autophagic genes are specifically found to be associated with neurodegenerative phenotypes. Non-functional, mutated, or gene copies having silent mutations, often termed synonymous variants, might explain this. However, these synonymous variant which codes for exactly similar proteins have different translation rates, stability, and gene expression profiling. Hence, it would be interesting to study the pattern of synonymous variant usage. In the study, synonymous variant usage in various transcripts of autophagic genes ATG5, ATG7, ATG8A, ATG16, and ATG17/FIP200 reported to cause neurodegeneration (if dysregulated) is studied. These genes were analyzed for their synonymous variant usage; nucleotide composition; any possible nucleotide skew in a gene; physical properties of autophagic protein including GRAVY and AROMA; hydropathicity; instability index; and frequency of acidic, basic, neutral amino acids; and gene expression level. The study will help understand various evolutionary forces acting on these genes and the possible augmentation of a gene if showing unusual behavior.
Collapse
Affiliation(s)
- Rekha Khandia
- Department of Biochemistry and Genetics, Barkatullah University, Bhopal, 462026, India.
| | - Megha Katare Pandey
- Department of Translational Medicine, All India Institute of Medical Sciences, Bhopal, 462020, India
| | | | - Azmat Ali Khan
- Pharmaceutical Biotechnology Laboratory, Department of Pharmaceutical Chemistry, College of Pharmacy, King Saud University, Riyadh, 11451, Saudi Arabia.
| | - Athanasios Alexiou
- Novel Global Community Educational Foundation, Hebersham, Australia
- AFNP Med, Wien, Austria
| |
Collapse
|
43
|
Höllerer S, Jeschek M. Ultradeep characterisation of translational sequence determinants refutes rare-codon hypothesis and unveils quadruplet base pairing of initiator tRNA and transcript. Nucleic Acids Res 2023; 51:2377-2396. [PMID: 36727459 PMCID: PMC10018350 DOI: 10.1093/nar/gkad040] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Revised: 12/05/2022] [Accepted: 01/13/2023] [Indexed: 02/03/2023] Open
Abstract
Translation is a key determinant of gene expression and an important biotechnological engineering target. In bacteria, 5'-untranslated region (5'-UTR) and coding sequence (CDS) are well-known mRNA parts controlling translation and thus cellular protein levels. However, the complex interaction of 5'-UTR and CDS has so far only been studied for few sequences leading to non-generalisable and partly contradictory conclusions. Herein, we systematically assess the dynamic translation from over 1.2 million 5'-UTR-CDS pairs in Escherichia coli to investigate their collective effect using a new method for ultradeep sequence-function mapping. This allows us to disentangle and precisely quantify effects of various sequence determinants of translation. We find that 5'-UTR and CDS individually account for 53% and 20% of variance in translation, respectively, and show conclusively that, contrary to a common hypothesis, tRNA abundance does not explain expression changes between CDSs with different synonymous codons. Moreover, the obtained large-scale data provide clear experimental evidence for a base-pairing interaction between initiator tRNA and mRNA beyond the anticodon-codon interaction, an effect that is often masked for individual sequences and therefore inaccessible to low-throughput approaches. Our study highlights the indispensability of ultradeep sequence-function mapping to accurately determine the contribution of parts and phenomena involved in gene regulation.
Collapse
Affiliation(s)
- Simon Höllerer
- Department of Biosystems Science and Engineering, Swiss Federal Institute of Technology – ETH Zurich, Basel CH-4058, Switzerland
| | - Markus Jeschek
- Department of Biosystems Science and Engineering, Swiss Federal Institute of Technology – ETH Zurich, Basel CH-4058, Switzerland
- Institute of Microbiology, Synthetic Microbiology Group, University of Regensburg, Regensburg D-93053, Germany
| |
Collapse
|
44
|
Fages-Lartaud M, Mueller Y, Elie F, Courtade G, Hohmann-Marriott MF. Standard Intein Gene Expression Ramps (SIGER) for Protein-Independent Expression Control. ACS Synth Biol 2023; 12:1058-1071. [PMID: 36920366 PMCID: PMC10127266 DOI: 10.1021/acssynbio.2c00530] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/16/2023]
Abstract
Coordination of multigene expression is one of the key challenges of metabolic engineering for the development of cell factories. Constraints on translation initiation and early ribosome kinetics of mRNA are imposed by features of the 5'UTR in combination with the start of the gene, referred to as the "gene ramp", such as rare codons and mRNA secondary structures. These features strongly influence the translation yield and protein quality by regulating the ribosome distribution on mRNA strands. The utilization of genetic expression sequences, such as promoters and 5'UTRs in combination with different target genes, leads to a wide variety of gene ramp compositions with irregular translation rates, leading to unpredictable levels of protein yield and quality. Here, we present the Standard Intein Gene Expression Ramp (SIGER) system for controlling protein expression. The SIGER system makes use of inteins to decouple the translation initiation features from the gene of a target protein. We generated sequence-specific gene expression sequences for two inteins (DnaB and DnaX) that display defined levels of protein expression. Additionally, we used inteins that possess the ability to release the C-terminal fusion protein in vivo to avoid the impairment of protein functionality by the fused intein. Overall, our results show that SIGER systems are unique tools to mitigate the undesirable effects of gene ramp variation and to control the relative ratios of enzymes involved in molecular pathways. As a proof of concept of the potential of the system, we also used a SIGER system to express two difficult-to-produce proteins, GumM and CBM73.
Collapse
Affiliation(s)
- Maxime Fages-Lartaud
- Department of Biotechnology and Food Science, Norwegian University of Science and Technology, Trondheim N-7491, Norway
| | - Yasmin Mueller
- Department of Biotechnology and Food Science, Norwegian University of Science and Technology, Trondheim N-7491, Norway
| | - Florence Elie
- Department of Biotechnology and Food Science, Norwegian University of Science and Technology, Trondheim N-7491, Norway
| | - Gaston Courtade
- Department of Biotechnology and Food Science, Norwegian University of Science and Technology, Trondheim N-7491, Norway
| | - Martin Frank Hohmann-Marriott
- Department of Biotechnology and Food Science, Norwegian University of Science and Technology, Trondheim N-7491, Norway.,United Scientists CORE (Limited), Dunedin 9016, Aotearoa, New Zealand
| |
Collapse
|
45
|
Wang Z, Wang C, Zhang Y, Liu S, Cheng Y, Wang S, Xia P, Hao L. Porcine IGF-1R synonymous mutations in the extracellular domain affect proliferation and differentiation of skeletal muscle cells. Gene 2023; 854:147098. [PMID: 36496177 DOI: 10.1016/j.gene.2022.147098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Revised: 11/24/2022] [Accepted: 11/29/2022] [Indexed: 12/13/2022]
Abstract
OBJECTIVE Miniature pigs are considered ideal organ donors for xenotransplantation in humans, but the mechanism underlying their dwarfism remains to be elucidated. IGF-1R is a crucial factor in body size formation in mammals, including skeletal muscle formation and development. The extracellular domain (ECD) binds to the ligand, a phenomenon that results in the activation of downstream pathways. METHODS In this study, the coding sequences of two IGF-1R ECD haplotypes of the large Landrace (LP) pig and the small Bama Xiang (BM) pig were cloned into pcDNA3.1 vectors to generate pcDNA3.1-LP and pcDNA3.1-BM. The two recombinant vectors were then transfected into skeletal muscle cells. RESULTS IGF-1R transcript was found to be expressed at higher levels in the pcDNA3.1-LP group than in the pcDNA3.1-BM group. The IGF-1R ECD from LP promoted cell proliferation and CyclinD1 expression, and promoted the phosphorylation of protein kinase B (to yield p-AKT). Moreover, the IGF-1R ECD from LP increased cell differentiation and the expression of myogenic determination factor (MyoD). CONCLUSION Our data indicated that the IGF-1R ECD haplotypes between pig breeds with different body sizes affect IGF-1R expression, in turn affecting the proliferation and differentiation of skeletal muscle cells by activating downstream signalling pathways.
Collapse
Affiliation(s)
- Zhaoguo Wang
- College of Animal Science, Jilin University, Changchun, Jilin 130061, China
| | - Chunli Wang
- College of Animal Science, Jilin University, Changchun, Jilin 130061, China
| | - Ying Zhang
- College of Animal Science, Jilin University, Changchun, Jilin 130061, China
| | - Songcai Liu
- College of Animal Science, Jilin University, Changchun, Jilin 130061, China.
| | - Yunyun Cheng
- Ministry of Health Key Laboratory of Radiobiology, College of Public Health, Jilin University, Changchun, Jilin 130061, China
| | - Siyao Wang
- College of Animal Science, Jilin University, Changchun, Jilin 130061, China
| | - Peijun Xia
- College of Animal Science, Jilin University, Changchun, Jilin 130061, China
| | - Linlin Hao
- College of Animal Science, Jilin University, Changchun, Jilin 130061, China.
| |
Collapse
|
46
|
Kohl MP, Kompatscher M, Clementi N, Holl L, Erlacher M. Initiation at AUGUG and GUGUG sequences can lead to translation of overlapping reading frames in E. coli. Nucleic Acids Res 2023; 51:271-289. [PMID: 36546769 PMCID: PMC9841429 DOI: 10.1093/nar/gkac1175] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Revised: 11/16/2022] [Accepted: 11/30/2022] [Indexed: 12/24/2022] Open
Abstract
During initiation, the ribosome is tasked to efficiently recognize open reading frames (ORFs) for accurate and fast translation of mRNAs. A critical step is start codon recognition, which is modulated by initiation factors, mRNA structure, a Shine Dalgarno (SD) sequence and the start codon itself. Within the Escherichia coli genome, we identified more than 50 annotated initiation sites harboring AUGUG or GUGUG sequence motifs that provide two canonical start codons, AUG and GUG, in immediate proximity. As these sites may challenge start codon recognition, we studied if and how the ribosome is accurately guided to the designated ORF, with a special focus on the SD sequence as well as adenine at the fourth coding sequence position (A4). By in vitro and in vivo experiments, we characterized key requirements for unambiguous start codon recognition, but also discovered initiation sites that lead to the translation of both overlapping reading frames. Our findings corroborate the existence of an ambiguous translation initiation mechanism, implicating a multitude of so far unrecognized ORFs and translation products in bacteria.
Collapse
Affiliation(s)
- Maximilian P Kohl
- Institute of Genomics and RNomics, Biocenter, Medical University of Innsbruck, 6020 Innsbruck, Austria
| | - Maria Kompatscher
- Institute of Genomics and RNomics, Biocenter, Medical University of Innsbruck, 6020 Innsbruck, Austria
| | - Nina Clementi
- Institute of Genomics and RNomics, Biocenter, Medical University of Innsbruck, 6020 Innsbruck, Austria
| | - Lena Holl
- Institute of Genomics and RNomics, Biocenter, Medical University of Innsbruck, 6020 Innsbruck, Austria
| | - Matthias D Erlacher
- Institute of Genomics and RNomics, Biocenter, Medical University of Innsbruck, 6020 Innsbruck, Austria
| |
Collapse
|
47
|
Zabolotskii AI, Kozlovskiy SV, Katrukha AG. The Influence of the Nucleotide Composition of Genes and Gene Regulatory Elements on the Efficiency of Protein Expression in Escherichia coli. BIOCHEMISTRY (MOSCOW) 2023; 88:S176-S191. [PMID: 37069120 DOI: 10.1134/s0006297923140109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/22/2023]
Abstract
Recombinant proteins expressed in Escherichia coli are widely used in biochemical research and industrial processes. At the same time, achieving higher protein expression levels and correct protein folding still remains the key problem, since optimization of nutrient media, growth conditions, and methods for induction of protein synthesis do not always lead to the desired result. Often, low protein expression is determined by the sequences of the expressed genes and their regulatory regions. The genetic code is degenerated; 18 out of 20 amino acids are encoded by more than one codon. Choosing between synonymous codons in the coding sequence can significantly affect the level of protein expression and protein folding due to the influence of the gene nucleotide composition on the probability of formation of secondary mRNA structures that affect the ribosome binding at the translation initiation phase, as well as the ribosome movement along the mRNA during elongation, which, in turn, influences the mRNA degradation and the folding of the nascent protein. The nucleotide composition of the mRNA untranslated regions, in particular the promoter and Shine-Dalgarno sequences, also affects the efficiency of mRNA transcription, translation, and degradation. In this review, we describe the genetic principles that determine the efficiency of protein production in Escherichia coli.
Collapse
Affiliation(s)
- Artur I Zabolotskii
- Faculty of Biology, Lomonosov Moscow State University, Moscow, 119991, Russia.
| | | | - Alexey G Katrukha
- Faculty of Biology, Lomonosov Moscow State University, Moscow, 119991, Russia
| |
Collapse
|
48
|
Xu K, Tong Y, Li Y, Tao J, Rao S, Li J, Zhou J, Liu S. Autoinduction Expression Modules for Regulating Gene Expression in Bacillus subtilis. ACS Synth Biol 2022; 11:4220-4225. [PMID: 36468943 DOI: 10.1021/acssynbio.2c00304] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Although quorum sensing (QS) promoters that can autonomously activate gene expression have been identified and engineered in Bacillus subtilis, researchers focus on quantifying individual promoters while ignoring the interaction between other genetic regulatory elements. Here, we constructed the autoinduction expression modules consisting of promoters responsive to QS ComQXPA, ribosome binding sites (RBSs), and terminators. Using superfolder green fluorescent protein (sfGFP) as a reporter gene, three individual element libraries were generated from 945 promoters, 12,000 RBSs, and 425 terminators by random mutation, de novo design, and database mining strategies, respectively. Then, the efficiency of three libraries in regulating gene expression was further enhanced by engineering the core region of each optimal element. After hybridizing the element libraries, the generated expression modules exhibited a 627-fold range in regulating gene expression without significantly affecting the autoinduction initiation. Here, the hybrid modules with broad expression strength may benefit the application of QS-based autoinduction systems in B. subtilis.
Collapse
Affiliation(s)
- Kuidong Xu
- National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, Jiangnan University, Wuxi 214122, China.,Science Center for Future Foods, Jiangnan University, Wuxi 214122, China.,Jiangsu Provisional Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Road, Wuxi 214122, China
| | - Yi Tong
- National Engineering Research Center for Corn Deep Processing, Jilin COFCO Biochemical Co. Ltd, Changchun 130033, China
| | - Yi Li
- National Engineering Research Center for Corn Deep Processing, Jilin COFCO Biochemical Co. Ltd, Changchun 130033, China
| | - Jin Tao
- National Engineering Research Center for Corn Deep Processing, Jilin COFCO Biochemical Co. Ltd, Changchun 130033, China
| | - Shengqi Rao
- College of Food Science and Engineering, Yangzhou University, Yangzhou, Jiangsu 214122, China
| | - Jianghua Li
- Science Center for Future Foods, Jiangnan University, Wuxi 214122, China.,School of Biotechnology, Jiangnan University, 1800 Lihu Road, Wuxi 214122, China
| | - Jingwen Zhou
- National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, Jiangnan University, Wuxi 214122, China.,Science Center for Future Foods, Jiangnan University, Wuxi 214122, China.,Jiangsu Provisional Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Road, Wuxi 214122, China
| | - Song Liu
- National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, Jiangnan University, Wuxi 214122, China.,Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| |
Collapse
|
49
|
Sethi A, Bhandawat A, Pati PK. Engineering medicinal plant-derived CYPs: a promising strategy for production of high-valued secondary metabolites. PLANTA 2022; 256:119. [PMID: 36378350 PMCID: PMC9664027 DOI: 10.1007/s00425-022-04024-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Accepted: 10/30/2022] [Indexed: 06/16/2023]
Abstract
Cytochorme P450s (CYPs) play a critical role in the catalysis of secondary metabolite biosynthetic pathways. For their commercial use, various strategies for metabolic pathway engineering using CYP as a potential target have been explored. Plants produce a vast diversity of secondary metabolites which are being used to treat various ailments and diseases. Some of these metabolites are difficult to obtain in large quantities limiting their industrial use. Cytochrome P450 enzymes (CYPs) are important catalysts in the biosynthesis of highly valued secondary metabolites, and are found in all domains of life. With the development of high-throughput sequencing and high-resolution mass spectrometry, new biosynthetic pathways and associated CYPs are being identified. In this review, we present CYPs identified from medicinal plants as a potential game changer in the metabolic engineering of secondary metabolic pathways. We present the achievements made so far in enhancing the production of important bioactivities through pathway engineering, giving some popular examples. At last, current challenges and possible strategies to overcome the limitations associated with CYP engineering to enhance the biosynthesis of target secondary metabolites are also highlighted.
Collapse
Affiliation(s)
- Anshika Sethi
- Department of Biotechnology, Guru Nanak Dev University, Amritsar, Punjab, 143 005, India
| | - Abhishek Bhandawat
- Department of Biotechnology, Guru Nanak Dev University, Amritsar, Punjab, 143 005, India
| | - Pratap Kumar Pati
- Department of Biotechnology, Guru Nanak Dev University, Amritsar, Punjab, 143 005, India.
| |
Collapse
|
50
|
Mushtaq M, Khan S, Hassan M, Al-Harbi AI, Hameed AR, Khan K, Ismail S, Irfan M, Ahmad S. Computational Design of a Chimeric Vaccine against Plesiomonas shigelloides Using Pan-Genome and Reverse Vaccinology. Vaccines (Basel) 2022; 10:1886. [PMID: 36366394 PMCID: PMC9697808 DOI: 10.3390/vaccines10111886] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2022] [Revised: 11/06/2022] [Accepted: 11/07/2022] [Indexed: 08/29/2023] Open
Abstract
The swift emergence of antibiotic resistance (AR) in bacterial pathogens to make themselves adaptable to changing environments has become an alarming health issue. To prevent AR infection, many ways can be accomplished such as by decreasing the misuse of antibiotics in human and animal medicine. Among these AR bacterial species, Plesiomonas shigelloides is one of the etiological agents of intestinal infection in humans. It is a gram-negative rod-shaped bacterium that is highly resistant to several classes of antibiotics, and no licensed vaccine against the aforementioned pathogen is available. Hence, substantial efforts are required to screen protective antigens from the pathogen whole genome that can be subjected easily to experimental evaluations. Here, we employed a reverse vaccinology (RV) approach to design a multi-antigenic epitopes based vaccine against P. shigelloides. The complete genomes of P. shigelloides were retrieved from the National Center for Biotechnological Information (NCBI) that on average consist of 5226 proteins. The complete proteomes were subjected to different subtractive proteomics filters, and in the results of that analysis, out of total proteins, 2399 were revealed as non-redundant and 2827 as redundant proteins. The non-redundant proteins were further checked for subcellular localization analysis, in which three were localized in the extracellular matrix, eight were outer membrane, and 13 were found in the periplasmic membrane. All surface localized proteins were found to be virulent. Out of a total of 24 virulent proteins, three proteins (flagellar hook protein (FlgE), hypothetical protein, and TonB-dependent hemoglobin/transferrin/lactoferrin family receptor protein) were considered as potential vaccine targets and subjected to epitopes prediction. The predicted epitopes were further examined for antigenicity, toxicity, and solubility. A total of 10 epitopes were selected (GFKESRAEF, VQVPTEAGQ, KINENGVVV, ENKALSQET, QGYASANDE, RLNPTDSRW, TLDYRLNPT, RVTKKQSDK, GEREGKNRP, RDKKTNQPL). The selected epitopes were linked with each other via specific GPGPG linkers in order to design a multi-epitopes vaccine construct, and linked with cholera toxin B subunit adjuvant to make the designed vaccine construct more efficient in terms of antigenicity. The 3D structure of the vaccine construct was modeled ab initio as no appropriate template was available. Furthermore, molecular docking was carried out to check the interaction affinity of the designed vaccine with major histocompatibility complex (MHC-)I (PDB ID: 1L1Y), MHC-II (1KG0), and toll-like receptor 4 ((TLR-4) (PDB: 4G8A). Molecular dynamic simulation was applied to evaluate the dynamic behavior of vaccine-receptor complexes. Lastly, the binding free energies of the vaccine with receptors were estimated by using MMPB/GBSA methods. All of the aforementioned analyses concluded that the designed vaccine molecule as a good candidate to be used in experimental studies to disclose its immune protective efficacy in animal models.
Collapse
Affiliation(s)
- Mahnoor Mushtaq
- Department of Health and Biological Sciences, Abasyn University, Peshawar 25000, Pakistan
| | - Saifullah Khan
- Institute of Biotechnology and Microbiology, Bacha Khan University, Charsadda 24461, Pakistan
| | - Muhammad Hassan
- Department of Pharmacy, Bacha Khan University, Charsadda 24461, Pakistan
| | - Alhanouf I. Al-Harbi
- Department of Medical Laboratory, College of Applied Medical Sciences, Taibah University, Yanbu 46411, Saudi Arabia
| | - Alaa R. Hameed
- Department of Medical Laboratory Techniques, School of Life Sciences, Dijlah University College, Baghdad 59058, Iraq
| | | | - Saba Ismail
- Department of Biological Sciences, National University of Medical Sciences, Rawalpindi 46000, Pakistan
| | - Muhammad Irfan
- Department of Oral Biology, College of Dentistry, University of Florida, Gainesville, FL 32611, USA
| | - Sajjad Ahmad
- Department of Health and Biological Sciences, Abasyn University, Peshawar 25000, Pakistan
| |
Collapse
|