1
|
Lin J, Wang X, Liu T, Teng Y, Cui W. Diffusion-Based Generative Network for de Novo Synthetic Promoter Design. ACS Synth Biol 2024; 13:1513-1522. [PMID: 38613497 DOI: 10.1021/acssynbio.4c00041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/15/2024]
Abstract
Computer-aided promoter design is a major development trend in synthetic promoter engineering. Various deep learning models have been used to evaluate or screen synthetic promoters, but there have been few works on de novo promoter design. To explore the potential ability of generative models in promoter design, we established a diffusion-based generative model for promoter design in Escherichia coli. The model was completely driven by sequence data and could study the essential characteristics of natural promoters, thus generating synthetic promoters similar to natural promoters in structure and component. We also improved the calculation method of FID indicator, using a convolution layer to extract the feature matrix of the promoter sequence instead. As a result, we got an FID equal to 1.37, which meant synthetic promoters have a distribution similar to that of natural ones. Our work provides a fresh approach to de novo promoter design, indicating that a completely data-driven generative model is feasible for promoter design.
Collapse
Affiliation(s)
- Jianfeng Lin
- School of Automation Science and Engineering, South China University of Technology, Guangzhou 510641, China
| | - Xin Wang
- School of Automation Science and Engineering, South China University of Technology, Guangzhou 510641, China
| | - Tuoyu Liu
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, Beijing 100071, China
| | - Yue Teng
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, Beijing 100071, China
| | - Wei Cui
- School of Automation Science and Engineering, South China University of Technology, Guangzhou 510641, China
| |
Collapse
|
2
|
Guo J, Lin LF, Oraskovich SV, Rivera de Jesús JA, Listgarten J, Schaffer DV. Computationally guided AAV engineering for enhanced gene delivery. Trends Biochem Sci 2024; 49:457-469. [PMID: 38531696 DOI: 10.1016/j.tibs.2024.03.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Revised: 02/22/2024] [Accepted: 03/01/2024] [Indexed: 03/28/2024]
Abstract
Gene delivery vehicles based on adeno-associated viruses (AAVs) are enabling increasing success in human clinical trials, and they offer the promise of treating a broad spectrum of both genetic and non-genetic disorders. However, delivery efficiency and targeting must be improved to enable safe and effective therapies. In recent years, considerable effort has been invested in creating AAV variants with improved delivery, and computational approaches have been increasingly harnessed for AAV engineering. In this review, we discuss how computationally designed AAV libraries are enabling directed evolution. Specifically, we highlight approaches that harness sequences outputted by next-generation sequencing (NGS) coupled with machine learning (ML) to generate new functional AAV capsids and related regulatory elements, pushing the frontier of what vector engineering and gene therapy may achieve.
Collapse
Affiliation(s)
- Jingxuan Guo
- California Institute for Quantitative Biosciences (QB3), University of California, Berkeley, CA 94720, USA
| | - Li F Lin
- Department of Chemical and Biomolecular Engineering, University of California, Berkeley, CA 94720, USA
| | - Sydney V Oraskovich
- Department of Bioengineering, University of California, Berkeley, CA 94720, USA; Graduate Program in Bioengineering, University of California, San Francisco and University of California, Berkeley, CA 94720, USA
| | - Julio A Rivera de Jesús
- Department of Bioengineering, University of California, Berkeley, CA 94720, USA; Graduate Program in Bioengineering, University of California, San Francisco and University of California, Berkeley, CA 94720, USA; Department of Neurological Surgery, University of California, San Francisco, CA 94143, USA
| | - Jennifer Listgarten
- Department of Electrical Engineering and Computer Science, University of California, Berkeley, CA 94720, USA
| | - David V Schaffer
- California Institute for Quantitative Biosciences (QB3), University of California, Berkeley, CA 94720, USA; Department of Chemical and Biomolecular Engineering, University of California, Berkeley, CA 94720, USA; Department of Bioengineering, University of California, Berkeley, CA 94720, USA; Department of Molecular and Cell Biology, University of California, Berkeley, CA 94720, USA.
| |
Collapse
|
3
|
Goshisht MK. Machine Learning and Deep Learning in Synthetic Biology: Key Architectures, Applications, and Challenges. ACS Omega 2024; 9:9921-9945. [PMID: 38463314 PMCID: PMC10918679 DOI: 10.1021/acsomega.3c05913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 01/19/2024] [Accepted: 01/30/2024] [Indexed: 03/12/2024]
Abstract
Machine learning (ML), particularly deep learning (DL), has made rapid and substantial progress in synthetic biology in recent years. Biotechnological applications of biosystems, including pathways, enzymes, and whole cells, are being probed frequently with time. The intricacy and interconnectedness of biosystems make it challenging to design them with the desired properties. ML and DL have a synergy with synthetic biology. Synthetic biology can be employed to produce large data sets for training models (for instance, by utilizing DNA synthesis), and ML/DL models can be employed to inform design (for example, by generating new parts or advising unrivaled experiments to perform). This potential has recently been brought to light by research at the intersection of engineering biology and ML/DL through achievements like the design of novel biological components, best experimental design, automated analysis of microscopy data, protein structure prediction, and biomolecular implementations of ANNs (Artificial Neural Networks). I have divided this review into three sections. In the first section, I describe predictive potential and basics of ML along with myriad applications in synthetic biology, especially in engineering cells, activity of proteins, and metabolic pathways. In the second section, I describe fundamental DL architectures and their applications in synthetic biology. Finally, I describe different challenges causing hurdles in the progress of ML/DL and synthetic biology along with their solutions.
Collapse
Affiliation(s)
- Manoj Kumar Goshisht
- Department of Chemistry, Natural and
Applied Sciences, University of Wisconsin—Green
Bay, Green
Bay, Wisconsin 54311-7001, United States
| |
Collapse
|
4
|
Boob AG, Chen J, Zhao H. Enabling pathway design by multiplex experimentation and machine learning. Metab Eng 2024; 81:70-87. [PMID: 38040110 DOI: 10.1016/j.ymben.2023.11.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 11/01/2023] [Accepted: 11/25/2023] [Indexed: 12/03/2023]
Abstract
The remarkable metabolic diversity observed in nature has provided a foundation for sustainable production of a wide array of valuable molecules. However, transferring the biosynthetic pathway to the desired host often runs into inherent failures that arise from intermediate accumulation and reduced flux resulting from competing pathways within the host cell. Moreover, the conventional trial and error methods utilized in pathway optimization struggle to fully grasp the intricacies of installed pathways, leading to time-consuming and labor-intensive experiments, ultimately resulting in suboptimal yields. Considering these obstacles, there is a pressing need to explore the enzyme expression landscape and identify the optimal pathway configuration for enhanced production of molecules. This review delves into recent advancements in pathway engineering, with a focus on multiplex experimentation and machine learning techniques. These approaches play a pivotal role in overcoming the limitations of traditional methods, enabling exploration of a broader design space and increasing the likelihood of discovering optimal pathway configurations for enhanced production of molecules. We discuss several tools and strategies for pathway design, construction, and optimization for sustainable and cost-effective microbial production of molecules ranging from bulk to fine chemicals. We also highlight major successes in academia and industry through compelling case studies.
Collapse
Affiliation(s)
- Aashutosh Girish Boob
- Department of Chemical and Biomolecular Engineering, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; Carl R. Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Junyu Chen
- Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; Carl R. Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Huimin Zhao
- Department of Chemical and Biomolecular Engineering, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; Carl R. Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States.
| |
Collapse
|
5
|
Fooladi S, Rabiee N, Iravani S. Genetically engineered bacteria: a new frontier in targeted drug delivery. J Mater Chem B 2023; 11:10072-10087. [PMID: 37873584 DOI: 10.1039/d3tb01805a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
Genetically engineered bacteria (GEB) have shown significant promise to revolutionize modern medicine. These engineered bacteria with unique properties such as enhanced targeting, versatility, biofilm disruption, reduced drug resistance, self-amplification capabilities, and biodegradability represent a highly promising approach for targeted drug delivery and cancer theranostics. This innovative approach involves modifying bacterial strains to function as drug carriers, capable of delivering therapeutic agents directly to specific cells or tissues. Unlike synthetic drug delivery systems, GEB are inherently biodegradable and can be naturally eliminated from the body, reducing potential long-term side effects or complications associated with residual foreign constituents. However, several pivotal challenges such as safety and controllability need to be addressed. Researchers have explored novel tactics to improve their capabilities and overcome existing challenges, including synthetic biology tools (e.g., clustered regularly interspaced short palindromic repeats (CRISPR) and bioinformatics-driven design), microbiome engineering, combination therapies, immune system interaction, and biocontainment strategies. Because of the remarkable advantages and tangible progress in this field, GEB may emerge as vital tools in personalized medicine, providing precise and controlled drug delivery for various diseases (especially cancer). In this context, future directions include the integration of nanotechnology with GEB, the focus on microbiota-targeted therapies, the incorporation of programmable behaviors, the enhancement in immunotherapy treatments, and the discovery of non-medical applications. In this way, careful ethical considerations and regulatory frameworks are necessary for developing GEB-based systems for targeted drug delivery. By addressing safety concerns, ensuring informed consent, promoting equitable access, understanding long-term effects, mitigating dual-use risks, and fostering public engagement, these engineered bacteria can be employed as promising delivery vehicles in bio- and nanomedicine. In this review, recent advances related to the application of GEB in targeted drug delivery and cancer therapy are discussed, covering crucial challenging issues and future perspectives.
Collapse
Affiliation(s)
- Saba Fooladi
- Yale Cardiovascular Research Center, Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, New Haven, CT 06511, USA
| | - Navid Rabiee
- Centre for Molecular Medicine and Innovative Therapeutics, Murdoch University, Perth, WA 6150, Australia.
- School of Engineering, Macquarie University, Sydney, New South Wales, 2109, Australia
| | - Siavash Iravani
- Independent Researcher, W Nazar ST, Boostan Ave, Isfahan, Iran.
| |
Collapse
|
6
|
Lopez-Martinez E, Manteca A, Ferruz N, Cortajarena AL. Statistical Analysis and Tokenization of Epitopes to Construct Artificial Neoepitope Libraries. ACS Synth Biol 2023; 12:2812-2818. [PMID: 37703075 PMCID: PMC10594869 DOI: 10.1021/acssynbio.3c00201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Indexed: 09/14/2023]
Abstract
Epitopes are specific regions on an antigen's surface that the immune system recognizes. Epitopes are usually protein regions on foreign immune-stimulating entities such as viruses and bacteria, and in some cases, endogenous proteins may act as antigens. Identifying epitopes is crucial for accelerating the development of vaccines and immunotherapies. However, mapping epitopes in pathogen proteomes is challenging using conventional methods. Screening artificial neoepitope libraries against antibodies can overcome this issue. Here, we applied conventional sequence analysis and methods inspired in natural language processing to reveal specific sequence patterns in the linear epitopes deposited in the Immune Epitope Database (www.iedb.org) that can serve as building blocks for the design of universal epitope libraries. Our results reveal that amino acid frequency in annotated linear epitopes differs from that in the human proteome. Aromatic residues are overrepresented, while the presence of cysteines is practically null in epitopes. Byte pair encoding tokenization shows high frequencies of tryptophan in tokens of 5, 6, and 7 amino acids, corroborating the findings of the conventional sequence analysis. These results can be applied to reduce the diversity of linear epitope libraries by orders of magnitude.
Collapse
Affiliation(s)
- Elena Lopez-Martinez
- Centre
for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramón 194, Donostia-San Sebastián, 20014 Spain
| | - Aitor Manteca
- Centre
for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramón 194, Donostia-San Sebastián, 20014 Spain
| | - Noelia Ferruz
- Molecular
Biology Institute of Barcelona (IBMB-CSIC), Barcelona Science Park, Baldiri Reixac, 15-21, 08028, Barcelona, Spain
| | - Aitziber L. Cortajarena
- Centre
for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramón 194, Donostia-San Sebastián, 20014 Spain
- IKERBASQUE, Basque
Foundation for Science, Plaza Euskadi 5, 48009 Bilbao, Spain
| |
Collapse
|
7
|
Yang W, Li D, Huang R. EVMP: enhancing machine learning models for synthetic promoter strength prediction by Extended Vision Mutant Priority framework. Front Microbiol 2023; 14:1215609. [PMID: 37476664 PMCID: PMC10354429 DOI: 10.3389/fmicb.2023.1215609] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Accepted: 06/19/2023] [Indexed: 07/22/2023] Open
Abstract
Introduction In metabolic engineering and synthetic biology applications, promoters with appropriate strengths are critical. However, it is time-consuming and laborious to annotate promoter strength by experiments. Nowadays, constructing mutation-based synthetic promoter libraries that span multiple orders of magnitude of promoter strength is receiving increasing attention. A number of machine learning (ML) methods are applied to synthetic promoter strength prediction, but existing models are limited by the excessive proximity between synthetic promoters. Methods In order to enhance ML models to better predict the synthetic promoter strength, we propose EVMP(Extended Vision Mutant Priority), a universal framework which utilize mutation information more effectively. In EVMP, synthetic promoters are equivalently transformed into base promoter and corresponding k-mer mutations, which are input into BaseEncoder and VarEncoder, respectively. EVMP also provides optional data augmentation, which generates multiple copies of the data by selecting different base promoters for the same synthetic promoter. Results In Trc synthetic promoter library, EVMP was applied to multiple ML models and the model effect was enhanced to varying extents, up to 61.30% (MAE), while the SOTA(state-of-the-art) record was improved by 15.25% (MAE) and 4.03% (R2). Data augmentation based on multiple base promoters further improved the model performance by 17.95% (MAE) and 7.25% (R2) compared with non-EVMP SOTA record. Discussion In further study, extended vision (or k-mer) is shown to be essential for EVMP. We also found that EVMP can alleviate the over-smoothing phenomenon, which may contributes to its effectiveness. Our work suggests that EVMP can highlight the mutation information of synthetic promoters and significantly improve the prediction accuracy of strength. The source code is publicly available on GitHub: https://github.com/Tiny-Snow/EVMP.
Collapse
Affiliation(s)
- Weiqin Yang
- Institute of Marine Science and Technology, Shandong University, Qingdao, China
- School of Computer Science and Technology, Shandong University, Qingdao, China
| | - Dexin Li
- Institute of Marine Science and Technology, Shandong University, Qingdao, China
- School of Computer Science and Technology, Shandong University, Qingdao, China
| | - Ranran Huang
- Institute of Marine Science and Technology, Shandong University, Qingdao, China
| |
Collapse
|
8
|
Paylar B, Längkvist M, Jass J, Olsson PE. Utilization of Computer Classification Methods for Exposure Prediction and Gene Selection in Daphnia magna Toxicogenomics. Biology (Basel) 2023; 12:biology12050692. [PMID: 37237504 DOI: 10.3390/biology12050692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Revised: 05/02/2023] [Accepted: 05/06/2023] [Indexed: 05/28/2023]
Abstract
Zinc (Zn) is an essential element that influences many cellular functions. Depending on bioavailability, Zn can cause both deficiency and toxicity. Zn bioavailability is influenced by water hardness. Therefore, water quality analysis for health-risk assessment should consider both Zn concentration and water hardness. However, exposure media selection for traditional toxicology tests are set to defined hardness levels and do not represent the diverse water chemistry compositions observed in nature. Moreover, these tests commonly use whole organism endpoints, such as survival and reproduction, which require high numbers of test animals and are labor intensive. Gene expression stands out as a promising alternative to provide insight into molecular events that can be used for risk assessment. In this work, we apply machine learning techniques to classify the Zn concentrations and water hardness from Daphnia magna gene expression by using quantitative PCR. A method for gene ranking was explored using techniques from game theory, namely, Shapley values. The results show that standard machine learning classifiers can classify both Zn concentration and water hardness simultaneously, and that Shapley values are a versatile and useful alternative for gene ranking that can provide insight about the importance of individual genes.
Collapse
Affiliation(s)
- Berkay Paylar
- The Life Science Center-Biology, School of Science and Technology, Örebro University, SE-701 82 Örebro, Sweden
| | - Martin Längkvist
- Center for Applied Autonomous Sensor Systems, Örebro University, SE-701 82 Örebro, Sweden
| | - Jana Jass
- The Life Science Center-Biology, School of Science and Technology, Örebro University, SE-701 82 Örebro, Sweden
| | - Per-Erik Olsson
- The Life Science Center-Biology, School of Science and Technology, Örebro University, SE-701 82 Örebro, Sweden
| |
Collapse
|
9
|
Yuan Y, Gao F, Chang Y, Zhao Q, He X. Advances of mRNA vaccine in tumor: a maze of opportunities and challenges. Biomark Res 2023; 11:6. [PMID: 36650562 PMCID: PMC9845107 DOI: 10.1186/s40364-023-00449-w] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 01/10/2023] [Indexed: 01/19/2023] Open
Abstract
High-frequency mutations in tumor genomes could be exploited as an asset for developing tumor vaccines. In recent years, with the tremendous breakthrough in genomics, intelligence algorithm, and in-depth insight of tumor immunology, it has become possible to rapidly target genomic alterations in tumor cell and rationally select vaccine targets. Among a variety of candidate vaccine platforms, the early application of mRNA was limited by instability low efficiency and excessive immunogenicity until the successful development of mRNA vaccines against SARS-COV-2 broken of technical bottleneck in vaccine preparation, allowing tumor mRNA vaccines to be prepared rapidly in an economical way with good performance of stability and efficiency. In this review, we systematically summarized the classification and characteristics of tumor antigens, the general process and methods for screening neoantigens, the strategies of vaccine preparations and advances in clinical trials, as well as presented the main challenges in the current mRNA tumor vaccine development.
Collapse
Affiliation(s)
- Yuan Yuan
- grid.413247.70000 0004 1808 0969Department of Gastroenterology, Zhongnan Hospital of Wuhan University, Wuhan, China ,grid.412793.a0000 0004 1799 5032Department of Gastroenterology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Fan Gao
- grid.413247.70000 0004 1808 0969Department of Gastroenterology, Zhongnan Hospital of Wuhan University, Wuhan, China ,grid.412793.a0000 0004 1799 5032Department of Gastroenterology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Ying Chang
- grid.413247.70000 0004 1808 0969Department of Gastroenterology, Zhongnan Hospital of Wuhan University, Wuhan, China ,grid.413247.70000 0004 1808 0969Hubei Clinical Center and Key Laboratory of Intestinal and Colorectal Diseases, Wuhan, China
| | - Qiu Zhao
- grid.413247.70000 0004 1808 0969Department of Gastroenterology, Zhongnan Hospital of Wuhan University, Wuhan, China ,grid.413247.70000 0004 1808 0969Hubei Clinical Center and Key Laboratory of Intestinal and Colorectal Diseases, Wuhan, China
| | - Xingxing He
- grid.413247.70000 0004 1808 0969Department of Gastroenterology, Zhongnan Hospital of Wuhan University, Wuhan, China ,grid.412793.a0000 0004 1799 5032Department of Gastroenterology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China ,grid.413247.70000 0004 1808 0969Hubei Clinical Center and Key Laboratory of Intestinal and Colorectal Diseases, Wuhan, China
| |
Collapse
|
10
|
Zabolotskii AI, Kozlovskiy SV, Katrukha AG. The Influence of the Nucleotide Composition of Genes and Gene Regulatory Elements on the Efficiency of Protein Expression in Escherichia coli. Biochemistry Moscow 2023; 88:S176-S191. [PMID: 37069120 DOI: 10.1134/s0006297923140109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/22/2023]
Abstract
Recombinant proteins expressed in Escherichia coli are widely used in biochemical research and industrial processes. At the same time, achieving higher protein expression levels and correct protein folding still remains the key problem, since optimization of nutrient media, growth conditions, and methods for induction of protein synthesis do not always lead to the desired result. Often, low protein expression is determined by the sequences of the expressed genes and their regulatory regions. The genetic code is degenerated; 18 out of 20 amino acids are encoded by more than one codon. Choosing between synonymous codons in the coding sequence can significantly affect the level of protein expression and protein folding due to the influence of the gene nucleotide composition on the probability of formation of secondary mRNA structures that affect the ribosome binding at the translation initiation phase, as well as the ribosome movement along the mRNA during elongation, which, in turn, influences the mRNA degradation and the folding of the nascent protein. The nucleotide composition of the mRNA untranslated regions, in particular the promoter and Shine-Dalgarno sequences, also affects the efficiency of mRNA transcription, translation, and degradation. In this review, we describe the genetic principles that determine the efficiency of protein production in Escherichia coli.
Collapse
Affiliation(s)
- Artur I Zabolotskii
- Faculty of Biology, Lomonosov Moscow State University, Moscow, 119991, Russia.
| | | | - Alexey G Katrukha
- Faculty of Biology, Lomonosov Moscow State University, Moscow, 119991, Russia
| |
Collapse
|
11
|
Yu W, Xu X, Jin K, Liu Y, Li J, Du G, Lv X, Liu L. Genetically encoded biosensors for microbial synthetic biology: From conceptual frameworks to practical applications. Biotechnol Adv 2023; 62:108077. [PMID: 36502964 DOI: 10.1016/j.biotechadv.2022.108077] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Revised: 12/06/2022] [Accepted: 12/06/2022] [Indexed: 12/13/2022]
Abstract
Genetically encoded biosensors are the vital components of synthetic biology and metabolic engineering, as they are regarded as powerful devices for the dynamic control of genotype metabolism and evolution/screening of desirable phenotypes. This review summarized the recent advances in the construction and applications of different genetically encoded biosensors, including fluorescent protein-based biosensors, nucleic acid-based biosensors, allosteric transcription factor-based biosensors and two-component system-based biosensors. First, the construction frameworks of these biosensors were outlined. Then, the recent progress of biosensor applications in creating versatile microbial cell factories for the bioproduction of high-value chemicals was summarized. Finally, the challenges and prospects for constructing robust and sophisticated biosensors were discussed. This review provided theoretical guidance for constructing genetically encoded biosensors to create desirable microbial cell factories for sustainable bioproduction.
Collapse
Affiliation(s)
- Wenwen Yu
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| | - Xianhao Xu
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| | - Ke Jin
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| | - Yanfeng Liu
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| | - Jianghua Li
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| | - Guocheng Du
- Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| | - Xueqin Lv
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| | - Long Liu
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; Science Center for Future Foods, Jiangnan University, Wuxi 214122, China.
| |
Collapse
|
12
|
Beardall WA, Stan GB, Dunlop MJ. Deep Learning Concepts and Applications for Synthetic Biology. GEN Biotechnology 2022; 1:360-371. [PMID: 36061221 PMCID: PMC9428732 DOI: 10.1089/genbio.2022.0017] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 07/14/2022] [Indexed: 12/24/2022]
Abstract
Synthetic biology has a natural synergy with deep learning. It can be used to generate large data sets to train models, for example by using DNA synthesis, and deep learning models can be used to inform design, such as by generating novel parts or suggesting optimal experiments to conduct. Recently, research at the interface of engineering biology and deep learning has highlighted this potential through successes including the design of novel biological parts, protein structure prediction, automated analysis of microscopy data, optimal experimental design, and biomolecular implementations of artificial neural networks. In this review, we present an overview of synthetic biology-relevant classes of data and deep learning architectures. We also highlight emerging studies in synthetic biology that capitalize on deep learning to enable novel understanding and design, and discuss challenges and future opportunities in this space.
Collapse
Affiliation(s)
- William A.V. Beardall
- Department of Bioengineering, Imperial College London, London, United Kingdom
- Imperial College Centre of Excellence in Synthetic Biology, Imperial College London, London, United Kingdom
| | - Guy-Bart Stan
- Department of Bioengineering, Imperial College London, London, United Kingdom
- Imperial College Centre of Excellence in Synthetic Biology, Imperial College London, London, United Kingdom
| | - Mary J. Dunlop
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
- Biological Design Center, Boston University, Boston, Massachusetts, USA
| |
Collapse
|
13
|
Cho KT, Sen TZ, Andorf CM. Predicting Tissue-Specific mRNA and Protein Abundance in Maize: A Machine Learning Approach. Front Artif Intell 2022; 5:830170. [PMID: 35719692 PMCID: PMC9204276 DOI: 10.3389/frai.2022.830170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Accepted: 04/26/2022] [Indexed: 11/13/2022] Open
Abstract
Machine learning and modeling approaches have been used to classify protein sequences for a broad set of tasks including predicting protein function, structure, expression, and localization. Some recent studies have successfully predicted whether a given gene is expressed as mRNA or even translated to proteins potentially, but given that not all genes are expressed in every condition and tissue, the challenge remains to predict condition-specific expression. To address this gap, we developed a machine learning approach to predict tissue-specific gene expression across 23 different tissues in maize, solely based on DNA promoter and protein sequences. For class labels, we defined high and low expression levels for mRNA and protein abundance and optimized classifiers by systematically exploring various methods and combinations of k-mer sequences in a two-phase approach. In the first phase, we developed Markov model classifiers for each tissue and built a feature vector based on the predictions. In the second phase, the feature vector was used as an input to a Bayesian network for final classification. Our results show that these methods can achieve high classification accuracy of up to 95% for predicting gene expression for individual tissues. By relying on sequence alone, our method works in settings where costly experimental data are unavailable and reveals useful insights into the functional, evolutionary, and regulatory characteristics of genes.
Collapse
Affiliation(s)
- Kyoung Tak Cho
- Department of Computer Science, Iowa State University, Ames, IA, United States
| | - Taner Z. Sen
- USDA-ARS, Crop Improvement and Genetics Research Unit, Albany, CA, United States
| | - Carson M. Andorf
- USDA-ARS, Corn Insects and Crop Genetics Research Unit, Ames, IA, United States
- *Correspondence: Carson M. Andorf
| |
Collapse
|
14
|
Singh RK, Prasad A, Maurya J, Prasad M. Regulation of small RNA-mediated high temperature stress responses in crop plants. Plant Cell Rep 2022; 41:765-773. [PMID: 34228188 DOI: 10.1007/s00299-021-02745-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Accepted: 06/24/2021] [Indexed: 05/20/2023]
Abstract
Small RNAs have emerged as key players of gene expression regulation. Several lines of evidences highlight their role in modulating high temperature stress responsiveness in plants. Throughout their life cycle, plants have to regulate their gene expression at various developmental phases, physiological changes, and in response to biotic or environmental stress. High temperature is one the most common abiotic stress for crop plants, that results in impaired morphology, physiology, and yield. However, plants have certain mechanisms that enable them to withstand such conditions by modulating the expression of stress-related genes. Small RNA (sRNA)-regulated gene expression is one such mechanism which is ubiquitous in all eukaryotes. The sRNAs mainly include micro RNAs (miRNAs) and small interfering RNAs (siRNAs). They are primarily associated with the gene silencing either through translation inhibition, mRNA degradation, or DNA methylation. During high temperature stress the increased or decreased level of miRNAs altered the protein accumulation of target transcripts and, therefore, regulate stress responses. Several reports are available in plants which are genetically engineered through expressing artificial miRNAs resulted in thermotolerance. sRNAs have also been reported to bring the epigenetic changes on chromatin region through RNA-dependent DNA methylation (RdDM). The present article draws a brief illustration of sRNA origin, their functional mechanisms, role in high temperature stress, and possible application for developing stress tolerant crop plants.
Collapse
Affiliation(s)
- Roshan Kumar Singh
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Ashish Prasad
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Jyoti Maurya
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Manoj Prasad
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi, 110067, India.
| |
Collapse
|
15
|
Cazier AP, Blazeck J. Advances in promoter engineering: novel applications and predefined transcriptional control. Biotechnol J 2021; 16:e2100239. [PMID: 34351706 DOI: 10.1002/biot.202100239] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Revised: 07/30/2021] [Accepted: 08/03/2021] [Indexed: 11/08/2022]
Abstract
Synthetic biology continues to progress by relying on more robust tools for transcriptional control, of which promoters are the most fundamental component. Numerous studies have sought to characterize promoter function, determine principles to guide their engineering, and create promoters with stronger expression or tailored inducible control. In this review, we will summarize promoter architecture and highlight recent advances in the field, focusing on the novel applications of inducible promoter design and engineering towards metabolic engineering and cellular therapeutic development. Additionally, we will highlight how the expansion of new, machine learning techniques for modeling and engineering promoter sequences are enabling more accurate prediction of promoter characteristics. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Andrew P Cazier
- School of Chemical and Biomolecular Engineering, Georgia Institute of Technology, 311 Ferst St. NW, Atlanta, Georgia, 30332, USA
| | - John Blazeck
- School of Chemical and Biomolecular Engineering, Georgia Institute of Technology, 311 Ferst St. NW, Atlanta, Georgia, 30332, USA
| |
Collapse
|
16
|
Abstract
Biocatalysis has undergone revolutionary progress in the past century. Benefited by the integration of multidisciplinary technologies, natural enzymatic reactions are constantly being explored. Protein engineering gives birth to robust biocatalysts that are widely used in industrial production. These research achievements have gradually constructed a network containing natural enzymatic synthesis pathways and artificially designed enzymatic cascades. Nowadays, the development of artificial intelligence, automation, and ultra-high-throughput technology provides infinite possibilities for the discovery of novel enzymes, enzymatic mechanisms and enzymatic cascades, and gradually complements the lack of remaining key steps in the pathway design of enzymatic total synthesis. Therefore, the research of biocatalysis is gradually moving towards the era of novel technology integration, intelligent manufacturing and enzymatic total synthesis.
Collapse
Affiliation(s)
- Dong Yi
- Department of Biotechnology & Enzyme Catalysis, Institute of Biochemistry, University GreifswaldFelix-Hausdorff-Str. 4D-17487 GreifswaldGermany
| | - Thomas Bayer
- Department of Biotechnology & Enzyme Catalysis, Institute of Biochemistry, University GreifswaldFelix-Hausdorff-Str. 4D-17487 GreifswaldGermany
| | - Christoffel P. S. Badenhorst
- Department of Biotechnology & Enzyme Catalysis, Institute of Biochemistry, University GreifswaldFelix-Hausdorff-Str. 4D-17487 GreifswaldGermany
| | - Shuke Wu
- Department of Biotechnology & Enzyme Catalysis, Institute of Biochemistry, University GreifswaldFelix-Hausdorff-Str. 4D-17487 GreifswaldGermany
| | - Mark Doerr
- Department of Biotechnology & Enzyme Catalysis, Institute of Biochemistry, University GreifswaldFelix-Hausdorff-Str. 4D-17487 GreifswaldGermany
| | - Matthias Höhne
- Department of Biotechnology & Enzyme Catalysis, Institute of Biochemistry, University GreifswaldFelix-Hausdorff-Str. 4D-17487 GreifswaldGermany
| | - Uwe T. Bornscheuer
- Department of Biotechnology & Enzyme Catalysis, Institute of Biochemistry, University GreifswaldFelix-Hausdorff-Str. 4D-17487 GreifswaldGermany
| |
Collapse
|
17
|
Abstract
Data-driven machine learning is the method of choice for predicting molecular phenotypes from nucleotide sequence, modeling gene expression events including protein-DNA binding, chromatin states as well as mRNA and protein levels. Deep neural networks automatically learn informative sequence representations and interpreting them enables us to improve our understanding of the regulatory code governing gene expression. Here, we review the latest developments that apply shallow or deep learning to quantify molecular phenotypes and decode the cis-regulatory grammar from prokaryotic and eukaryotic sequencing data. Our approach is to build from the ground up, first focusing on the initiating protein-DNA interactions, then specific coding and non-coding regions, and finally on advances that combine multiple parts of the gene and mRNA regulatory structures, achieving unprecedented performance. We thus provide a quantitative view of gene expression regulation from nucleotide sequence, concluding with an information-centric overview of the central dogma of molecular biology.
Collapse
Affiliation(s)
- Jan Zrimec
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| | - Filip Buric
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| | - Mariia Kokina
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Victor Garcia
- School of Life Sciences and Facility Management, Zurich University of Applied Sciences, Wädenswil, Switzerland
| | - Aleksej Zelezniak
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
- Science for Life Laboratory, Stockholm, Sweden
| |
Collapse
|
18
|
Abstract
mRNA vaccines have evolved from being a mere curiosity to emerging as COVID-19 vaccine front-runners. Recent advancements in the field of RNA technology, vaccinology, and nanotechnology have generated interest in delivering safe and effective mRNA therapeutics. In this review, we discuss design and self-assembly of mRNA vaccines. Self-assembly, a spontaneous organization of individual molecules, allows for design of nanoparticles with customizable properties. We highlight the materials commonly utilized to deliver mRNA, their physicochemical characteristics, and other relevant considerations, such as mRNA optimization, routes of administration, cellular fate, and immune activation, that are important for successful mRNA vaccination. We also examine the COVID-19 mRNA vaccines currently in clinical trials. mRNA vaccines are ready for the clinic, showing tremendous promise in the COVID-19 vaccine race, and have pushed the boundaries of gene therapy.
Collapse
Affiliation(s)
- Jeonghwan Kim
- Department of Pharmaceutical Sciences, College of Pharmacy, Oregon State University, Robertson Life Science Building, 2730 South Moody Avenue, Portland, Oregon 97201, USA
| | - Yulia Eygeris
- Department of Pharmaceutical Sciences, College of Pharmacy, Oregon State University, Robertson Life Science Building, 2730 South Moody Avenue, Portland, Oregon 97201, USA
| | - Mohit Gupta
- Department of Pharmaceutical Sciences, College of Pharmacy, Oregon State University, Robertson Life Science Building, 2730 South Moody Avenue, Portland, Oregon 97201, USA
| | - Gaurav Sahay
- Department of Pharmaceutical Sciences, College of Pharmacy, Oregon State University, Robertson Life Science Building, 2730 South Moody Avenue, Portland, Oregon 97201, USA; Department of Biomedical Engineering, Oregon Health & Science University, Robertson Life Science Building, 2730 South Moody Avenue, Portland, Oregon 97201, USA; Department of Ophthalmology, Casey Eye Institute, Oregon Health & Science University, Portland, Oregon 97239, USA.
| |
Collapse
|
19
|
Abstract
The design and optimization of biological systems is an inherently complex undertaking that requires careful balancing of myriad synergistic and antagonistic variables. However, despite this complexity, much synthetic biology research is predicated on One Factor at A Time (OFAT) experimentation; the genetic and environmental variables affecting the activity of a system of interest are sequentially altered while all other variables are held constant. Beyond being time and resource intensive, OFAT experimentation crucially ignores the effect of interactions between factors. Given the ubiquity of interacting genetic and environmental factors in biology this failure to account for interaction effects in OFAT experimentation can result in the development of suboptimal systems. To address these limitations, an increasing number of studies have turned to Design of Experiments (DoE), a suite of methods that enable efficient, systematic exploration and exploitation of complex design spaces. This review provides an overview of DoE for synthetic biologists. Key concepts and commonly used experimental designs are introduced, and we discuss the advantages of DoE as compared to OFAT experimentation. We dissect the applicability of DoE in the context of synthetic biology and review studies which have successfully employed these methods, illustrating the potential of statistical experimental design to guide the design, characterization, and optimization of biological protocols, pathways, and processes.
Collapse
Affiliation(s)
- James Gilman
- Institute for Bioengineering, School of Engineering, University of Edinburgh, Edinburgh EH8 9YL, U.K
| | - Laura Walls
- Institute for Bioengineering, School of Engineering, University of Edinburgh, Edinburgh EH8 9YL, U.K
| | - Lucia Bandiera
- Institute for Bioengineering, School of Engineering, University of Edinburgh, Edinburgh EH8 9YL, U.K
| | - Filippo Menolascina
- Institute for Bioengineering, School of Engineering, University of Edinburgh, Edinburgh EH8 9YL, U.K
| |
Collapse
|
20
|
Goris T, Pérez‐Valero Á, Martínez I, Yi D, Fernández‐Calleja L, San León D, Bornscheuer UT, Magadán‐Corpas P, Lombó F, Nogales J. Repositioning microbial biotechnology against COVID-19: the case of microbial production of flavonoids. Microb Biotechnol 2021; 14:94-110. [PMID: 33047877 PMCID: PMC7675739 DOI: 10.1111/1751-7915.13675] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Revised: 09/17/2020] [Accepted: 09/18/2020] [Indexed: 12/19/2022] Open
Abstract
Coronavirus-related disease 2019 (COVID-19) became a pandemic in February 2020, and worldwide researchers try to tackle the disease with approved drugs of all kinds, or to develop novel compounds inhibiting viral spreading. Flavonoids, already investigated as antivirals in general, also might bear activities specific for the viral agent causing COVID-19, SARS-CoV-2. Microbial biotechnology and especially synthetic biology may help to produce flavonoids, which are exclusive plant secondary metabolites, at a larger scale or indeed to find novel pharmaceutically active flavonoids. Here, we review the state of the art in (i) antiviral activity of flavonoids specific for coronaviruses and (ii) results derived from computational studies, mostly docking studies mainly inhibiting specific coronaviral proteins such as the 3CL (main) protease, the spike protein or the RNA-dependent RNA polymerase. In the end, we strive towards a synthetic biology pipeline making the fast and tailored production of valuable antiviral flavonoids possible by applying the last concepts of division of labour through co-cultivation/microbial community approaches to the DBTL (Design, Build, Test, Learn) principle.
Collapse
Affiliation(s)
- Tobias Goris
- Department of Molecular Toxicology, Research Group Intestinal MicrobiologyGerman Institute of Human Nutrition Potsdam‐RehbrueckeArthur‐Scheunert‐Allee 114‐116NuthetalBrandenburg14558Germany
| | - Álvaro Pérez‐Valero
- Research Unit “Biotechnology in Nutraceuticals and Bioactive Compounds‐BIONUC”Departamento de Biología Funcional, Área de MicrobiologíaUniversidad de OviedoOviedoSpain
- Instituto Universitario de Oncología del Principado de AsturiasOviedoSpain
- Instituto de Investigación Sanitaria del Principado de AsturiasOviedoSpain
| | - Igor Martínez
- Department of Systems BiologyCentro Nacional de BiotecnologíaCSICMadridSpain
| | - Dong Yi
- Department of Biotechnology & Enzyme CatalysisInstitute of BiochemistryUniversity GreifswaldFelix‐Hausdorff‐Str. 4GreifswaldD‐17487Germany
| | - Luis Fernández‐Calleja
- Research Unit “Biotechnology in Nutraceuticals and Bioactive Compounds‐BIONUC”Departamento de Biología Funcional, Área de MicrobiologíaUniversidad de OviedoOviedoSpain
- Instituto Universitario de Oncología del Principado de AsturiasOviedoSpain
- Instituto de Investigación Sanitaria del Principado de AsturiasOviedoSpain
| | - David San León
- Department of Systems BiologyCentro Nacional de BiotecnologíaCSICMadridSpain
| | - Uwe T. Bornscheuer
- Department of Biotechnology & Enzyme CatalysisInstitute of BiochemistryUniversity GreifswaldFelix‐Hausdorff‐Str. 4GreifswaldD‐17487Germany
| | - Patricia Magadán‐Corpas
- Research Unit “Biotechnology in Nutraceuticals and Bioactive Compounds‐BIONUC”Departamento de Biología Funcional, Área de MicrobiologíaUniversidad de OviedoOviedoSpain
- Instituto Universitario de Oncología del Principado de AsturiasOviedoSpain
- Instituto de Investigación Sanitaria del Principado de AsturiasOviedoSpain
| | - Felipe Lombó
- Research Unit “Biotechnology in Nutraceuticals and Bioactive Compounds‐BIONUC”Departamento de Biología Funcional, Área de MicrobiologíaUniversidad de OviedoOviedoSpain
- Instituto Universitario de Oncología del Principado de AsturiasOviedoSpain
- Instituto de Investigación Sanitaria del Principado de AsturiasOviedoSpain
| | - Juan Nogales
- Department of Systems BiologyCentro Nacional de BiotecnologíaCSICMadridSpain
- Interdisciplinary Platform for Sustainable Plastics towards a Circular Economy‐Spanish National Research Council (SusPlast‐CSIC)MadridSpain
| |
Collapse
|
21
|
Nieuwkoop T, Finger-Bou M, van der Oost J, Claassens NJ. The Ongoing Quest to Crack the Genetic Code for Protein Production. Mol Cell 2020; 80:193-209. [PMID: 33010203 DOI: 10.1016/j.molcel.2020.09.014] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 08/10/2020] [Accepted: 09/10/2020] [Indexed: 01/05/2023]
Abstract
Understanding the genetic design principles that determine protein production remains a major challenge. Although the key principles of gene expression were discovered 50 years ago, additional factors are still being uncovered. Both protein-coding and non-coding sequences harbor elements that collectively influence the efficiency of protein production by modulating transcription, mRNA decay, and translation. The influences of many contributing elements are intertwined, which complicates a full understanding of the individual factors. In natural genes, a functional balance between these factors has been obtained in the course of evolution, whereas for genetic-engineering projects, our incomplete understanding still limits optimal design of synthetic genes. However, notable advances have recently been made, supported by high-throughput analysis of synthetic gene libraries as well as by state-of-the-art biomolecular techniques. We discuss here how these advances further strengthen understanding of the gene expression process and how they can be harnessed to optimize protein production.
Collapse
Affiliation(s)
- Thijs Nieuwkoop
- Laboratory of Microbiology, Wageningen University, Stippeneng 4, 6708 WE Wageningen, the Netherlands
| | - Max Finger-Bou
- Laboratory of Microbiology, Wageningen University, Stippeneng 4, 6708 WE Wageningen, the Netherlands
| | - John van der Oost
- Laboratory of Microbiology, Wageningen University, Stippeneng 4, 6708 WE Wageningen, the Netherlands
| | - Nico J Claassens
- Laboratory of Microbiology, Wageningen University, Stippeneng 4, 6708 WE Wageningen, the Netherlands.
| |
Collapse
|
22
|
Sergeeva D, Lee GM, Nielsen LK, Grav LM. Multicopy Targeted Integration for Accelerated Development of High-Producing Chinese Hamster Ovary Cells. ACS Synth Biol 2020; 9:2546-2561. [PMID: 32835482 DOI: 10.1021/acssynbio.0c00322] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The ever-growing biopharmaceutical industry relies on the production of recombinant therapeutic proteins in Chinese hamster ovary (CHO) cells. The traditional timelines of CHO cell line development can be significantly shortened by the use of targeted gene integration (TI). However, broad use of TI has been limited due to the low specific productivity (qP) of TI-generated clones. Here, we show a 10-fold increase in the qP of therapeutic glycoproteins in CHO cells through the development and optimization of a multicopy TI method. We used a recombinase-mediated cassette exchange (RMCE) platform to investigate the effect of gene copy number, 5' and 3' gene regulatory elements, and landing pad features on qP. We evaluated the limitations of multicopy expression from a single genomic site as well as multiple genomic sites and found that a transcriptional bottleneck can appear with an increase in gene dosage. We created a dual-RMCE system for simultaneous multicopy TI in two genomic sites and generated isogenic high-producing clones with qP of 12-14 pg/cell/day and product titer close to 1 g/L in fed-batch. Our study provides an extensive characterization of the multicopy TI method and elucidates the relationship between gene copy number and protein expression in mammalian cells. Moreover, it demonstrates that TI-generated CHO cells are capable of producing therapeutic proteins at levels that can support their industrial manufacture.
Collapse
Affiliation(s)
- Daria Sergeeva
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby 2800, Denmark
| | - Gyun Min Lee
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby 2800, Denmark
- Department of Biological Sciences, KAIST, Daejeon 34141, Republic of Korea
| | - Lars Keld Nielsen
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby 2800, Denmark
- Australian Institute for Bioengineering and Nanotechnology, University of Queensland, Brisbane 4072, Australia
| | - Lise Marie Grav
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby 2800, Denmark
| |
Collapse
|
23
|
Chaudhuri A, Das S, Das B. Localization elements and zip codes in the intracellular transport and localization of messenger RNAs in Saccharomyces cerevisiae. Wiley Interdiscip Rev RNA 2020; 11:e1591. [PMID: 32101377 DOI: 10.1002/wrna.1591] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Revised: 02/05/2020] [Accepted: 02/07/2020] [Indexed: 12/13/2022]
Abstract
Intracellular trafficking and localization of mRNAs provide a mechanism of regulation of expression of genes with excellent spatial control. mRNA localization followed by localized translation appears to be a mechanism of targeted protein sorting to a specific cell-compartment, which is linked to the establishment of cell polarity, cell asymmetry, embryonic axis determination, and neuronal plasticity in metazoans. However, the complexity of the mechanism and the components of mRNA localization in higher organisms prompted the use of the unicellular organism Saccharomyces cerevisiae as a simplified model organism to study this vital process. Current knowledge indicates that a variety of mRNAs are asymmetrically and selectively localized to the tip of the bud of the daughter cells, to the vicinity of endoplasmic reticulum, mitochondria, and nucleus in this organism, which are connected to diverse cellular processes. Interestingly, specific cis-acting RNA localization elements (LEs) or RNA zip codes play a crucial role in the localization and trafficking of these localized mRNAs by providing critical binding sites for the specific RNA-binding proteins (RBPs). In this review, we present a comprehensive account of mRNA localization in S. cerevisiae, various types of localization elements influencing the mRNA localization, and the RBPs, which bind to these LEs to implement a number of vital physiological processes. Finally, we emphasize the significance of this process by highlighting their connection to several neuropathological disorders and cancers. This article is categorized under: RNA Export and Localization > RNA Localization.
Collapse
Affiliation(s)
- Anusha Chaudhuri
- Department of Life Science and Biotechnology, Jadavpur University, Kolkata, India
| | - Subhadeep Das
- Department of Life Science and Biotechnology, Jadavpur University, Kolkata, India
| | - Biswadip Das
- Department of Life Science and Biotechnology, Jadavpur University, Kolkata, India
| |
Collapse
|