1
|
Ranaghan MJ, Li JJ, Laprise DM, Garvie CW. Assessing optimal: inequalities in codon optimization algorithms. BMC Biol 2021; 19:36. [PMID: 33607980 PMCID: PMC7893858 DOI: 10.1186/s12915-021-00968-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2020] [Accepted: 01/26/2021] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND Custom genes have become a common resource in recombinant biology over the last 20 years due to the plummeting cost of DNA synthesis. These genes are often "optimized" to non-native sequences for overexpression in a non-native host by substituting synonymous codons within the coding DNA sequence (CDS). A handful of studies have compared native and optimized CDSs, reporting different levels of soluble product due to the accumulation of misfolded aggregates, variable activity of enzymes, and (at least one report of) a change in substrate specificity. No study, to the best of our knowledge, has performed a practical comparison of CDSs generated from different codon optimization algorithms or reported the corresponding protein yields. RESULTS In our efforts to understand what factors constitute an optimized CDS, we identified that there is little consensus among codon-optimization algorithms, a roughly equivalent chance that an algorithm-optimized CDS will increase or diminish recombinant yields as compared to the native DNA, a near ubiquitous use of a codon database that was last updated in 2007, and a high variability of output CDSs by some algorithms. We present a case study, using KRas4B, to demonstrate that a median codon frequency may be a better predictor of soluble yields than the more commonly utilized CAI metric. CONCLUSIONS We present a method for visualizing, analyzing, and comparing algorithm-optimized DNA sequences for recombinant protein expression. We encourage researchers to consider if DNA optimization is right for their experiments, and work towards improving the reproducibility of published recombinant work by publishing non-native CDSs.
Collapse
Affiliation(s)
- Matthew J Ranaghan
- Center for the Development of Therapeutics, The Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA, 02142, USA.
| | - Jeffrey J Li
- Center for the Development of Therapeutics, The Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA, 02142, USA
| | - Dylan M Laprise
- Center for the Development of Therapeutics, The Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA, 02142, USA
| | - Colin W Garvie
- Center for the Development of Therapeutics, The Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA, 02142, USA
| |
Collapse
|
2
|
Cripwell RA, Rose SH, van Zyl WH. Expression and comparison of codon optimised Aspergillus tubingensis amylase variants in Saccharomyces cerevisiae. FEMS Yeast Res 2017. [DOI: 10.1093/femsyr/fox040] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
|
3
|
Chen Y, Liu Y, Zhang G, Wang A, Dong Z, Qi Y, Wang J, Zhao B, Li N, Jiang M. Human papillomavirus L1 protein expressed in Escherichia coli self-assembles into virus-like particles that are highly immunogenic. Virus Res 2016; 220:97-103. [DOI: 10.1016/j.virusres.2016.04.017] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2016] [Revised: 04/17/2016] [Accepted: 04/19/2016] [Indexed: 12/13/2022]
|
4
|
DNASynth: a computer program for assembly of artificial gene parts in decreasing temperature. BIOMED RESEARCH INTERNATIONAL 2015; 2015:413262. [PMID: 25629047 PMCID: PMC4300049 DOI: 10.1155/2015/413262] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/30/2014] [Revised: 10/08/2014] [Accepted: 10/11/2014] [Indexed: 11/23/2022]
Abstract
Artificial gene synthesis requires consideration of nucleotide sequence development as well as long DNA molecule assembly protocols. The nucleotide sequence of the molecule must meet many conditions including particular preferences of the host organism for certain codons, avoidance of specific regulatory subsequences, and a lack of secondary structures that inhibit expression. The chemical synthesis of DNA molecule has limitations in terms of strand length; thus, the creation of artificial genes requires the assembly of long DNA molecules from shorter fragments.
In the approach presented, the algorithm and the computer program address both tasks: developing the optimal nucleotide sequence to encode a given peptide for a given host organism and determining the long DNA assembly protocol. These tasks are closely connected; a change in codon usage may lead to changes in the optimal assembly protocol, and the lack of a simple assembly protocol may be addressed by changing the nucleotide sequence. The computer program presented in this study was tested with real data from an experiment in a wet biological laboratory to synthesize a peptide. The benefit of the presented algorithm and its application is the shorter time, compared to polymerase cycling assembly, needed to produce a ready synthetic gene.
Collapse
|
5
|
Liu X, Deng R, Wang J, Wang X. COStar: A D-star Lite-based dynamic search algorithm for codon optimization. J Theor Biol 2014; 344:19-30. [DOI: 10.1016/j.jtbi.2013.11.022] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2013] [Revised: 11/24/2013] [Accepted: 11/26/2013] [Indexed: 01/29/2023]
|
6
|
Nørholm MH, Toddo S, Virkki MT, Light S, von Heijne G, Daley DO. Improved production of membrane proteins in Escherichia coli
by selective codon substitutions. FEBS Lett 2013; 587:2352-8. [DOI: 10.1016/j.febslet.2013.05.063] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2013] [Revised: 05/24/2013] [Accepted: 05/27/2013] [Indexed: 11/29/2022]
|
7
|
Reuveni S, Meilijson I, Kupiec M, Ruppin E, Tuller T. Genome-scale analysis of translation elongation with a ribosome flow model. PLoS Comput Biol 2011; 7:e1002127. [PMID: 21909250 PMCID: PMC3164701 DOI: 10.1371/journal.pcbi.1002127] [Citation(s) in RCA: 122] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2011] [Accepted: 06/06/2011] [Indexed: 11/18/2022] Open
Abstract
We describe the first large scale analysis of gene translation that is based on a model that takes into account the physical and dynamical nature of this process. The Ribosomal Flow Model (RFM) predicts fundamental features of the translation process, including translation rates, protein abundance levels, ribosomal densities and the relation between all these variables, better than alternative ('non-physical') approaches. In addition, we show that the RFM can be used for accurate inference of various other quantities including genes' initiation rates and translation costs. These quantities could not be inferred by previous predictors. We find that increasing the number of available ribosomes (or equivalently the initiation rate) increases the genomic translation rate and the mean ribosome density only up to a certain point, beyond which both saturate. Strikingly, assuming that the translation system is tuned to work at the pre-saturation point maximizes the predictive power of the model with respect to experimental data. This result suggests that in all organisms that were analyzed (from bacteria to Human), the global initiation rate is optimized to attain the pre-saturation point. The fact that similar results were not observed for heterologous genes indicates that this feature is under selection. Remarkably, the gap between the performance of the RFM and alternative predictors is strikingly large in the case of heterologous genes, testifying to the model's promising biotechnological value in predicting the abundance of heterologous proteins before expressing them in the desired host.
Collapse
Affiliation(s)
- Shlomi Reuveni
- Department of Statistics and Operations Research, School of Mathematical Sciences, Tel Aviv University, Ramat Aviv, Israel
- School of Chemistry, Tel Aviv University, Ramat Aviv, Israel
| | - Isaac Meilijson
- Department of Statistics and Operations Research, School of Mathematical Sciences, Tel Aviv University, Ramat Aviv, Israel
| | - Martin Kupiec
- Molecular Microbiology and Biotechnology, Tel Aviv University, Ramat Aviv, Israel
| | - Eytan Ruppin
- School of Computer Sciences, Tel Aviv University, Ramat Aviv, Israel
- School of Medicine, Tel Aviv University, Ramat Aviv, Israel
| | - Tamir Tuller
- Faculty of Mathematics and Computer Science, Weizmann Institute of Science, Rehovot, Israel
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
- * E-mail:
| |
Collapse
|
8
|
Nørholm MHH, Light S, Virkki MTI, Elofsson A, von Heijne G, Daley DO. Manipulating the genetic code for membrane protein production: what have we learnt so far? BIOCHIMICA ET BIOPHYSICA ACTA-BIOMEMBRANES 2011; 1818:1091-6. [PMID: 21884679 DOI: 10.1016/j.bbamem.2011.08.018] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2011] [Revised: 08/04/2011] [Accepted: 08/15/2011] [Indexed: 12/19/2022]
Abstract
With synthetic gene services, molecular cloning is as easy as ordering a pizza. However choosing the right RNA code for efficient protein production is less straightforward, more akin to deciding on the pizza toppings. The possibility to choose synonymous codons in the gene sequence has ignited a discussion that dates back 50 years: Does synonymous codon use matter? Recent studies indicate that replacement of particular codons for synonymous codons can improve expression in homologous or heterologous hosts, however it is not always successful. Furthermore it is increasingly apparent that membrane protein biogenesis can be codon-sensitive. Single synonymous codon substitutions can influence mRNA stability, mRNA structure, translational initiation, translational elongation and even protein folding. Synonymous codon substitutions therefore need to be carefully evaluated when membrane proteins are engineered for higher production levels and further studies are needed to fully understand how to select the codons that are optimal for higher production. This article is part of a Special Issue entitled: Protein Folding in Membranes.
Collapse
Affiliation(s)
- Morten H H Nørholm
- Center for Biomembrane Research, Department of Biochemistry and Biophysics, Stockholm University, SE-106 91, Sweden.
| | | | | | | | | | | |
Collapse
|
9
|
Strategies for high-level recombinant protein expression in transgenic microalgae: A review. Biotechnol Adv 2010; 28:910-8. [DOI: 10.1016/j.biotechadv.2010.08.006] [Citation(s) in RCA: 127] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2010] [Revised: 08/03/2010] [Accepted: 08/13/2010] [Indexed: 11/22/2022]
|
10
|
Maertens B, Spriestersbach A, von Groll U, Roth U, Kubicek J, Gerrits M, Graf M, Liss M, Daubert D, Wagner R, Schäfer F. Gene optimization mechanisms: a multi-gene study reveals a high success rate of full-length human proteins expressed in Escherichia coli. Protein Sci 2010; 19:1312-26. [PMID: 20506237 PMCID: PMC2970903 DOI: 10.1002/pro.408] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The genetic code is universal, but recombinant protein expression in heterologous systems is often hampered by divergent codon usage. Here, we demonstrate that reprogramming by standardized multi-parameter gene optimization software and de novo gene synthesis is a suitable general strategy to improve heterologous protein expression. This study compares expression levels of 94 full-length human wt and sequence-optimized genes coding for pharmaceutically important proteins such as kinases and membrane proteins in E. coli. Fluorescence-based quantification revealed increased protein yields for 70% of in vivo expressed optimized genes compared to the wt DNA sequences and also resulted in increased amounts of protein that can be purified. The improvement in transgene expression correlated with higher mRNA levels in our analyzed examples. In all cases tested, expression levels using wt genes in tRNA-supplemented bacterial strains were outperformed by optimized genes expressed in non-supplemented host cells.
Collapse
|
11
|
Multifactorial determinants of protein expression in prokaryotic open reading frames. J Mol Biol 2010; 402:905-18. [PMID: 20727358 DOI: 10.1016/j.jmb.2010.08.010] [Citation(s) in RCA: 79] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2010] [Revised: 07/27/2010] [Accepted: 08/05/2010] [Indexed: 01/21/2023]
Abstract
A quantitative description of the relationship between protein expression levels and open reading frame (ORF) nucleotide sequences is important for understanding natural systems, designing synthetic systems, and optimizing heterologous expression. Codon identity, mRNA secondary structure, and nucleotide composition within ORFs markedly influence expression levels. Bioinformatic analysis of ORF sequences in 816 bacterial genomes revealed that these features show distinct regional trends. To investigate their effects on protein expression, we designed 285 synthetic genes and determined corresponding expression levels in vitro using Escherichia coli extracts. We developed a mathematical function, parameterized using this synthetic gene data set, which enables computation of protein expression levels from ORF nucleotide sequences. In addition to its practical application in the design of heterologous expression systems, this equation provides mechanistic insight into the factors that control translation efficiency. We found that expression is strongly dependent on the presence of high AU content and low secondary structure in the ORF 5' region. Choice of high-frequency codons contributes to a lesser extent. The 3' terminal AU content makes modest, but detectable contributions. We present a model for the effect of these factors on the three phases of ribosomal function: initiation, elongation, and termination.
Collapse
|
12
|
Abstract
Proteins are the most versatile among the various biological building blocks and a mature field of protein engineering has lead to many industrial and biomedical applications. But the strength of proteins—their versatility, dynamics and interactions—also complicates and hinders systems engineering. Therefore, the design of more sophisticated, multi-component protein systems appears to lag behind, in particular, when compared to the engineering of gene regulatory networks. Yet, synthetic biologists have started to tinker with the information flow through natural signaling networks or integrated protein switches. A successful strategy common to most of these experiments is their focus on modular interactions between protein domains or domains and peptide motifs. Such modular interaction swapping has rewired signaling in yeast, put mammalian cell morphology under the control of light, or increased the flux through a synthetic metabolic pathway. Based on this experience, we outline an engineering framework for the connection of reusable protein interaction devices into self-sufficient circuits. Such a framework should help to ‘refacture’ protein complexity into well-defined exchangeable devices for predictive engineering. We review the foundations and initial success stories of protein synthetic biology and discuss the challenges and promises on the way from protein- to protein systems design.
Collapse
Affiliation(s)
- Raik Grünberg
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), UPF, 08003 Barcelona, Spain.
| | | |
Collapse
|
13
|
Han JH, Choi YS, Kim WJ, Jeon YH, Lee SK, Lee BJ, Ryu KS. Codon optimization enhances protein expression of human peptide deformylase in E. coli. Protein Expr Purif 2009; 70:224-30. [PMID: 19825416 DOI: 10.1016/j.pep.2009.10.005] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2009] [Revised: 10/06/2009] [Accepted: 10/06/2009] [Indexed: 01/27/2023]
Abstract
Human peptide deformylase (hPDF), located in the mitochondria, has recently become a promising target for anti-cancer therapy. However, the expression of the hPDF gene in Escherichia coli is not efficient likely due to extremely high levels of GC content as well as the presence of rare codons. We performed codon optimization of the hPDF gene in order to reduce GC content and to eliminate rare codons. Putative stable secondary structures of the optimized gene were also reduced. Codon optimization increased the expression of hPDF protein (residues 63-243) presumably by reducing the GC content. A large amount of soluble hPDF was obtained upon its fusion with thioredoxin (Trx-hPDF), although an insoluble fraction was still dominant. We confirmed that Co(2+) is an optimal metal for increasing the activity of purified Trx-hPDF, and that actinonin acts as an efficient inhibitor. Therefore, a large amount of purified hPDF protein would provide many benefits for the screening of various drug candidates.
Collapse
Affiliation(s)
- Ji-Hoon Han
- Division of Magnetic Resonance, Korea Basic Science Institute Ochang Campus, Cheongwon-Gun, Ochang-Eup, Yangcheong-Ri 804-1, Chungcheongbuk-Do 363-883, Republic of Korea
| | | | | | | | | | | | | |
Collapse
|
14
|
Welch M, Govindarajan S, Ness JE, Villalobos A, Gurney A, Minshull J, Gustafsson C. Design parameters to control synthetic gene expression in Escherichia coli. PLoS One 2009; 4:e7002. [PMID: 19759823 PMCID: PMC2736378 DOI: 10.1371/journal.pone.0007002] [Citation(s) in RCA: 265] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2009] [Accepted: 08/17/2009] [Indexed: 01/10/2023] Open
Abstract
BACKGROUND Production of proteins as therapeutic agents, research reagents and molecular tools frequently depends on expression in heterologous hosts. Synthetic genes are increasingly used for protein production because sequence information is easier to obtain than the corresponding physical DNA. Protein-coding sequences are commonly re-designed to enhance expression, but there are no experimentally supported design principles. PRINCIPAL FINDINGS To identify sequence features that affect protein expression we synthesized and expressed in E. coli two sets of 40 genes encoding two commercially valuable proteins, a DNA polymerase and a single chain antibody. Genes differing only in synonymous codon usage expressed protein at levels ranging from undetectable to 30% of cellular protein. Using partial least squares regression we tested the correlation of protein production levels with parameters that have been reported to affect expression. We found that the amount of protein produced in E. coli was strongly dependent on the codons used to encode a subset of amino acids. Favorable codons were predominantly those read by tRNAs that are most highly charged during amino acid starvation, not codons that are most abundant in highly expressed E. coli proteins. Finally we confirmed the validity of our models by designing, synthesizing and testing new genes using codon biases predicted to perform well. CONCLUSION The systematic analysis of gene design parameters shown in this study has allowed us to identify codon usage within a gene as a critical determinant of achievable protein expression levels in E. coli. We propose a biochemical basis for this, as well as design algorithms to ensure high protein production from synthetic genes. Replication of this methodology should allow similar design algorithms to be empirically derived for any expression system.
Collapse
|
15
|
Williams JA, Carnes AE, Hodgson CP. Plasmid DNA vaccine vector design: impact on efficacy, safety and upstream production. Biotechnol Adv 2009; 27:353-70. [PMID: 19233255 PMCID: PMC2693335 DOI: 10.1016/j.biotechadv.2009.02.003] [Citation(s) in RCA: 121] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2008] [Revised: 02/02/2009] [Accepted: 02/07/2009] [Indexed: 10/21/2022]
Abstract
Critical molecular and cellular biological factors impacting design of licensable DNA vaccine vectors that combine high yield and integrity during bacterial production with increased expression in mammalian cells are reviewed. Food and Drug Administration (FDA), World Health Organization (WHO) and European Medical Agencies (EMEA) regulatory guidance's are discussed, as they relate to vector design and plasmid fermentation. While all new vectors will require extensive preclinical testing to validate safety and performance prior to clinical use, regulatory testing burden for follow-on products can be reduced by combining carefully designed synthetic genes with existing validated vector backbones. A flowchart for creation of new synthetic genes, combining rationale design with bioinformatics, is presented. The biology of plasmid replication is reviewed, and process engineering strategies that reduce metabolic burden discussed. Utilizing recently developed low metabolic burden seed stock and fermentation strategies, optimized vectors can now be manufactured in high yields exceeding 2 g/L, with specific plasmid yields of 5% total dry cell weight.
Collapse
|
16
|
Raymond A, Lovell S, Lorimer D, Walchli J, Mixon M, Wallace E, Thompkins K, Archer K, Burgin A, Stewart L. Combined protein construct and synthetic gene engineering for heterologous protein expression and crystallization using Gene Composer. BMC Biotechnol 2009; 9:37. [PMID: 19383143 PMCID: PMC2680836 DOI: 10.1186/1472-6750-9-37] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2008] [Accepted: 04/21/2009] [Indexed: 01/29/2023] Open
Abstract
Background With the goal of improving yield and success rates of heterologous protein production for structural studies we have developed the database and algorithm software package Gene Composer. This freely available electronic tool facilitates the information-rich design of protein constructs and their engineered synthetic gene sequences, as detailed in the accompanying manuscript. Results In this report, we compare heterologous protein expression levels from native sequences to that of codon engineered synthetic gene constructs designed by Gene Composer. A test set of proteins including a human kinase (P38α), viral polymerase (HCV NS5B), and bacterial structural protein (FtsZ) were expressed in both E. coli and a cell-free wheat germ translation system. We also compare the protein expression levels in E. coli for a set of 11 different proteins with greatly varied G:C content and codon bias. Conclusion The results consistently demonstrate that protein yields from codon engineered Gene Composer designs are as good as or better than those achieved from the synonymous native genes. Moreover, structure guided N- and C-terminal deletion constructs designed with the aid of Gene Composer can lead to greater success in gene to structure work as exemplified by the X-ray crystallographic structure determination of FtsZ from Bacillus subtilis. These results validate the Gene Composer algorithms, and suggest that using a combination of synthetic gene and protein construct engineering tools can improve the economics of gene to structure research.
Collapse
Affiliation(s)
- Amy Raymond
- deCODE biostructures Inc, 7869 NE Day Road West, Bainbridge Island, WA 98110, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Welch M, Villalobos A, Gustafsson C, Minshull J. You're one in a googol: optimizing genes for protein expression. J R Soc Interface 2009; 6 Suppl 4:S467-76. [PMID: 19324676 DOI: 10.1098/rsif.2008.0520.focus] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
A vast number of different nucleic acid sequences can all be translated by the genetic code into the same amino acid sequence. These sequences are not all equally useful however; the exact sequence chosen can have profound effects on the expression of the encoded protein. Despite the importance of protein-coding sequences, there has been little systematic study to identify parameters that affect expression. This is probably because protein expression has largely been tackled on an ad hoc basis in many independent projects: once a sequence has been obtained that yields adequate expression for that project, there is little incentive to continue work on the problem. Synthetic biology may now provide the impetus to transform protein expression folklore into design principles, so that DNA sequences may easily be designed to express any protein in any system. In this review, we offer a brief survey of the literature, outline the major challenges in interpreting existing data and constructing robust design algorithms, and propose a way to proceed towards the goal of rational sequence engineering.
Collapse
Affiliation(s)
- Mark Welch
- DNA 2.0, Inc., 1430 O'Brien Drive, Menlo Park, CA 94025, USA
| | | | | | | |
Collapse
|
18
|
Ye H, Huang MC, Li MH, Ying JY. Experimental analysis of gene assembly with TopDown one-step real-time gene synthesis. Nucleic Acids Res 2009; 37:e51. [PMID: 19264797 PMCID: PMC2673447 DOI: 10.1093/nar/gkp118] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Herein we present a simple, cost-effective TopDown (TD) gene synthesis method that eliminates the interference between the polymerase chain reactions (PCR) assembly and amplification in one-step gene synthesis. The method involves two key steps: (i) design of outer primers and assembly oligonucleotide set with a melting temperature difference of >10°C and (ii) utilization of annealing temperatures to selectively control the efficiencies of oligonucleotide assembly and full-length template amplification. In addition, we have combined the proposed method with real-time PCR to analyze the step-wise efficiency and the kinetics of the gene synthesis process. Gel electrophoresis results are compared with real-time fluorescence signals to investigate the effects of oligonucleotide concentration, outer primer concentration, stringency of annealing temperature, and number of PCR cycles. Analysis of the experimental results has led to insights into the gene synthesis process. We further discuss the conditions for preventing the formation of spurious DNA products. The TD real-time gene synthesis method provides a simple and efficient method for assembling fairly long DNA sequence, and aids in optimizing gene synthesis conditions. To our knowledge, this is the first report that utilizes real-time PCR for gene synthesis.
Collapse
Affiliation(s)
- Hongye Ye
- Institute of Bioengineering and Nanotechnology, The Nanos, Singapore
| | | | | | | |
Collapse
|
19
|
Xiong AS, Peng RH, Zhuang J, Gao F, Li Y, Cheng ZM, Yao QH. Chemical gene synthesis: strategies, softwares, error corrections, and applications. FEMS Microbiol Rev 2008; 32:522-40. [DOI: 10.1111/j.1574-6976.2008.00109.x] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
|