251
|
Chemical synthesis of the mouse mitochondrial genome. Nat Methods 2010; 7:901-3. [DOI: 10.1038/nmeth.1515] [Citation(s) in RCA: 273] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2010] [Accepted: 09/08/2010] [Indexed: 12/13/2022]
|
252
|
Maertens B, Spriestersbach A, von Groll U, Roth U, Kubicek J, Gerrits M, Graf M, Liss M, Daubert D, Wagner R, Schäfer F. Gene optimization mechanisms: a multi-gene study reveals a high success rate of full-length human proteins expressed in Escherichia coli. Protein Sci 2010; 19:1312-26. [PMID: 20506237 PMCID: PMC2970903 DOI: 10.1002/pro.408] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The genetic code is universal, but recombinant protein expression in heterologous systems is often hampered by divergent codon usage. Here, we demonstrate that reprogramming by standardized multi-parameter gene optimization software and de novo gene synthesis is a suitable general strategy to improve heterologous protein expression. This study compares expression levels of 94 full-length human wt and sequence-optimized genes coding for pharmaceutically important proteins such as kinases and membrane proteins in E. coli. Fluorescence-based quantification revealed increased protein yields for 70% of in vivo expressed optimized genes compared to the wt DNA sequences and also resulted in increased amounts of protein that can be purified. The improvement in transgene expression correlated with higher mRNA levels in our analyzed examples. In all cases tested, expression levels using wt genes in tRNA-supplemented bacterial strains were outperformed by optimized genes expressed in non-supplemented host cells.
Collapse
|
253
|
Raab D, Graf M, Notka F, Schödl T, Wagner R. The GeneOptimizer Algorithm: using a sliding window approach to cope with the vast sequence space in multiparameter DNA sequence optimization. SYSTEMS AND SYNTHETIC BIOLOGY 2010; 4:215-25. [PMID: 21189842 PMCID: PMC2955205 DOI: 10.1007/s11693-010-9062-3] [Citation(s) in RCA: 146] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/19/2009] [Revised: 07/16/2010] [Accepted: 08/02/2010] [Indexed: 11/25/2022]
Abstract
One of the main advantages of de novo gene synthesis is the fact that it frees the researcher from any limitations imposed by the use of natural templates. To make the most out of this opportunity, efficient algorithms are needed to calculate a coding sequence, combining different requirements, such as adapted codon usage or avoidance of restriction sites, in the best possible way. We present an algorithm where a “variation window” covering several amino acid positions slides along the coding sequence. Candidate sequences are built comprising the already optimized part of the complete sequence and all possible combinations of synonymous codons representing the amino acids within the window. The candidate sequences are assessed with a quality function, and the first codon of the best candidates’ variation window is fixed. Subsequently the window is shifted by one codon position. As an example of a freely accessible software implementing the algorithm, we present the Mr. Gene web-application. Additionally two experimental applications of the algorithm are shown.
Collapse
|
254
|
Bulky high-mannose-type N-glycan blocks the taste-modifying activity of miraculin. Biochim Biophys Acta Gen Subj 2010; 1800:986-92. [DOI: 10.1016/j.bbagen.2010.06.003] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2010] [Revised: 05/17/2010] [Accepted: 06/04/2010] [Indexed: 11/20/2022]
|
255
|
Multifactorial determinants of protein expression in prokaryotic open reading frames. J Mol Biol 2010; 402:905-18. [PMID: 20727358 DOI: 10.1016/j.jmb.2010.08.010] [Citation(s) in RCA: 79] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2010] [Revised: 07/27/2010] [Accepted: 08/05/2010] [Indexed: 01/21/2023]
Abstract
A quantitative description of the relationship between protein expression levels and open reading frame (ORF) nucleotide sequences is important for understanding natural systems, designing synthetic systems, and optimizing heterologous expression. Codon identity, mRNA secondary structure, and nucleotide composition within ORFs markedly influence expression levels. Bioinformatic analysis of ORF sequences in 816 bacterial genomes revealed that these features show distinct regional trends. To investigate their effects on protein expression, we designed 285 synthetic genes and determined corresponding expression levels in vitro using Escherichia coli extracts. We developed a mathematical function, parameterized using this synthetic gene data set, which enables computation of protein expression levels from ORF nucleotide sequences. In addition to its practical application in the design of heterologous expression systems, this equation provides mechanistic insight into the factors that control translation efficiency. We found that expression is strongly dependent on the presence of high AU content and low secondary structure in the ORF 5' region. Choice of high-frequency codons contributes to a lesser extent. The 3' terminal AU content makes modest, but detectable contributions. We present a model for the effect of these factors on the three phases of ribosomal function: initiation, elongation, and termination.
Collapse
|
256
|
Cheong WC, Lim LS, Huang MC, Bode M, Li MH. New insights into the de novo gene synthesis using the automatic kinetics switch approach. Anal Biochem 2010; 406:51-60. [PMID: 20599643 DOI: 10.1016/j.ab.2010.06.036] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2010] [Revised: 06/18/2010] [Accepted: 06/23/2010] [Indexed: 12/27/2022]
Abstract
Here we present a simple, highly efficient, universal automatic kinetics switch (AKS) gene synthesis method that enables synthesis of DNA up to 1.6kbp from 1nM oligonucleotide with just one polymerase chain reaction (PCR) process. This method eliminates the interference between the PCR assembly and amplification in one-step gene synthesis and simultaneously maximizes the amplification of emerged desired DNA by using a pair of flanked primers. In addition, we describe an analytical model of PCR gene synthesis based on the thermodynamics and kinetics of DNA hybridization. The kinetics difference between standard PCR amplification and one-step PCR gene synthesis is analyzed using this model and is validated using real-time gene synthesis with eight gene segments (318-1656bp). The effects of oligonucleotide concentration, stringency of annealing temperature, annealing time, extension time, and PCR buffer conditions are examined systematically. Analysis of the experimental results leads to new insights into the gene synthesis process and aids in optimizing gene synthesis conditions. We further extend this method for multiplexing gene assembly with a total DNA length up to 5.74kbp from 1nM oligonucleotide.
Collapse
Affiliation(s)
- Wai Chye Cheong
- Institute of Bioengineering and Nanotechnology, The Nanos, Singapore, Singapore
| | | | | | | | | |
Collapse
|
257
|
Baronio R, Danziger SA, Hall LV, Salmon K, Hatfield GW, Lathrop RH, Kaiser P. All-codon scanning identifies p53 cancer rescue mutations. Nucleic Acids Res 2010; 38:7079-88. [PMID: 20581117 PMCID: PMC2978351 DOI: 10.1093/nar/gkq571] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
In vitro scanning mutagenesis strategies are valuable tools to identify critical residues in proteins and to generate proteins with modified properties. We describe the fast and simple All-Codon Scanning (ACS) strategy that creates a defined gene library wherein each individual codon within a specific target region is changed into all possible codons with only a single codon change per mutagenesis product. ACS is based on a multiplexed overlapping mutagenesis primer design that saturates only the targeted gene region with single codon changes. We have used ACS to produce single amino-acid changes in small and large regions of the human tumor suppressor protein p53 to identify single amino-acid substitutions that can restore activity to inactive p53 found in human cancers. Single-tube reactions were used to saturate defined 30-nt regions with all possible codon changes. The same technique was used in 20 parallel reactions to scan the 600-bp fragment encoding the entire p53 core domain. Identification of several novel p53 cancer rescue mutations demonstrated the utility of the ACS approach. ACS is a fast, simple and versatile method, which is useful for protein structure–function analyses and protein design or evolution problems.
Collapse
Affiliation(s)
- Roberta Baronio
- Institute for Genomics and Bioinformatics, Department of Biomedical Engineering, University of California, Irvine, CA 92697, USA
| | | | | | | | | | | | | |
Collapse
|
258
|
Sengupta D, Heilshorn SC. Protein-Engineered Biomaterials: Highly Tunable Tissue Engineering Scaffolds. TISSUE ENGINEERING PART B-REVIEWS 2010; 16:285-93. [DOI: 10.1089/ten.teb.2009.0591] [Citation(s) in RCA: 108] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Affiliation(s)
- Debanti Sengupta
- Department of Chemistry, Stanford University, Stanford, California
| | - Sarah C. Heilshorn
- Department of Materials Science and Engineering, Stanford University, Stanford, California
| |
Collapse
|
259
|
Richardson SM, Nunley PW, Yarrington RM, Boeke JD, Bader JS. GeneDesign 3.0 is an updated synthetic biology toolkit. Nucleic Acids Res 2010; 38:2603-6. [PMID: 20211837 PMCID: PMC2860129 DOI: 10.1093/nar/gkq143] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2009] [Revised: 02/16/2010] [Accepted: 02/17/2010] [Indexed: 11/14/2022] Open
Abstract
GeneDesign is a set of web applications that provides public access to a nucleotide manipulation pipeline for synthetic biology. The server is public and freely accessible, and the source is available for download under the New BSD License. Since GeneDesign was published and made publicly available 3 years ago, we have made its code base more efficient, added several algorithms and modules, updated the restriction enzyme library, added batch processing capabilities, and added several command line modules, all of which we briefly describe here.
Collapse
Affiliation(s)
- Sarah M. Richardson
- McKusick-Nathans Institute of Genetic Medicine, High Throughput Biology Center, Johns Hopkins University School of Medicine, 733 North Broadway, Baltimore, MD 21205, Department of Biology and Department of Biomedical Engineering, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD 21215, USA
| | - Paul W. Nunley
- McKusick-Nathans Institute of Genetic Medicine, High Throughput Biology Center, Johns Hopkins University School of Medicine, 733 North Broadway, Baltimore, MD 21205, Department of Biology and Department of Biomedical Engineering, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD 21215, USA
| | - Robert M. Yarrington
- McKusick-Nathans Institute of Genetic Medicine, High Throughput Biology Center, Johns Hopkins University School of Medicine, 733 North Broadway, Baltimore, MD 21205, Department of Biology and Department of Biomedical Engineering, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD 21215, USA
| | - Jef D. Boeke
- McKusick-Nathans Institute of Genetic Medicine, High Throughput Biology Center, Johns Hopkins University School of Medicine, 733 North Broadway, Baltimore, MD 21205, Department of Biology and Department of Biomedical Engineering, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD 21215, USA
| | - Joel S. Bader
- McKusick-Nathans Institute of Genetic Medicine, High Throughput Biology Center, Johns Hopkins University School of Medicine, 733 North Broadway, Baltimore, MD 21205, Department of Biology and Department of Biomedical Engineering, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD 21215, USA
| |
Collapse
|
260
|
Fernández-Robledo JA, Vasta GR. Production of recombinant proteins from protozoan parasites. Trends Parasitol 2010; 26:244-54. [PMID: 20189877 PMCID: PMC2862126 DOI: 10.1016/j.pt.2010.02.004] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2009] [Revised: 01/05/2010] [Accepted: 02/05/2010] [Indexed: 12/20/2022]
Abstract
Although the past decade has witnessed sequencing from an increasing number of parasites, modern high-throughput DNA sequencing technologies have the potential to generate complete genome sequences at even higher rates. Along with the discovery of genes that might constitute potential targets for chemotherapy or vaccination, the need for novel protein expression platforms has become a pressing matter. In addition to reviewing the advantages and limitations of the currently available and emerging expression systems, we discuss novel approaches that could overcome current limitations, including the 'pseudoparasite' concept, an expression platform in which the choice of the surrogate organism is based on its phylogenetic affinity to the target parasite, while taking advantage of the whole engineered organism as a vaccination adjuvant.
Collapse
Affiliation(s)
- José A Fernández-Robledo
- Department of Microbiology and Immunology, University of Maryland School of Medicine, IMET, 701 E. Pratt Street, Suite 236, Baltimore, MD 21202-3101, USA. <>
| | | |
Collapse
|
261
|
On relevance of codon usage to expression of synthetic and natural genes in Escherichia coli. Genetics 2010; 185:1129-34. [PMID: 20421604 DOI: 10.1534/genetics.110.115477] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
A recent investigation concluded that codon bias did not affect expression of green fluorescent protein (GFP) variants in Escherichia coli, while stability of an mRNA secondary structure near the 5' end played a dominant role. We demonstrate that combining the two variables using regression trees or support vector regression yields a biologically plausible model with better support in the GFP data set and in other experimental data: codon usage is relevant for protein levels if the 5' mRNA structures are not strong. Natural E. coli genes had weaker 5' mRNA structures than the examined set of GFP variants and did not exhibit a correlation between the folding free energy of 5' mRNA structures and protein expression.
Collapse
|
262
|
Abstract
Proteins are the most versatile among the various biological building blocks and a mature field of protein engineering has lead to many industrial and biomedical applications. But the strength of proteins—their versatility, dynamics and interactions—also complicates and hinders systems engineering. Therefore, the design of more sophisticated, multi-component protein systems appears to lag behind, in particular, when compared to the engineering of gene regulatory networks. Yet, synthetic biologists have started to tinker with the information flow through natural signaling networks or integrated protein switches. A successful strategy common to most of these experiments is their focus on modular interactions between protein domains or domains and peptide motifs. Such modular interaction swapping has rewired signaling in yeast, put mammalian cell morphology under the control of light, or increased the flux through a synthetic metabolic pathway. Based on this experience, we outline an engineering framework for the connection of reusable protein interaction devices into self-sufficient circuits. Such a framework should help to ‘refacture’ protein complexity into well-defined exchangeable devices for predictive engineering. We review the foundations and initial success stories of protein synthetic biology and discuss the challenges and promises on the way from protein- to protein systems design.
Collapse
Affiliation(s)
- Raik Grünberg
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), UPF, 08003 Barcelona, Spain.
| | | |
Collapse
|
263
|
|
264
|
Translation efficiency is determined by both codon bias and folding energy. Proc Natl Acad Sci U S A 2010; 107:3645-50. [PMID: 20133581 DOI: 10.1073/pnas.0909910107] [Citation(s) in RCA: 410] [Impact Index Per Article: 27.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Synonymous mutations do not alter the protein produced yet can have a significant effect on protein levels. The mechanisms by which this effect is achieved are controversial; although some previous studies have suggested that codon bias is the most important determinant of translation efficiency, a recent study suggested that mRNA folding at the beginning of genes is the dominant factor via its effect on translation initiation. Using the Escherichia coli and Saccharomyces cerevisiae transcriptomes, we conducted a genome-scale study aiming at dissecting the determinants of translation efficiency. There is a significant association between codon bias and translation efficiency across all endogenous genes in E. coli and S. cerevisiae but no association between folding energy and translation efficiency, demonstrating the role of codon bias as an important determinant of translation efficiency. However, folding energy does modulate the strength of association between codon bias and translation efficiency, which is maximized at very weak mRNA folding (i.e., high folding energy) levels. We find a strong correlation between the genomic profiles of ribosomal density and genomic profiles of folding energy across mRNA, suggesting that lower folding energies slow down the ribosomes and decrease translation efficiency. Accordingly, we find that selection forces act near uniformly to decrease the folding energy at the beginning of genes. In summary, these findings testify that in endogenous genes, folding energy affects translation efficiency in a global manner that is not related to the expression levels of individual genes, and thus cannot be detected by correlation with their expression levels.
Collapse
|
265
|
Functional viral metagenomics and the next generation of molecular tools. Trends Microbiol 2009; 18:20-9. [PMID: 19896852 DOI: 10.1016/j.tim.2009.10.001] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2009] [Revised: 10/14/2009] [Accepted: 10/19/2009] [Indexed: 12/13/2022]
Abstract
The enzymes of bacteriophages and other viruses have been essential research tools since the first days of molecular biology. However, the current repertoire of viral enzymes only hints at their overall potential. The most commonly used enzymes are derived from a surprisingly small number of cultivated viruses, which is remarkable considering the extreme abundance and diversity of viruses revealed over the past decade by metagenomic analysis. To access the treasure trove of enzymes hidden in the global virosphere and develop them for research, therapeutic and diagnostic uses, improvements are needed in our ability to rapidly and efficiently discover, express and characterize viral genes to produce useful proteins. In this paper, we discuss improvements to sampling and cloning methods, functional and genomics-based screens, and expression systems, which should accelerate discovery of new enzymes and other viral proteins for use in research and medicine.
Collapse
|