1
|
Hans S, Gimpel M, Glauche F, Neubauer P, Cruz-Bournazou MN. Automated Cell Treatment for Competence and Transformation of Escherichia coli in a High-Throughput Quasi-Turbidostat Using Microtiter Plates. Microorganisms 2018; 6:E60. [PMID: 29941834 PMCID: PMC6163857 DOI: 10.3390/microorganisms6030060] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2018] [Revised: 06/10/2018] [Accepted: 06/22/2018] [Indexed: 12/20/2022] Open
Abstract
Metabolic engineering and genome editing strategies often lead to large strain libraries of a bacterial host. Nevertheless, the generation of competent cells is the basis for transformation and subsequent screening of these strains. While preparation of competent cells is a standard procedure in flask cultivations, parallelization becomes a challenging task when working with larger libraries and liquid handling stations as transformation efficiency depends on a distinct physiological state of the cells. We present a robust method for the preparation of competent cells and their transformation. The strength of the method is that all cells on the plate can be maintained at a high growth rate until all cultures have reached a defined cell density regardless of growth rate and lag phase variabilities. This allows sufficient transformation in automated high throughput facilities and solves important scheduling issues in wet-lab library screenings. We address the problem of different growth rates, lag phases, and initial cell densities inspired by the characteristics of continuous cultures. The method functions on a fully automated liquid handling platform including all steps from the inoculation of the liquid cultures to plating and incubation on agar plates. The key advantage of the developed method is that it enables cell harvest in 96 well plates at a predefined time by keeping fast growing cells in the exponential phase as in turbidostat cultivations. This is done by a periodic monitoring of cell growth and a controlled dilution specific for each well. With the described methodology, we were able to transform different strains in parallel. The transformants produced can be picked and used in further automated screening experiments. This method offers the possibility to transform any combination of strain- and plasmid library in an automated high-throughput system, overcoming an important bottleneck in the high-throughput screening and the overall chain of bioprocess development.
Collapse
Affiliation(s)
- Sebastian Hans
- Chair of Bioprocess Engineering, Institute of Biotechnology, Technische Universität Berlin, Ackerstraße 76, D-13357 Berlin, Germany.
| | - Matthias Gimpel
- Chair of Bioprocess Engineering, Institute of Biotechnology, Technische Universität Berlin, Ackerstraße 76, D-13357 Berlin, Germany.
| | - Florian Glauche
- Chair of Bioprocess Engineering, Institute of Biotechnology, Technische Universität Berlin, Ackerstraße 76, D-13357 Berlin, Germany.
| | - Peter Neubauer
- Chair of Bioprocess Engineering, Institute of Biotechnology, Technische Universität Berlin, Ackerstraße 76, D-13357 Berlin, Germany.
| | - Mariano Nicolas Cruz-Bournazou
- Chair of Bioprocess Engineering, Institute of Biotechnology, Technische Universität Berlin, Ackerstraße 76, D-13357 Berlin, Germany.
| |
Collapse
|
2
|
Hughes SR, Riedmuller SB, Mertens JA, Li XL, Bischoff KM, Cotta MA, Farrelly PJ. Development of a Liquid Handler Component for a Plasmid-Based Functional Proteomic Robotic Workcell. ACTA ACUST UNITED AC 2016. [DOI: 10.1016/j.jala.2005.06.009] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
A Hudson Control Group, Inc. ProLink Express™ robotic workcell to conduct plasmid-based functional proteomics is being developed for optimization of protein open reading frames (ORF). The initial phase of this project is to design and assemble a Xantus liquid handler from Sias, Inc. modified by Hudson so that a workcell track component can be placed within the Xantus® gripper tool work area. The liquid handler is designed to produce plasmids using the Qiagen Turbo® plasmid preparation kit. This design allows processing of up to four 96-well plates in one run. The procedure eliminates disposable tips and provides an advanced wash system to prevent cross contamination. To evaluate liquid handler operation, a mutagenized cellulase F ORF plasmid library was prepared from wild-type cellulase F (Chen, H.; Li, X.-L.; Blum, D. L.; Ximenes, E. A.; Ljungdahl, L. G. CelF of Orpinomyces PC-2 has an intron and encodes a cellulase (CelF) containing a carbohydrate-binding module. Applied Biochemistry and Biotechnology 2003, 105–108, 775–785; Li, X.-L.; Chen, H.; Ljungdahl, L. G. Two cellulases, CelA and CelC, from the polycentric anaerobic fungus Orpinomyces strain PC-2 contain N-terminal docking domains for a cellulase-hemicellulase complex. Applied and Environmental Microbiology 1997, 63(12), 4721–4728) using a novel Invitrogen Gateway® cloning strategy. For the automated reproducibility run, the average yield of plasmid was 5.35 μg per well from 1.347 mL of starting culture. Four plates were processed automatically on the liquid handler in 374 min compared to at least 441 min for the same plate operations performed manually. The quality and quantity of plasmids prepared on the liquid handler made the implementation of the following workcell protocols possible: DNA sequencing, in vitro transcription/translation, and transformation of bacterial and yeast strains for protein expression.
Collapse
Affiliation(s)
- Stephen R. Hughes
- BBC Research Unit, USDA, ARS, NCAUR, 1815 North University Street, Peoria, IL 61604
| | | | - Jeffrey A. Mertens
- FBT Research Unit, USDA, ARS, NCAUR, 1815 North University Street, Peoria, IL 61604
| | - Xin-Liang Li
- FBT Research Unit, USDA, ARS, NCAUR, 1815 North University Street, Peoria, IL 61604
| | - Kenneth M. Bischoff
- BBC Research Unit, USDA, ARS, NCAUR, 1815 North University Street, Peoria, IL 61604
| | - Michael A. Cotta
- FBT Research Unit, USDA, ARS, NCAUR, 1815 North University Street, Peoria, IL 61604
| | | |
Collapse
|
3
|
Das C, Nadler T, Strug I. Detergent Analysis in Protein Samples Using Mid-Infrared (MIR) Spectroscopy. CURRENT PROTOCOLS IN PROTEIN SCIENCE 2015; 81:29.12.1-29.12.15. [PMID: 26237674 DOI: 10.1002/0471140864.ps2912s81] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Quantitating relative levels of detergent present in protein preparations or samples derived from biological material, such as tissue or body fluids, is important because the presence of detergent may affect downstream analyses as well as protein structure/function. Especially because sample volumes, analysts' available time, and other resources may be limited, a method that consumes little sample and that is rapid and simple is needed for detergent analysis. It would also be preferable to have a method that is generally applicable across many aliphatic chain-containing molecules with many different physical properties. In this unit, methods are described for analyzing detergents and proteins in detergent-protein mixtures using mid-infrared (MIR) spectroscopy. A protocol is also included for efficient removal of unbound detergents from a protein sample accompanied by MIR-based monitoring of both detergent and protein content. This rapid monitoring of sample preparation during the workflow enables users to make timely decisions about sample preparation strategies that maximize both analyte purity and yield.
Collapse
Affiliation(s)
| | | | - Ivona Strug
- EMD Millipore Corporation, Danvers, Massachusetts
| |
Collapse
|
4
|
Design and Construction of a First-Generation High-Throughput Integrated Robotic Molecular Biology Platform for Bioenergy Applications. ACTA ACUST UNITED AC 2011; 16:292-307. [DOI: 10.1016/j.jala.2011.04.004] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2011] [Indexed: 01/01/2023]
Abstract
The molecular biological techniques for plasmid-based assembly and cloning of gene open reading frames are essential for elucidating the function of the proteins encoded by the genes. High-throughput integrated robotic molecular biology platforms that have the capacity to rapidly clone and express heterologous gene open reading frames in bacteria and yeast and to screen large numbers of expressed proteins for optimized function are an important technology for improving microbial strains Published by Elsevier Inc. on behalf of the Society for Laboratory Automation and Screening for biofuel production. The process involves the production of full-length complementary DNA libraries as a source of plasmid-based clones to express the desired proteins in active form for determination of their functions. Proteins that were identified by high-throughput screening as having desired characteristics are overexpressed in microbes to enable them to perform functions that will allow more cost-effective and sustainable production of biofuels. Because the plasmid libraries are composed of several thousand unique genes, automation of the process is essential. This review describes the design and implementation of an automated integrated programmable robotic workcell capable of producing complementary DNA libraries, colony picking, isolating plasmid DNA, transforming yeast and bacteria, expressing protein, and performing appropriate functional assays. These operations will allow tailoring microbial strains to use renewable feedstocks for production of biofuels, bioderived chemicals, fertilizers, and other coproducts for profitable and sustainable biorefineries.
Collapse
|
5
|
Lim HK, Mansell TJ, Linderman SW, Fisher AC, Dyson MR, DeLisa MP. Mining mammalian genomes for folding competent proteins using Tat-dependent genetic selection in Escherichia coli. Protein Sci 2010; 18:2537-49. [PMID: 19830686 DOI: 10.1002/pro.262] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Recombinant expression of eukaryotic proteins in Escherichia coli is often limited by poor folding and solubility. To address this problem, we employed a recently developed genetic selection for protein folding and solubility based on the bacterial twin-arginine translocation (Tat) pathway to rapidly identify properly folded recombinant proteins or soluble protein domains of mammalian origin. The coding sequences for 29 different mammalian polypeptides were cloned as sandwich fusions between an N-terminal Tat export signal and a C-terminal selectable marker, namely beta-lactamase. Hence, expression of the selectable marker and survival on selective media was linked to Tat export of the target mammalian protein. Since the folding quality control feature of the Tat pathway prevents export of misfolded proteins, only correctly folded fusion proteins reached the periplasm and conferred cell survival. In general, the ability to confer growth was found to relate closely to the solubility profile and molecular weight of the protein, although other features such as number of contiguous hydrophobic amino acids and cysteine content may also be important. These results highlight the capacity of Tat selection to reveal the folding potential of mammalian proteins and protein domains without the need for structural or functional information about the target protein.
Collapse
Affiliation(s)
- Hyung-Kwon Lim
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, New York 14853, USA
| | | | | | | | | | | |
Collapse
|
6
|
Listwan P, Pédelacq JD, Lockard M, Bell C, Terwilliger TC, Waldo GS. The optimization of in vitro high-throughput chemical lysis of Escherichia coli. Application to ACP domain of the polyketide synthase ppsC from Mycobacterium tuberculosis. ACTA ACUST UNITED AC 2010; 11:41-9. [PMID: 20069378 PMCID: PMC2855807 DOI: 10.1007/s10969-009-9077-8] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2009] [Accepted: 12/30/2009] [Indexed: 11/26/2022]
Abstract
Protein production in Escherichia coli involves high-level expression in a culture, followed by harvesting of the cells and finally their disruption, or lysis, to release the expressed proteins. We compare three high-throughput chemical lysis methods to sonication, using a robotic platform and methodologies developed in our laboratory [1]. Under the same expression conditions, all lysis methods varied in the degree of released soluble proteins. With a set of 96 test proteins, we used our split GFP to quantify the soluble and insoluble protein fractions after lysis. Both the amount of soluble protein and the percentage recovered in the soluble fraction using SoluLyse were well correlated with sonication. Two other methods, Bugbuster and lysozyme, did not correlate well with sonication. Considering the effects of lysis methods on protein solubility is especially important when accurate protein solubility measurements are needed, for example, when testing adjuvants, growth media, temperature, or when establishing the effects of truncation or sequence variation on protein stability.
Collapse
Affiliation(s)
- Pawel Listwan
- Bioscience Division, MS-M888, Los Alamos National Laboratory, Bikini Atoll Rd, SM30, Los Alamos, NM 87545, USA.
| | | | | | | | | | | |
Collapse
|
7
|
Kurotani A, Takagi T, Toyama M, Shirouzu M, Yokoyama S, Fukami Y, Tokmakov AA. Comprehensive bioinformatics analysis of cell-free protein synthesis: identification of multiple protein properties that correlate with successful expression. FASEB J 2009; 24:1095-104. [PMID: 19940260 DOI: 10.1096/fj.09-139527] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
High-throughput cell-free protein synthesis is being used increasingly in structural/functional genomics projects. However, the factors determining expression success are poorly understood. Here, we evaluated the expression of 3066 human proteins and their domains in a bacterial cell-free system and analyzed the correlation of protein expression with 39 physicochemical and structural properties of proteins. As a result of the bioinformatics analysis performed, we determined the 18 most influential features that affect protein amenability to cell-free expression. They include protein length; hydrophobicity; pI; content of charged, nonpolar, and aromatic residues;, cysteine content; solvent accessibility; presence of coiled coil; content of intrinsically disordered and structured (alpha-helix and beta-sheet) sequence; number of disulfide bonds and functional domains; presence of transmembrane regions; PEST motifs; and signaling sequences. This study represents the first comprehensive bioinformatics analysis of heterologous protein synthesis in a cell-free system. The rules and correlations revealed here provide a plethora of important insights into rationalization of cell-free protein production and can be of practical use for protein engineering with the aim of increasing expression success.-Kurotani, A., Takagi, T., Toyama, M., Shirouzu, M., Yokoyama, S., Fukami, Y., Tokmakov, A. A. Comprehensive bioinformatics analysis of cell-free protein synthesis: identification of multiple protein properties that correlate with successful expression.
Collapse
|
8
|
High quality protein microarray using in situ protein purification. BMC Biotechnol 2009; 9:72. [PMID: 19698181 PMCID: PMC2746808 DOI: 10.1186/1472-6750-9-72] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2009] [Accepted: 08/23/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In the postgenomic era, high throughput protein expression and protein microarray technologies have progressed markedly permitting screening of therapeutic reagents and discovery of novel protein functions. Hexa-histidine is one of the most commonly used fusion tags for protein expression due to its small size and convenient purification via immobilized metal ion affinity chromatography (IMAC). This purification process has been adapted to the protein microarray format, but the quality of in situ His-tagged protein purification on slides has not been systematically evaluated. We established methods to determine the level of purification of such proteins on metal chelate-modified slide surfaces. Optimized in situ purification of His-tagged recombinant proteins has the potential to become the new gold standard for cost-effective generation of high-quality and high-density protein microarrays. RESULTS Two slide surfaces were examined, chelated Cu2+ slides suspended on a polyethylene glycol (PEG) coating and chelated Ni2+ slides immobilized on a support without PEG coating. Using PEG-coated chelated Cu2+ slides, consistently higher purities of recombinant proteins were measured. An optimized wash buffer (PBST) composed of 10 mM phosphate buffer, 2.7 mM KCl, 140 mM NaCl and 0.05% Tween 20, pH 7.4, further improved protein purity levels. Using Escherichia coli cell lysates expressing 90 recombinant Streptococcus pneumoniae proteins, 73 proteins were successfully immobilized, and 66 proteins were in situ purified with greater than 90% purity. We identified several antigens among the in situ-purified proteins via assays with anti-S. pneumoniae rabbit antibodies and a human patient antiserum, as a demonstration project of large scale microarray-based immunoproteomics profiling. The methodology is compatible with higher throughput formats of in vivo protein expression, eliminates the need for resin-based purification and circumvents protein solubility and denaturation problems caused by buffer exchange steps and freeze-thaw cycles, which are associated with resin-based purification, intermittent protein storage and deposition on microarrays. CONCLUSION An optimized platform for in situ protein purification on microarray slides using His-tagged recombinant proteins is a desirable tool for the screening of novel protein functions and protein-protein interactions. In the context of immunoproteomics, such protein microarrays are complimentary to approaches using non-recombinant methods to discover and characterize bacterial antigens.
Collapse
|
9
|
Park JS, Han KY, Lee JH, Song JA, Ahn KY, Seo HS, Sim SJJ, Kim SW, Lee J. Solubility enhancement of aggregation-prone heterologous proteins by fusion expression using stress-responsive Escherichia coli protein, RpoS. BMC Biotechnol 2008; 8:15. [PMID: 18282304 PMCID: PMC2278137 DOI: 10.1186/1472-6750-8-15] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2007] [Accepted: 02/19/2008] [Indexed: 11/10/2022] Open
Abstract
Background The most efficient method for enhancing solubility of recombinant proteins appears to use the fusion expression partners. Although commercial fusion partners including maltose binding protein and glutathione-S-transferase have shown good performance in enhancing the solubility, they cannot be used for the proprietory production of commercially value-added proteins and likely cannot serve as universal helpers to solve all protein solubility and folding issues. Thus, novel fusion partners will continue to be developed through systematic investigations including proteome mining presented in this study. Results We analyzed the Escherichia coli proteome response to the exogenous stress of guanidine hydrochloride using 2-dimensional gel electrophoresis and found that RpoS (RNA polymerase sigma factor) was significantly stress responsive. While under the stress condition the total number of soluble proteins decreased by about 7 %, but a 6-fold increase in the level of RpoS was observed, indicating that RpoS is a stress-induced protein. As an N-terminus fusion expression partner, RpoS increased significantly the solubility of many aggregation-prone heterologous proteins in E. coli cytoplasm, indicating that RpoS is a very effective solubility enhancer for the synthesis of many recombinant proteins. RpoS was also well suited for the production of a biologically active fusion mutant of Pseudomonas putida cutinase. Conclusion RpoS is highly effective as a strong solubility enhancer for aggregation-prone heterologous proteins when it is used as a fusion expression partner in an E. coli expression system. The results of these findings may, therefore, be useful in the production of other biologically active industrial enzymes, as successfully demonstrated by cutinase.
Collapse
Affiliation(s)
- Jin-Seung Park
- Department of Chemical and Biological Engineering, Korea University, Anam-Dong 5-1, Sungbuk-Ku, Seoul 136-713, South Korea.
| | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Kwon K, Pieper R, Shallom S, Grose C, Kwon E, Do Y, Latham S, Alami H, Huang ST, Gatlin C, Papazisi L, Fleischmann R, Peterson S. A correlation analysis of protein characteristics associated with genome-wide high throughput expression and solubility of Streptococcus pneumoniae proteins. Protein Expr Purif 2007; 55:368-78. [PMID: 17703947 DOI: 10.1016/j.pep.2007.06.006] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2007] [Revised: 06/12/2007] [Accepted: 06/18/2007] [Indexed: 12/01/2022]
Abstract
We have developed and evaluated a highly parallel protein expression and purification system using ORFs derived from the pathogenic bacterium Streptococcus pneumoniae as a representative test case in conjunction with the Gateway cloning technology. Establishing high throughput protein production capability is essential for genome-wide characterization of protein function. In this study, we focused on protein expression and purification outcomes generated from an expression vector which encodes an NH(2)-terminal hexa-histidine tag and a COOH-terminal S-tag. Purified recombinant proteins were validated by SDS-PAGE, followed by in-gel digestion and identification by MALDI-TOF/TOF analysis. Starting with 1360 sequence-validated destination clones we examined correlation analyses of expression and solubility of a wide variety of recombinant proteins. In total, 428 purified proteins (31%) were recovered in soluble form. We describe a semi-quantitative scoring method using an S-tag assay to improve the throughput and efficiency of expression and solubility studies for recombinant proteins. Given a relatively large dataset derived from proteins representing all functional groups in a microbial genome we correlated various protein characteristics as they relate to protein expression outcomes.
Collapse
Affiliation(s)
- Keehwan Kwon
- J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Ventzki R, Stegemann J, Martinez L, de Marco A. Automated protein analysis by online detection of laser-induced fluorescence in slab gels and 3-D geometry gels. Electrophoresis 2006; 27:3338-48. [PMID: 16850506 DOI: 10.1002/elps.200600006] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) still remains the most reliable and comprehensive analytical method for the evaluation of protein extracts. However, conventional SDS-PAGE is time-consuming and, thus, unpractical if several tens or hundreds of samples must be examined. We show that SDS-PAGE protein analysis can be automated using slab gel DNA sequencers and compare the instrument's performance with conventional SDS-PAGE in terms of resolution, sensitivity and sample capacity. Labeled protein bands are detected online by laser-induced fluorescence (LIF) and the acquired signals are electronically stored for further processing, avoiding gel staining and scanning. Appropriate software allows immediate display of recorded data and convenient evaluation. The method provides a higher sensitivity and dynamic range than conventional Coomassie-stained gels and the resolution of proteins with different masses is independent of the polyacrylamide concentration. Internal markers can also be used for direct quantification and assignment of the molecular masses. Additionally, we present a novel electrophoresis instrument for the simultaneous separation and online LIF detection of all samples of a microtiterplate in parallel lanes in a 3-D geometry gel cylinder. The specific gel thermostatting concept prevents irregular sample migration (smiling) and improves the reproducibility and comparability of individual separation patterns. In combination with the expected large capacity of 384 or 1,536 samples, this makes the instrument a valuable tool for high-throughput comparative screening applications.
Collapse
Affiliation(s)
- Robert Ventzki
- Scientific Core Facilities, Services & Technology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | | | | | | |
Collapse
|
12
|
Dieckman LJ, Hanly WC, Collart ER. Strategies for high-throughput gene cloning and expression. GENETIC ENGINEERING 2006; 27:179-90. [PMID: 16382877 DOI: 10.1007/0-387-25856-6_10] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
High-throughput approaches for gene cloning and expression require the development of new, nonstandard tools for use by molecular biologists and biochemists. We have developed and implemented a series of methods that enable the production of expression constructs in 96-well plate format. A screening process is described that facilitates the identification of bacterial clones expressing soluble protein. Application of the solubility screen then provides a plate map that identifies the location of wells containing clones producing soluble proteins. A series of semi-automated methods can then be applied for validation of solubility and production of freezer stocks for the protein production group. This process provides an 80% success rate for the identification of clones producing soluble protein and results in a significant decrease in the level of effort required for the labor-intensive components of validation and preparation of freezer stocks. This process is customized for large-scale structural genomics programs that rely on the production of large amounts of soluble proteins for crystallization trials.
Collapse
Affiliation(s)
- L J Dieckman
- Biosciences Division, Argonne National Laboratory, Argonne, Illinois 60439, USA
| | | | | |
Collapse
|
13
|
Shimada K, Nagano M, Kawai M, Koga H. Influences of amino acid features of glutathioneS-transferase fusion proteins on their solubility. Proteomics 2005; 5:3859-63. [PMID: 16145715 DOI: 10.1002/pmic.200402085] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
We have previously described our strategy for high-throughput (HT) production of recombinant antigens for anti-mKIAA antibody generation, which involves using shotgun fragments generated during entire sequencing of mKIAA cDNAs. We applied this strategy to 1628 mouse KIAA (mKIAA) cDNA fragments, and 84.2% of the GST-mKIAA fusion proteins were successfully purified. The solubility of the proteins was predicted by a small-scale bacterial culture, and a large-scale culture was then performed according to the expected results. Among them, 43.8% of the proteins were purified as a soluble form and 56.2% as an insoluble form. The average yield of the soluble proteins was 0.15 nmol/mL of bacterial culture, and that of the insoluble proteins was 0.55 nmol/mL Statistical analysis of the data revealed a significant correlation between amino acid features of the recombinant proteins and their solubility. To achieve the most effective and feasible protein expression, we constructed a decision tree in which the analyzed data were reflected. The information described here may provide practical guidelines for HT production of recombinant proteins.
Collapse
Affiliation(s)
- Kiyo Shimada
- Chiba Industry Advancement Center, Mihama-ku, Chiba, Japan
| | | | | | | |
Collapse
|
14
|
Dai X, Chen Q, Lian M, Zhou Y, Zhou M, Lu S, Chen Y, Luo J, Gu X, Jiang Y, Luo M, Zheng X. Systematic high-yield production of human secreted proteins in Escherichia coli. Biochem Biophys Res Commun 2005; 332:593-601. [PMID: 15896719 DOI: 10.1016/j.bbrc.2005.04.163] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2005] [Accepted: 04/29/2005] [Indexed: 12/31/2022]
Abstract
Human secreted proteins play a very important role in signal transduction. In order to study all potential secreted proteins identified from the human genome sequence, systematic production of large amounts of biologically active secreted proteins is a prerequisite. We selected 25 novel genes as a trial case for establishing a reliable expression system to produce active human secreted proteins in Escherichia coli. Expression of proteins with or without signal peptides was examined and compared in E. coli strains. The results indicated that deletion of signal peptides, to a certain extent, can improve the expression of these proteins and their solubilities. More importantly, under expression conditions such as induction temperature, N-terminus fusion peptides need to be optimized in order to express adequate amounts of soluble proteins. These recombinant proteins were characterized as well-folded proteins. This system enables us to rapidly obtain soluble and highly purified human secreted proteins for further functional studies.
Collapse
Affiliation(s)
- Xueyu Dai
- National Laboratory of Protein Engineering and Plant Genetic Engineering, College of Life Sciences, Peking University, Beijing 100871, China
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Shen W, Yun S, Tam B, Dalal K, Pio FF. Target selection of soluble protein complexes for structural proteomics studies. Proteome Sci 2005; 3:3. [PMID: 15904526 PMCID: PMC1156946 DOI: 10.1186/1477-5956-3-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2004] [Accepted: 05/18/2005] [Indexed: 11/23/2022] Open
Abstract
Background Protein expression in E. coli is the most commonly used system to produce protein for structural studies, because it is fast and inexpensive and can produce large quantity of proteins. However, when proteins from other species such as mammalian are produced in this system, problems of protein expression and solubility arise [1]. Structural genomics project are currently investigating proteomics pipelines that would produce sufficient quantities of recombinant proteins for structural studies of protein complexes. To investigate how the E. coli protein expression system could be used for this purpose, we purified apoptotic binary protein complexes formed between members of the Caspase Associated Recruitment Domain (CARD) family. Results A combinatorial approach to the generation of protein complexes was performed between members of the CARD domain protein family that have the ability to form hetero-dimers between each other. In our method, each gene coding for a specific protein partner is cloned in pET-28b (Novagen) and PGEX2T (Amersham) expression vectors. All combinations of protein complexes are then obtained by reconstituting complexes from purified components in native conditions, after denaturation-renaturation or co-expression. Our study applied to 14 soluble CARD domain proteins revealed that co-expression studies perform better than native and denaturation-renaturation methods. In this study, we confirm existing interactions obtained in vivoin mammalian cells and also predict new interactions. Conclusion The simplicity of this screening method could be easily scaled up to identify soluble protein complexes for structural genomic projects. This study reports informative statistics on the solubility of human protein complexes expressed in E.coli belonging to the human CARD protein family.
Collapse
Affiliation(s)
- Weiping Shen
- Department of Molecular Biology and Biochemistry, Simon Fraser University, 8888 University Drive, Burnaby, British Columbia, Canada, V5A 1S6
| | - Steven Yun
- Department of Molecular Biology and Biochemistry, Simon Fraser University, 8888 University Drive, Burnaby, British Columbia, Canada, V5A 1S6
| | - Bonny Tam
- Department of Molecular Biology and Biochemistry, Simon Fraser University, 8888 University Drive, Burnaby, British Columbia, Canada, V5A 1S6
| | - Kush Dalal
- Department of Molecular Biology and Biochemistry, Simon Fraser University, 8888 University Drive, Burnaby, British Columbia, Canada, V5A 1S6
| | - Frederic F Pio
- Department of Molecular Biology and Biochemistry, Simon Fraser University, 8888 University Drive, Burnaby, British Columbia, Canada, V5A 1S6
| |
Collapse
|
16
|
Dyson MR, Shadbolt SP, Vincent KJ, Perera RL, McCafferty J. Production of soluble mammalian proteins in Escherichia coli: identification of protein features that correlate with successful expression. BMC Biotechnol 2004; 4:32. [PMID: 15598350 PMCID: PMC544853 DOI: 10.1186/1472-6750-4-32] [Citation(s) in RCA: 195] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2004] [Accepted: 12/14/2004] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND In the search for generic expression strategies for mammalian protein families several bacterial expression vectors were examined for their ability to promote high yields of soluble protein. Proteins studied included cell surface receptors (Ephrins and Eph receptors, CD44), kinases (EGFR-cytoplasmic domain, CDK2 and 4), proteases (MMP1, CASP2), signal transduction proteins (GRB2, RAF1, HRAS) and transcription factors (GATA2, Fli1, Trp53, Mdm2, JUN, FOS, MAD, MAX). Over 400 experiments were performed where expression of 30 full-length proteins and protein domains were evaluated with 6 different N-terminal and 8 C-terminal fusion partners. Expression of an additional set of 95 mammalian proteins was also performed to test the conclusions of this study. RESULTS Several protein features correlated with soluble protein expression yield including molecular weight and the number of contiguous hydrophobic residues and low complexity regions. There was no relationship between successful expression and protein pI, grand average of hydropathicity (GRAVY), or sub-cellular location. Only small globular cytoplasmic proteins with an average molecular weight of 23 kDa did not require a solubility enhancing tag for high level soluble expression. Thioredoxin (Trx) and maltose binding protein (MBP) were the best N-terminal protein fusions to promote soluble expression, but MBP was most effective as a C-terminal fusion. 63 of 95 mammalian proteins expressed at soluble levels of greater than 1 mg/l as N-terminal H10-MBP fusions and those that failed possessed, on average, a higher molecular weight and greater number of contiguous hydrophobic amino acids and low complexity regions. CONCLUSIONS By analysis of the protein features identified here, this study will help predict which mammalian proteins and domains can be successfully expressed in E. coli as soluble product and also which are best targeted for a eukaryotic expression system. In some cases proteins may be truncated to minimise molecular weight and the numbers of contiguous hydrophobic amino acids and low complexity regions to aid soluble expression in E. coli.
Collapse
Affiliation(s)
- Michael R Dyson
- The Atlas of Gene Expression Project, The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - S Paul Shadbolt
- The Atlas of Gene Expression Project, The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Karen J Vincent
- The Atlas of Gene Expression Project, The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Rajika L Perera
- The Atlas of Gene Expression Project, The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - John McCafferty
- The Atlas of Gene Expression Project, The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| |
Collapse
|
17
|
Luan CH, Qiu S, Finley JB, Carson M, Gray RJ, Huang W, Johnson D, Tsao J, Reboul J, Vaglio P, Hill DE, Vidal M, Delucas LJ, Luo M. High-throughput expression of C. elegans proteins. Genome Res 2004; 14:2102-10. [PMID: 15489332 PMCID: PMC528926 DOI: 10.1101/gr.2520504] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Proteome-scale studies of protein three-dimensional structures should provide valuable information for both investigating basic biology and developing therapeutics. Critical for these endeavors is the expression of recombinant proteins. We selected Caenorhabditis elegans as our model organism in a structural proteomics initiative because of the high quality of its genome sequence and the availability of its ORFeome, protein-encoding open reading frames (ORFs), in a flexible recombinational cloning format. We developed a robotic pipeline for recombinant protein expression, applying the Gateway cloning/expression technology and utilizing a stepwise automation strategy on an integrated robotic platform. Using the pipeline, we have carried out heterologous protein expression experiments on 10,167 ORFs of C. elegans. With one expression vector and one Escherichia coli strain, protein expression was observed for 4854 ORFs, and 1536 were soluble. Bioinformatics analysis of the data indicates that protein hydrophobicity is a key determining factor for an ORF to yield a soluble expression product. This protein expression effort has investigated the largest number of genes in any organism to date. The pipeline described here is applicable to high-throughput expression of recombinant proteins for other species, both prokaryotic and eukaryotic, provided that ORFeome resources become available.
Collapse
Affiliation(s)
- Chi-Hao Luan
- Center for Biophysical Sciences and Engineering, Southeast Collaboratory for Structural Genomics, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|