1
|
High-throughput protein expression screening and purification in Escherichia coli. Methods 2011; 55:65-72. [DOI: 10.1016/j.ymeth.2011.08.010] [Citation(s) in RCA: 75] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2011] [Revised: 07/25/2011] [Accepted: 08/11/2011] [Indexed: 11/18/2022] Open
|
2
|
Chayen NE. High-throughput protein crystallization. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2010; 77:1-22. [PMID: 20663479 DOI: 10.1016/s1876-1623(09)77001-4] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/13/2023]
Abstract
Structural genomics projects have led to great progress in the field of structural biology. Considerable advances have been made in the automation of all stages of the pipeline from clone to structure. This chapter focuses on crystallization that is one of the major bottlenecks in this pipeline. It discusses new developments and describes a variety of techniques for high-throughput screening and optimizing of conditions for crystallization.
Collapse
Affiliation(s)
- Naomi E Chayen
- Department of Surgery and Cancer, Faculty of Medicine, Imperial College London, London, UK
| |
Collapse
|
3
|
Groisillier A, Hervé C, Jeudy A, Rebuffet E, Pluchon PF, Chevolot Y, Flament D, Geslin C, Morgado IM, Power D, Branno M, Moreau H, Michel G, Boyen C, Czjzek M. MARINE-EXPRESS: taking advantage of high throughput cloning and expression strategies for the post-genomic analysis of marine organisms. Microb Cell Fact 2010; 9:45. [PMID: 20546566 PMCID: PMC2897777 DOI: 10.1186/1475-2859-9-45] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2010] [Accepted: 06/14/2010] [Indexed: 11/14/2022] Open
Abstract
Background The production of stable and soluble proteins is one of the most important steps prior to structural and functional studies of biological importance. We investigated the parallel production in a medium throughput strategy of genes coding for proteins from various marine organisms, using protocols that involved recombinatorial cloning, protein expression screening and batch purification. This strategy was applied in order to respond to the need for post-genomic validation of the recent success of a large number of marine genomic projects. Indeed, the upcoming challenge is to go beyond the bioinformatic data, since the bias introduced through the genomes of the so called model organisms leads to numerous proteins of unknown function in the still unexplored world of the oceanic organisms. Results We present here the results of expression tests for 192 targets using a 96-well plate format. Genes were PCR amplified and cloned in parallel into expression vectors pFO4 and pGEX-4T-1, in order to express proteins N-terminally fused to a six-histidine-tag and to a GST-tag, respectively. Small-scale expression and purification permitted isolation of 84 soluble proteins and 34 insoluble proteins, which could also be used in refolding assays. Selected examples of proteins expressed and purified to a larger scale are presented. Conclusions The objective of this program was to get around the bottlenecks of soluble, active protein expression and crystallization for post-genomic validation of a number of proteins that come from various marine organisms. Multiplying the constructions, vectors and targets treated in parallel is important for the success of a medium throughput strategy and considerably increases the chances to get rapid access to pure and soluble protein samples, needed for the subsequent biochemical characterizations. Our set up of a medium throughput strategy applied to genes from marine organisms had a mean success rate of 44% soluble protein expression from marine bacteria, archaea as well as eukaryotic organisms. This success rate compares favorably with other protein screening projects, particularly for eukaryotic proteins. Several purified targets have already formed the base for experiments aimed at post-genomic validation.
Collapse
Affiliation(s)
- Agnès Groisillier
- UPMC Univ Paris 6, UMR 7139 Végétaux marins et Biomolécules, LIA DIAMS, Station Biologique, Roscoff, France
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
4
|
Busso D, Thierry JC, Moras D. The structural biology and genomics platform in strasbourg: an overview. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2008; 426:523-36. [PMID: 18542888 DOI: 10.1007/978-1-60327-058-8_35] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
This chapter describes the modules and facilities of the Structural Biology and Genomics Platform (SBGP), Strasbourg, France. The platform consists of three modules (cloning, mini-expression screening; optimization-large scale protein production; characterization, crystallization) with dedicated scientists, and other facilities for purifying recombinant proteins and solving three-dimensional (3D) structures. Strong collaborations have been established with the Integrative Bioinformatics and Genomics group, located in the same institition, for target selection and domains definition. To handle large numbers of samples, classical and new protocols were adapted to automation, increasing reproducibility and reducing error risks as well. Using the platform and its facilities, over 2,000 expression vectors have been constructed and more than 40 novel structures, of mostly human proteins, have been solved.
Collapse
Affiliation(s)
- Didier Busso
- Structural Biology and Genomics Platform, IGBMC, CNRS/INSERM/Université Louis Pasteur, Illkirch, France
| | | | | |
Collapse
|
5
|
Automated recombinant protein expression screening in Escherichia coli. Methods Mol Biol 2008. [PMID: 18542863 DOI: 10.1007/978-1-60327-058-8_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
Abstract
To fit the requirements of structural genomics programs, new as well as classical methods have been adapted to automation. This chapter describes the automated procedure developed within the Structural Biology and Genomics Platform, Strasbourg for performing recombinant protein expression screening in Escherichia coli. The procedure consists of parallel competent cells transformation, cell plating, and liquid culture inoculation, implemented for up to 96 samples at a time.
Collapse
|
6
|
Busso D, Stierlé M, Thierry JC, Moras D. A comparison of inoculation methods to simplify recombinant protein expression screening in Escherichia coli. Biotechniques 2008; 44:101-6. [PMID: 18254387 DOI: 10.2144/000112632] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
In the past five years, Structural Genomics (SG) initiatives have established an automated pipeline for protein production in Escherichia coli to rapidly screen various conditions, resulting in soluble expression of recombinant proteins to aid in carrying out structural studies. However, some steps of the procedure are still extensive and require manual handling. Here, we present a comparative study of one step of the process, E. coli cultivation, using a set of 12 expression vectors encoding for fusion proteins of seven independent target proteins. First, we show that performing E. coli growth in auto-inducible medium (ZYM-5052) results in a comparable protein expression/solubility profile to that obtained when growing cells in classical Luria-Bertani (LB) medium. Second, we show that the transformation mix can be used directly to inoculate a culture, saving time and circumventing the error-prone step of colony picking, without impairing cell growth and the protein expression/solubility profile. Thus, we show that a basic, but nevertheless essential, step of a protein production pipeline, E. coli cultivation, can be simplified to a single event that is fully compatible with complete automation.
Collapse
Affiliation(s)
- Didier Busso
- Institut de Génetique et de Biologie Moléculaire et Cellulaire/Centre National de Recherche Scientifique/Institut National de la Santé de la Recherche Médicale, Université Louis Pasteur, Illkirch, France.
| | | | | | | |
Collapse
|
7
|
Robin G, Cowieson NP, Guncar G, Forwood JK, Listwan P, Hume DA, Kobe B, Martin JL, Huber T. A general target selection method for crystallographic proteomics. Methods Mol Biol 2008; 426:27-35. [PMID: 18542855 DOI: 10.1007/978-1-60327-058-8_2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Increasing the success in obtaining structures and maximizing the value of the structures determined are the two major goals of target selection in structural proteomics. This chapter presents an efficient and flexible target selection procedure supplemented with a Web-based resource that is suitable for small- to large-scale structural genomics projects that use crystallography as the major means of structure determination. Based on three criteria, biological significance, structural novelty, and "crystallizability," the approach first removes (filters) targets that do not meet minimal criteria and then ranks the remaining targets based on their "crystallizability" estimates. This novel procedure was designed to maximize selection efficiency, and its prevailing criteria categories make it suitable for a broad range of structural proteomics projects.
Collapse
Affiliation(s)
- Gautier Robin
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia
| | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Prodromou C, Savva R, Driscoll PC. DNA fragmentation-based combinatorial approaches to soluble protein expression Part I. Generating DNA fragment libraries. Drug Discov Today 2007; 12:931-8. [PMID: 17993411 DOI: 10.1016/j.drudis.2007.08.012] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2007] [Revised: 08/16/2007] [Accepted: 08/21/2007] [Indexed: 11/28/2022]
Abstract
In addressing a new drug discovery target, the generation of tractable protein substrates for functional and structural analyses can represent a significant hurdle. Traditional approaches rely on protein expression trials of multiple variants in various systems, frequently with limited success. The increasing knowledge base derived from genomics and structural proteomics initiatives assists the bioinformatics-led design of these experiments. Nevertheless, for many eukaryotic polypeptides, particularly those with relatively few homologues, the generation of useful protein products can still be a major challenge. This review describes the basis of efforts to forge an alternative 'domain-hunting' paradigm, based upon combinatorial sampling of expression construct libraries derived by fragmentation of the encoding DNA template, namely the methods and considerations in generating fragment length DNA from target genes. An accompanying review focuses upon the expression screening of such combinatorial DNA libraries for the sampling of the corresponding set of protein fragments.
Collapse
Affiliation(s)
- Chrisostomos Prodromou
- Section of Structural Biology, Institute of Cancer Research, Chester Beatty Laboratories, 237 Fulham Road, London SW3 6JB, United Kingdom
| | | | | |
Collapse
|
9
|
Marsden RL, Lewis TA, Orengo CA. Towards a comprehensive structural coverage of completed genomes: a structural genomics viewpoint. BMC Bioinformatics 2007; 8:86. [PMID: 17349043 PMCID: PMC1829165 DOI: 10.1186/1471-2105-8-86] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2006] [Accepted: 03/09/2007] [Indexed: 11/25/2022] Open
Abstract
Background Structural genomics initiatives were established with the aim of solving protein structures on a large-scale. For many initiatives, such as the Protein Structure Initiative (PSI), the primary aim of target selection is focussed towards structurally characterising protein families which, so far, lack a structural representative. It is therefore of considerable interest to gain insights into the number and distribution of these families, and what efforts may be required to achieve a comprehensive structural coverage across all protein families. Results In this analysis we have derived a comprehensive domain annotation of the genomes using CATH, Pfam-A and Newfam domain families. We consider what proportions of structurally uncharacterised families are accessible to high-throughput structural genomics pipelines, specifically those targeting families containing multiple prokaryotic orthologues. In measuring the domain coverage of the genomes, we show the benefits of selecting targets from both structurally uncharacterised domain families, whilst in addition, pursuing additional targets from large structurally characterised protein superfamilies. Conclusion This work suggests that such a combined approach to target selection is essential if structural genomics is to achieve a comprehensive structural coverage of the genomes, leading to greater insights into structure and the mechanisms that underlie protein evolution.
Collapse
Affiliation(s)
- Russell L Marsden
- Department of Biochemistry and Molecular Biology, University College London, Gower Street, London WC1E 6BT, UK
| | - Tony A Lewis
- Department of Biochemistry and Molecular Biology, University College London, Gower Street, London WC1E 6BT, UK
| | - Christine A Orengo
- Department of Biochemistry and Molecular Biology, University College London, Gower Street, London WC1E 6BT, UK
| |
Collapse
|
10
|
Puri M, Robin G, Cowieson N, Forwood JK, Listwan P, Hu SH, Guncar G, Huber T, Kellie S, Hume DA, Kobe B, Martin JL. Focusing in on structural genomics: The University of Queensland structural biology pipeline. ACTA ACUST UNITED AC 2006; 23:281-9. [PMID: 17097918 DOI: 10.1016/j.bioeng.2006.09.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2006] [Revised: 09/22/2006] [Accepted: 09/25/2006] [Indexed: 10/24/2022]
Abstract
The flood of new genomic sequence information together with technological innovations in protein structure determination have led to worldwide structural genomics (SG) initiatives. The goals of SG initiatives are to accelerate the process of protein structure determination, to fill in protein fold space and to provide information about the function of uncharacterized proteins. In the long-term, these outcomes are likely to impact on medical biotechnology and drug discovery, leading to a better understanding of disease as well as the development of new therapeutics. Here we describe the high throughput pipeline established at the University of Queensland in Australia. In this focused pipeline, the targets for structure determination are proteins that are expressed in mouse macrophage cells and that are inferred to have a role in innate immunity. The aim is to characterize the molecular structure and the biochemical and cellular function of these targets by using a parallel processing pipeline. The pipeline is designed to work with tens to hundreds of target gene products and comprises target selection, cloning, expression, purification, crystallization and structure determination. The structures from this pipeline will provide insights into the function of previously uncharacterized macrophage proteins and could lead to the validation of new drug targets for chronic obstructive pulmonary disease and arthritis.
Collapse
Affiliation(s)
- Munish Puri
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Surade S, Klein M, Stolt-Bergner PC, Muenke C, Roy A, Michel H. Comparative analysis and "expression space" coverage of the production of prokaryotic membrane proteins for structural genomics. Protein Sci 2006; 15:2178-89. [PMID: 16943447 PMCID: PMC2242615 DOI: 10.1110/ps.062312706] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2006] [Revised: 06/23/2006] [Accepted: 06/23/2006] [Indexed: 10/24/2022]
Abstract
Membrane proteins comprise up to one-third of prokaryotic and eukaryotic genomes, but only a very small number of membrane protein structures are known. Membrane proteins are challenging targets for structural biology, primarily due to the difficulty in producing and purifying milligram quantities of these proteins. We are evaluating different methods to produce and purify large numbers of prokaryotic membrane proteins for subsequent structural and functional analysis. Here, we present the comparative expression data for 37 target proteins, all of them secondary transporters, from the mesophilic organism Salmonella typhimurium and the two hyperthermophilic organisms Aquifex aeolicus and Pyrococcus furiosus in three different Escherichia coli expression vectors. In addition, we study the use of Lactococcus lactis as a host for integral membrane protein expression. Overall, 78% of the targets were successfully produced under at least one set of conditions. Analysis of these results allows us to assess the role of different variables in increasing "expression space" coverage for our set of targets. This analysis implies that to maximize the number of nonhomologous targets that are expressed, orthologous targets should be chosen and tested in two vectors with different types of promoters, using C-terminal tags. In addition, E. coli is shown to be a robust host for the expression of prokaryotic transporters, and is superior to L. lactis. These results therefore suggest appropriate strategies for high-throughput heterologous overproduction of membrane proteins.
Collapse
Affiliation(s)
- Sachin Surade
- Department of Molecular Membrane Biology, Max Planck Institute for Biophysics, Germany
| | | | | | | | | | | |
Collapse
|
12
|
Bravo J, Aloy P. Target selection for complex structural genomics. Curr Opin Struct Biol 2006; 16:385-92. [PMID: 16713251 DOI: 10.1016/j.sbi.2006.05.003] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2006] [Revised: 04/25/2006] [Accepted: 05/04/2006] [Indexed: 01/05/2023]
Abstract
Most cellular processes are carried out by macromolecular assemblies and regulated through a complex network of transient protein-protein interactions. Genome-wide interaction discovery experiments are already delivering the first drafts of whole organism interactomes and, thus, depicting the limits of the interaction space. However, a complete understanding of molecular interactions can only come from high-resolution three-dimensional structures, as they provide key atomic details about the binding interfaces. The launch of structural genomics initiatives focused on protein interactions and complexes could quickly fill up the interaction space with structural details, offering a new perspective on how cell networks operate at atomic level. Clear target selection strategies that rationally identify the key interactions and complexes that should be first tackled are fundamental to maximize the return, minimize the costs and prevent experimental difficulties.
Collapse
Affiliation(s)
- Jerónimo Bravo
- Centro Nacional de Investigaciones Oncológicas, C/Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | | |
Collapse
|
13
|
Busso D, Delagoutte-Busso B, Moras D. Construction of a set Gateway-based destination vectors for high-throughput cloning and expression screening in Escherichia coli. Anal Biochem 2005; 343:313-21. [PMID: 15993367 DOI: 10.1016/j.ab.2005.05.015] [Citation(s) in RCA: 166] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2005] [Revised: 05/10/2005] [Accepted: 05/12/2005] [Indexed: 11/29/2022]
Abstract
We describe here the construction of a 10-Gateway-based vector set applicable for high-throughput cloning and for expressing recombinant proteins in Escherichia coli. Plasmids bear elements required to produce recombinant proteins under control of the T7 promoter and encode different N-terminal partners. Since the vector set is derived from a unique backbone, a consistent comparison of the impact of fusion partner(s) on protein expression and solubility is easily amenable. Finally, a sequence encoding a six-histidine tag has been inserted to be in frame with the cloned open reading frame either at its C terminus or at the N terminus, giving the flexibility of choosing the six-histidine tag location for further purification. To test the applicability of our vector set, expression and solubility profile and six-histidine tag accessibility have been demonstrated for two Bacillus subtilis signaling proteins' encoding genes (SBGP codes E0508 and E0511).
Collapse
Affiliation(s)
- Didier Busso
- Département de Biologie et de Génomique Structurales, IGBMC, CNRS/INSERM/Université Louis Pasteur, Parc d'Innovation, 1 rue Laurent Fries, BP10142, 67404 Illkirch Cedex, France.
| | | | | |
Collapse
|