1
|
Dai H, Umarov R, Kuwahara H, Li Y, Song L, Gao X. Sequence2Vec: a novel embedding approach for modeling transcription factor binding affinity landscape. Bioinformatics 2017; 33:3575-3583. [PMID: 28961686 PMCID: PMC5870668 DOI: 10.1093/bioinformatics/btx480] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2017] [Revised: 07/07/2017] [Accepted: 07/26/2017] [Indexed: 12/29/2022] Open
Abstract
MOTIVATION An accurate characterization of transcription factor (TF)-DNA affinity landscape is crucial to a quantitative understanding of the molecular mechanisms underpinning endogenous gene regulation. While recent advances in biotechnology have brought the opportunity for building binding affinity prediction methods, the accurate characterization of TF-DNA binding affinity landscape still remains a challenging problem. RESULTS Here we propose a novel sequence embedding approach for modeling the transcription factor binding affinity landscape. Our method represents DNA binding sequences as a hidden Markov model which captures both position specific information and long-range dependency in the sequence. A cornerstone of our method is a novel message passing-like embedding algorithm, called Sequence2Vec, which maps these hidden Markov models into a common nonlinear feature space and uses these embedded features to build a predictive model. Our method is a novel combination of the strength of probabilistic graphical models, feature space embedding and deep learning. We conducted comprehensive experiments on over 90 large-scale TF-DNA datasets which were measured by different high-throughput experimental technologies. Sequence2Vec outperforms alternative machine learning methods as well as the state-of-the-art binding affinity prediction methods. AVAILABILITY AND IMPLEMENTATION Our program is freely available at https://github.com/ramzan1990/sequence2vec. CONTACT xin.gao@kaust.edu.sa or lsong@cc.gatech.edu. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Hanjun Dai
- College of Computing, Georgia Institute of Technology, Atlanta, GA, USA
| | - Ramzan Umarov
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Thuwal, Saudi Arabia
| | - Hiroyuki Kuwahara
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Thuwal, Saudi Arabia
| | - Yu Li
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Thuwal, Saudi Arabia
| | - Le Song
- College of Computing, Georgia Institute of Technology, Atlanta, GA, USA
| | - Xin Gao
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Thuwal, Saudi Arabia
| |
Collapse
|
2
|
Wang X, Kuwahara H, Gao X. Modeling DNA affinity landscape through two-round support vector regression with weighted degree kernels. BMC SYSTEMS BIOLOGY 2014; 8 Suppl 5:S5. [PMID: 25605483 PMCID: PMC4305984 DOI: 10.1186/1752-0509-8-s5-s5] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
BACKGROUND A quantitative understanding of interactions between transcription factors (TFs) and their DNA binding sites is key to the rational design of gene regulatory networks. Recent advances in high-throughput technologies have enabled high-resolution measurements of protein-DNA binding affinity. Importantly, such experiments revealed the complex nature of TF-DNA interactions, whereby the effects of nucleotide changes on the binding affinity were observed to be context dependent. A systematic method to give high-quality estimates of such complex affinity landscapes is, thus, essential to the control of gene expression and the advance of synthetic biology. RESULTS Here, we propose a two-round prediction method that is based on support vector regression (SVR) with weighted degree (WD) kernels. In the first round, a WD kernel with shifts and mismatches is used with SVR to detect the importance of subsequences with different lengths at different positions. The subsequences identified as important in the first round are then fed into a second WD kernel to fit the experimentally measured affinities. To our knowledge, this is the first attempt to increase the accuracy of the affinity prediction by applying two rounds of string kernels and by identifying a small number of crucial k-mers. The proposed method was tested by predicting the binding affinity landscape of Gcn4p in Saccharomyces cerevisiae using datasets from HiTS-FLIP. Our method explicitly identified important subsequences and showed significant performance improvements when compared with other state-of-the-art methods. Based on the identified important subsequences, we discovered two surprisingly stable 10-mers and one sensitive 10-mer which were not reported before. Further test on four other TFs in S. cerevisiae demonstrated the generality of our method. CONCLUSION We proposed in this paper a two-round method to quantitatively model the DNA binding affinity landscape. Since the ability to modify genetic parts to fine-tune gene expression rates is crucial to the design of biological systems, such a tool may play an important role in the success of synthetic biology going forward.
Collapse
|
3
|
A protein adaptor to locate a functional protein dimer on molecular switchboard. Methods 2014; 67:142-50. [DOI: 10.1016/j.ymeth.2013.10.014] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2013] [Revised: 10/09/2013] [Accepted: 10/16/2013] [Indexed: 01/25/2023] Open
|
4
|
Oheix E, Peacock AFA. Metal-ion-regulated miniature DNA-binding proteins based on GCN4 and non-native regulation sites. Chemistry 2014; 20:2829-39. [PMID: 24478111 DOI: 10.1002/chem.201303747] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2013] [Indexed: 02/06/2023]
Abstract
The design of artificial peptide dimers containing polypyridine switching domains, for which metal-ion coordination is shown to regulate DNA binding, is reported. Short peptides, based on the basic domain of the GCN4 transcription factor (GCN4bd), dimerised with either 2,2'-bipyridine (bipy(GCN4bd)2 ) or 2,2':6',2''-terpyridine (terpy(GCN4bd)2 ) linker units, undergo a conformational rearrangement on Cu(II) and Zn(II) coordination. Depending on the linker substitution pattern, this is proposed to alter the relative alignment of the two peptide moieties, and in turn regulate DNA binding. Circular dichroism and UV-visible spectroscopy reveal that Cu(II) and Zn(II) coordination promotes binding to DNA containing the CRE target site, but to a differing and opposite degree for the two linkers, and that the metal-ion affinity for terpy(GCN4bd)2 is enhanced in the presence of CRE DNA. Binding to DNA containing the shorter AP1 target site, which lacks a single nucleobase pair compared to CRE, as well as half-CRE, which contains only half of the CRE target site, was also investigated. Cu(II) and Zn(II) coordination to terpy(GCN4bd)2 promotes binding to AP1 DNA, and to a lesser extent half-CRE DNA. Whereas, bipy(GCN4bd)2 , for which interpeptide distances are largely independent of metal-ion coordination and less suitable for binding to these shorter sites, displays allosteric ineffective behaviour in these cases. These findings for the first time demonstrate that biomolecular recognition, and specifically sequence-selective DNA binding, can be controlled by metal-ion coordination to designed switching units, non-native regulation sites, in artificial biomolecules. We believe that in the future these could find a wide range of applications in biotechnology.
Collapse
Affiliation(s)
- Emmanuel Oheix
- School of Chemistry, University of Birmingham, Edgbaston, B15 2TT (UK)
| | | |
Collapse
|
5
|
Nutiu R, Friedman RC, Luo S, Khrebtukova I, Silva D, Li R, Zhang L, Schroth GP, Burge CB. Direct measurement of DNA affinity landscapes on a high-throughput sequencing instrument. Nat Biotechnol 2011; 29:659-64. [PMID: 21706015 DOI: 10.1038/nbt.1882] [Citation(s) in RCA: 145] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2011] [Accepted: 04/27/2011] [Indexed: 01/11/2023]
Abstract
Several methods for characterizing DNA-protein interactions are available, but none have demonstrated both high throughput and quantitative measurement of affinity. Here we describe 'high-throughput sequencing'-'fluorescent ligand interaction profiling' (HiTS-FLIP), a technique for measuring quantitative protein-DNA binding affinity at unprecedented depth. In this approach, the optics built into a high-throughput sequencer are used to visualize in vitro binding of a protein to sequenced DNA in a flow cell. Application of HiTS-FLIP to the protein Gcn4 (Gcn4p), the master regulator of the yeast amino acid starvation response, yielded ~440 million binding measurements, enabling determination of dissociation constants for all 12-mer sequences having submicromolar affinity. These data revealed a complex interdependency between motif positions, allowed improved discrimination of in vivo Gcn4p binding sites and regulatory targets relative to previous methods and showed that sets of genes with different promoter affinities to Gcn4p have distinct functions and expression kinetics. Broad application of this approach should increase understanding of the interactions that drive transcription.
Collapse
Affiliation(s)
- Razvan Nutiu
- Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
6
|
Pensato S, Renda M, Leccia F, Saviano M, D'Andrea LD, Pedone C, Pedone PV, Romanelli A. PNA zipper as a dimerization tool: development of a bZip mimic. Biopolymers 2010; 93:434-41. [PMID: 19938072 DOI: 10.1002/bip.21357] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
The article describes the use of a PNA duplex (PNA zipper) as a tool to dimerize or bring in close proximity two polypeptides or protein domains. The amino acid sequence to be dimerized is covalently bound to complementary PNA sequences. Annealing of the PNA strands results in dimer formation. To test the ability of the "PNA-zipper" as a dimerization tool, we designed a GCN4 mimetic, where the leucine-zipper dimerization domain was replaced by the PNA zipper, whereas the basic DNA-binding domain was covalently attached to the PNA. The molecule was assembled by chemical ligation of the peptide corresponding to the DNA-binding domain of GCN4 modified with a succinyl thioester with two complementary PNAs harboring a cysteine residue. Electromobility-shift experiments show the ability of the PNA zipper-GCN4 to bind selected DNA duplexes. The PNA zipper-GCN4 binds both the TRE and CRE DNA sites, but it does not bind TRE and CRE mutants containing even a single base mutation, as the native GCN4. The ability to fold upon complexation with DNA was investigated by CD. A good correlation between the ability of the PNA zipper-GCN4 to fold into alpha helices and the ability to bind DNA was found.
Collapse
|
7
|
Seong KM, Park H, Kim SJ, Ha HN, Lee JY, Kim J. A new method for the construction of a mutant library with a predictable occurrence rate using Poisson distribution. J Microbiol Methods 2007; 69:442-50. [PMID: 17428560 DOI: 10.1016/j.mimet.2007.02.010] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2006] [Revised: 02/15/2007] [Accepted: 02/19/2007] [Indexed: 11/24/2022]
Abstract
A yeast transcriptional activator, Gcn4p, induces the expression of genes that are involved in amino acid and purine biosynthetic pathways under amino acid starvation. Gcn4p has an acidic activation domain in the central region and a bZIP domain in the C-terminus that is divided into the DNA-binding motif and dimerization leucine zipper motif. In order to identify amino acids in the DNA-binding motif of Gcn4p which are involved in transcriptional activation, we constructed mutant libraries in the DNA-binding motif through an innovative application of random mutagenesis. Mutant library made by oligonucleotides which were mutated randomly using the Poisson distribution showed that the actual mutation frequency was in good agreement with expected values. This method could save the time and effort to create a mutant library with a predictable mutation frequency. Based on the studies using the mutant libraries constructed by the new method, the specific residues of the DNA-binding domain in Gcn4p appear to be involved in the transcriptional activities on a conserved binding site.
Collapse
Affiliation(s)
- Ki Moon Seong
- Laboratory of Biochemistry, School of Life Sciences and Biotechnology, Korea University, Seoul 136-701, Republic of Korea
| | | | | | | | | | | |
Collapse
|
8
|
Namoto K, Gardiner J, Kimmerlin T, Seebach D. Investigation of the Interactions ofβ-Peptides with DNA Duplexes by Circular Dichroism Spectroscopy. Helv Chim Acta 2006. [DOI: 10.1002/hlca.200690277] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
9
|
Son S, Tanrikulu IC, Tirrell DA. Stabilization of bzip peptides through incorporation of fluorinated aliphatic residues. Chembiochem 2006; 7:1251-7. [PMID: 16758500 DOI: 10.1002/cbic.200500420] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Two fluorinated amino acids, 5,5,5-trifluoroisoleucine (5TFI) and (2S,3R)-4,4,4-trifluorovaline (4TFV), which have been shown to serve as isoleucine surrogates in protein synthesis in Escherichia coli, have been incorporated in vivo into basic leucine zipper (bzip) peptides derived from GCN4. The extents of residue-specific incorporation of 5TFI and 4TFV were 90 and 88 %, respectively, of the encoded isoleucine residues, as evidenced by MALDI mass spectrometry and amino acid analysis. Both circular dichroism and equilibrium sedimentation studies of the fluorinated bzip peptides indicated preservation of secondary and higher-order protein structure. Thermal-denaturation experiments showed an increase of 27 degrees C in melting temperature when isoleucine was replaced by 5TFI. However, the T(m) of the peptide containing 4TFV was increased by only 4 degrees C over that of the peptide containing valine. Similar trends were observed in chemical denaturation studies in which DeltaDeltaG(unfold) in water was determined to be 2.1 or 0.3 kcal mol(-1) upon incorporation of 5TFI or 4TFV, respectively. When the fluorinated peptides were tested for DNA binding, both their affinity and specificity were similar to those of the respective hydrogenated peptides. These results suggest that fluorinated amino acids, even when introduced into the same positions, can have markedly different effects on the physical properties of proteins, while having little impact on secondary and higher-order structure.
Collapse
Affiliation(s)
- Soojin Son
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125, USA
| | | | | |
Collapse
|
10
|
Ishida C, Aranda C, Valenzuela L, Riego L, Deluna A, Recillas-Targa F, Filetici P, López-Revilla R, González A. The UGA3-GLT1 intergenic region constitutes a promoter whose bidirectional nature is determined by chromatin organization in Saccharomyces cerevisiae. Mol Microbiol 2006; 59:1790-806. [PMID: 16553884 DOI: 10.1111/j.1365-2958.2006.05055.x] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Transcription of an important number of divergent genes of Saccharomyces cerevisiae is controlled by intergenic regions, which constitute factual bidirectional promoters. However, few of such promoters have been characterized in detail. The analysis of the UGA3-GLT1 intergenic region has provided an interesting model to study the joint action of two global transcriptional activators that had been considered to act independently. Our results show that Gln3p and Gcn4p exert their effect upon cis-acting elements, which are shared in a bidirectional promoter. Accordingly, when yeast is grown on a low-quality nitrogen source, or under amino acid deprivation, the expression of both UGA3 and GLT1 is induced through the action of both these global transcriptional modulators that bind to a region of the bidirectional promoter. In addition, we demonstrate that chromatin organization plays a major role in the bidirectional properties of the UGA3-GLT1 promoter, through the action of an upstream Abf1p-binding consensus sequence and a polydAdT(tract). Mutations in these cis-elements differentially affect transcription of UGA3 and GLT1, and thus alter the overall relative expression. This is the first example of an intergenic region constituting a promoter whose bidirectional character is determined by chromatin organization.
Collapse
Affiliation(s)
- Cecilia Ishida
- Departamento de Genética Molecular, Instituto de Fisiología Celular, Universidad Nacional Autónoma de México, México
| | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Peng X, Karuturi RKM, Miller LD, Lin K, Jia Y, Kondu P, Wang L, Wong LS, Liu ET, Balasubramanian MK, Liu J. Identification of cell cycle-regulated genes in fission yeast. Mol Biol Cell 2004; 16:1026-42. [PMID: 15616197 PMCID: PMC551471 DOI: 10.1091/mbc.e04-04-0299] [Citation(s) in RCA: 141] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Cell cycle progression is both regulated and accompanied by periodic changes in the expression levels of a large number of genes. To investigate cell cycle-regulated transcriptional programs in the fission yeast Schizosaccharomyces pombe, we developed a whole-genome oligonucleotide-based DNA microarray. Microarray analysis of both wild-type and cdc25 mutant cell cultures was performed to identify transcripts whose levels oscillated during the cell cycle. Using an unsupervised algorithm, we identified 747 genes that met the criteria for cell cycle-regulated expression. Peaks of gene expression were found to be distributed throughout the entire cell cycle. Furthermore, we found that four promoter motifs exhibited strong association with cell cycle phase-specific expression. Examination of the regulation of MCB motif-containing genes through the perturbation of DNA synthesis control/MCB-binding factor (DSC/MBF)-mediated transcription in arrested synchronous cdc10 mutant cell cultures revealed a subset of functional targets of the DSC/MBF transcription factor complex, as well as certain gene promoter requirements. Finally, we compared our data with those for the budding yeast Saccharomyces cerevisiae and found approximately 140 genes that are cell cycle regulated in both yeasts, suggesting that these genes may play an evolutionarily conserved role in regulation of cell cycle-specific processes. Our complete data sets are available at http://giscompute.gis.a-star.edu.sg/~gisljh/CDC.
Collapse
Affiliation(s)
- Xu Peng
- Genome Institute of Singapore, Singapore 138672, Singapore
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Siggers TW, Silkov A, Honig B. Structural alignment of protein--DNA interfaces: insights into the determinants of binding specificity. J Mol Biol 2004; 345:1027-45. [PMID: 15644202 DOI: 10.1016/j.jmb.2004.11.010] [Citation(s) in RCA: 59] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2004] [Revised: 11/04/2004] [Accepted: 11/05/2004] [Indexed: 11/29/2022]
Abstract
A new method is introduced to structurally align interfaces observed in protein--DNA complexes. The method is based on a procedure that describes the interfacial geometry in terms of the spatial relationships between individual amino acid--nucleotide pairs. An amino acid--amino acid similarity matrix, S, is defined that provides a quantitative measure of the geometric relationships of amino acids in different interfaces and the entire stretch of "local" DNA within some distance of each amino acid. S is used as a substitution matrix in a dynamic programming algorithm that aligns the interfacial amino acids of the two complexes. The quality of the alignment is determined by an interface alignment score, IAS, that provides a quantitative measure of the similarity in the docking geometry between two protein--DNA complexes. We have clustered a large set of protein--DNA complexes based on their IAS values. In general, proteins within a single family form identifiable clusters. Subgroup clustering is often observed within families offering a fine-grained description of docking geometries. Although proteins with similar folds tend to dock in similar ways, important differences are observed even for structural motifs that almost perfectly align. Relationships are observed between the interfaces formed in cognate and non-cognate complexes involving the same proteins indicating a strong driving force to maintain certain contacts, even if this requires a distortion of the DNA. There are cases where inter-family similarities are greater than intra-family similarities. Our method offers the possibility of comparing different protein--DNA interfaces in a detailed, objective and quantitative fashion. This offers the possibility of new approaches to the description of the determinants of molecular recognition and to the prediction of protein and DNA sequence combinations that are optimal for binding.
Collapse
Affiliation(s)
- Trevor W Siggers
- Department of Biochemistry and Molecular Biophysics, Howard Hughes Medical Institute, Columbia University, 630 West 168th Street, New York, NY 10032, USA
| | | | | |
Collapse
|
13
|
Dragan AI, Frank L, Liu Y, Makeyeva EN, Crane-Robinson C, Privalov PL. Thermodynamic Signature of GCN4-bZIP Binding to DNA Indicates the Role of Water in Discriminating Between the AP-1 and ATF/CREB Sites. J Mol Biol 2004; 343:865-78. [PMID: 15476806 DOI: 10.1016/j.jmb.2004.08.101] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2004] [Revised: 08/16/2004] [Accepted: 08/26/2004] [Indexed: 11/30/2022]
Abstract
The energetic basis of GCN4-bZIP complexes with the AP-1 and ATF/CREB sites was investigated by optical methods and scanning and isothermal titration microcalorimetry. The dissociation constant of the bZIP dimer was found to be significantly higher than that of its isolated leucine zipper domain: at 20 degrees C it is 1.45microM and increases with temperature. To avoid complications from dissociation of this dimer, DNA binding experiments were carried out using an SS crosslinked version of the bZIP. The thermodynamic characteristics of the bZIP/DNA association measured at different temperatures and salt concentrations were corrected for the contribution of refolding the basic segment upon binding, determined from the scanning calorimetric experiments. Fluorescence anisotropy titration experiments showed that the association constants of the bZIP at 20 degrees C with the AP-1 and ATF/CREB binding sites do not differ much, being 1.5nM and 6.4nM, corresponding to Gibbs energies of -49kJmol(-1) and -46kJmol(-1), respectively. Almost half of the Gibbs energy is attributable to the electrostatic component, resulting from the entropic effect of counterion release upon DNA association with the bZIP and is identical for both sites. In contrast to the Gibbs energies, the enthalpies of association of the fully folded bZIP with the AP-1 and ATF/CREB sites, and correspondingly the entropies of association, are very different. bZIP binding to the AP-1 site is characterized by a substantially larger negative enthalpy and non-electrostatic entropy than to the ATF/CREB site, implying that the AP-1 complex incorporates significantly more water molecules than the ATF/CREB complex.
Collapse
Affiliation(s)
- Anatoly I Dragan
- Department of Biology, Johns Hopkins University, Baltimore, MD 21218, USA
| | | | | | | | | | | |
Collapse
|
14
|
Abstract
We hypothesize that we can exploit what Nature has already evolved by manipulating the alpha-helix molecular recognition scaffold. Therefore, minimalist proteins capable of sequence-specific, high-affinity binding of DNA were generated to probe how proteins are used and can be used to recognize DNA. The already minimal basic region/leucine zipper motif (bZIP) of GCN4 was reduced to an even more simplified structure by substitution with alanine residues-hence, a generic, Ala-based, helical scaffold. The proteins generated, wt bZIP, 4A, 11A, and 18A, contain 0, 4, 11, and 18 alanine mutations in their DNA-binding basic regions, respectively. All alanine mutants still retain alpha-helical structure and DNA-binding function, despite loss of virtually all Coulombic protein-DNA interactions. Mass spectrometry allowed characterization of proteins and post-translational modifications. Fluorescence anisotropy and DNase I footprinting were used to measure in situ binding of these mutant proteins to DNA duplexes containing target sites AP-1 (5'-TGACTCA-3'), ATF/CREB (5'-TGACGTCA-3'), or nonspecific DNA. The roles of van der Waals and Coulombic interactions toward binding specificity and affinity are being investigated. Thus, both DNA-binding specificity and affinity are maintained in all our bZIP derivatives. This Ala-rich scaffold may be useful in design and synthesis of small, alpha-helical proteins with desired DNA-recognition properties.
Collapse
Affiliation(s)
- Jumi A. Shin
- Department of Chemistry, University of Toronto, Mississauga, Ontario L5L 1C6, Canada
| |
Collapse
|
15
|
Hollenbeck JJ, McClain DL, Oakley MG. The role of helix stabilizing residues in GCN4 basic region folding and DNA binding. Protein Sci 2002; 11:2740-7. [PMID: 12381856 PMCID: PMC2373721 DOI: 10.1110/ps.0211102] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Basic region leucine zipper (bZip) proteins contain a bipartite DNA-binding motif consisting of a coiled-coil leucine zipper dimerization domain and a highly charged basic region that directly contacts DNA. The basic region is largely unfolded in the absence of DNA, but adopts a helical conformation upon DNA binding. Although a coil --> helix transition is entropically unfavorable, this conformational change positions the DNA-binding residues appropriately for sequence-specific interactions with DNA. The N-terminal residues of the GCN4 DNA-binding domain, DPAAL, make no DNA contacts and are not part of the conserved basic region, but are nonetheless important for DNA binding. Asp and Pro are often found at the N-termini of alpha-helices, and such N-capping motifs can stabilize alpha-helical structure. In the present study, we investigate whether these two residues serve to stabilize a helical conformation in the GCN4 basic region, lowering the energetic cost for DNA binding. Our results suggest that the presence of these residues contributes significantly to helical structure and to the DNA-binding ability of the basic region in the absence of the leucine zipper. Similar helix-capping motifs are found in approximately half of all bZip domains, and the implications of these findings for in vivo protein function are discussed.
Collapse
Affiliation(s)
- Jessica J Hollenbeck
- Department of Chemistry, Indiana University, Bloomington, Indiana 47405-7102, USA
| | | | | |
Collapse
|
16
|
Bird GH, Lajmi AR, Shin JA. Sequence-specific recognition of DNA by hydrophobic, alanine-rich mutants of the basic region/leucine zipper motif investigated by fluorescence anisotropy. Biopolymers 2002; 65:10-20. [PMID: 12209468 DOI: 10.1002/bip.10205] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
We generated minimalist proteins capable of sequence-specific, high-affinity binding of DNA to probe how proteins are used and can be used to recognize DNA. In order to quantify binding affinities and specificities in our protein-DNA system, we used fluorescence anisotropy to measure in situ the thermodynamics of binding of alanine-rich mutants of the GCN4 basic region/leucine zipper (bZIP) domain to DNA duplexes containing target sites AP-1 (5'-TGACTCA-3') or ATF/CREB (5'-TGACGTCA-3'). We simplified the alpha-helical bZIP molecular recognition scaffold by alanine substitution: 4A, 11A, and 18A contain four, eleven, and eighteen alanine mutations in their DNA-binding basic regions, respectively. DNase I footprinting analysis demonstrates that all bZIP mutants retain the sequence-specific DNA-binding function of native GCN4 bZIP. Titration of fluorescein-labeled oligonucleotide duplexes with increasing amounts of protein yielded low nanomolar dissociation constants for all bZIP mutants in complex with the AP-1 and ATF/CREB sites: binding to the nonspecific control duplex was > 1000-fold weaker. Remarkably, the most heavily mutated protein 18A, containing 24 alanines in its 27-residue basic region, still binds AP-1 and ATF/CREB with dissociation constants of 15 and 7.8 nM, respectively. Similarly, wild-type bZIP binds these sites with K(d) values of 9.1 and 14 nM. 11A also displays low nanomolar dissociation constants for AP-1 and ATF/CREB, while 4A binds these sites with approximately 10-fold weaker K(d) values. Thus, both DNA-binding specificity and affinity are maintained in all our bZIP derivatives. This Ala-rich scaffold may be useful in design and synthesis of small alpha-helical proteins with desired DNA-recognition properties capable of serving as therapeutics targeting transcription.
Collapse
Affiliation(s)
- Gregory H Bird
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | | | | |
Collapse
|
17
|
Spode I, Maiwald D, Hollenberg CP, Suckow M. ATF/CREB sites present in sub-telomeric regions of Saccharomyces cerevisiae chromosomes are part of promoters and act as UAS/URS of highly conserved COS genes. J Mol Biol 2002; 319:407-20. [PMID: 12051917 DOI: 10.1016/s0022-2836(02)00322-4] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
A highly conserved 48 bp DNA element was identified present at 26 chromosome ends of Saccharomyces cerevisiae. Each element harbours an ideal or a mutated ATF/CREB site, which is a well-known target sequence for bZip transcription factors. In all cases, the sub-telomeric ATF/CREB site element (SACE) is a direct extension of the respective sub-telomeric coreX element. Eight SACEs are part of very long quasi-identical regions of several kilobases, including a sub-telomeric COS open reading frame. Three of these eight SACEs harbour an ideal ATF/CREB site, four a triple-exchange variant (5'-ATGGTATCAT-3'; GTA variant), and one a single exchange variant with a C to G exchange at the left side of the center of symmetry. We analyzed the function of the SACE of the left arm of chromosome VIII in vivo and found its ATF/CREB site to act as UAS/URS of the COS8 promoter, effected by the yeast bZip proteins Sko1p, Aca1p, and Aca2p. Cos8 protein was found in proximity to the nuclear membrane, where it accumulated, especially during cell division. When the ATF/CREB site of the COS8 promoter was exchanged with the GTA variant, the regulation was changed. COS8 was then regulated by Hac1p, a bZip protein known to be involved in the unfolded protein response of S. cerevisiae, indicating, for the first time, a possible functional category for the Cos proteins of S. cerevisiae.
Collapse
Affiliation(s)
- Igo Spode
- Institut für Mikrobiologie and Biologisch-Medizinisches Forschungszentrum, Heinrich-Heine-Universität, Universitätsstr. 1, 40225 Dusseldorf, Germany
| | | | | | | |
Collapse
|
18
|
Lee YS, Gurnon DG, Hollenbeck JJ, Oakley MG. Selection of a high-affinity DNA pool for a bZip protein with an out-of-phase alignment of the basic region relative to the leucine zipper. Bioorg Med Chem 2001. [DOI: 10.1016/s0968-0896(01)00158-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
19
|
Schumacher MA, Goodman RH, Brennan RG. The structure of a CREB bZIP.somatostatin CRE complex reveals the basis for selective dimerization and divalent cation-enhanced DNA binding. J Biol Chem 2000; 275:35242-7. [PMID: 10952992 DOI: 10.1074/jbc.m007293200] [Citation(s) in RCA: 135] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The cAMP responsive element-binding protein (CREB) is central to second messenger regulated transcription. To elucidate the structural mechanisms of DNA binding and selective dimerization of CREB, we determined to 3.0 A resolution, the structure of the CREB bZIP (residues 283-341) bound to a 21-base pair deoxynucleotide that encompasses the canonical 8-base pair somatostatin cAMP response element (SSCRE). The CREB dimer is stabilized in part by ionic interactions from Arg(314) to Glu(319') and Glu(328) to Lys(333') as well as a hydrogen bond network that links the carboxamide side chains of Gln(322')-Asn(321)-Asn(321')-Gln(322). Critical to family selective dimerization are intersubunit hydrogen bonds between basic region residue Tyr(307) and leucine zipper residue Glu(312), which are conserved in all CREB/CREM/ATF-1 family members. Strikingly, the structure reveals a hexahydrated Mg(2+) ion bound in the cavity between the basic region and SSCRE that makes a water-mediated DNA contact. DNA binding studies demonstrate that Mg(2+) ions enhance CREB bZIP:SSCRE binding by more than 25-fold and suggest a possible physiological role for this ion in somatostatin cAMP response element and potentially other CRE-mediated gene expression.
Collapse
Affiliation(s)
- M A Schumacher
- Department of Biochemistry and Molecular Biology and the Vollum Institute, Oregon Health Sciences University, Portland, Oregon 97201-3098, USA
| | | | | |
Collapse
|
20
|
Suckow M, Kisters-Woike B, Hollenberg CP. A novel feature of DNA recognition: a mutant Gcn4p bZip peptide with dual DNA binding specificities dependent of half-site spacing. J Mol Biol 1999; 286:983-7. [PMID: 10047475 DOI: 10.1006/jmbi.1999.2537] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Homodimeric DNA-binding proteins with relaxed half-site spacing requirements for their DNA targets have been described. As an example, the yeast transcriptional activator Gcn4p binds in vitro equally well to the AP1 site (5'A4T3G2A1C0T1'C2'A3'T4'3') and the ATF/CREB site (5'A4T3G2A1C0G0'T1'C2'A3'T4'3'), which have identical but differently spaced half-site blocks. We describe a novel feature for the bZip class of DNA-binding proteins. The N-14 mutant of a Gcn4p-derived bZip peptide shows a diametrically opposed base-pair recognition specificity depending on the half-site spacing of its DNA target: on pseudo-palindromic, AP1 site-like binding sites, guanine is required in position 2 for proper binding; in contrast, on palindromic, ATF/CREB site-like targets, position 2 must be cytosine to prevent a loss of binding. Modeling studies suggest that the different base-pair requirements on differently spaced DNA targets are due to minimal alterations of the distances between the relevant atoms of the N-14 side-chain and the corresponding target groups on the DNA. Although the N-14 peptide does not have a natural counterpart, its behavior hints at the possibility that dual binding modi dependent on half-site spacing may occur also for natural homodimeric DNA-binding proteins.
Collapse
Affiliation(s)
- M Suckow
- Institut für Mikrobiologie, Heinrich-Heine-Universität, Universitätsstr. 1 40225, Düsseldorf, Germany
| | | | | |
Collapse
|
21
|
Lundblad JR, Kwok RP, Laurance ME, Huang MS, Richards JP, Brennan RG, Goodman RH. The human T-cell leukemia virus-1 transcriptional activator Tax enhances cAMP-responsive element-binding protein (CREB) binding activity through interactions with the DNA minor groove. J Biol Chem 1998; 273:19251-9. [PMID: 9668114 DOI: 10.1074/jbc.273.30.19251] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Tax-1, the transcriptional activation protein of human T-cell leukemia virus-1, increases transcription from the human T-cell leukemia virus-1 long terminal repeat and specific cellular promoters through interactions with cellular DNA-binding proteins. The Tax response elements (TxREs) of the long terminal repeat resemble cAMP response elements (CREs), the target of cAMP-responsive element-binding protein (CREB). CREB binds the TxRE with reduced affinity; however, the interaction is specifically enhanced by Tax. Using a fluorescence quenching method, we determined that CREB dimerizes in the absence of DNA, and that Tax does not enhance dimerization. DNA footprinting of the TxRE with 1, 10-phenanthroline-copper complex demonstrates that Tax contacts DNA and extends the footprint of CREB to GC-rich sequences flanking the core CRE-like element. The minor groove-binding drug chromomycin A3, but not distamycin A, disrupted Tax-enhanced CREB binding to the TxRE. Substitution of the guanine-rich sequences flanking the core of the TxRE with inosine residues also blocked the Tax effect. Finally, the IC-substituted TxRE binds CREB with increased affinity, suggesting flanking DNA influences the binding of CREB to the core CRE-like element. These data indicate that Tax does not regulate DNA binding of CREB by altering dimerization, but rather enhances DNA binding by additionally interacting with the minor groove of flanking DNA sequences.
Collapse
Affiliation(s)
- J R Lundblad
- Division of Molecular Medicine, Oregon Health Sciences University, Portland, Oregon 97201, USA.
| | | | | | | | | | | | | |
Collapse
|
22
|
Abstract
The Pichia pastoris TRP1 and HIS3 genes were cloned by complementation of the Saccharomyces cerevisiae trip1 and his3 mutants, respectively, and their nucleotide sequence was determined. The P. pastoris TRP1 gene includes an open reading frame (ORF) of 714 nucleotides corresponding to a polypeptide of 237 amino acids whose sequence shares about 40% identity with that of TRP1 encoding proteins in other yeast species. DNA sequencing showed that an ORF of 858 nucleotides, encoding a protein of 285 amino acids with high homology to inorganic pyrophosphatases (IPP1), is located downstream of the P. pastoris TRP1 gene. Both genes converge in this chromosomal region, showing a genetic organization analogous to that found in the Kluyveromyces lactis genome. The P. pastoris HIS3 gene possesses an ORF of 675 nucleotides, encoding a polypeptide of 224 amino acids which shows 74.1% identity to the homologous S. cerevisiae protein. The hexameric consensus GCN4 binding sequence (TGACTC), characteristic of many amino acid biosynthetic genes, is present in the promoter region.
Collapse
Affiliation(s)
- I Cosano
- Departamento de Microbiología II, Facultad de Farmacia, Universidad Complutense, Madrid, Spain
| | | | | | | |
Collapse
|
23
|
Suckow M, Hollenberg CP. The activation specificities of wild-type and mutant Gcn4p in vivo can be different from the DNA binding specificities of the corresponding bZip peptides in vitro. J Mol Biol 1998; 276:887-902. [PMID: 9566194 DOI: 10.1006/jmbi.1997.1565] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Single amino acid substitutions which previously have been shown to alter the DNA binding specificity of a Gcn4p bZip peptide in vitro were transformed to full length Gcn4p, and activation of a test promoter carrying various palindromic and pseudo-palindromic binding sites was measured. All mutations were found to have different phenotypes, and the first change-of-specificity mutants for Gcn4p in vivo are described. The comparison of plasmids encoding no protein or a particular Gcn4p mutant with broadened activation specificity in gcn4 and gcn4 acr1 genetic backgrounds revealed three new DNA targets of the yeast Acr1p repressor. Surprisingly, we found the activation specificities Gcn4p and the mutants tested in vivo to be generally different from DNA binding specificities of the corresponding bZip peptides in vitro. Especially, the proteins respond differently, in vitro and in vivo, on changes in half site spacing of the DNA binding sites. We present data which largely exclude that the differences between in vivo and in vitro-derived results are due to differences in protein structure, or to the presence of competing protein factors in the yeast cell. We conclude that the differences between in vitro and in vivo-derived results are caused by differences in the degree of flexibility of the target DNA sequences in vitro and in vivo.
Collapse
Affiliation(s)
- M Suckow
- Institut für Mikrobiologie der Heinrich-Heine-Universität, Düsseldorf, Germany
| | | |
Collapse
|
24
|
Wang P, Larson TG, Chen CH, Pawlyk DM, Clark JA, Nuss DL. Cloning and characterization of a general amino acid control transcriptional activator from the chestnut blight fungus Cryphonectria parasitica. Fungal Genet Biol 1998; 23:81-94. [PMID: 9501479 DOI: 10.1006/fgbi.1997.1023] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We have cloned and characterized a homologue of the Neurospora crassa general amino acid control gene cpc-1 from the chestnut blight fungus Cryphonectria parasitica. The deduced amino acid sequence of C. parasitica CPC1 (cpCPC1) contains regions with significant homology to the transcriptional activation, DNA binding, and dimerization domains previously defined for N. crassa CPC1 (ncCPC1) and the equivalent "b-ZIP" transcription factor from Saccharomyces cerevisiae, GCN4 (scGCN4). Treatment of C. parasitica with low levels of the protein synthesis inhibitor cycloheximide caused cpc-1 transcript levels to undergo a rapid, transient increase similar to that reported for the mammalian b-ZIP transactivators, c-Jun and c-Fos. Northern analysis also revealed that amino acid starvation of C. parasitica elicits an increase in cpc-1 transcript levels. Hypovirus infection did not affect this increase, although transcript accumulation for several amino acid biosynthetic genes was slightly diminished in the hypovirus-containing strain. Recombinant cpCPC1 specifically bound to the consensus DNA binding element (AP-1), 5'-A/GTGACTCAT-3', also located upstream of the C. parasitica cpc-1 coding region. Constitutive transgenic expression of a DNA binding defective cpCPC1 mutant impaired the ability of C. parasitica to adjust to amino acid starvation. Moreover, these transformants showed reduced ability to grow on host chestnut tissue. Our results define a general amino acid control transactivator in a plant pathogenic fungus and suggest that functional modulation of this factor can influence fungal virulence.
Collapse
Affiliation(s)
- P Wang
- Center for Agricultural Biotechnology, University of Maryland Biotechnology Institute, University of Maryland, College Park 20742, USA
| | | | | | | | | | | |
Collapse
|
25
|
Craig LE, Nealen ML, Strandberg JD, Zink MC. Differential replication of ovine lentivirus in endothelial cells cultured from different tissues. Virology 1997; 238:316-26. [PMID: 9400604 DOI: 10.1006/viro.1997.8830] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Blood-brain barrier dysfunction has been postulated to be important in the pathogenesis of HIV dementia. This study used an in vitro model of the blood-brain barrier to determine the effects of ovine lentivirus (OvLV) infection on endothelial cells. The replication of two American OvLV isolates and two lcelandic OvLV isolates in pure cultures of endothelial cells isolated from brain was compared to replication in endothelial cells from adipose, lung, and aorta. Inoculation with the two American isolates resulted in 100 times greater reverse transcriptase (RT) activity in supernatant of the microvascular endothelial cells (brain, lung, and adipose) than in the macrovascular endothelial cells (aorta). Conversely, inoculation with the two lcelandic isolates resulted in 100 times higher RT activity in aortic, lung, and adipose endothelial cells than in the brain endothelial cells. Transmission electron microscopy of the brain capillary endothelial cells infected with the American isolates revealed polarized viral budding from the lateral cell membrane and a loss of tight junctions. Replication of OvLV in brain capillary endothelial cells could play a role in the pathogenesis of lentiviral encephalitis by altering blood-brain barrier integrity.
Collapse
Affiliation(s)
- L E Craig
- Division of Comparative Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | | | | | | |
Collapse
|
26
|
Takenaka S, Sato H, Itakura Y, Kondo H, Takagi M. Construction of a dimeric DNA-binding peptide model by peptide-anthraquinone conjugation. INTERNATIONAL JOURNAL OF PEPTIDE AND PROTEIN RESEARCH 1996; 48:397-400. [PMID: 8919061 DOI: 10.1111/j.1399-3011.1996.tb00857.x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
A peptide-anthraquinone conjugate was designed and synthesized containing linked peptide chains composed of Asp-Pro-Ala-Ala-Leu-Lys-Arg-Ala-Arg-Asn-Thr-Glu-Ala- Ala-Arg-Arg-Ser-Arg-Ala-Arg-Lys-Leu-Gln-Arg-Met, representing the basic region of GCN4. The two peptides were joined with anthraquinone at its 1- and 8-positions in a two-fold symmetric fashion, mimicking dimeric DNA-binding proteins. Experimental data indicated both an interaction of the anthraquinone moiety with the DNA double strand and an increase in the alpha-helicity of the peptide moieties of the ligand when it binds to DNA.
Collapse
Affiliation(s)
- S Takenaka
- Department of Chemical Science and Technology, Kyushu University, Fukuoka, Japan
| | | | | | | | | |
Collapse
|
27
|
Suckow M, Lopata M, Seydel A, Kisters-Woike B, von Wilcken-Bergmann B, Müller-Hill B. Mutant bZip-DNA complexes with four quasi-identical protein-DNA interfaces. EMBO J 1996; 15:598-606. [PMID: 8599943 PMCID: PMC449978] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
The complex between the yeast transcriptional activator GCN4 and the palindromic ATF/CREB site 5'- A4T3G2A1C0*G0'T1'C2'A3'T4'-3' shows dyad symmetry. The basic region of GCN4 contains a segment of 18 amino acids with a partially palindromic sequence: N-LKRARNTEA*ARRSRARKL-C. Symmetric residues are underlined. Apart from the ATF/CREB site, GCN4 also binds well to the symmetric variants with guanine in position 4 (5'-G4T3G2A1C0*G0'T1'C2'A3'C4'-3') or thymine in position 0 (5'-A4T3G2A1T0*A0'T1'C2'A3'T4'-3'). The half-sites of these sequences can be regarded as short pseudo-palindromes with central guanine 2/cytosine 2' base pairs. We investigated whether the geometry of the peptide of the basic region of GCN4 could be functionally related to the pseudo-palindromic character of some target half-sites. Since inspection of the X-ray structures of GCN4-DNA complexes reveals that several amino acid-DNA interactions are symmetric within the wild-type half-complexes, we introduced mutations into a GCN4 bZip peptide that improve the symmetry of the peptide. We found that most of the constructs retain specific DNA recognition. For one mutant, we conclude that it is not only capable of forming DNA complexes showing the well-known overall dyad symmetry, but that the protein-DNA interface of each half-complex can be divided further into two quasi-identical, quasi-symmetric substructures.
Collapse
Affiliation(s)
- M Suckow
- Institut für Genetik der Universität zu Köln, Germany
| | | | | | | | | | | |
Collapse
|
28
|
Olive M, Williams SC, Dezan C, Johnson PF, Vinson C. Design of a C/EBP-specific, dominant-negative bZIP protein with both inhibitory and gain-of-function properties. J Biol Chem 1996; 271:2040-7. [PMID: 8567657 DOI: 10.1074/jbc.271.4.2040] [Citation(s) in RCA: 49] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
We have developed a bZIP protein, GBF-F, with both dominant-negative (DN) and gain-of-function properties. GBF-F is a chimera consisting of two components: the DNA binding (basic) region from the plant bZIP protein GBF-1 (GBF) and a leucine zipper (F) designed to preferentially heterodimerize with the C/EBP alpha leucine zipper. Biochemical studies show that GBF-F preferentially forms heterodimers with C/EBP alpha and thus binds a chimeric DNA sequence composed of the half-sites recognized by the C/EBP and GBF basic regions. Transient transfections in HepG2 hepatoma cells show that both components of GBF-F are necessary for inhibition of C/EBP alpha transactivation. When the C/EBP alpha leucine zipper is replaced with that of either GCN4 or VBP, the resulting protein can transactivate a C/EBP cis-element but is not inhibited by GBF-F, indicating that the specificity of dominant-negative action is determined by the leucine zipper. All known members of the C/EBP family contain similar leucine zipper regions and are inhibited by GBF-F. GBF-F also exhibits gain-of-function properties, since, with the essential cooperation of a C/EBP family member, it can transactivate a promoter containing the chimeric C/EBP/GBF site. This protein therefore has potential utility both as a dominant-negative inhibitor of C/EBP function and as an activator protein with novel DNA sequence specificity.
Collapse
Affiliation(s)
- M Olive
- Laboratory of Biochemistry, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | | | | | | | | |
Collapse
|
29
|
Svetlov VV, Cooper TG. Review: compilation and characteristics of dedicated transcription factors in Saccharomyces cerevisiae. Yeast 1995; 11:1439-84. [PMID: 8750235 DOI: 10.1002/yea.320111502] [Citation(s) in RCA: 57] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Affiliation(s)
- V V Svetlov
- Department of Microbiology and Immunology, University of Tennessee, Memphis 36163, USA
| | | |
Collapse
|
30
|
Stanojevic D, Verdine GL. Deconstruction of GCN4/GCRE into a monomeric peptide-DNA complex. NATURE STRUCTURAL BIOLOGY 1995; 2:450-7. [PMID: 7664107 DOI: 10.1038/nsb0695-450] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Here we describe a system that enables short peptides to bind DNA sequence-specifically. Linking the peptide covalently to DNA through a disulphide bond eliminates the unfavourable energetic cost of diffusion and thus potentiates the peptide-DNA interaction. By this approach we have deconstructed the GCN4/DNA complex into its elemental DNA recognition units. We find that the GCN4 basic region contacts the two half-sites with very different affinities and propose that this thermodynamic asymmetry plays a role in differential regulation of gene expression. Specific binding of the peptide to DNA stabilizes the disulphide bond toward reduction suggesting a novel approach to the discovery of new DNA-binding specificities.
Collapse
Affiliation(s)
- D Stanojevic
- Department of Chemistry, Harvard University, Cambridge, Massachusetts 02138, USA
| | | |
Collapse
|
31
|
Okagami M, Ueno M, Makino K, Shimomura M, Saito I, Morii T, Sugiura Y. Sequence-specific DNA binding by covalently constrained peptide dimers of the basic leucine zipper protein GCN4. Bioorg Med Chem 1995; 3:777-84. [PMID: 7582955 DOI: 10.1016/0968-0896(95)00047-k] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
DNA binding of covalently bonded peptide dimers was studied by using enantiomeric and C2-symmetric templates as a dimerization module. Amino acid sequence of the peptide is derived from that of DNA contact region of the basic leucine zipper protein GCN4. These peptide dimers were designed to possess different constraints with respect to the orientation of two peptides. The basic region peptides were covalently linked to the enantiomeric template at the C-terminal ends. Two peptides are arranged either in a right-handed or left-handed geometry depending on the chirality of the template. The GCN4 basic region dimers with both right-handed and left-handed geometries show equal affinity to the native GCN4 binding DNA sequences, 5'-ATGACTCAT-3' and 5'-ATGACGTCAT-3', as revealed by the gel mobility shift assay. Specific recognition of the palindromic DNA sequence by the peptide dimers was confirmed by the DNase I footprinting. Circular dichroism spectroscopic study indicates that the basic region peptides bound the target DNA sequence in a helical conformation. The degree to which a chiral constraint effects may depend on the geometry of two DNA binding domains in the parent protein-DNA complex and on a position to apply the chiral constraint.
Collapse
Affiliation(s)
- M Okagami
- Department of Polymer Science and Engineering, Kyoto Institute of Technology, Japan
| | | | | | | | | | | | | |
Collapse
|
32
|
Metallo SJ, Schepartz A. Distribution of labor among bZIP segments in the control of DNA affinity and specificity. CHEMISTRY & BIOLOGY 1994; 1:143-51. [PMID: 9383384 DOI: 10.1016/1074-5521(94)90004-3] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
BACKGROUND The basic region-leucine zipper (bZIP) family of proteins use an atypically simple motif for DNA recognition, yet family members discriminate differently between target sites that differ only in half-site spacing. Two such sites are the cAMP-response element (CRE) and the AP-1 target site. Fos/Jun prefers the AP-1 site (ATGACTCAT), while CRE-BP1 prefers CRE (ATGACGTCAT), and GCN4 binds both sites with equal affinity. We therefore asked what determines the relative specificity for CRE and AP-1 sites in bZIP proteins. RESULTS Here we show that CRE/AP-1 specificity in CRE-BP1 is encoded within the spacer and basic segments of the bZIP element. Of these two regions, the basic segment is the more important. This specificity is in part achieved at the expense of affinity. CONCLUSIONS The small size and simplicity of the bZIP recognition helix was already unusual; our findings show that the information that determines the target site specificity of members of the bZIP family of proteins is even more condensed than expected. These results suggest that it may be possible to design surprisingly small proteins that bind DNA with high sequence specificity, although it may be more difficult to achieve high-affinity binding in small proteins.
Collapse
Affiliation(s)
- S J Metallo
- Department of Chemistry, Yale University, New Haven, CT 06511-8118, USA
| | | |
Collapse
|
33
|
Tamaru H, Nishida T, Harashima T, Inoue H. Transcriptional activation of a cycloheximide-inducible gene encoding laccase is mediated by cpc-1, the cross-pathway control gene, in Neurospora crassa. MOLECULAR & GENERAL GENETICS : MGG 1994; 243:548-54. [PMID: 8208246 DOI: 10.1007/bf00284203] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Expression of the laccase gene (lacc) of Neurospora crassa is transcriptionally inducible by the protein synthesis inhibitor cycloheximide. A lni-1 mutation, conferring the laccase non-inducible phenotype, was found to be a cpc-1 allele. Northern blots probed with plasmid pLA1, which carries the lacc gene revealed that the cpc-1 mutation abolishes the induced transcription of the lacc gene, indicating requirement of the cpc-1 gene for transcriptional activation of the lacc gene. In Northern blots probed with plasmid pAB1, which bears arg-2 a gene whose transcription is under the control of CPC1, the level of the arg-2 transcript was shown to increase several-fold in wild-type mycelia but remained low in cpc-1 mycelia, after treatment with cycloheximide. This suggests that inhibition of protein synthesis with cycloheximide, as well as amino acid limitation, elicits the CPC1-mediated cross-pathway control. Characterization of the lacc upstream region using a series of 5'-deletion plasmids led to the identification of a 170 bp DNA region required for the induced lacc expression. Sequence analysis of this DNA region demonstrated that it includes a 9 bp sequence with dyad symmetry, ATGAATCAT, which differs only by a central base pair from ATGA(C/G)TCAT, the recognition sequence characteristic of CPC1 and GCN4 binding sites. Possible mechanisms by which CPC1 mediates transcriptional activation of the lacc gene are discussed.
Collapse
Affiliation(s)
- H Tamaru
- Department of Regulation Biology, Faculty of Science, Saitama University, Urawa, Japan
| | | | | | | |
Collapse
|
34
|
|
35
|
Ellenberger T. Getting a grip on DNA recognition: structures of the basic region leucine zipper, and the basic region helix-loop-helix DNA-binding domains. Curr Opin Struct Biol 1994. [DOI: 10.1016/s0959-440x(94)90054-x] [Citation(s) in RCA: 58] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
36
|
Lee KA, Masson N. Transcriptional regulation by CREB and its relatives. BIOCHIMICA ET BIOPHYSICA ACTA 1993; 1174:221-33. [PMID: 8373801 DOI: 10.1016/0167-4781(93)90191-f] [Citation(s) in RCA: 149] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Affiliation(s)
- K A Lee
- Imperial Cancer Research Fund, Clare Hall Laboratories, Potters Bar, UK
| | | |
Collapse
|
37
|
Weinstock KG, Strathern JN. Molecular genetics in Saccharomyces kluyveri: the HIS3 homolog and its use as a selectable marker gene in S. kluyveri and Saccharomyces cerevisiae. Yeast 1993; 9:351-61. [PMID: 8511965 DOI: 10.1002/yea.320090405] [Citation(s) in RCA: 25] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
We cloned the Saccharomyces kluyveri HIS3 homolog, k-HIS3, and made a partial deletion of the gene. The k-HIS3 gene complemented a HIS3 deletion in S. cerevisiae. The DNA sequences of the open reading frames (ORFs) of the HIS3 homologs are 70% identical at the DNA level and 83% identical at the deduced amino acid level. The ORF upstream of the k-HIS3 gene is related to the PET56 gene of S. cerevisiae found upstream of the HIS3 gene of S. cerevisiae. The ORF downstream from the k-HIS3 gene is not related to the DED1 gene found downstream of the HIS3 gene in S. cerevisiae.
Collapse
Affiliation(s)
- K G Weinstock
- Laboratory of Eukaryotic Gene Expression, ABL-Basic Research Program, NCI-Frederick Cancer Research and Development Center, MD 21702-1201
| | | |
Collapse
|
38
|
Ellenberger TE, Brandl CJ, Struhl K, Harrison SC. The GCN4 basic region leucine zipper binds DNA as a dimer of uninterrupted alpha helices: crystal structure of the protein-DNA complex. Cell 1992; 71:1223-37. [PMID: 1473154 DOI: 10.1016/s0092-8674(05)80070-4] [Citation(s) in RCA: 718] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
The yeast transcriptional activator GCN4 is 1 of over 30 identified eukaryotic proteins containing the basic region leucine zipper (bZIP) DNA-binding motif. We have determined the crystal structure of the GCN4 bZIP element complexed with DNA at 2.9 A resolution. The bZIP dimer is a pair of continuous alpha helices that form a parallel coiled coil over their carboxy-terminal 30 residues and gradually diverge toward their amino termini to pass through the major groove of the DNA-binding site. The coiled-coil dimerization interface is oriented almost perpendicular to the DNA axis, giving the complex the appearance of the letter T. There are no kinks or sharp bends in either bZIP monomer. Numerous contacts to DNA bases and phosphate oxygens are made by basic region residues that are conserved in the bZIP protein family. The details of the bZIP dimer interaction with DNA can explain recognition of the AP-1 site by the GCN4 protein.
Collapse
Affiliation(s)
- T E Ellenberger
- Harvard University, Department of Biochemistry and Molecular Biology, Cambridge, Massachusetts 02138
| | | | | | | |
Collapse
|