101
|
Lutz B, Sinner C, Heuermann G, Verma A, Schug A. eSBMTools 1.0: enhanced native structure-based modeling tools. Bioinformatics 2013; 29:2795-6. [DOI: 10.1093/bioinformatics/btt478] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
|
102
|
Anand P, Schug A, Wenzel W. Structure based design of protein linkers for zinc finger nuclease. FEBS Lett 2013; 587:3231-5. [PMID: 23994524 DOI: 10.1016/j.febslet.2013.08.015] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2013] [Revised: 08/05/2013] [Accepted: 08/12/2013] [Indexed: 12/18/2022]
Abstract
Zinc finger nucleases are a promising tool to edit DNA in many biological applications, in particular for gene knockout. Despite many efforts the number of genes that can be effectively targeted with ZFNs remains severely limited, as available constructs cannot address arbitrary gene sequences. Here, we develop a novel concept to significantly enhance the number of DNA sequences that can be targeted by ZFN. Using an efficient computational model, we provide an extensive library of possible linker molecules between individual zinc finger motifs in the construct that can skip up to 10 base pairs between adjacent zinc finger recognition sites in the DNA sequence, which increases the number of genes that can be efficiently targeted by more than an order of magnitude.
Collapse
Affiliation(s)
- Priya Anand
- Institute of Nanotechnology, Karlsruhe Institute of Technology, 76021 Karlsruhe, Germany
| | | | | |
Collapse
|
103
|
From principal component to direct coupling analysis of coevolution in proteins: low-eigenvalue modes are needed for structure prediction. PLoS Comput Biol 2013; 9:e1003176. [PMID: 23990764 PMCID: PMC3749948 DOI: 10.1371/journal.pcbi.1003176] [Citation(s) in RCA: 100] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2012] [Accepted: 06/24/2013] [Indexed: 11/19/2022] Open
Abstract
Various approaches have explored the covariation of residues in multiple-sequence alignments of homologous proteins to extract functional and structural information. Among those are principal component analysis (PCA), which identifies the most correlated groups of residues, and direct coupling analysis (DCA), a global inference method based on the maximum entropy principle, which aims at predicting residue-residue contacts. In this paper, inspired by the statistical physics of disordered systems, we introduce the Hopfield-Potts model to naturally interpolate between these two approaches. The Hopfield-Potts model allows us to identify relevant ‘patterns’ of residues from the knowledge of the eigenmodes and eigenvalues of the residue-residue correlation matrix. We show how the computation of such statistical patterns makes it possible to accurately predict residue-residue contacts with a much smaller number of parameters than DCA. This dimensional reduction allows us to avoid overfitting and to extract contact information from multiple-sequence alignments of reduced size. In addition, we show that low-eigenvalue correlation modes, discarded by PCA, are important to recover structural information: the corresponding patterns are highly localized, that is, they are concentrated in few sites, which we find to be in close contact in the three-dimensional protein fold. Extracting functional and structural information about protein families from the covariation of residues in multiple sequence alignments is an important challenge in computational biology. Here we propose a statistical-physics inspired framework to analyze those covariations, which naturally unifies existing methods in the literature. Our approach allows us to identify statistically relevant ‘patterns’ of residues, specific to a protein family. We show that many patterns correspond to a small number of sites on the protein sequence, in close contact on the 3D fold. Hence, those patterns allow us to make accurate predictions about the contact map from sequence data only. Further more, we show that the dimensional reduction, which is achieved by considering only the statistically most significant patterns, avoids overfitting in small sequence alignments, and improves our capacity of extracting residue contacts in this case.
Collapse
|
104
|
Johnson ME, Hummer G. Evolutionary pressure on the topology of protein interface interaction networks. J Phys Chem B 2013; 117:13098-106. [PMID: 23701316 DOI: 10.1021/jp402944e] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The densely connected structure of protein-protein interaction (PPI) networks reflects the functional need of proteins to cooperate in cellular processes. However, PPI networks do not adequately capture the competition in protein binding. By contrast, the interface interaction network (IIN) studied here resolves the modular character of protein-protein binding and distinguishes between simultaneous and exclusive interactions that underlie both cooperation and competition. We show that the topology of the IIN is under evolutionary pressure, and we connect topological features of the IIN to specific biological functions. To reveal the forces shaping the network topology, we use a sequence-based computational model of interface binding along with network analysis. We find that the more fragmented structure of IINs, in contrast to the dense PPI networks, arises in large part from the competition between specific and nonspecific binding. The need to minimize nonspecific binding favors specific network motifs, including a minimal number of cliques (i.e., fully connected subgraphs) and many disconnected fragments. Validating the model, we find that these network characteristics are closely mirrored in the IIN of clathrin-mediated endocytosis. Features unexpected on the basis of our motif analysis are found to indicate either exceptional binding selectivity or important regulatory functions.
Collapse
Affiliation(s)
- Margaret E Johnson
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health , Bethesda, Maryland 20892-0520, United States
| | | |
Collapse
|
105
|
Walther TH, Gottselig C, Grage SL, Wolf M, Vargiu AV, Klein MJ, Vollmer S, Prock S, Hartmann M, Afonin S, Stockwald E, Heinzmann H, Nolandt OV, Wenzel W, Ruggerone P, Ulrich AS. Folding and self-assembly of the TatA translocation pore based on a charge zipper mechanism. Cell 2013; 152:316-26. [PMID: 23332763 DOI: 10.1016/j.cell.2012.12.017] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2010] [Revised: 08/20/2012] [Accepted: 12/13/2012] [Indexed: 12/11/2022]
Abstract
We propose a concept for the folding and self-assembly of the pore-forming TatA complex from the Twin-arginine translocase and of other membrane proteins based on electrostatic "charge zippers." Each subunit of TatA consists of a transmembrane segment, an amphiphilic helix (APH), and a C-terminal densely charged region (DCR). The sequence of charges in the DCR is complementary to the charge pattern on the APH, suggesting that the protein can be "zipped up" by a ladder of seven salt bridges. The length of the resulting hairpin matches the lipid bilayer thickness, hence a transmembrane pore could self-assemble via intra- and intermolecular salt bridges. The steric feasibility was rationalized by molecular dynamics simulations, and experimental evidence was obtained by monitoring the monomer-oligomer equilibrium of specific charge mutants. Similar "charge zippers" are proposed for other membrane-associated proteins, e.g., the biofilm-inducing peptide TisB, the human antimicrobial peptide dermcidin, and the pestiviral E(RNS) protein.
Collapse
Affiliation(s)
- Torsten H Walther
- Karlsruhe Institute of Technology, Institute of Biological Interfaces, Institute of Organic Chemistry and CFN, Fritz-Haber-Weg 6, 76131 Karlsruhe, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
106
|
Abstract
Co-evolution is a fundamental component of the theory of evolution and is essential for understanding the relationships between species in complex ecological networks. A wide range of co-evolution-inspired computational methods has been designed to predict molecular interactions, but it is only recently that important advances have been made. Breakthroughs in the handling of phylogenetic information and in disentangling indirect relationships have resulted in an improved capacity to predict interactions between proteins and contacts between different protein residues. Here, we review the main co-evolution-based computational approaches, their theoretical basis, potential applications and foreseeable developments.
Collapse
Affiliation(s)
- David de Juan
- Structural Biology and Biocomputing Programme, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
| | | | | |
Collapse
|
107
|
Ekeberg M, Lövkvist C, Lan Y, Weigt M, Aurell E. Improved contact prediction in proteins: using pseudolikelihoods to infer Potts models. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2013; 87:012707. [PMID: 23410359 DOI: 10.1103/physreve.87.012707] [Citation(s) in RCA: 411] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/23/2012] [Indexed: 05/24/2023]
Abstract
Spatially proximate amino acids in a protein tend to coevolve. A protein's three-dimensional (3D) structure hence leaves an echo of correlations in the evolutionary record. Reverse engineering 3D structures from such correlations is an open problem in structural biology, pursued with increasing vigor as more and more protein sequences continue to fill the data banks. Within this task lies a statistical inference problem, rooted in the following: correlation between two sites in a protein sequence can arise from firsthand interaction but can also be network-propagated via intermediate sites; observed correlation is not enough to guarantee proximity. To separate direct from indirect interactions is an instance of the general problem of inverse statistical mechanics, where the task is to learn model parameters (fields, couplings) from observables (magnetizations, correlations, samples) in large systems. In the context of protein sequences, the approach has been referred to as direct-coupling analysis. Here we show that the pseudolikelihood method, applied to 21-state Potts models describing the statistical properties of families of evolutionarily related proteins, significantly outperforms existing approaches to the direct-coupling analysis, the latter being based on standard mean-field techniques. This improved performance also relies on a modified score for the coupling strength. The results are verified using known crystal structures of specific sequence instances of various protein families. Code implementing the new method can be found at http://plmdca.csc.kth.se/.
Collapse
Affiliation(s)
- Magnus Ekeberg
- Engineering Physics Program, KTH Royal Institute of Technology, 100 44 Stockholm, Sweden
| | | | | | | | | |
Collapse
|
108
|
Szurmant H, Hoch JA. Statistical analyses of protein sequence alignments identify structures and mechanisms in signal activation of sensor histidine kinases. Mol Microbiol 2012; 87:707-12. [PMID: 23279101 DOI: 10.1111/mmi.12128] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/10/2012] [Indexed: 01/31/2023]
Abstract
Statistical analyses of genome sequence-derived protein sequence data can identify amino acid residues that interact between proteins or between domains of a protein. These statistical methods are based on evolution-directed amino acid variation responding to structural and functional constraints in proteins. The identified residues form a basis for determining structure and folding of proteins as well as inferring mechanisms of protein function. When applied to two-component systems, several research groups have shown they can be used to identify the amino acid interactions between response regulators and histidine kinases and the specificity therein. Recently, statistical studies between the HisKA and HATPase-ATP-binding domains of histidine kinases identified amino acid interactions for both the inactive and the active catalytic states of such kinases. The identified interactions generated a model structure for the domain conformation of the active state. This conformation requires an unwinding of a portion of the C-terminal helix of the HisKA domain that destroys the inactive state residue contacts and suggests how signal-binding determines the equilibrium between the inactive and active states of histidine kinases. The rapidly accumulating protein sequence databases from genome, metagenome and microbiome studies are an important resource for functional and structural understanding of proteins and protein complexes in microbes.
Collapse
Affiliation(s)
- Hendrik Szurmant
- Department of Molecular and Experimental Medicine, The Scripps Research Institute, 10550 N Torrey Pines Rd, La Jolla, CA 92037, USA
| | | |
Collapse
|
109
|
Kensche PR, Duarte I, Huynen MA. A three-dimensional topology of complex I inferred from evolutionary correlations. BMC STRUCTURAL BIOLOGY 2012; 12:19. [PMID: 22857522 PMCID: PMC3436739 DOI: 10.1186/1472-6807-12-19] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/13/2012] [Accepted: 06/28/2012] [Indexed: 11/22/2022]
Abstract
Background The quaternary structure of eukaryotic NADH:ubiquinone oxidoreductase (complex I), the largest complex of the oxidative phosphorylation, is still mostly unresolved. Furthermore, it is unknown where transiently bound assembly factors interact with complex I. We therefore asked whether the evolution of complex I contains information about its 3D topology and the binding positions of its assembly factors. We approached these questions by correlating the evolutionary rates of eukaryotic complex I subunits using the mirror-tree method and mapping the results into a 3D representation by multidimensional scaling. Results More than 60% of the evolutionary correlation among the conserved seven subunits of the complex I matrix arm can be explained by the physical distance between the subunits. The three-dimensional evolutionary model of the eukaryotic conserved matrix arm has a striking similarity to the matrix arm quaternary structure in the bacterium Thermus thermophilus (rmsd=19 Å) and supports the previous finding that in eukaryotes the N-module is turned relative to the Q-module when compared to bacteria. By contrast, the evolutionary rates contained little information about the structure of the membrane arm. A large evolutionary model of 45 subunits and assembly factors allows to predict subunit positions and interactions (rmsd = 52.6 Å). The model supports an interaction of NDUFAF3, C8orf38 and C2orf56 during the assembly of the proximal matrix arm and the membrane arm. The model further suggests a tight relationship between the assembly factor NUBPL and NDUFA2, which both have been linked to iron-sulfur cluster assembly, as well as between NDUFA12 and its paralog, the assembly factor NDUFAF2. Conclusions The physical distance between subunits of complex I is a major correlate of the rate of protein evolution in the complex I matrix arm and is sufficient to infer parts of the complex’s structure with high accuracy. The resulting evolutionary model predicts the positions of a number of subunits and assembly factors.
Collapse
Affiliation(s)
- Philip R Kensche
- Center for Molecular and Biomolecular Informatics/Nijmegen Center for Molecular Life Sciences, Radboud University Medical Center, PO Box 9101, Nijmegen, HB, 6500, The Netherlands.
| | | | | |
Collapse
|
110
|
Structural basis of histidine kinase autophosphorylation deduced by integrating genomics, molecular dynamics, and mutagenesis. Proc Natl Acad Sci U S A 2012; 109:E1733-42. [PMID: 22670053 DOI: 10.1073/pnas.1201301109] [Citation(s) in RCA: 114] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Signal transduction proteins such as bacterial sensor histidine kinases, designed to transition between multiple conformations, are often ruled by unstable transient interactions making structural characterization of all functional states difficult. This study explored the inactive and signal-activated conformational states of the two catalytic domains of sensor histidine kinases, HisKA and HATPase. Direct coupling analyses, a global statistical inference approach, was applied to >13,000 such domains from protein databases to identify residue contacts between the two domains. These contacts guided structural assembly of the domains using MAGMA, an advanced molecular dynamics docking method. The active conformation structure generated by MAGMA simultaneously accommodated the sequence derived residue contacts and the ATP-catalytic histidine contact. The validity of this structure was confirmed biologically by mutation of contact positions in the Bacillus subtilis sensor histidine kinase KinA and by restoration of activity in an inactive KinA(HisKA):KinD(HATPase) hybrid protein. These data indicate that signals binding to sensor domains activate sensor histidine kinases by causing localized strain and unwinding at the end of the C-terminal helix of the HisKA domain. This destabilizes the contact positions of the inactive conformation of the two domains, identified by previous crystal structure analyses and by the sequence analysis described here, inducing the formation of the active conformation. This study reveals that structures of unstable transient complexes of interacting proteins and of protein domains are accessible by applying this combination of cross-validating technologies.
Collapse
|
111
|
Ho BK, Perahia D, Buckle AM. Hybrid approaches to molecular simulation. Curr Opin Struct Biol 2012; 22:386-93. [PMID: 22633678 DOI: 10.1016/j.sbi.2012.05.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2012] [Revised: 05/07/2012] [Accepted: 05/08/2012] [Indexed: 10/28/2022]
Abstract
Molecular dynamics (MD) simulation is an established method for studying the conformational changes that are important for protein function. Recent advances in hardware and software have allowed MD simulations over the same timescales as experiment, improving the agreement between theory and experiment to a large extent. However, running such simulations are costly, in terms of resources, storage, and trajectory analysis. There is still a place for techniques that involve short MD simulations. In order to overcome the sampling paucity of short time-scales, hybrid methods that include some form of MD simulation can exploit certain features of the system of interest, often combining experimental information in surprising ways. Here, we review some recent hybrid approaches to the simulation of proteins.
Collapse
Affiliation(s)
- Bosco K Ho
- Department of Biochemistry and Molecular Biology, Monash University, Clayton, Victoria 3800, Australia
| | | | | |
Collapse
|
112
|
Noel JK, Whitford PC, Onuchic JN. The shadow map: a general contact definition for capturing the dynamics of biomolecular folding and function. J Phys Chem B 2012; 116:8692-702. [PMID: 22536820 DOI: 10.1021/jp300852d] [Citation(s) in RCA: 174] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Structure-based models (SBMs) are simplified models of the biomolecular dynamics that arise from funneled energy landscapes. We recently introduced an all-atom SBM that explicitly represents the atomic geometry of a biomolecule. While this initial study showed the robustness of the all-atom SBM Hamiltonian to changes in many of the energetic parameters, an important aspect, which has not been explored previously, is the definition of native interactions. In this study, we propose a general definition for generating atomically grained contact maps called "Shadow". The Shadow algorithm initially considers all atoms within a cutoff distance and then, controlled by a screening parameter, discards the occluded contacts. We show that this choice of contact map is not only well behaved for protein folding, since it produces consistently cooperative folding behavior in SBMs but also desirable for exploring the dynamics of macromolecular assemblies since, it distributes energy similarly between RNAs and proteins despite their disparate internal packing. All-atom structure-based models employing Shadow contact maps provide a general framework for exploring the geometrical features of biomolecules, especially the connections between folding and function.
Collapse
Affiliation(s)
- Jeffrey K Noel
- Center for Theoretical Biological Physics and Department of Physics, Rice University, Houston, Texas 77005, United States
| | | | | |
Collapse
|
113
|
Aurell E, Ekeberg M. Inverse Ising inference using all the data. PHYSICAL REVIEW LETTERS 2012; 108:090201. [PMID: 22463617 DOI: 10.1103/physrevlett.108.090201] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2011] [Revised: 12/12/2011] [Indexed: 05/31/2023]
Abstract
We show that a method based on logistic regression, using all the data, solves the inverse Ising problem far better than mean-field calculations relying only on sample pairwise correlation functions, while still computationally feasible for hundreds of nodes. The largest improvement in reconstruction occurs for strong interactions. Using two examples, a diluted Sherrington-Kirkpatrick model and a two-dimensional lattice, we also show that interaction topologies can be recovered from few samples with good accuracy and that the use of l(1) regularization is beneficial in this process, pushing inference abilities further into low-temperature regimes.
Collapse
Affiliation(s)
- Erik Aurell
- ACCESS Linnaeus Centre, KTH, Stockholm, Sweden.
| | | |
Collapse
|
114
|
The Many Faces of Structure-Based Potentials: From Protein Folding Landscapes to Structural Characterization of Complex Biomolecules. COMPUTATIONAL MODELING OF BIOLOGICAL SYSTEMS 2012. [DOI: 10.1007/978-1-4614-2146-7_2] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
115
|
Marks DS, Colwell LJ, Sheridan R, Hopf TA, Pagnani A, Zecchina R, Sander C. Protein 3D structure computed from evolutionary sequence variation. PLoS One 2011; 6:e28766. [PMID: 22163331 PMCID: PMC3233603 DOI: 10.1371/journal.pone.0028766] [Citation(s) in RCA: 778] [Impact Index Per Article: 55.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2011] [Accepted: 11/14/2011] [Indexed: 11/19/2022] Open
Abstract
The evolutionary trajectory of a protein through sequence space is constrained by its function. Collections of sequence homologs record the outcomes of millions of evolutionary experiments in which the protein evolves according to these constraints. Deciphering the evolutionary record held in these sequences and exploiting it for predictive and engineering purposes presents a formidable challenge. The potential benefit of solving this challenge is amplified by the advent of inexpensive high-throughput genomic sequencing. In this paper we ask whether we can infer evolutionary constraints from a set of sequence homologs of a protein. The challenge is to distinguish true co-evolution couplings from the noisy set of observed correlations. We address this challenge using a maximum entropy model of the protein sequence, constrained by the statistics of the multiple sequence alignment, to infer residue pair couplings. Surprisingly, we find that the strength of these inferred couplings is an excellent predictor of residue-residue proximity in folded structures. Indeed, the top-scoring residue couplings are sufficiently accurate and well-distributed to define the 3D protein fold with remarkable accuracy. We quantify this observation by computing, from sequence alone, all-atom 3D structures of fifteen test proteins from different fold classes, ranging in size from 50 to 260 residues., including a G-protein coupled receptor. These blinded inferences are de novo, i.e., they do not use homology modeling or sequence-similar fragments from known structures. The co-evolution signals provide sufficient information to determine accurate 3D protein structure to 2.7–4.8 Å Cα-RMSD error relative to the observed structure, over at least two-thirds of the protein (method called EVfold, details at http://EVfold.org). This discovery provides insight into essential interactions constraining protein evolution and will facilitate a comprehensive survey of the universe of protein structures, new strategies in protein and drug design, and the identification of functional genetic variants in normal and disease genomes.
Collapse
Affiliation(s)
- Debora S Marks
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America.
| | | | | | | | | | | | | |
Collapse
|
116
|
Direct-coupling analysis of residue coevolution captures native contacts across many protein families. Proc Natl Acad Sci U S A 2011; 108:E1293-301. [PMID: 22106262 DOI: 10.1073/pnas.1111471108] [Citation(s) in RCA: 944] [Impact Index Per Article: 67.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The similarity in the three-dimensional structures of homologous proteins imposes strong constraints on their sequence variability. It has long been suggested that the resulting correlations among amino acid compositions at different sequence positions can be exploited to infer spatial contacts within the tertiary protein structure. Crucial to this inference is the ability to disentangle direct and indirect correlations, as accomplished by the recently introduced direct-coupling analysis (DCA). Here we develop a computationally efficient implementation of DCA, which allows us to evaluate the accuracy of contact prediction by DCA for a large number of protein domains, based purely on sequence information. DCA is shown to yield a large number of correctly predicted contacts, recapitulating the global structure of the contact map for the majority of the protein domains examined. Furthermore, our analysis captures clear signals beyond intradomain residue contacts, arising, e.g., from alternative protein conformations, ligand-mediated residue couplings, and interdomain interactions in protein oligomers. Our findings suggest that contacts predicted by DCA can be used as a reliable guide to facilitate computational predictions of alternative protein conformations, protein complex formation, and even the de novo prediction of protein domain structures, contingent on the existence of a large number of homologous sequences which are being rapidly made available due to advances in genome sequencing.
Collapse
|
117
|
Excited states of ribosome translocation revealed through integrative molecular modeling. Proc Natl Acad Sci U S A 2011; 108:18943-8. [PMID: 22080606 DOI: 10.1073/pnas.1108363108] [Citation(s) in RCA: 81] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
The dynamic nature of biomolecules leads to significant challenges when characterizing the structural properties associated with function. While X-ray crystallography and imaging techniques (such as cryo-electron microscopy) can reveal the structural details of stable molecular complexes, strategies must be developed to characterize configurations that exhibit only marginal stability (such as intermediates) or configurations that do not correspond to minima on the energy landscape (such as transition-state ensembles). Here, we present a methodology (MDfit) that utilizes molecular dynamics simulations to generate configurations of excited states that are consistent with available biophysical and biochemical measurements. To demonstrate the approach, we present a sequence of configurations that are suggested to be associated with transfer RNA (tRNA) movement through the ribosome (translocation). The models were constructed by combining information from X-ray crystallography, cryo-electron microscopy, and biochemical data. These models provide a structural framework for translocation that may be further investigated experimentally and theoretically to determine the precise energetic character of each configuration and the transition dynamics between them.
Collapse
|
118
|
Yuriev E, Agostino M, Ramsland PA. Challenges and advances in computational docking: 2009 in review. J Mol Recognit 2010; 24:149-64. [DOI: 10.1002/jmr.1077] [Citation(s) in RCA: 223] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2010] [Revised: 07/20/2010] [Accepted: 07/21/2010] [Indexed: 12/12/2022]
|
119
|
Jamros MA, Oliveira LC, Whitford PC, Onuchic JN, Adams JA, Blumenthal DK, Jennings PA. Proteins at work: a combined small angle X-RAY scattering and theoretical determination of the multiple structures involved on the protein kinase functional landscape. J Biol Chem 2010; 285:36121-8. [PMID: 20801888 DOI: 10.1074/jbc.m110.116947] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
C-terminal Src kinase (Csk) phosphorylates and down-regulates the Src family tyrosine kinases (SFKs). Crystallographic studies of Csk found an unusual arrangement of the SH2 and SH3 regulatory domains about the kinase core, forming a compact structure. However, recent structural studies of mutant Csk in the presence of an inhibitor indicate that the enzyme accesses an expanded structure. To investigate whether wt-Csk may also access open conformations we applied small angle x-ray scattering (SAXS). We find wt-Csk frequently occupies an extended conformation where the regulatory domains are removed from the kinase core. In addition, all-atom structure-based simulations indicate Csk occupies two free energy basins. These basins correspond to ensembles of distinct global conformations of Csk: a compact structure and an extended structure. The transitions between these structures are entropically driven and accessible via thermal fluctuations that break local interactions. We further characterized the ensemble by generating theoretical scattering curves for mixed populations of conformations from both basins and compared the predicted scattering curves to the experimental profile. This population-combination analysis is more consistent with the experimental data than any rigid model. It suggests that Csk adopts a broad ensemble of conformations in solution, populating extended conformations not observed in the crystal structure that may play an important role in the regulation of Csk. The methodology developed here is broadly applicable to biological macromolecules and will provide useful information about what ensembles of conformations are consistent with the experimental data as well as the ubiquitous dynamic reversible assembly processes inherent in biology.
Collapse
Affiliation(s)
- Michael A Jamros
- Department of Chemistry and Biochemistry, University of California, La Jolla, California 92093, USA
| | | | | | | | | | | | | |
Collapse
|
120
|
Noel JK, Whitford PC, Sanbonmatsu KY, Onuchic JN. SMOG@ctbp: simplified deployment of structure-based models in GROMACS. Nucleic Acids Res 2010; 38:W657-61. [PMID: 20525782 PMCID: PMC2896113 DOI: 10.1093/nar/gkq498] [Citation(s) in RCA: 266] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Molecular dynamics simulations with coarse-grained and/or simplified Hamiltonians are an effective means of capturing the functionally important long-time and large-length scale motions of proteins and RNAs. Structure-based Hamiltonians, simplified models developed from the energy landscape theory of protein folding, have become a standard tool for investigating biomolecular dynamics. SMOG@ctbp is an effort to simplify the use of structure-based models. The purpose of the web server is two fold. First, the web tool simplifies the process of implementing a well-characterized structure-based model on a state-of-the-art, open source, molecular dynamics package, GROMACS. Second, the tutorial-like format helps speed the learning curve of those unfamiliar with molecular dynamics. A web tool user is able to upload any multi-chain biomolecular system consisting of standard RNA, DNA and amino acids in PDB format and receive as output all files necessary to implement the model in GROMACS. Both Cα and all-atom versions of the model are available. SMOG@ctbp resides at http://smog.ucsd.edu.
Collapse
Affiliation(s)
- Jeffrey K Noel
- Center for Theoretical Biological Physics and Department of Physics, University of California at San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA
| | | | | | | |
Collapse
|
121
|
Jende I, Varughese KI, Devine KM. Amino acid identity at one position within the α1 helix of both the histidine kinase and the response regulator of the WalRK and PhoPR two-component systems plays a crucial role in the specificity of phosphotransfer. Microbiology (Reading) 2010; 156:1848-1859. [DOI: 10.1099/mic.0.037515-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Two-component systems usually function as cognate pairs, thereby ensuring an appropriate response to the detected signal. The ability to exclusively phosphorylate a partner protein, often in the presence of many competing homologous substrates, demonstrates a high level of specificity that must derive from the interacting surfaces of the two-component system. Here, we identify positions within the histidine kinases and response regulators of the WalRK and PhoPR two-component systems ofBacillus subtilisthat make a major contribution to the specificity of phosphotransfer. Changing the identity of the amino acid at position 11 within theα1 helix of WalK and at position 17 within theα1 helix of PhoP altered discrimination and allowed phosphotransfer to occur with the non-cognate partner. Changing amino acids at additional positions of the WalK kinase increased phosphotransfer, while changes at additional positions in PhoP only had an effect in the presence of the change at position 17. The importance of amino acid identity at these two positions is supported by the fact that the amino acid combinations of Ile and Ser in WalRK, and Leu and Gly in PhoPR, are very highly conserved among orthologues, while modelling indicates that these amino acid pairs are juxtaposed in the WalRK and PhoPR complexes.
Collapse
Affiliation(s)
- Inga Jende
- Smurfit Institute of Genetics, Trinity College Dublin, Dublin 2, Ireland
| | - Kottayil I. Varughese
- University of Arkansas for Medical Sciences, Department of Physiology and Biophysics, 4301 West Markham, Little Rock, AR 72205, USA
| | - Kevin M. Devine
- Smurfit Institute of Genetics, Trinity College Dublin, Dublin 2, Ireland
| |
Collapse
|
122
|
Schug A, Weigt M, Hoch JA, Onuchic JN, Hwa T, Szurmant H. Computational modeling of phosphotransfer complexes in two-component signaling. Methods Enzymol 2010; 471:43-58. [PMID: 20946841 DOI: 10.1016/s0076-6879(10)71003-x] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]
Abstract
Two-component signal transduction systems enable cells in bacteria, fungi, and plants to react to extracellular stimuli. A sensor histidine kinase (SK) detects such stimuli with its sensor domains and transduces the input signals to a response regulator (RR) by trans-phosphorylation. This trans-phosphorylation reaction requires the formation of a complex formed by the two interacting proteins. The complex is stabilized by transient interactions. The nature of the transient interactions makes it challenging for experimental techniques to gain structural information. X-ray crystallography requires stable crystals, which are difficult to grow and stabilize. Similarly, the mere size of these systems proves problematic for NMR. Theoretical methods can, however, complement existing data. The statistical direct coupling analysis presented in the previous chapter reveals the interacting residues at the contact interface of the SK/RR pair. This information can be combined with the structures of the individual proteins in molecular dynamical simulation to generate structural models of the complex. The general approach, referred to as MAGMA, was tested on the sporulation phosphorelay phosphotransfer complex, the Spo0B/Spo0F pair, delivering crystal resolution accuracy. The MAGMA method is described here in a step-by-step explanation. The developed parameters are transferrable to other SK/RR systems.
Collapse
Affiliation(s)
- Alexander Schug
- Center for Theoretical Biological Physics, University of California San Diego, La Jolla, California, USA
| | | | | | | | | | | |
Collapse
|
123
|
Interaction fidelity in two-component signaling. Curr Opin Microbiol 2010; 13:190-7. [PMID: 20133181 DOI: 10.1016/j.mib.2010.01.007] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2009] [Revised: 01/08/2010] [Accepted: 01/11/2010] [Indexed: 01/26/2023]
Abstract
Two component signal transduction systems and phosphorelays have been adapted and amplified by bacteria to respond to a multitude of environmental, metabolic and cell cycle signals while maintaining essentially identical structures for the domains responsible for recognition and phosphotransfer between the sensor histidine kinase and the response regulator. Co-crystal structures of these domains have revealed the variable residues at the interaction surface of the two components responsible for interaction specificity in signal transfer. This information has formed the basis for the development and validation of statistical methods to identify interaction residues and surfaces from compiled databases of interacting proteins and holds forth the promise of determining structures of multi-protein complexes and signaling networks.
Collapse
|
124
|
News in brief. Nat Methods 2010. [DOI: 10.1038/nmeth0210-97] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|