1
|
Tessmer MH, Stoll S. Protein Modeling with DEER Spectroscopy. Annu Rev Biophys 2025; 54:35-57. [PMID: 39689263 DOI: 10.1146/annurev-biophys-030524-013431] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2024]
Abstract
Double electron-electron resonance (DEER) combined with site-directed spin labeling can provide distance distributions between selected protein residues to investigate protein structure and conformational heterogeneity. The utilization of the full quantitative information contained in DEER data requires effective protein and spin label modeling methods. Here, we review the application of DEER data to protein modeling. First, we discuss the significance of spin label modeling for accurate extraction of protein structural information and review the most popular label modeling methods. Next, we review several important aspects of protein modeling with DEER, including site selection, how DEER restraints are applied, common artifacts, and the unique potential of DEER data for modeling structural ensembles and conformational landscapes. Finally, we discuss common applications of protein modeling with DEER data and provide an outlook.
Collapse
Affiliation(s)
- Maxx H Tessmer
- Department of Chemistry, University of Washington, Seattle, Washington, USA;
| | - Stefan Stoll
- Department of Chemistry, University of Washington, Seattle, Washington, USA;
| |
Collapse
|
2
|
Cavender CE, Case DA, Chen JCH, Chong LT, Keedy DA, Lindorff-Larsen K, Mobley DL, Ollila OHS, Oostenbrink C, Robustelli P, Voelz VA, Wall ME, Wych DC, Gilson MK. Structure-Based Experimental Datasets for Benchmarking Protein Simulation Force Fields [Article v0.1]. ARXIV 2025:arXiv:2303.11056v2. [PMID: 40196146 PMCID: PMC11975311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 04/09/2025]
Abstract
This review article provides an overview of structurally oriented experimental datasets that can be used to benchmark protein force fields, focusing on data generated by nuclear magnetic resonance (NMR) spectroscopy and room temperature (RT) protein crystallography. We discuss what the observables are, what they tell us about structure and dynamics, what makes them useful for assessing force field accuracy, and how they can be connected to molecular dynamics simulations carried out using the force field one wishes to benchmark. We also touch on statistical issues that arise when comparing simulations with experiment. We hope this article will be particularly useful to computational researchers and trainees who develop, benchmark, or use protein force fields for molecular simulations.
Collapse
Affiliation(s)
- Chapin E. Cavender
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - David A. Case
- Department of Chemistry & Chemical Biology, Rutgers University, Piscataway, NJ, USA
| | - Julian C.-H. Chen
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA; Department of Chemistry and Biochemistry, The University of Toledo, Toledo, OH, USA
| | - Lillian T. Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA, USA
| | - Daniel A. Keedy
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY, USA; Department of Chemistry and Biochemistry, City College of New York, New York, NY, USA; PhD Programs in Biochemistry, Biology, and Chemistry, CUNY Graduate Center, New York, NY, USA
| | - Kresten Lindorff-Larsen
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen N, Denmark
| | - David L. Mobley
- Department of Pharmaceutical Sciences, University of California Irvine, Irvine, CA, USA
| | - O. H. Samuli Ollila
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland; VTT Technical Research Centre of Finland, Espoo, Finland
| | - Chris Oostenbrink
- Institute for Molecular Modeling and Simulation, University of Natural Resources and Life Sciences, Vienna, Austria
| | - Paul Robustelli
- Department of Chemistry, Dartmouth College, Hanover, NH, USA
| | - Vincent A. Voelz
- Department of Chemistry, Temple University, Philadelphia, PA, USA
| | - Michael E. Wall
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, NM, USA; The Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - David C. Wych
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, NM, USA; The Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Michael K. Gilson
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| |
Collapse
|
3
|
Esteban-Hofer L, Emmanouilidis L, Yulikov M, Allain FHT, Jeschke G. Ensemble structure of the N-terminal domain (1-267) of FUS in a biomolecular condensate. Biophys J 2024; 123:538-554. [PMID: 38279531 PMCID: PMC10938082 DOI: 10.1016/j.bpj.2024.01.023] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 11/06/2023] [Accepted: 01/22/2024] [Indexed: 01/28/2024] Open
Abstract
Solutions of some proteins phase separate into a condensed state of high protein concentration and a dispersed state of low concentration. Such behavior is observed in living cells for a number of RNA-binding proteins that feature intrinsically disordered domains. It is relevant for cell function via the formation of membraneless organelles and transcriptional condensates. On a basic level, the process can be studied in vitro on protein domains that are necessary and sufficient for liquid-liquid phase separation (LLPS). We have performed distance distribution measurements by electron paramagnetic resonance for 13 sections in an N-terminal domain (NTD) construct of the protein fused in sarcoma (FUS), consisting of the QGSY-rich domain and the RGG1 domain, in the denatured, dispersed, and condensed state. Using 10 distance distribution restraints for ensemble modeling and three such restraints for model validation, we have found that FUS NTD behaves as a random-coil polymer under good-solvent conditions in both the dispersed and condensed state. Conformation distribution in the biomolecular condensate is virtually indistinguishable from the one in an unrestrained ensemble, with the latter one being based on only residue-specific Ramachandran angle distributions. Over its whole length, FUS NTD is slightly more compact in the condensed than in the dispersed state, which is in line with the theory for random coils in good solvent proposed by de Gennes, Daoud, and Jannink. The estimated concentration in the condensate exceeds the overlap concentration resulting from this theory. The QGSY-rich domain is slightly more extended, slightly more hydrated, and has slightly higher propensity for LLPS than the RGG1 domain. Our results support previous suggestions that LLPS of FUS is driven by multiple transient nonspecific hydrogen bonding and π-sp2 interactions.
Collapse
Affiliation(s)
- Laura Esteban-Hofer
- ETH Zurich, Department of Chemistry and Applied Biosciences, Zurich, Switzerland
| | | | - Maxim Yulikov
- ETH Zurich, Department of Chemistry and Applied Biosciences, Zurich, Switzerland
| | | | - Gunnar Jeschke
- ETH Zurich, Department of Chemistry and Applied Biosciences, Zurich, Switzerland.
| |
Collapse
|
4
|
Zhang O, Haghighatlari M, Li J, Liu ZH, Namini A, Teixeira JMC, Forman-Kay JD, Head-Gordon T. Learning to evolve structural ensembles of unfolded and disordered proteins using experimental solution data. J Chem Phys 2023; 158:174113. [PMID: 37144719 PMCID: PMC10163956 DOI: 10.1063/5.0141474] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Accepted: 04/11/2023] [Indexed: 05/06/2023] Open
Abstract
The structural characterization of proteins with a disorder requires a computational approach backed by experiments to model their diverse and dynamic structural ensembles. The selection of conformational ensembles consistent with solution experiments of disordered proteins highly depends on the initial pool of conformers, with currently available tools limited by conformational sampling. We have developed a Generative Recurrent Neural Network (GRNN) that uses supervised learning to bias the probability distributions of torsions to take advantage of experimental data types such as nuclear magnetic resonance J-couplings, nuclear Overhauser effects, and paramagnetic resonance enhancements. We show that updating the generative model parameters according to the reward feedback on the basis of the agreement between experimental data and probabilistic selection of torsions from learned distributions provides an alternative to existing approaches that simply reweight conformers of a static structural pool for disordered proteins. Instead, the biased GRNN, DynamICE, learns to physically change the conformations of the underlying pool of the disordered protein to those that better agree with experiments.
Collapse
Affiliation(s)
- Oufan Zhang
- Kenneth S. Pitzer Theory Center and Department of Chemistry, University of California, Berkeley, California 94720, USA
| | - Mojtaba Haghighatlari
- Kenneth S. Pitzer Theory Center and Department of Chemistry, University of California, Berkeley, California 94720, USA
| | - Jie Li
- Kenneth S. Pitzer Theory Center and Department of Chemistry, University of California, Berkeley, California 94720, USA
| | | | - Ashley Namini
- Molecular Medicine Program, Hospital for Sick Children, Toronto, Ontario M5S 1A8, Canada
| | | | | | | |
Collapse
|
5
|
Czaplewski C, Gong Z, Lubecka EA, Xue K, Tang C, Liwo A. Recent Developments in Data-Assisted Modeling of Flexible Proteins. Front Mol Biosci 2022; 8:765562. [PMID: 35004845 PMCID: PMC8740120 DOI: 10.3389/fmolb.2021.765562] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Accepted: 12/06/2021] [Indexed: 11/13/2022] Open
Abstract
Many proteins can fold into well-defined conformations. However, intrinsically-disordered proteins (IDPs) do not possess a defined structure. Moreover, folded multi-domain proteins often digress into alternative conformations. Collectively, the conformational dynamics enables these proteins to fulfill specific functions. Thus, most experimental observables are averaged over the conformations that constitute an ensemble. In this article, we review the recent developments in the concept and methods for the determination of the dynamic structures of flexible peptides and proteins. In particular, we describe ways to extract information from nuclear magnetic resonance small-angle X-ray scattering (SAXS), and chemical cross-linking coupled with mass spectroscopy (XL-MS) measurements. All these techniques can be used to obtain ensemble-averaged restraints or to re-weight the simulated conformational ensembles.
Collapse
Affiliation(s)
| | - Zhou Gong
- Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, Wuhan, China
| | - Emilia A Lubecka
- Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, Gdańsk, Poland
| | - Kai Xue
- PKU-Tsinghua Center for Life Sciences, Beijing National Laboratory for Molecular Sciences, College of Chemistry and Molecular Engineering, Peking University, Beijing, China
| | - Chun Tang
- PKU-Tsinghua Center for Life Sciences, Beijing National Laboratory for Molecular Sciences, College of Chemistry and Molecular Engineering, Peking University, Beijing, China
| | - Adam Liwo
- Faculty of Chemistry, University of Gdańsk, Gdańsk, Poland
| |
Collapse
|
6
|
Ritsch I, Esteban-Hofer L, Lehmann E, Emmanouilidis L, Yulikov M, Allain FHT, Jeschke G. Characterization of Weak Protein Domain Structure by Spin-Label Distance Distributions. Front Mol Biosci 2021; 8:636599. [PMID: 33912586 PMCID: PMC8072059 DOI: 10.3389/fmolb.2021.636599] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Accepted: 02/19/2021] [Indexed: 01/04/2023] Open
Abstract
Function of intrinsically disordered proteins may depend on deviation of their conformational ensemble from that of a random coil. Such deviation may be hard to characterize and quantify, if it is weak. We explored the potential of distance distributions between spin labels, as they can be measured by electron paramagnetic resonance techniques, for aiding such characterization. On the example of the intrinsically disordered N-terminal domain 1-267 of fused in sarcoma (FUS) we examined what such distance distributions can and cannot reveal on the random-coil reference state. On the example of the glycine-rich domain 188-320 of heterogeneous nuclear ribonucleoprotein A1 (hnRNP A1) we studied whether deviation from a random-coil ensemble can be robustly detected with 19 distance distribution restraints. We discuss limitations imposed by ill-posedness of the conversion of primary data to distance distributions and propose overlap of distance distributions as a fit criterion that can tackle this problem. For testing consistency and size sufficiency of the restraint set, we propose jack-knife resampling. At current desktop computers, our approach is expected to be viable for domains up to 150 residues and for between 10 and 50 distance distribution restraints.
Collapse
Affiliation(s)
- Irina Ritsch
- Department of Chemistry and Applied Biosciences, ETH Zürich, Zürich, Switzerland
| | - Laura Esteban-Hofer
- Department of Chemistry and Applied Biosciences, ETH Zürich, Zürich, Switzerland
| | | | | | - Maxim Yulikov
- Department of Chemistry and Applied Biosciences, ETH Zürich, Zürich, Switzerland
| | | | - Gunnar Jeschke
- Department of Chemistry and Applied Biosciences, ETH Zürich, Zürich, Switzerland
| |
Collapse
|
7
|
Wang W. Recent advances in atomic molecular dynamics simulation of intrinsically disordered proteins. Phys Chem Chem Phys 2021; 23:777-784. [PMID: 33355572 DOI: 10.1039/d0cp05818a] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Intrinsically disordered proteins (IDPs) play important roles in cellular functions. The inherent structural heterogeneity of IDPs makes the high-resolution experimental characterization of IDPs extremely difficult. Molecular dynamics (MD) simulation could provide the atomic-level description of the structural and dynamic properties of IDPs. This perspective reviews the recent progress in atomic MD simulation studies of IDPs, including the development of force fields and sampling methods, as well as applications in IDP-involved protein-protein interactions. The employment of large-scale simulations and advanced sampling techniques allows more accurate estimation of the thermodynamics and kinetics of IDP-mediated protein interactions, and the holistic landscape of the binding process of IDPs is emerging.
Collapse
Affiliation(s)
- Wenning Wang
- Department of Chemistry, Multiscale Research Institute of Complex Systems and Institute of Biomedical Sciences, Fudan University, Shanghai 200438, China.
| |
Collapse
|
8
|
Tesei G, Martins JM, Kunze MBA, Wang Y, Crehuet R, Lindorff-Larsen K. DEER-PREdict: Software for efficient calculation of spin-labeling EPR and NMR data from conformational ensembles. PLoS Comput Biol 2021; 17:e1008551. [PMID: 33481784 PMCID: PMC7857587 DOI: 10.1371/journal.pcbi.1008551] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Revised: 02/03/2021] [Accepted: 11/19/2020] [Indexed: 11/25/2022] Open
Abstract
Owing to their plasticity, intrinsically disordered and multidomain proteins require descriptions based on multiple conformations, thus calling for techniques and analysis tools that are capable of dealing with conformational ensembles rather than a single protein structure. Here, we introduce DEER-PREdict, a software program to predict Double Electron-Electron Resonance distance distributions as well as Paramagnetic Relaxation Enhancement rates from ensembles of protein conformations. DEER-PREdict uses an established rotamer library approach to describe the paramagnetic probes which are bound covalently to the protein.DEER-PREdict has been designed to operate efficiently on large conformational ensembles, such as those generated by molecular dynamics simulation, to facilitate the validation or refinement of molecular models as well as the interpretation of experimental data. The performance and accuracy of the software is demonstrated with experimentally characterized protein systems: HIV-1 protease, T4 Lysozyme and Acyl-CoA-binding protein. DEER-PREdict is open source (GPLv3) and available at github.com/KULL-Centre/DEERpredict and as a Python PyPI package pypi.org/project/DEERPREdict.
Collapse
Affiliation(s)
- Giulio Tesei
- Structural Biology and NMR Laboratory & the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - João M. Martins
- Structural Biology and NMR Laboratory & the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Micha B. A. Kunze
- Structural Biology and NMR Laboratory & the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Yong Wang
- Structural Biology and NMR Laboratory & the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Ramon Crehuet
- Structural Biology and NMR Laboratory & the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
- CSIC-Institute for Advanced Chemistry of Catalonia (IQAC), Barcelona, Spain
| | - Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory & the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
9
|
Kakeshpour T, Ramanujam V, Barnes CA, Shen Y, Ying J, Bax A. A lowly populated, transient β-sheet structure in monomeric Aβ 1-42 identified by multinuclear NMR of chemical denaturation. Biophys Chem 2020; 270:106531. [PMID: 33453683 DOI: 10.1016/j.bpc.2020.106531] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Revised: 12/08/2020] [Accepted: 12/13/2020] [Indexed: 02/07/2023]
Abstract
Chemical denaturation is a well-established approach for probing the equilibrium between folded and unfolded states of proteins. We demonstrate applicability of this method to the detection of a small population of a transiently folded structural element in a system that is often considered to be intrinsically fully disordered. The 1HN, 15N, 13Cα, and 13C' chemical shifts of Aβ1-40 and Aβ1-42 peptides and their M35-oxidized variants were monitored as a function of urea concentration and compared to analogous urea titrations of synthetic pentapeptides of homologous sequence. Fitting of the chemical shift titrations yields a 10 ± 1% population for a structured element at the C-terminus of Aβ1-42 that folds with a cooperativity of m = 0.06 kcal/mol·M. The fit also yields the chemical shifts of the folded state and, using a database search, for Aβ1-42 these shifts identified an antiparallel intramolecular β-sheet for residues I32-A42, linked by a type I' β-turn at G37 and G38. The structure is destabilized by oxidation of M35. Paramagnetic relaxation rates and two previously reported weak, medium-range NOE interactions are consistent with this transient β-sheet. Introduction of the requisite A42C mutation and tagging with MTSL resulted in a small stabilization of this β-sheet. Chemical shift analysis suggests a C-terminal β-sheet may be present in Aβ1-40 too, but the turn type at G37 is not type I'. The approach to derive Transient Structure from chemical Denaturation by NMR (TSD-NMR), demonstrated here for Aβ peptides, provides a sensitive tool for identifying the presence of lowly populated, transiently ordered elements in proteins that are considered to be intrinsically disordered, and permits extraction of structural data for such elements.
Collapse
Affiliation(s)
- Tayeb Kakeshpour
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD 20892, USA
| | - Venkat Ramanujam
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD 20892, USA
| | - C Ashley Barnes
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD 20892, USA
| | - Yang Shen
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD 20892, USA
| | - Jinfa Ying
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD 20892, USA
| | - Ad Bax
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD 20892, USA.
| |
Collapse
|
10
|
Gomes GNW, Krzeminski M, Namini A, Martin EW, Mittag T, Head-Gordon T, Forman-Kay JD, Gradinaru CC. Conformational Ensembles of an Intrinsically Disordered Protein Consistent with NMR, SAXS, and Single-Molecule FRET. J Am Chem Soc 2020; 142:15697-15710. [PMID: 32840111 PMCID: PMC9987321 DOI: 10.1021/jacs.0c02088] [Citation(s) in RCA: 105] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Intrinsically disordered proteins (IDPs) have fluctuating heterogeneous conformations, which makes their structural characterization challenging. Although challenging, characterization of the conformational ensembles of IDPs is of great interest, since their conformational ensembles are the link between their sequences and functions. An accurate description of IDP conformational ensembles depends crucially on the amount and quality of the experimental data, how it is integrated, and if it supports a consistent structural picture. We used integrative modeling and validation to apply conformational restraints and assess agreement with the most common structural techniques for IDPs: Nuclear Magnetic Resonance (NMR) spectroscopy, Small-angle X-ray Scattering (SAXS), and single-molecule Förster Resonance Energy Transfer (smFRET). Agreement with such a diverse set of experimental data suggests that details of the generated ensembles can now be examined with a high degree of confidence. Using the disordered N-terminal region of the Sic1 protein as a test case, we examined relationships between average global polymeric descriptions and higher-moments of their distributions. To resolve apparent discrepancies between smFRET and SAXS inferences, we integrated SAXS data with NMR data and reserved the smFRET data for independent validation. Consistency with smFRET, which was not guaranteed a priori, indicates that, globally, the perturbative effects of NMR or smFRET labels on the Sic1 ensemble are minimal. Analysis of the ensembles revealed distinguishing features of Sic1, such as overall compactness and large end-to-end distance fluctuations, which are consistent with biophysical models of Sic1's ultrasensitive binding to its partner Cdc4. Our results underscore the importance of integrative modeling and validation in generating and drawing conclusions from IDP conformational ensembles.
Collapse
Affiliation(s)
- Gregory-Neal W Gomes
- Department of Physics, University of Toronto, Toronto, Ontario M5G 1X8, Canada.,Department of Chemical and Physical Sciences, University of Toronto Mississauga, Mississauga, Ontario L5L 1C6, Canada
| | - Mickaël Krzeminski
- Molecular Medicine Program, Hospital for Sick Children, Toronto, Ontario M5S 1A8, Canada.,Department of Biochemistry, University of Toronto, Toronto, Ontario M5G 1X8, Canada
| | - Ashley Namini
- Department of Physics, University of Toronto, Toronto, Ontario M5G 1X8, Canada.,Department of Chemical and Physical Sciences, University of Toronto Mississauga, Mississauga, Ontario L5L 1C6, Canada
| | - Erik W Martin
- Department of Structural Biology, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, United States
| | - Tanja Mittag
- Department of Structural Biology, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, United States
| | - Teresa Head-Gordon
- Departments of Chemistry, Bioengineering, Chemical and Biomolecular Engineering University of California, Berkeley, California 94720, United States
| | - Julie D Forman-Kay
- Molecular Medicine Program, Hospital for Sick Children, Toronto, Ontario M5S 1A8, Canada.,Department of Biochemistry, University of Toronto, Toronto, Ontario M5G 1X8, Canada
| | - Claudiu C Gradinaru
- Department of Physics, University of Toronto, Toronto, Ontario M5G 1X8, Canada.,Department of Chemical and Physical Sciences, University of Toronto Mississauga, Mississauga, Ontario L5L 1C6, Canada
| |
Collapse
|
11
|
Lincoff J, Haghighatlari M, Krzeminski M, Teixeira JMC, Gomes GNW, Gradinaru CC, Forman-Kay JD, Head-Gordon T. Extended Experimental Inferential Structure Determination Method in Determining the Structural Ensembles of Disordered Protein States. Commun Chem 2020; 3:74. [PMID: 32775701 PMCID: PMC7409953 DOI: 10.1038/s42004-020-0323-0] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Accepted: 04/22/2020] [Indexed: 01/12/2023] Open
Abstract
Proteins with intrinsic or unfolded state disorder comprise a new frontier in structural biology, requiring the characterization of diverse and dynamic structural ensembles. We introduce a comprehensive Bayesian framework, the Extended Experimental Inferential Structure Determination (X-EISD) method, that calculates the maximum log-likelihood of a disordered protein ensemble. X-EISD accounts for the uncertainties of a range of experimental data and back-calculation models from structures, including NMR chemical shifts, J-couplings, Nuclear Overhauser Effects (NOEs), paramagnetic relaxation enhancements (PREs), residual dipolar couplings (RDCs), hydrodynamic radii (R h ), single molecule fluorescence Förster resonance energy transfer (smFRET) and small angle X-ray scattering (SAXS). We apply X-EISD to the joint optimization against experimental data for the unfolded drkN SH3 domain and find that combining a local data type, such as chemical shifts or J-couplings, paired with long-ranged restraints such as NOEs, PREs or smFRET, yields structural ensembles in good agreement with all other data types if combined with representative IDP conformers.
Collapse
Affiliation(s)
- James Lincoff
- Department of Chemical and Biomolecular Engineering, University of California, Berkeley, CA 94720 USA
- Pitzer Center for Theoretical Chemistry, University of California, Berkeley, CA 94720 USA
- Present Address: Cardiovascular Research Institute, University of California, San Francisco, CA 94158 USA
| | - Mojtaba Haghighatlari
- Pitzer Center for Theoretical Chemistry, University of California, Berkeley, CA 94720 USA
- Department of Chemistry, University of California, Berkeley, CA 94720 USA
| | - Mickael Krzeminski
- Molecular Structure and Function Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4 Canada
| | - João M. C. Teixeira
- Molecular Structure and Function Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4 Canada
- Department of Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8 Canada
| | - Gregory-Neal W. Gomes
- Department of Chemical and Physical Sciences, University of Toronto Mississauga, Mississauga, Ontario L5L 1C6 Canada
| | - Claudiu C. Gradinaru
- Department of Chemical and Physical Sciences, University of Toronto Mississauga, Mississauga, Ontario L5L 1C6 Canada
| | - Julie D. Forman-Kay
- Molecular Structure and Function Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4 Canada
- Department of Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8 Canada
| | - Teresa Head-Gordon
- Department of Chemical and Biomolecular Engineering, University of California, Berkeley, CA 94720 USA
- Pitzer Center for Theoretical Chemistry, University of California, Berkeley, CA 94720 USA
- Department of Chemistry, University of California, Berkeley, CA 94720 USA
- Department of Bioengineering, University of California, Berkeley, CA 94720 USA
| |
Collapse
|
12
|
Recent Advances in Computational Protocols Addressing Intrinsically Disordered Proteins. Biomolecules 2019; 9:biom9040146. [PMID: 30979035 PMCID: PMC6523529 DOI: 10.3390/biom9040146] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Revised: 04/09/2019] [Accepted: 04/10/2019] [Indexed: 01/09/2023] Open
Abstract
Intrinsically disordered proteins (IDP) are abundant in the human genome and have recently emerged as major therapeutic targets for various diseases. Unlike traditional proteins that adopt a definitive structure, IDPs in free solution are disordered and exist as an ensemble of conformations. This enables the IDPs to signal through multiple signaling pathways and serve as scaffolds for multi-protein complexes. The challenge in studying IDPs experimentally stems from their disordered nature. Nuclear magnetic resonance (NMR), circular dichroism, small angle X-ray scattering, and single molecule Förster resonance energy transfer (FRET) can give the local structural information and overall dimension of IDPs, but seldom provide a unified picture of the whole protein. To understand the conformational dynamics of IDPs and how their structural ensembles recognize multiple binding partners and small molecule inhibitors, knowledge-based and physics-based sampling techniques are utilized in-silico, guided by experimental structural data. However, efficient sampling of the IDP conformational ensemble requires traversing the numerous degrees of freedom in the IDP energy landscape, as well as force-fields that accurately model the protein and solvent interactions. In this review, we have provided an overview of the current state of computational methods for studying IDP structure and dynamics and discussed the major challenges faced in this field.
Collapse
|
13
|
Lincoff J, Sasmal S, Head-Gordon T. The combined force field-sampling problem in simulations of disordered amyloid-β peptides. J Chem Phys 2019; 150:104108. [PMID: 30876367 DOI: 10.1063/1.5078615] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Molecular dynamics simulations of intrinsically disordered proteins (IDPs) can provide high resolution structural ensembles if the force field is accurate enough and if the simulation sufficiently samples the conformational space of the IDP with the correct weighting of sub-populations. Here, we investigate the combined force field-sampling problem by testing a standard force field as well as newer fixed charge force fields, the latter specifically motivated for better description of unfolded states and IDPs, and comparing them with a standard temperature replica exchange (TREx) protocol and a non-equilibrium Temperature Cool Walking (TCW) sampling algorithm. The force field and sampling combinations are used to characterize the structural ensembles of the amyloid-beta peptides Aβ42 and Aβ43, which both should be random coils as shown recently by experimental nuclear magnetic resonance (NMR) and 2D Förster resonance energy transfer (FRET) experiments. The results illustrate the key importance of the sampling algorithm: while the standard force field using TREx is in poor agreement with the NMR J-coupling and nuclear Overhauser effect and 2D FRET data, when using the TCW method, the standard and optimized protein-water force field combinations are in very good agreement with the same experimental data since the TCW sampling method produces qualitatively different ensembles than TREx. We also discuss the relative merit of the 2D FRET data when validating structural ensembles using the different force fields and sampling protocols investigated in this work for small IDPs such as the Aβ42 and Aβ43 peptides.
Collapse
Affiliation(s)
- James Lincoff
- Department of Chemical and Biomolecular Engineering, University of California, Berkeley, California 94720, USA
| | - Sukanya Sasmal
- Department of Chemical and Biomolecular Engineering, University of California, Berkeley, California 94720, USA
| | - Teresa Head-Gordon
- Department of Chemical and Biomolecular Engineering, University of California, Berkeley, California 94720, USA
| |
Collapse
|
14
|
Chan-Yao-Chong M, Durand D, Ha-Duong T. Molecular Dynamics Simulations Combined with Nuclear Magnetic Resonance and/or Small-Angle X-ray Scattering Data for Characterizing Intrinsically Disordered Protein Conformational Ensembles. J Chem Inf Model 2019; 59:1743-1758. [PMID: 30840442 DOI: 10.1021/acs.jcim.8b00928] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The concept of intrinsically disordered proteins (IDPs) has emerged relatively slowly, but over the past 20 years, it has become an intense research area in structural biology. Indeed, because of their considerable flexibility and structural heterogeneity, the determination of IDP conformational ensemble is particularly challenging and often requires a combination of experimental measurements and computational approaches. With the improved accuracy of all-atom force fields and the increasing computing performances, molecular dynamics (MD) simulations have become more and more reliable to generate realistic conformational ensembles. And the combination of MD simulations with experimental approaches, such as nuclear magnetic resonance (NMR) and/or small-angle X-ray scattering (SAXS) allows one to converge toward a more accurate and exhaustive description of IDP structures. In this Review, we discuss the state of the art of MD simulations of IDP conformational ensembles, with a special focus on studies that back-calculated and directly compared theoretical and experimental NMR or SAXS observables, such as chemical shifts (CS), 3J-couplings (3Jc), residual dipolar couplings (RDC), or SAXS intensities. We organize the review in three parts. In the first section, we discuss the studies which used NMR and/or SAXS data to test and validate the development of force fields or enhanced sampling techniques. In the second part, we explore different methods for the refinement of MD-derived structural ensembles, such as NMR or SAXS data-restrained MD simulations or ensemble reweighting to better fit experiments. Finally, we survey some recent studies combining MD simulations with NMR and/or SAXS measurements to investigate the relationship between IDP conformational ensemble and biological activity, as well as their implication in human diseases. From this review, we noticed that quite a few studies compared MD-generated conformational ensembles with both NMR and SAXS measurements to validate IDP structures at both local and global levels. Yet, beside the IDP propensity to form local secondary structures, their dynamic extension or compactness also appears important for their activity. Thus, we believe that a close synergy between MD simulations, NMR, and SAXS experiments would be greatly appropriate to address the challenges of characterizing the disordered structures of proteins and their complexes, relative to their biological functions.
Collapse
Affiliation(s)
- Maud Chan-Yao-Chong
- BioCIS, Université Paris-Sud, CNRS , Université Paris-Saclay , 92290 Châtenay-Malabry , France.,Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud , Université Paris-Saclay , 91198 , Gif-sur-Yvette cedex, France
| | - Dominique Durand
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud , Université Paris-Saclay , 91198 , Gif-sur-Yvette cedex, France
| | - Tâp Ha-Duong
- BioCIS, Université Paris-Sud, CNRS , Université Paris-Saclay , 92290 Châtenay-Malabry , France
| |
Collapse
|
15
|
Kooshapur H, Schwieters CD, Tjandra N. Conformational Ensemble of Disordered Proteins Probed by Solvent Paramagnetic Relaxation Enhancement (sPRE). Angew Chem Int Ed Engl 2018. [DOI: 10.1002/ange.201807365] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Affiliation(s)
- Hamed Kooshapur
- Laboratory of Structural Biophysics, National Heart, Lung and Blood Institute; National Institutes of Health; Bethesda MD 20892 USA
| | - Charles D. Schwieters
- Office of Intramural Research; Center for Information Technology; National Institutes of Health; Bethesda MD 20892 USA
| | - Nico Tjandra
- Laboratory of Structural Biophysics, National Heart, Lung and Blood Institute; National Institutes of Health; Bethesda MD 20892 USA
| |
Collapse
|
16
|
Kooshapur H, Schwieters CD, Tjandra N. Conformational Ensemble of Disordered Proteins Probed by Solvent Paramagnetic Relaxation Enhancement (sPRE). Angew Chem Int Ed Engl 2018; 57:13519-13522. [PMID: 30125451 DOI: 10.1002/anie.201807365] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2018] [Revised: 07/27/2018] [Indexed: 01/05/2023]
Abstract
Characterization of the conformational ensemble of disordered proteins is highly important for understanding protein folding and aggregation mechanisms, but remains a computational and experimental challenge owing to the dynamic nature of these proteins. New observables that can provide unique insights into transient residual structures in disordered proteins are needed. Here using denatured ubiquitin as a model system, NMR solvent paramagnetic relaxation enhancement (sPRE) measurements provide an accurate and highly sensitive probe for detecting low populations of residual structure in a disordered protein. Furthermore, a new ensemble calculation approach based on sPRE restraints in conjunction with residual dipolar couplings (RDCs) and small-angle X-ray scattering (SAXS) is used to define the conformational ensemble of disordered proteins at atomic resolution. The approach presented should be applicable to a wide range of dynamic macromolecules.
Collapse
Affiliation(s)
- Hamed Kooshapur
- Laboratory of Structural Biophysics, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Charles D Schwieters
- Office of Intramural Research, Center for Information Technology, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Nico Tjandra
- Laboratory of Structural Biophysics, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| |
Collapse
|