1
|
Latham AP, Zhang W, Tempkin JOB, Otsuka S, Ellenberg J, Sali A. Integrative spatiotemporal modeling of biomolecular processes: Application to the assembly of the nuclear pore complex. Proc Natl Acad Sci U S A 2025; 122:e2415674122. [PMID: 40085653 PMCID: PMC11929490 DOI: 10.1073/pnas.2415674122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2024] [Accepted: 02/06/2025] [Indexed: 03/16/2025] Open
Abstract
Dynamic processes involving biomolecules are essential for the function of the cell. Here, we introduce an integrative method for computing models of these processes based on multiple heterogeneous sources of information, including time-resolved experimental data and physical models of dynamic processes. First, for each time point, a set of coarse models of compositional and structural heterogeneity is computed (heterogeneity models). Second, for each heterogeneity model, a set of static integrative structure models is computed (a snapshot model). Finally, these snapshot models are selected and connected into a series of trajectories that optimize the likelihood of both the snapshot models and transitions between them (a trajectory model). The method is demonstrated by application to the assembly process of the human nuclear pore complex in the context of the reforming nuclear envelope during mitotic cell division, based on live-cell correlated electron tomography, bulk fluorescence correlation spectroscopy-calibrated quantitative live imaging, and a structural model of the fully assembled nuclear pore complex. Modeling of the assembly process improves the model precision over static integrative structure modeling alone. The method is applicable to a wide range of time-dependent systems in cell biology and is available to the broader scientific community through an implementation in the open source Integrative Modeling Platform (IMP) software.
Collapse
Affiliation(s)
- Andrew P. Latham
- Department of Bioengineering and Therapeutic Sciences, Quantitative Biosciences Institute, University of California, San Francisco, CA94143
- Department of Pharmaceutical Chemistry, Quantitative Biosciences Institute, University of California, San Francisco, CA94143
| | - Wanlu Zhang
- Cell Biology and Biophysics Unit, European Molecular Biology Laboratory, Heidelberg69117, Germany
| | - Jeremy O. B. Tempkin
- Department of Bioengineering and Therapeutic Sciences, Quantitative Biosciences Institute, University of California, San Francisco, CA94143
- Department of Pharmaceutical Chemistry, Quantitative Biosciences Institute, University of California, San Francisco, CA94143
| | - Shotaro Otsuka
- Cell Biology and Biophysics Unit, European Molecular Biology Laboratory, Heidelberg69117, Germany
| | - Jan Ellenberg
- Cell Biology and Biophysics Unit, European Molecular Biology Laboratory, Heidelberg69117, Germany
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, Quantitative Biosciences Institute, University of California, San Francisco, CA94143
- Department of Pharmaceutical Chemistry, Quantitative Biosciences Institute, University of California, San Francisco, CA94143
| |
Collapse
|
2
|
Peulen TO, Hemmen K, Greife A, Webb BM, Felekyan S, Sali A, Seidel CAM, Sanabria H, Heinze KG. tttrlib: modular software for integrating fluorescence spectroscopy, imaging, and molecular modeling. Bioinformatics 2025; 41:btaf025. [PMID: 39836627 PMCID: PMC11796090 DOI: 10.1093/bioinformatics/btaf025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 01/04/2025] [Accepted: 01/17/2025] [Indexed: 01/23/2025] Open
Abstract
SUMMARY We introduce software for reading, writing and processing fluorescence single-molecule and image spectroscopy data and developing analysis pipelines to unify various spectroscopic analysis tools. Our software can be used for processing multiple experiment types, e.g. for time-resolved single-molecule spectroscopy, laser scanning microscopy, fluorescence correlation spectroscopy and image correlation spectroscopy. The software is file format agnostic and processes multiple time-resolved data formats and outputs. Our software eliminates the need for data conversion and mitigates data archiving issues. AVAILABILITY AND IMPLEMENTATION tttrlib is available via pip (https://pypi.org/project/tttrlib/) and bioconda while the open-source code is available via GitHub (https://github.com/fluorescence-tools/tttrlib). Presented examples and additional documentation demonstrating how to implement in vitro and live-cell image spectroscopy analysis are available at https://docs.peulen.xyz/tttrlib and https://zenodo.org/records/14002224.
Collapse
Affiliation(s)
- Thomas-Otavio Peulen
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and Quantitative Biosciences Institute, University of California, San Francisco, CA, 94143, United States
- Rudolf Virchow Center for Integrative and Translational Bioimaging, Julius-Maximilians-University Würzburg (JMU), Würzburg, 97080, Germany
| | - Katherina Hemmen
- Rudolf Virchow Center for Integrative and Translational Bioimaging, Julius-Maximilians-University Würzburg (JMU), Würzburg, 97080, Germany
| | - Annemarie Greife
- Chair of Molecular Physical Chemistry, Heinrich-Heine University, Düsseldorf, 40225, Germany
| | - Benjamin M Webb
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and Quantitative Biosciences Institute, University of California, San Francisco, CA, 94143, United States
| | - Suren Felekyan
- Chair of Molecular Physical Chemistry, Heinrich-Heine University, Düsseldorf, 40225, Germany
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and Quantitative Biosciences Institute, University of California, San Francisco, CA, 94143, United States
| | - Claus A M Seidel
- Chair of Molecular Physical Chemistry, Heinrich-Heine University, Düsseldorf, 40225, Germany
| | - Hugo Sanabria
- Department of Physics & Astronomy, Clemson University, Clemson, SC, 29634, United States
| | - Katrin G Heinze
- Rudolf Virchow Center for Integrative and Translational Bioimaging, Julius-Maximilians-University Würzburg (JMU), Würzburg, 97080, Germany
| |
Collapse
|
3
|
Sanbonmatsu K. Supercomputing in the biological sciences: Toward Zettascale and Yottascale simulations. Curr Opin Struct Biol 2024; 88:102889. [PMID: 39163795 DOI: 10.1016/j.sbi.2024.102889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2024] [Revised: 07/03/2024] [Accepted: 07/03/2024] [Indexed: 08/22/2024]
Abstract
Molecular simulations of biological systems tend to be significantly more compute-intensive than those in materials science and astrophysics, due to important contributions of long-range electrostatic forces and large numbers of time steps (>1E9) required. Simulations of biomolecular complexes of microseconds to milliseconds are considered state-of-the-art today. However, these time scales are miniscule in comparison to physiological time scales relevant to molecular machine activity, drug action, and elongation cycles for protein synthesis, RNA synthesis, and DNA synthesis (seconds to days). While an exascale supercomputer has simulated an entire virus for nanoseconds, this supercomputer would need to be 10 billion times faster to simulate that virus for 3 hours of physiological time, demonstrating the insatiable need for computing power. With growing interest in computational drug design from the pharmaceutical sector, the biological sciences are positioned to be an industry driver in computing.
Collapse
Affiliation(s)
- Karissa Sanbonmatsu
- Los Alamos National Laboratory, United States; New Mexico Consortium, New Mexico.
| |
Collapse
|
4
|
Latham AP, Tempkin JOB, Otsuka S, Zhang W, Ellenberg J, Sali A. Integrative spatiotemporal modeling of biomolecular processes: application to the assembly of the Nuclear Pore Complex. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.06.606842. [PMID: 39149317 PMCID: PMC11326192 DOI: 10.1101/2024.08.06.606842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 08/17/2024]
Abstract
Dynamic processes involving biomolecules are essential for the function of the cell. Here, we introduce an integrative method for computing models of these processes based on multiple heterogeneous sources of information, including time-resolved experimental data and physical models of dynamic processes. We first compute integrative structure models at fixed time points and then optimally select and connect these snapshots into a series of trajectories that optimize the likelihood of both the snapshots and transitions between them. The method is demonstrated by application to the assembly process of the human Nuclear Pore Complex in the context of the reforming nuclear envelope during mitotic cell division, based on live-cell correlated electron tomography, bulk fluorescence correlation spectroscopy-calibrated quantitative live imaging, and a structural model of the fully-assembled Nuclear Pore Complex. Modeling of the assembly process improves the model precision over static integrative structure modeling alone. The method is applicable to a wide range of time-dependent systems in cell biology, and is available to the broader scientific community through an implementation in the open source Integrative Modeling Platform software.
Collapse
Affiliation(s)
- Andrew P Latham
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Jeremy O B Tempkin
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Shotaro Otsuka
- Cell Biology and Biophysics Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Wanlu Zhang
- Cell Biology and Biophysics Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Jan Ellenberg
- Cell Biology and Biophysics Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA 94143, USA
| |
Collapse
|
5
|
Wankowicz SA, Ravikumar A, Sharma S, Riley B, Raju A, Hogan DW, Flowers J, van den Bedem H, Keedy DA, Fraser JS. Automated multiconformer model building for X-ray crystallography and cryo-EM. eLife 2024; 12:RP90606. [PMID: 38904665 PMCID: PMC11192534 DOI: 10.7554/elife.90606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/22/2024] Open
Abstract
In their folded state, biomolecules exchange between multiple conformational states that are crucial for their function. Traditional structural biology methods, such as X-ray crystallography and cryogenic electron microscopy (cryo-EM), produce density maps that are ensemble averages, reflecting molecules in various conformations. Yet, most models derived from these maps explicitly represent only a single conformation, overlooking the complexity of biomolecular structures. To accurately reflect the diversity of biomolecular forms, there is a pressing need to shift toward modeling structural ensembles that mirror the experimental data. However, the challenge of distinguishing signal from noise complicates manual efforts to create these models. In response, we introduce the latest enhancements to qFit, an automated computational strategy designed to incorporate protein conformational heterogeneity into models built into density maps. These algorithmic improvements in qFit are substantiated by superior Rfree and geometry metrics across a wide range of proteins. Importantly, unlike more complex multicopy ensemble models, the multiconformer models produced by qFit can be manually modified in most major model building software (e.g., Coot) and fit can be further improved by refinement using standard pipelines (e.g., Phenix, Refmac, Buster). By reducing the barrier of creating multiconformer models, qFit can foster the development of new hypotheses about the relationship between macromolecular conformational dynamics and function.
Collapse
Affiliation(s)
- Stephanie A Wankowicz
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| | - Ashraya Ravikumar
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| | - Shivani Sharma
- Structural Biology Initiative, CUNY Advanced Science Research CenterNew YorkUnited States
- Ph.D. Program in Biology, The Graduate Center, City University of New YorkNew YorkUnited States
| | - Blake Riley
- Structural Biology Initiative, CUNY Advanced Science Research CenterNew YorkUnited States
| | - Akshay Raju
- Structural Biology Initiative, CUNY Advanced Science Research CenterNew YorkUnited States
| | - Daniel W Hogan
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| | - Jessica Flowers
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| | - Henry van den Bedem
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
- Atomwise IncSan FranciscoUnited States
| | - Daniel A Keedy
- Structural Biology Initiative, CUNY Advanced Science Research CenterNew YorkUnited States
- Department of Chemistry and Biochemistry, City College of New YorkNew YorkUnited States
- Ph.D. Programs in Biochemistry, Biology and Chemistry, The Graduate Center, City University of New YorkNew YorkUnited States
| | - James S Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| |
Collapse
|
6
|
Oriol F, Alberto M, Joachim AP, Patrick G, M BP, Ruben MF, Jaume B, Altair CH, Ferran P, Oriol G, Narcis FF, Baldo O. Structure-based learning to predict and model protein-DNA interactions and transcription-factor co-operativity in cis-regulatory elements. NAR Genom Bioinform 2024; 6:lqae068. [PMID: 38867914 PMCID: PMC11167492 DOI: 10.1093/nargab/lqae068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 04/18/2024] [Accepted: 05/23/2024] [Indexed: 06/14/2024] Open
Abstract
Transcription factor (TF) binding is a key component of genomic regulation. There are numerous high-throughput experimental methods to characterize TF-DNA binding specificities. Their application, however, is both laborious and expensive, which makes profiling all TFs challenging. For instance, the binding preferences of ∼25% human TFs remain unknown; they neither have been determined experimentally nor inferred computationally. We introduce a structure-based learning approach to predict the binding preferences of TFs and the automated modelling of TF regulatory complexes. We show the advantage of using our approach over the classical nearest-neighbor prediction in the limits of remote homology. Starting from a TF sequence or structure, we predict binding preferences in the form of motifs that are then used to scan a DNA sequence for occurrences. The best matches are either profiled with a binding score or collected for their subsequent modeling into a higher-order regulatory complex with DNA. Co-operativity is modelled by: (i) the co-localization of TFs and (ii) the structural modeling of protein-protein interactions between TFs and with co-factors. We have applied our approach to automatically model the interferon-β enhanceosome and the pioneering complexes of OCT4, SOX2 (or SOX11) and KLF4 with a nucleosome, which are compared with the experimentally known structures.
Collapse
Affiliation(s)
- Fornes Oriol
- Centre for Molecular Medicine and Therapeutics. BC Children's Hospital Research Institute. Department of Medical Genetics. University of British Columbia, Vancouver, BC V5Z 4H4, Canada
| | - Meseguer Alberto
- Structural Bioinformatics Lab (GRIB-IMIM). Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona 08005 Catalonia, Spain
| | | | - Gohl Patrick
- Structural Bioinformatics Lab (GRIB-IMIM). Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona 08005 Catalonia, Spain
| | - Bota Patricia M
- Structural Bioinformatics Lab (GRIB-IMIM). Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona 08005 Catalonia, Spain
| | - Molina-Fernández Ruben
- Structural Bioinformatics Lab (GRIB-IMIM). Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona 08005 Catalonia, Spain
| | - Bonet Jaume
- Structural Bioinformatics Lab (GRIB-IMIM). Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona 08005 Catalonia, Spain
- Laboratory of Protein Design & Immunoengineering. School of Engineering. Ecole Polytechnique Federale de Lausanne. Lausanne 1015, Vaud, Switzerland
| | - Chinchilla-Hernandez Altair
- Live-Cell Structural Biology. Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona 08005 Catalonia, Spain
| | - Pegenaute Ferran
- Live-Cell Structural Biology. Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona 08005 Catalonia, Spain
| | - Gallego Oriol
- Live-Cell Structural Biology. Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona 08005 Catalonia, Spain
| | - Fernandez-Fuentes Narcis
- Institute of Biological, Environmental and Rural Science. Aberystwyth University, SY23 3DA Aberystwyth, UK
| | - Oliva Baldo
- Structural Bioinformatics Lab (GRIB-IMIM). Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona 08005 Catalonia, Spain
| |
Collapse
|
7
|
Wankowicz SA, Ravikumar A, Sharma S, Riley BT, Raju A, Flowers J, Hogan D, van den Bedem H, Keedy DA, Fraser JS. Uncovering Protein Ensembles: Automated Multiconformer Model Building for X-ray Crystallography and Cryo-EM. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.06.28.546963. [PMID: 37425870 PMCID: PMC10327213 DOI: 10.1101/2023.06.28.546963] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
In their folded state, biomolecules exchange between multiple conformational states that are crucial for their function. Traditional structural biology methods, such as X-ray crystallography and cryogenic electron microscopy (cryo-EM), produce density maps that are ensemble averages, reflecting molecules in various conformations. Yet, most models derived from these maps explicitly represent only a single conformation, overlooking the complexity of biomolecular structures. To accurately reflect the diversity of biomolecular forms, there is a pressing need to shift towards modeling structural ensembles that mirror the experimental data. However, the challenge of distinguishing signal from noise complicates manual efforts to create these models. In response, we introduce the latest enhancements to qFit, an automated computational strategy designed to incorporate protein conformational heterogeneity into models built into density maps. These algorithmic improvements in qFit are substantiated by superior R f r e e and geometry metrics across a wide range of proteins. Importantly, unlike more complex multicopy ensemble models, the multiconformer models produced by qFit can be manually modified in most major model building software (e.g. Coot) and fit can be further improved by refinement using standard pipelines (e.g. Phenix, Refmac, Buster). By reducing the barrier of creating multiconformer models, qFit can foster the development of new hypotheses about the relationship between macromolecular conformational dynamics and function.
Collapse
Affiliation(s)
- Stephanie A. Wankowicz
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Ashraya Ravikumar
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Shivani Sharma
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
- Ph.D. Program in Biology, The Graduate Center – City University of New York, New York, NY 10016
| | - Blake T. Riley
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
| | - Akshay Raju
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
| | - Jessica Flowers
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Daniel Hogan
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Henry van den Bedem
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
- Atomwise, Inc., San Francisco, CA, United States
| | - Daniel A. Keedy
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
- Department of Chemistry and Biochemistry, City College of New York, New York, NY 10031
- Ph.D. Programs in Biochemistry, Biology, and Chemistry, The Graduate Center – City University of New York, New York, NY 10016
| | - James S. Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| |
Collapse
|
8
|
Caparotta M, Perez A. Advancing Molecular Dynamics: Toward Standardization, Integration, and Data Accessibility in Structural Biology. J Phys Chem B 2024; 128:2219-2227. [PMID: 38418288 DOI: 10.1021/acs.jpcb.3c04823] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/01/2024]
Abstract
Molecular dynamics (MD) simulations have become a valuable tool in structural biology, offering insights into complex biological systems that are difficult to obtain through experimental techniques alone. The lack of available data sets and structures in most published computational work has limited other researchers' use of these models. In recent years, the emergence of online sharing platforms and MD database initiatives favor the deposition of ensembles and structures to accompany publications, favoring reuse of the data sets. However, the lack of uniform metadata collection, formats, and what data are deposited limits the impact and its use by different communities that are not necessarily experts in MD. This Perspective highlights the need for standardization and better resource sharing for processing and interpreting MD simulation results, akin to efforts in other areas of structural biology. As the field moves forward, we will see an increase in popularity and benefits of MD-based integrative approaches combining experimental data and simulations through probabilistic reasoning, but these too are limited by uniformity in experimental data availability and choices on how the data are modeled that are not trivial to decipher from papers. Other fields have addressed similar challenges comprehensively by establishing task forces with different degrees of success. The large scope and number of communities to represent the breadth of types of MD simulations complicates a parallel approach that would fit all. Thus, each group typically decides what data and which format to upload on servers like Zenodo. Uploading data with FAIR (findable, accessible, interoperable, reusable) principles in mind including optimal metadata collection will make the data more accessible and actionable by the community. Such a wealth of simulation data will foster method development and infrastructure advancements, thus propelling the field forward.
Collapse
Affiliation(s)
- Marcelo Caparotta
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, Florida 32611, United States
| | - Alberto Perez
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, Florida 32611, United States
| |
Collapse
|
9
|
Mondal A, Lenz S, MacCallum JL, Perez A. Hybrid computational methods combining experimental information with molecular dynamics. Curr Opin Struct Biol 2023; 81:102609. [PMID: 37224642 DOI: 10.1016/j.sbi.2023.102609] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2023] [Revised: 04/12/2023] [Accepted: 04/23/2023] [Indexed: 05/26/2023]
Abstract
A goal of structural biology is to understand how macromolecules carry out their biological roles by identifying their metastable states, mechanisms of action, pathways leading to conformational changes, and the thermodynamic and kinetic relationships between those states. Integrative modeling brings structural insights into systems where traditional structure determination approaches cannot help. We focus on the synergies and challenges of integrative modeling combining experimental data with molecular dynamics simulations.
Collapse
Affiliation(s)
- Arup Mondal
- Quantum Theory Project, Department of Chemistry, University of Florida, Leigh, UK. https://twitter.com/@amondal_chem
| | - Stefan Lenz
- Department of Chemistry, University of Calgary, 2500 University Drive, Canada
| | - Justin L MacCallum
- Department of Chemistry, University of Calgary, 2500 University Drive, Canada. https://twitter.com/@jlmaccal
| | - Alberto Perez
- Quantum Theory Project, Department of Chemistry, University of Florida, Leigh, UK.
| |
Collapse
|
10
|
Du S, Wankowicz SA, Yabukarski F, Doukov T, Herschlag D, Fraser JS. Refinement of multiconformer ensemble models from multi-temperature X-ray diffraction data. Methods Enzymol 2023; 688:223-254. [PMID: 37748828 PMCID: PMC10637719 DOI: 10.1016/bs.mie.2023.06.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/27/2023]
Abstract
Conformational ensembles underlie all protein functions. Thus, acquiring atomic-level ensemble models that accurately represent conformational heterogeneity is vital to deepen our understanding of how proteins work. Modeling ensemble information from X-ray diffraction data has been challenging, as traditional cryo-crystallography restricts conformational variability while minimizing radiation damage. Recent advances have enabled the collection of high quality diffraction data at ambient temperatures, revealing innate conformational heterogeneity and temperature-driven changes. Here, we used diffraction datasets for Proteinase K collected at temperatures ranging from 313 to 363 K to provide a tutorial for the refinement of multiconformer ensemble models. Integrating automated sampling and refinement tools with manual adjustments, we obtained multiconformer models that describe alternative backbone and sidechain conformations, their relative occupancies, and interconnections between conformers. Our models revealed extensive and diverse conformational changes across temperature, including increased bound peptide ligand occupancies, different Ca2+ binding site configurations and altered rotameric distributions. These insights emphasize the value and need for multiconformer model refinement to extract ensemble information from diffraction data and to understand ensemble-function relationships.
Collapse
Affiliation(s)
- Siyuan Du
- Department of Biochemistry, Stanford University, Stanford, CA, United States; Department of Chemistry, Stanford University, Stanford, CA, United States
| | - Stephanie A Wankowicz
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States
| | - Filip Yabukarski
- Department of Biochemistry, Stanford University, Stanford, CA, United States; Bristol-Myers Squibb, San Diego, CA, United States
| | - Tzanko Doukov
- Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, CA, United States
| | - Daniel Herschlag
- Department of Biochemistry, Stanford University, Stanford, CA, United States; Department of Chemical Engineering, Stanford University, Stanford, CA, United States; Stanford ChEM-H, Stanford University, Stanford, CA, United States
| | - James S Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States; Quantitative Biosciences Institute, University of California, San Francisco, CA, United States.
| |
Collapse
|
11
|
Du S, Wankowicz SA, Yabukarski F, Doukov T, Herschlag D, Fraser JS. Refinement of Multiconformer Ensemble Models from Multi-temperature X-ray Diffraction Data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.05.539620. [PMID: 37205593 PMCID: PMC10187334 DOI: 10.1101/2023.05.05.539620] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Conformational ensembles underlie all protein functions. Thus, acquiring atomic-level ensemble models that accurately represent conformational heterogeneity is vital to deepen our understanding of how proteins work. Modeling ensemble information from X-ray diffraction data has been challenging, as traditional cryo-crystallography restricts conformational variability while minimizing radiation damage. Recent advances have enabled the collection of high quality diffraction data at ambient temperatures, revealing innate conformational heterogeneity and temperature-driven changes. Here, we used diffraction datasets for Proteinase K collected at temperatures ranging from 313 to 363K to provide a tutorial for the refinement of multiconformer ensemble models. Integrating automated sampling and refinement tools with manual adjustments, we obtained multiconformer models that describe alternative backbone and sidechain conformations, their relative occupancies, and interconnections between conformers. Our models revealed extensive and diverse conformational changes across temperature, including increased bound peptide ligand occupancies, different Ca2+ binding site configurations and altered rotameric distributions. These insights emphasize the value and need for multiconformer model refinement to extract ensemble information from diffraction data and to understand ensemble-function relationships.
Collapse
Affiliation(s)
- Siyuan Du
- Department of Biochemistry, Stanford University, Stanford, California 94305, United States
- Department of Chemistry, Stanford University, Stanford, California 94305, United States
| | - Stephanie A. Wankowicz
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, California 94143, United States
| | - Filip Yabukarski
- Department of Biochemistry, Stanford University, Stanford, California 94305, United States
- Bristol-Myers Squibb, San Diego, California 92121, United States
| | - Tzanko Doukov
- Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, California 94025, United States
| | - Daniel Herschlag
- Department of Biochemistry, Stanford University, Stanford, California 94305, United States
- Department of Chemical Engineering, Stanford University, Stanford, California 94305, United States
- Stanford ChEM-H, Stanford University, Stanford, California 94305, United States
| | - James S. Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, California 94143, United States
- Quantitative Biosciences Institute, University of California, San Francisco, California 94143, United States
| |
Collapse
|
12
|
Chang L, Mondal A, MacCallum JL, Perez A. CryoFold 2.0: Cryo-EM Structure Determination with MELD. J Phys Chem A 2023; 127:3906-3913. [PMID: 37084537 DOI: 10.1021/acs.jpca.3c01731] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/23/2023]
Abstract
Cryo-electron microscopy data are becoming more prevalent and accessible at higher resolution levels, leading to the development of new computational tools to determine the atomic structure of macromolecules. However, while existing tools adapted from X-ray crystallography are suitable for the highest-resolution maps, new tools are needed for lower-resolution levels and to account for map heterogeneity. In this article, we introduce CryoFold 2.0, an integrative physics-based approach that combines Bayesian inference and the ability to handle multiple data sources with the molecular dynamics flexible fitting (MDFF) approach to determine the structures of macromolecules by using cryo-EM data. CryoFold 2.0 is incorporated into the MELD (modeling employing limited data) plugin, resulting in a pipeline that is more computationally efficient and accurate than running MELD or MDFF alone. The approach requires fewer computational resources and shorter simulation times than the original CryoFold, and it minimizes manual intervention. We demonstrate the effectiveness of the approach on eight different systems, highlighting its various benefits.
Collapse
Affiliation(s)
- Liwei Chang
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, Florida 32611, United States
| | - Arup Mondal
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, Florida 32611, United States
| | - Justin L MacCallum
- Department of Chemistry, University of Calgary, Calgary, AB T2N 1N4, Canada
| | - Alberto Perez
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, Florida 32611, United States
| |
Collapse
|