1
|
Zheng T, Cai S. Recent technical advances in cellular cryo-electron tomography. Int J Biochem Cell Biol 2024; 175:106648. [PMID: 39181502 DOI: 10.1016/j.biocel.2024.106648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2024] [Revised: 08/20/2024] [Accepted: 08/20/2024] [Indexed: 08/27/2024]
Abstract
Understanding the in situ structure, organization, and interactions of macromolecules is essential for elucidating their functions and mechanisms of action. Cellular cryo-electron tomography (cryo-ET) is a cutting-edge technique that reveals in situ molecular-resolution architectures of macromolecules in their lifelike states. It also provides insights into the three-dimensional distribution of macromolecules and their spatial relationships with various subcellular structures. Thus, cellular cryo-ET bridges the gap between structural biology and cell biology. With rapid advancements, this technique achieved substantial improvements in throughput, automation, and resolution. This review presents the fundamental principles and methodologies of cellular cryo-ET, highlighting recent developments in sample preparation, data collection, and image processing. We also discuss emerging trends and potential future directions. As cellular cryo-ET continues to develop, it is set to play an increasingly vital role in structural cell biology.
Collapse
Affiliation(s)
- Tianyu Zheng
- Department of Chemical Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China; Institute for Biological Electron Microscopy, Southern University of Science and Technology, Shenzhen 518055, China
| | - Shujun Cai
- Department of Chemical Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China; Institute for Biological Electron Microscopy, Southern University of Science and Technology, Shenzhen 518055, China.
| |
Collapse
|
2
|
Włodarski T, Streit JO, Mitropoulou A, Cabrita LD, Vendruscolo M, Christodoulou J. Bayesian reweighting of biomolecular structural ensembles using heterogeneous cryo-EM maps with the cryoENsemble method. Sci Rep 2024; 14:18149. [PMID: 39103467 PMCID: PMC11300795 DOI: 10.1038/s41598-024-68468-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Accepted: 07/24/2024] [Indexed: 08/07/2024] Open
Abstract
Cryogenic electron microscopy (cryo-EM) has emerged as a powerful method for the determination of structures of complex biological molecules. The accurate characterisation of the dynamics of such systems, however, remains a challenge. To address this problem, we introduce cryoENsemble, a method that applies Bayesian reweighting to conformational ensembles derived from molecular dynamics simulations to improve their agreement with cryo-EM data, thus enabling the extraction of dynamics information. We illustrate the use of cryoENsemble to determine the dynamics of the ribosome-bound state of the co-translational chaperone trigger factor (TF). We also show that cryoENsemble can assist with the interpretation of low-resolution, noisy or unaccounted regions of cryo-EM maps. Notably, we are able to link an unaccounted part of the cryo-EM map to the presence of another protein (methionine aminopeptidase, or MetAP), rather than to the dynamics of TF, and model its TF-bound state. Based on these results, we anticipate that cryoENsemble will find use for challenging heterogeneous cryo-EM maps for biomolecular systems encompassing dynamic components.
Collapse
Affiliation(s)
- Tomasz Włodarski
- Institute of Structural and Molecular Biology, University College London, Gower Street, London, WC1E 6BT, UK.
- Institute of Biochemistry and Biophysics, Polish Academy of Sciences, Pawinskiego 5a, 02-106, Warsaw, Poland.
| | - Julian O Streit
- Institute of Structural and Molecular Biology, University College London, Gower Street, London, WC1E 6BT, UK
| | - Alkistis Mitropoulou
- Institute of Structural and Molecular Biology, University College London, Gower Street, London, WC1E 6BT, UK
| | - Lisa D Cabrita
- Institute of Structural and Molecular Biology, University College London, Gower Street, London, WC1E 6BT, UK
| | - Michele Vendruscolo
- Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK
| | - John Christodoulou
- Institute of Structural and Molecular Biology, University College London, Gower Street, London, WC1E 6BT, UK
- Birkbeck College, University of London, Malet Street, London, WC1E 7HX, UK
| |
Collapse
|
3
|
Steffen FD, Cunha RA, Sigel RKO, Börner R. FRET-guided modeling of nucleic acids. Nucleic Acids Res 2024; 52:e59. [PMID: 38869063 PMCID: PMC11260485 DOI: 10.1093/nar/gkae496] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 05/29/2024] [Indexed: 06/14/2024] Open
Abstract
The functional diversity of RNAs is encoded in their innate conformational heterogeneity. The combination of single-molecule spectroscopy and computational modeling offers new attractive opportunities to map structural transitions within nucleic acid ensembles. Here, we describe a framework to harmonize single-molecule Förster resonance energy transfer (FRET) measurements with molecular dynamics simulations and de novo structure prediction. Using either all-atom or implicit fluorophore modeling, we recreate FRET experiments in silico, visualize the underlying structural dynamics and quantify the reaction coordinates. Using multiple accessible-contact volumes as a post hoc scoring method for fragment assembly in Rosetta, we demonstrate that FRET can be used to filter a de novo RNA structure prediction ensemble by refuting models that are not compatible with in vitro FRET measurement. We benchmark our FRET-assisted modeling approach on double-labeled DNA strands and validate it against an intrinsically dynamic manganese(II)-binding riboswitch. We show that a FRET coordinate describing the assembly of a four-way junction allows our pipeline to recapitulate the global fold of the riboswitch displayed by the crystal structure. We conclude that computational fluorescence spectroscopy facilitates the interpretability of dynamic structural ensembles and improves the mechanistic understanding of nucleic acid interactions.
Collapse
Affiliation(s)
- Fabio D Steffen
- Department of Chemistry, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
| | - Richard A Cunha
- Department of Chemistry, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
| | - Roland K O Sigel
- Department of Chemistry, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
| | - Richard Börner
- Department of Chemistry, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
| |
Collapse
|
4
|
Hoff SE, Thomasen FE, Lindorff-Larsen K, Bonomi M. Accurate model and ensemble refinement using cryo-electron microscopy maps and Bayesian inference. PLoS Comput Biol 2024; 20:e1012180. [PMID: 39008528 PMCID: PMC11271924 DOI: 10.1371/journal.pcbi.1012180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2023] [Revised: 07/25/2024] [Accepted: 05/20/2024] [Indexed: 07/17/2024] Open
Abstract
Converting cryo-electron microscopy (cryo-EM) data into high-quality structural models is a challenging problem of outstanding importance. Current refinement methods often generate unbalanced models in which physico-chemical quality is sacrificed for excellent fit to the data. Furthermore, these techniques struggle to represent the conformational heterogeneity averaged out in low-resolution regions of density maps. Here we introduce EMMIVox, a Bayesian inference approach to determine single-structure models as well as structural ensembles from cryo-EM maps. EMMIVox automatically balances experimental information with accurate physico-chemical models of the system and the surrounding environment, including waters, lipids, and ions. Explicit treatment of data correlation and noise as well as inference of accurate B-factors enable determination of structural models and ensembles with both excellent fit to the data and high stereochemical quality, thus outperforming state-of-the-art refinement techniques. EMMIVox represents a flexible approach to determine high-quality structural models that will contribute to advancing our understanding of the molecular mechanisms underlying biological functions.
Collapse
Affiliation(s)
- Samuel E. Hoff
- Institut Pasteur, Université Paris Cité, CNRS UMR 3528, Computational Structural Biology Unit, Paris, France
| | - F. Emil Thomasen
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Massimiliano Bonomi
- Institut Pasteur, Université Paris Cité, CNRS UMR 3528, Computational Structural Biology Unit, Paris, France
| |
Collapse
|
5
|
Heo L, Feig M. One bead per residue can describe all-atom protein structures. Structure 2024; 32:97-111.e6. [PMID: 38000367 PMCID: PMC10872525 DOI: 10.1016/j.str.2023.10.013] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 09/16/2023] [Accepted: 10/30/2023] [Indexed: 11/26/2023]
Abstract
Atomistic resolution is the standard for high-resolution biomolecular structures, but experimental structural data are often at lower resolution. Coarse-grained models are also used extensively in computational studies to reach biologically relevant spatial and temporal scales. This study explores the use of advanced machine learning networks for reconstructing atomistic models from reduced representations. The main finding is that a single bead per amino acid residue allows construction of accurate and stereochemically realistic all-atom structures with minimal loss of information. This suggests that lower resolution representations of proteins may be sufficient for many applications when combined with a machine learning framework that encodes knowledge from known structures. Practical applications include the rapid addition of atomistic detail to low-resolution structures from experiment or computational coarse-grained models. The application of rapid, deterministic all-atom reconstruction within multi-scale frameworks is further demonstrated with a rapid protocol for the generation of accurate models from cryo-EM densities close to experimental structures.
Collapse
Affiliation(s)
- Lim Heo
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Michael Feig
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA.
| |
Collapse
|
6
|
Park J, Joung I, Joo K, Lee J. Application of conformational space annealing to the protein structure modeling using cryo-EM maps. J Comput Chem 2023; 44:2332-2346. [PMID: 37585026 DOI: 10.1002/jcc.27200] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 04/26/2023] [Accepted: 07/16/2023] [Indexed: 08/17/2023]
Abstract
Conformational space annealing (CSA), a global optimization method, has been applied to various protein structure modeling tasks. In this paper, we applied CSA to the cryo-EM structure modeling task by combining the python subroutine of CSA (PyCSA) and the fast relax (FastRelax) protocol of PyRosetta. Refinement of initial structures generated from two methods, rigid fitting of predicted structures to the Cryo-EM map and de novo protein modeling by tracing the Cryo-EM map, was performed by CSA. In the refinement of the rigid-fitted structures, the final models showed that CSA can generate reliable atomic structures of proteins, even when large movements of protein domains were required. In the de novo modeling case, although the overall structural qualities of the final models were rather dependent on the initial models, the final models generated by CSA showed improved MolProbity scores and cross-correlation coefficients to the maps. These results suggest that CSA can accomplish flexible fitting and refinement together by sampling diverse conformations effectively and thus can be utilized for cryo-EM structure modeling.
Collapse
Affiliation(s)
| | | | - Keehyoung Joo
- Center for Advanced Computations, Korea Institute for Advanced Study, Seoul, South Korea
| | - Jooyoung Lee
- School of Computational Sciences, Korea Institute for Advanced Study, Seoul, South Korea
| |
Collapse
|
7
|
Krieger JM, Sorzano COS, Carazo JM. Scipion-EM-ProDy: A Graphical Interface for the ProDy Python Package within the Scipion Workflow Engine Enabling Integration of Databases, Simulations and Cryo-Electron Microscopy Image Processing. Int J Mol Sci 2023; 24:14245. [PMID: 37762547 PMCID: PMC10532346 DOI: 10.3390/ijms241814245] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Revised: 09/10/2023] [Accepted: 09/15/2023] [Indexed: 09/29/2023] Open
Abstract
Macromolecular assemblies, such as protein complexes, undergo continuous structural dynamics, including global reconfigurations critical for their function. Two fast analytical methods are widely used to study these global dynamics, namely elastic network model normal mode analysis and principal component analysis of ensembles of structures. These approaches have found wide use in various computational studies, driving the development of complex pipelines in several software packages. One common theme has been conformational sampling through hybrid simulations incorporating all-atom molecular dynamics and global modes of motion. However, wide functionality is only available for experienced programmers with limited capabilities for other users. We have, therefore, integrated one popular and extensively developed software for such analyses, the ProDy Python application programming interface, into the Scipion workflow engine. This enables a wider range of users to access a complete range of macromolecular dynamics pipelines beyond the core functionalities available in its command-line applications and the normal mode wizard in VMD. The new protocols and pipelines can be further expanded and integrated into larger workflows, together with other software packages for cryo-electron microscopy image analysis and molecular simulations. We present the resulting plugin, Scipion-EM-ProDy, in detail, highlighting the rich functionality made available by its development.
Collapse
Affiliation(s)
- James M. Krieger
- Biocomputing Unit, National Centre for Biotechnology (CNB CSIC), Campus Universidad Autónoma de Madrid, Darwin 3, Cantoblanco, 28049 Madrid, Spain
| | | | - Jose Maria Carazo
- Biocomputing Unit, National Centre for Biotechnology (CNB CSIC), Campus Universidad Autónoma de Madrid, Darwin 3, Cantoblanco, 28049 Madrid, Spain
| |
Collapse
|
8
|
Schweke H, Xu Q, Tauriello G, Pantolini L, Schwede T, Cazals F, Lhéritier A, Fernandez-Recio J, Rodríguez-Lumbreras LÁ, Schueler-Furman O, Varga JK, Jiménez-García B, Réau MF, Bonvin A, Savojardo C, Martelli PL, Casadio R, Tubiana J, Wolfson H, Oliva R, Barradas-Bautista D, Ricciardelli T, Cavallo L, Venclovas Č, Olechnovič K, Guerois R, Andreani J, Martin J, Wang X, Kihara D, Marchand A, Correia B, Zou X, Dey S, Dunbrack R, Levy E, Wodak S. Discriminating physiological from non-physiological interfaces in structures of protein complexes: A community-wide study. Proteomics 2023; 23:e2200323. [PMID: 37365936 PMCID: PMC10937251 DOI: 10.1002/pmic.202200323] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Revised: 05/11/2023] [Accepted: 05/11/2023] [Indexed: 06/28/2023]
Abstract
Reliably scoring and ranking candidate models of protein complexes and assigning their oligomeric state from the structure of the crystal lattice represent outstanding challenges. A community-wide effort was launched to tackle these challenges. The latest resources on protein complexes and interfaces were exploited to derive a benchmark dataset consisting of 1677 homodimer protein crystal structures, including a balanced mix of physiological and non-physiological complexes. The non-physiological complexes in the benchmark were selected to bury a similar or larger interface area than their physiological counterparts, making it more difficult for scoring functions to differentiate between them. Next, 252 functions for scoring protein-protein interfaces previously developed by 13 groups were collected and evaluated for their ability to discriminate between physiological and non-physiological complexes. A simple consensus score generated using the best performing score of each of the 13 groups, and a cross-validated Random Forest (RF) classifier were created. Both approaches showed excellent performance, with an area under the Receiver Operating Characteristic (ROC) curve of 0.93 and 0.94, respectively, outperforming individual scores developed by different groups. Additionally, AlphaFold2 engines recalled the physiological dimers with significantly higher accuracy than the non-physiological set, lending support to the reliability of our benchmark dataset annotations. Optimizing the combined power of interface scoring functions and evaluating it on challenging benchmark datasets appears to be a promising strategy.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | - Julia K. Varga
- Hebrew University of Jerusalem Institute for Medical Research Israel-Canada
| | | | | | | | | | | | | | - Jérôme Tubiana
- Tel Aviv University Blavatnik School of Computer Science
| | - Haim Wolfson
- Tel Aviv University Blavatnik School of Computer Science
| | | | | | | | | | | | | | | | | | | | | | | | | | | | - Xiaoqin Zou
- Dalton Cardiovascular Research Center, Institute for Data Science and Informatics, University of Missouri
| | | | | | | | | |
Collapse
|
9
|
Liu ZH, Zhang O, Teixeira JMC, Li J, Head-Gordon T, Forman-Kay JD. SPyCi-PDB: A modular command-line interface for back-calculating experimental datatypes of protein structures. JOURNAL OF OPEN SOURCE SOFTWARE 2023; 8:4861. [PMID: 38726305 PMCID: PMC11081106 DOI: 10.21105/joss.04861] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/12/2024]
Affiliation(s)
- Zi Hao Liu
- Molecular Medicine Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
- Department of Biochemistry, University of Toronto, Toronto, Ontario, M5S 1A8, Canada
| | - Oufan Zhang
- Pitzer Center for Theoretical Chemistry, University of California, Berkeley, California 94720-1460, USA
- Department of Chemistry, University of California, Berkeley, California 94720-1460, USA
| | - João M C Teixeira
- Department of Biomedical Sciences, University of Padova, Padova 35131, Italy
| | - Jie Li
- Pitzer Center for Theoretical Chemistry, University of California, Berkeley, California 94720-1460, USA
- Department of Chemistry, University of California, Berkeley, California 94720-1460, USA
| | - Teresa Head-Gordon
- Pitzer Center for Theoretical Chemistry, University of California, Berkeley, California 94720-1460, USA
- Department of Chemistry, University of California, Berkeley, California 94720-1460, USA
- Department of Chemical and Biomolecular Engineering, University of California, Berkeley, California 94720-1462, USA
- Department of Bioengineering, University of California, Berkeley, California 94720-1762, USA
| | - Julie D Forman-Kay
- Molecular Medicine Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
- Department of Biochemistry, University of Toronto, Toronto, Ontario, M5S 1A8, Canada
| |
Collapse
|
10
|
Flechsig H, Ando T. Protein dynamics by the combination of high-speed AFM and computational modeling. Curr Opin Struct Biol 2023; 80:102591. [PMID: 37075535 DOI: 10.1016/j.sbi.2023.102591] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Revised: 03/06/2023] [Accepted: 03/08/2023] [Indexed: 04/21/2023]
Abstract
High-speed atomic force microscopy (HS-AFM) allows direct observation of biological molecules in dynamic action. However, HS-AFM has no atomic resolution. This article reviews recent progress of computational methods to infer high-resolution information, including the construction of 3D atomistic structures, from experimentally acquired resolution-limited HS-AFM images.
Collapse
Affiliation(s)
- Holger Flechsig
- Nano Life Science Institute (WPI-NanoLSI), Kanazawa University, Kakuma-machi, Kanazawa, 920-1192, Japan
| | - Toshio Ando
- Nano Life Science Institute (WPI-NanoLSI), Kanazawa University, Kakuma-machi, Kanazawa, 920-1192, Japan.
| |
Collapse
|
11
|
Beton JG, Cragnolini T, Kaleel M, Mulvaney T, Sweeney A, Topf M. Integrating model simulation tools and
cryo‐electron
microscopy. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2022. [DOI: 10.1002/wcms.1642] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Affiliation(s)
- Joseph George Beton
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| | - Tristan Cragnolini
- Institute of Structural and Molecular Biology, Birkbeck and University College London London UK
| | - Manaz Kaleel
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| | - Thomas Mulvaney
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| | - Aaron Sweeney
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| | - Maya Topf
- Centre for Structural Systems Biology (CSSB) Leibniz‐Institut für Virologie (LIV) Hamburg Germany
| |
Collapse
|
12
|
Abstract
Adeno-associated virus (AAV) has a single-stranded DNA genome encapsidated in a small icosahedrally symmetric protein shell with 60 subunits. AAV is the leading delivery vector in emerging gene therapy treatments for inherited disorders, so its structure and molecular interactions with human hosts are of intense interest. A wide array of electron microscopic approaches have been used to visualize the virus and its complexes, depending on the scientific question, technology available, and amenability of the sample. Approaches range from subvolume tomographic analyses of complexes with large and flexible host proteins to detailed analysis of atomic interactions within the virus and with small ligands at resolutions as high as 1.6 Å. Analyses have led to the reclassification of glycan receptors as attachment factors, to structures with a new-found receptor protein, to identification of the epitopes of antibodies, and a new understanding of possible neutralization mechanisms. AAV is now well-enough characterized that it has also become a model system for EM methods development. Heralding a new era, cryo-EM is now also being deployed as an analytic tool in the process development and production quality control of high value pharmaceutical biologics, namely AAV vectors.
Collapse
Affiliation(s)
- Scott
M. Stagg
- Department
of Biological Sciences, Florida State University, Tallahassee, Florida 32306, United States
- Institute
of Molecular Biophysics, Florida State University, Tallahassee, Florida 32306, United States
| | - Craig Yoshioka
- Department
of Biomedical Engineering, Oregon Health
& Science University, Portland Oregon 97239, United States
| | - Omar Davulcu
- Environmental
Molecular Sciences Laboratory, Pacific Northwest
National Laboratory, 3335 Innovation Boulevard, Richland, Washington 99354, United States
| | - Michael S. Chapman
- Department
of Biochemistry, University of Missouri, Columbia, Missouri 65211, United States
| |
Collapse
|
13
|
Alnabati E, Esquivel-Rodriguez J, Terashi G, Kihara D. MarkovFit: Structure Fitting for Protein Complexes in Electron Microscopy Maps Using Markov Random Field. Front Mol Biosci 2022; 9:935411. [PMID: 35959463 PMCID: PMC9358042 DOI: 10.3389/fmolb.2022.935411] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Accepted: 06/13/2022] [Indexed: 11/13/2022] Open
Abstract
An increasing number of protein complex structures are determined by cryo-electron microscopy (cryo-EM). When individual protein structures have been determined and are available, an important task in structure modeling is to fit the individual structures into the density map. Here, we designed a method that fits the atomic structures of proteins in cryo-EM maps of medium to low resolutions using Markov random fields, which allows probabilistic evaluation of fitted models. The accuracy of our method, MarkovFit, performed better than existing methods on datasets of 31 simulated cryo-EM maps of resolution 10 Å , nine experimentally determined cryo-EM maps of resolution less than 4 Å , and 28 experimentally determined cryo-EM maps of resolution 6 to 20 Å .
Collapse
Affiliation(s)
- Eman Alnabati
- Department of Computer Science, Purdue University, West Lafayette, IN, United States
| | | | - Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, IN, United States
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN, United States
- Department of Biological Sciences, Purdue University, West Lafayette, IN, United States
| |
Collapse
|
14
|
He J, Lin P, Chen J, Cao H, Huang SY. Model building of protein complexes from intermediate-resolution cryo-EM maps with deep learning-guided automatic assembly. Nat Commun 2022; 13:4066. [PMID: 35831370 PMCID: PMC9279371 DOI: 10.1038/s41467-022-31748-9] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Accepted: 06/30/2022] [Indexed: 12/29/2022] Open
Abstract
Advances in microscopy instruments and image processing algorithms have led to an increasing number of cryo-electron microscopy (cryo-EM) maps. However, building accurate models into intermediate-resolution EM maps remains challenging and labor-intensive. Here, we propose an automatic model building method of multi-chain protein complexes from intermediate-resolution cryo-EM maps, named EMBuild, by integrating AlphaFold structure prediction, FFT-based global fitting, domain-based semi-flexible refinement, and graph-based iterative assembling on the main-chain probability map predicted by a deep convolutional network. EMBuild is extensively evaluated on diverse test sets of 47 single-particle EM maps at 4.0-8.0 Å resolution and 16 subtomogram averaging maps of cryo-ET data at 3.7-9.3 Å resolution, and compared with state-of-the-art approaches. We demonstrate that EMBuild is able to build high-quality complex structures that are comparably accurate to the manually built PDB structures from the cryo-EM maps. These results demonstrate the accuracy and reliability of EMBuild in automatic model building.
Collapse
Affiliation(s)
- Jiahua He
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Peicong Lin
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Ji Chen
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Hong Cao
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Sheng-You Huang
- School of Physics and Key Laboratory of Molecular Biophysics of MOE, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China.
| |
Collapse
|
15
|
Alnabati E, Terashi G, Kihara D. Protein Structural Modeling for Electron Microscopy Maps Using VESPER and MAINMAST. Curr Protoc 2022; 2:e494. [PMID: 35849043 PMCID: PMC9299282 DOI: 10.1002/cpz1.494] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
An increasing number of protein structures are determined by cryo-electron microscopy (cryo-EM) and stored in the Electron Microscopy Data Bank (EMDB). To interpret determined cryo-EM maps, several methods have been developed that model the tertiary structure of biomolecules, particularly proteins. Here we show how to use two such methods, VESPER and MAINMAST, which were developed in our group. VESPER is a method mainly for two purposes: fitting protein structure models into an EM map and aligning two EM maps locally or globally to capture their similarity. VESPER represents each EM map as a set of vectors pointing toward denser points. By considering matching the directions of vectors, in general, VESPER aligns maps better than conventional methods that only consider local densities of maps. MAINMAST is a de novo protein modeling tool designed for EM maps with resolution of 3-5 Å or better. MAINMAST builds a protein main chain directly from a density map by tracing dense points in an EM map and connecting them using a tree-graph structure. This article describes how to use these two tools using three illustrative modeling examples. © 2022 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Protein structure model fitting using VESPER Alternate Protocol: Atomic model fitting using VESPER web server Basic Protocol 2: Protein de novo modeling using MAINMAST.
Collapse
Affiliation(s)
- Eman Alnabati
- Department of Computer SciencePurdue UniversityWest LafayetteIndiana
| | - Genki Terashi
- Department of Biological SciencesPurdue UniversityWest LafayetteIndiana
| | - Daisuke Kihara
- Department of Computer SciencePurdue UniversityWest LafayetteIndiana
- Department of Biological SciencesPurdue UniversityWest LafayetteIndiana
| |
Collapse
|
16
|
Wang C, Anglès F, Balch WE. Triangulating variation in the population to define mechanisms for precision management of genetic disease. Structure 2022; 30:1190-1207.e5. [PMID: 35714602 PMCID: PMC9357173 DOI: 10.1016/j.str.2022.05.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Revised: 04/18/2022] [Accepted: 05/17/2022] [Indexed: 10/18/2022]
Abstract
To understand mechanistically how the protein fold is shaped by therapeutics to inform precision management of disease, we developed variation-capture (VarC) mapping. VarC triangulates sparse sequence variation information found in the population using Gaussian process regression (GPR)-based machine learning to define the combined pairwise-residue interactions contributing to dynamic protein function in the individual in response to therapeutics. Using VarC mapping, we now reveal the pairwise-residue covariant relationships across the entire protein fold of cystic fibrosis (CF) transmembrane conductance regulator (CFTR) to define the molecular mechanisms of clinically approved CF chemical modulators. We discover an energetically destabilized covariant core containing a di-acidic YKDAD endoplasmic reticulum (ER) exit code that is only weakly corrected by current therapeutics. Our results illustrate that VarC provides a generalizable tool to triangulate information from genetic variation in the population to mechanistically discover therapeutic strategies that guide precision management of the individual.
Collapse
Affiliation(s)
- Chao Wang
- Department of Molecular Medicine, Scripps Research, La Jolla, CA 92037, USA
| | - Frédéric Anglès
- Department of Molecular Medicine, Scripps Research, La Jolla, CA 92037, USA
| | - William E Balch
- Department of Molecular Medicine, Scripps Research, La Jolla, CA 92037, USA.
| |
Collapse
|
17
|
Behkamal B, Naghibzadeh M, Pagnani A, Saberi MR, Al Nasr K. LPTD: a novel linear programming-based topology determination method for cryo-EM maps. Bioinformatics 2022; 38:2734-2741. [PMID: 35561171 PMCID: PMC9306757 DOI: 10.1093/bioinformatics/btac170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Revised: 03/01/2022] [Accepted: 03/18/2022] [Indexed: 02/03/2023] Open
Abstract
SUMMARY Topology determination is one of the most important intermediate steps toward building the atomic structure of proteins from their medium-resolution cryo-electron microscopy (cryo-EM) map. The main goal in the topology determination is to identify correct matches (i.e. assignment and direction) between secondary structure elements (SSEs) (α-helices and β-sheets) detected in a protein sequence and cryo-EM density map. Despite many recent advances in molecular biology technologies, the problem remains a challenging issue. To overcome the problem, this article proposes a linear programming-based topology determination (LPTD) method to solve the secondary structure topology problem in three-dimensional geometrical space. Through modeling of the protein's sequence with the aid of extracting highly reliable features and a distance-based scoring function, the secondary structure matching problem is transformed into a complete weighted bipartite graph matching problem. Subsequently, an algorithm based on linear programming is developed as a decision-making strategy to extract the true topology (native topology) between all possible topologies. The proposed automatic framework is verified using 12 experimental and 15 simulated α-β proteins. Results demonstrate that LPTD is highly efficient and extremely fast in such a way that for 77% of cases in the dataset, the native topology has been detected in the first rank topology in <2 s. Besides, this method is able to successfully handle large complex proteins with as many as 65 SSEs. Such a large number of SSEs have never been solved with current tools/methods. AVAILABILITY AND IMPLEMENTATION The LPTD package (source code and data) is publicly available at https://github.com/B-Behkamal/LPTD. Moreover, two test samples as well as the instruction of utilizing the graphical user interface have been provided in the shared readme file. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Bahareh Behkamal
- Department of Computer Engineering, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad 9177948974, Iran
| | - Mahmoud Naghibzadeh
- Department of Computer Engineering, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad 9177948974, Iran
| | - Andrea Pagnani
- Department of Applied Science and Technology (DISAT), Politecnico di Torino, Torino I-10129, Italy
- Italian Institute for Genomic Medicine (IIGM), IRCC-Candiolo, Candiolo (TO) I-10060, Italy
- INFN Sezione di Torino, Torino I-10125, Italy
| | - Mohammad Reza Saberi
- Medicinal Chemistry Department, School of Pharmacy, Mashhad University of Medical Sciences, Mashhad 9177899191, Iran
- Bioinformatics Research Group, Mashhad University of Medical Sciences, Mashhad 9177899191, Iran
| | - Kamal Al Nasr
- Department of Computer Science, Tennessee State University, Nashville, TN 37209, USA
| |
Collapse
|
18
|
Meyer NL, Chapman MS. Adeno-associated virus (AAV) cell entry: structural insights. Trends Microbiol 2022; 30:432-451. [PMID: 34711462 PMCID: PMC11225776 DOI: 10.1016/j.tim.2021.09.005] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Revised: 09/02/2021] [Accepted: 09/14/2021] [Indexed: 02/07/2023]
Abstract
Adeno-associated virus (AAV) is the leading vector in emerging treatments of inherited diseases. Higher transduction efficiencies and cellular specificity are required for broader clinical application, motivating investigations of virus-host molecular interactions during cell entry. High-throughput methods are identifying host proteins more comprehensively, with subsequent molecular studies revealing unanticipated complexity and serotype specificity. Cryogenic electron microscopy (cryo-EM) provides a path towards structural details of these sometimes heterogeneous virus-host complexes, and is poised to illuminate more fully the steps in entry. Here presented, is progress in understanding the distinct steps of glycan attachment, and receptor-mediated entry/trafficking. Comparison with structures of antibody complexes provides new insights on immune neutralization with implications for the design of improved gene therapy vectors.
Collapse
Affiliation(s)
- Nancy L Meyer
- Pacific Northwest Cryo-EM Center, Oregon Health and Science University (OHSU) Center for Spatial Systems Biomedicine, Portland, OR, USA
| | - Michael S Chapman
- Department of Biochemistry, University of Missouri, Columbia, MO 65211, USA.
| |
Collapse
|
19
|
Treder KP, Huang C, Kim JS, Kirkland AI. Applications of deep learning in electron microscopy. Microscopy (Oxf) 2022; 71:i100-i115. [DOI: 10.1093/jmicro/dfab043] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 08/30/2021] [Accepted: 11/08/2021] [Indexed: 12/25/2022] Open
Abstract
Abstract
We review the growing use of machine learning in electron microscopy (EM) driven in part by the availability of fast detectors operating at kiloHertz frame rates leading to large data sets that cannot be processed using manually implemented algorithms. We summarize the various network architectures and error metrics that have been applied to a range of EM-related problems including denoising and inpainting. We then provide a review of the application of these in both physical and life sciences, highlighting how conventional networks and training data have been specifically modified for EM.
Collapse
Affiliation(s)
- Kevin P Treder
- Department of Materials, University of Oxford, Oxford, Oxfordshire OX1 3PH, UK
| | - Chen Huang
- Rosalind Franklin Institute, Harwell Research Campus, Didcot, Oxfordshire OX11 0FA, UK
| | - Judy S Kim
- Department of Materials, University of Oxford, Oxford, Oxfordshire OX1 3PH, UK
- Rosalind Franklin Institute, Harwell Research Campus, Didcot, Oxfordshire OX11 0FA, UK
| | - Angus I Kirkland
- Department of Materials, University of Oxford, Oxford, Oxfordshire OX1 3PH, UK
- Rosalind Franklin Institute, Harwell Research Campus, Didcot, Oxfordshire OX11 0FA, UK
| |
Collapse
|
20
|
Neijenhuis T, van Keulen SC, Bonvin AMJJ. Interface refinement of low- to medium-resolution Cryo-EM complexes using HADDOCK2.4. Structure 2022; 30:476-484.e3. [PMID: 35216656 DOI: 10.1016/j.str.2022.02.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Revised: 11/25/2021] [Accepted: 01/28/2022] [Indexed: 10/19/2022]
Abstract
A wide range of cellular processes requires the formation of multimeric protein complexes. The rise of cryo-electron microscopy (cryo-EM) has enabled the structural characterization of these protein assemblies. The density maps produced can, however, still suffer from limited resolution, impeding the process of resolving structures at atomic resolution. In order to solve this issue, monomers can be fitted into low- to medium-resolution maps. Unfortunately, the models produced frequently contain atomic clashes at the protein-protein interfaces (PPIs), as intermolecular interactions are typically not considered during monomer fitting. Here, we present a refinement approach based on HADDOCK2.4 to remove intermolecular clashes and optimize PPIs. A dataset of 14 cryo-EM complexes was used to test eight protocols. The best-performing protocol, consisting of a semi-flexible simulated annealing refinement with centroid restraints on the monomers, was able to decrease intermolecular atomic clashes by 98% without significantly deteriorating the quality of the cryo-EM density fit.
Collapse
Affiliation(s)
- Tim Neijenhuis
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Science for Life, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584 CH, Utrecht, the Netherlands
| | - Siri C van Keulen
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Science for Life, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584 CH, Utrecht, the Netherlands
| | - Alexandre M J J Bonvin
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Science for Life, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584 CH, Utrecht, the Netherlands.
| |
Collapse
|
21
|
Ravikumar A, Gopnarayan MN, Subramaniam S, Srinivasan N. Comparison of side-chain dispersion in protein structures determined by cryo-EM and X-ray crystallography. IUCRJ 2022; 9:98-103. [PMID: 35059214 PMCID: PMC8733892 DOI: 10.1107/s2052252521011945] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Accepted: 11/10/2021] [Indexed: 06/14/2023]
Abstract
An evaluation of systematic differences in local structure and conformation in the interior of protein tertiary structures determined by crystallography and by cryo-electron microscopy (cryo-EM) is reported. The expectation is that any consistent differences between the derived atomic models could provide insights into variations in side-chain packing that result from differences in specimens prepared for analysis between these two methods. By computing an atomic packing score, which provides a quantitative measure of clustering of side-chain atoms in the core of the tertiary structures, it is found that, in general, for structures determined by cryo-EM, side chains are more dispersed than in structures determined by X-ray crystallography over a similar resolution range. This trend is also observed in the packing comparison at subunit interfaces. Similar trends were observed in the packing comparison at the core of tertiary structures of the same proteins determined by both X-ray and cryo-EM methods. It is proposed here that the reduced dispersion of side chains in protein crystals could be due to some level of dehydration in 3D crystals prepared for X-ray crystallography and also because the higher rate of freezing of protein samples for cryo-EM may enable preservation of a more native conformation.
Collapse
Affiliation(s)
- Ashraya Ravikumar
- Molecular Biophysics Unit, Indian Institute of Science, Bengaluru, India
| | | | - Sriram Subramaniam
- University of British Columbia, Vancouver, British Columbia V6T 1Z3, Canada
| | | |
Collapse
|
22
|
Kryshtafovych A, Moult J, Albrecht R, Chang GA, Chao K, Fraser A, Greenfield J, Hartmann MD, Herzberg O, Josts I, Leiman PG, Linden SB, Lupas AN, Nelson DC, Rees SD, Shang X, Sokolova ML, Tidow H. Computational models in the service of X-ray and cryo-electron microscopy structure determination. Proteins 2021; 89:1633-1646. [PMID: 34449113 PMCID: PMC8616789 DOI: 10.1002/prot.26223] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Revised: 08/11/2021] [Accepted: 08/17/2021] [Indexed: 01/20/2023]
Abstract
Critical assessment of structure prediction (CASP) conducts community experiments to determine the state of the art in computing protein structure from amino acid sequence. The process relies on the experimental community providing information about not yet public or about to be solved structures, for use as targets. For some targets, the experimental structure is not solved in time for use in CASP. Calculated structure accuracy improved dramatically in this round, implying that models should now be much more useful for resolving many sorts of experimental difficulties. To test this, selected models for seven unsolved targets were provided to the experimental groups. These models were from the AlphaFold2 group, who overall submitted the most accurate predictions in CASP14. Four targets were solved with the aid of the models, and, additionally, the structure of an already solved target was improved. An a posteriori analysis showed that, in some cases, models from other groups would also be effective. This paper provides accounts of the successful application of models to structure determination, including molecular replacement for X-ray crystallography, backbone tracing and sequence positioning in a cryo-electron microscopy structure, and correction of local features. The results suggest that, in future, there will be greatly increased synergy between computational and experimental approaches to structure determination.
Collapse
Affiliation(s)
| | - John Moult
- Institute for Bioscience and Biotechnology Research, Department of Cell Biology and Molecular genetics, University of Maryland, 9600 Gudelsky Drive, Rockville, MD 20850, USA
| | - Reinhard Albrecht
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Geoffrey A. Chang
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California-San Diego, La Jolla, CA, 92093, USA
- Department of Pharmacology, University of California-San Diego, La Jolla, CA, 92093, USA
| | - Kinlin Chao
- Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, MD 20850, USA
| | - Alec Fraser
- Department of Biochemistry and Molecular Biology, Sealy Center for Structural Biology and Molecular Biophysics (SCSB), The University of Texas Medical Branch at Galveston, TX 77555, USA
| | - Julia Greenfield
- Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, MD 20850, USA
| | - Marcus D. Hartmann
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Osnat Herzberg
- Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, MD 20850, USA
- Department of Chemistry and Biochemistry, University of Maryland, College Park, MD 20742, USA
| | - Inokentijs Josts
- The Hamburg Advanced Research Center for Bioorganic Chemistry (HARBOR) & Department of Chemistry, Institute for Biochemistry and Molecular Biology, University of Hamburg, Luruper Chaussee 149, 22761 Hamburg, Germany
| | - Petr G. Leiman
- Department of Biochemistry and Molecular Biology, Sealy Center for Structural Biology and Molecular Biophysics (SCSB), The University of Texas Medical Branch at Galveston, TX 77555, USA
| | - Sara B. Linden
- Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, MD 20850, USA
| | - Andrei N. Lupas
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Daniel C. Nelson
- Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, MD 20850, USA
- Department of Veterinary Medicine, University of Maryland, College Park, MD 20742, USA
| | - Steven D. Rees
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California-San Diego, La Jolla, CA, 92093, USA
| | - Xiaoran Shang
- Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, MD 20850, USA
| | - Maria L. Sokolova
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, 121205, Russia
| | - Henning Tidow
- The Hamburg Advanced Research Center for Bioorganic Chemistry (HARBOR) & Department of Chemistry, Institute for Biochemistry and Molecular Biology, University of Hamburg, Luruper Chaussee 149, 22761 Hamburg, Germany
| | | |
Collapse
|
23
|
Integrative structural modeling of macromolecular complexes using Assembline. Nat Protoc 2021; 17:152-176. [PMID: 34845384 DOI: 10.1038/s41596-021-00640-z] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2020] [Accepted: 09/30/2021] [Indexed: 11/08/2022]
Abstract
Integrative modeling enables structure determination of macromolecular complexes by combining data from multiple experimental sources such as X-ray crystallography, electron microscopy or cross-linking mass spectrometry. It is particularly useful for complexes not amenable to high-resolution electron microscopy-complexes that are flexible, heterogeneous or imaged in cells with cryo-electron tomography. We have recently developed an integrative modeling protocol that allowed us to model multi-megadalton complexes as large as the nuclear pore complex. Here, we describe the Assembline software package, which combines multiple programs and libraries with our own algorithms in a streamlined modeling pipeline. Assembline builds ensembles of models satisfying data from atomic structures or homology models, electron microscopy maps and other experimental data, and provides tools for their analysis. Compared with other methods, Assembline enables efficient sampling of conformational space through a multistep procedure, provides new modeling restraints and includes a unique configuration system for setting up the modeling project. Our protocol achieves exhaustive sampling in less than 100-1,000 CPU-hours even for complexes in the megadalton range. For larger complexes, resources available in institutional or public computer clusters are needed and sufficient to run the protocol. We also provide step-by-step instructions for preparing the input, running the core modeling steps and assessing modeling performance at any stage.
Collapse
|
24
|
Dasgupta B, Miyashita O, Uchihashi T, Tama F. Reconstruction of Three-Dimensional Conformations of Bacterial ClpB from High-Speed Atomic-Force-Microscopy Images. Front Mol Biosci 2021; 8:704274. [PMID: 34422905 PMCID: PMC8376356 DOI: 10.3389/fmolb.2021.704274] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2021] [Accepted: 07/13/2021] [Indexed: 11/14/2022] Open
Abstract
ClpB belongs to the cellular disaggretase machinery involved in rescuing misfolded or aggregated proteins during heat or other cellular shocks. The function of this protein relies on the interconversion between different conformations in its native condition. A recent high-speed-atomic-force-microscopy (HS-AFM) experiment on ClpB from Thermus thermophilus shows four predominant conformational classes, namely, open, closed, spiral, and half-spiral. Analyses of AFM images provide only partial structural information regarding the molecular surface, and thus computational modeling of three-dimensional (3D) structures of these conformations should help interpret dynamical events related to ClpB functions. In this study, we reconstruct 3D models of ClpB from HS-AFM images in different conformational classes. We have applied our recently developed computational method based on a low-resolution representation of 3D structure using a Gaussian mixture model, combined with a Monte-Carlo sampling algorithm to optimize the agreement with target AFM images. After conformational sampling, we obtained models that reflect conformational variety embedded within the AFM images. From these reconstructed 3D models, we described, in terms of relative domain arrangement, the different types of ClpB oligomeric conformations observed by HS-AFM experiments. In particular, we highlighted the slippage of the monomeric components around the seam. This study demonstrates that such details of information, necessary for annotating the different conformational states involved in the ClpB function, can be obtained by combining HS-AFM images, even with limited resolution, and computational modeling.
Collapse
Affiliation(s)
- Bhaskar Dasgupta
- Computational Structural Biology Research Team, RIKEN-Center for Computational Science, Kobe, Japan
| | - Osamu Miyashita
- Computational Structural Biology Research Team, RIKEN-Center for Computational Science, Kobe, Japan
| | - Takayuki Uchihashi
- Institute for Glyco-core Research (iGCORE), Nagoya University, Nagoya, Japan.,Exploratory Research Center on Life and Living Systems (ExCELLS), National Institutes of Natural Sciences, Okazaki, Japan.,Department of Physics, Graduate School of Science, Nagoya University, Nagoya, Japan
| | - Florence Tama
- Computational Structural Biology Research Team, RIKEN-Center for Computational Science, Kobe, Japan.,Department of Physics, Graduate School of Science, Nagoya University, Nagoya, Japan.,Institute of Transformative Bio-Molecules, Nagoya University, Nagoya, Japan
| |
Collapse
|
25
|
Masrati G, Landau M, Ben-Tal N, Lupas A, Kosloff M, Kosinski J. Integrative Structural Biology in the Era of Accurate Structure Prediction. J Mol Biol 2021; 433:167127. [PMID: 34224746 DOI: 10.1016/j.jmb.2021.167127] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Revised: 06/28/2021] [Accepted: 06/28/2021] [Indexed: 11/16/2022]
Abstract
Characterizing the three-dimensional structure of macromolecules is central to understanding their function. Traditionally, structures of proteins and their complexes have been determined using experimental techniques such as X-ray crystallography, NMR, or cryo-electron microscopy-applied individually or in an integrative manner. Meanwhile, however, computational methods for protein structure prediction have been improving their accuracy, gradually, then suddenly, with the breakthrough advance by AlphaFold2, whose models of monomeric proteins are often as accurate as experimental structures. This breakthrough foreshadows a new era of computational methods that can build accurate models for most monomeric proteins. Here, we envision how such accurate modeling methods can combine with experimental structural biology techniques, enhancing integrative structural biology. We highlight the challenges that arise when considering multiple structural conformations, protein complexes, and polymorphic assemblies. These challenges will motivate further developments, both in modeling programs and in methods to solve experimental structures, towards better and quicker investigation of structure-function relationships.
Collapse
Affiliation(s)
- Gal Masrati
- Department of Biochemistry and Molecular Biology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Meytal Landau
- Department of Biology, Technion-Israel Institute of Technology, Haifa 3200003, Israel; European Molecular Biology Laboratory (EMBL), Hamburg 22607, Germany
| | - Nir Ben-Tal
- Department of Biochemistry and Molecular Biology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Andrei Lupas
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany.
| | - Mickey Kosloff
- Department of Human Biology, Faculty of Natural Sciences, University of Haifa, 199 Aba Khoushy Ave., Mt. Carmel, 3498838 Haifa, Israel.
| | - Jan Kosinski
- European Molecular Biology Laboratory (EMBL), Hamburg 22607, Germany; Centre for Structural Systems Biology (CSSB), Hamburg 22607, Germany; Structural and Computational Biology Unit, European Molecular Biology Laboratory, Meyerhofstrasse 1, 69117 Heidelberg, Germany.
| |
Collapse
|
26
|
Mori T, Terashi G, Matsuoka D, Kihara D, Sugita Y. Efficient Flexible Fitting Refinement with Automatic Error Fixing for De Novo Structure Modeling from Cryo-EM Density Maps. J Chem Inf Model 2021; 61:3516-3528. [PMID: 34142833 PMCID: PMC9282639 DOI: 10.1021/acs.jcim.1c00230] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Structural modeling of proteins from cryo-electron microscopy (cryo-EM) density maps is one of the challenging issues in structural biology. De novo modeling combined with flexible fitting refinement (FFR) has been widely used to build a structure of new proteins. In de novo prediction, artificial conformations containing local structural errors such as chirality errors, cis peptide bonds, and ring penetrations are frequently generated and cannot be easily removed in the subsequent FFR. Moreover, refinement can be significantly suppressed due to the low mobility of atoms inside the protein. To overcome these problems, we propose an efficient scheme for FFR, in which the local structural errors are fixed first, followed by FFR using an iterative simulated annealing (SA) molecular dynamics protocol with the united atom (UA) model in an implicit solvent model; we call this scheme "SAUA-FFR". The best model is selected from multiple flexible fitting runs with various biasing force constants to reduce overfitting. We apply our scheme to the decoys obtained from MAINMAST and demonstrate an improvement of the best model of eight selected proteins in terms of the root-mean-square deviation, MolProbity score, and RWplus score compared to the original scheme of MAINMAST. Fixing the local structural errors can enhance the formation of secondary structures, and the UA model enables progressive refinement compared to the all-atom model owing to its high mobility in the implicit solvent. The SAUA-FFR scheme realizes efficient and accurate protein structure modeling from medium-resolution maps with less overfitting.
Collapse
Affiliation(s)
- Takaharu Mori
- RIKEN Cluster for Pioneering Research, 2-1 Hirosawa, Wako-shi, Saitama 351-0198, Japan
| | - Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907, United States
| | - Daisuke Matsuoka
- RIKEN Cluster for Pioneering Research, 2-1 Hirosawa, Wako-shi, Saitama 351-0198, Japan
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907, United States.,Department of Computer Science, Purdue University, West Lafayette, Indiana 47907, United States
| | - Yuji Sugita
- RIKEN Cluster for Pioneering Research, 2-1 Hirosawa, Wako-shi, Saitama 351-0198, Japan.,RIKEN Center for Computational Science, 7-1-26 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan.,RIKEN Center for Biosystems Dynamics Research, 7-1-26 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan
| |
Collapse
|
27
|
Träger S, Tamò G, Aydin D, Fonti G, Audagnotto M, Dal Peraro M. CLoNe: automated clustering based on local density neighborhoods for application to biomolecular structural ensembles. Bioinformatics 2021; 37:921-928. [PMID: 32821900 PMCID: PMC8128458 DOI: 10.1093/bioinformatics/btaa742] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Revised: 07/14/2020] [Accepted: 08/18/2020] [Indexed: 11/14/2022] Open
Abstract
Motivation Proteins are intrinsically dynamic entities. Flexibility sampling methods, such as molecular dynamics or those arising from integrative modeling strategies, are now commonplace and enable the study of molecular conformational landscapes in many contexts. Resulting structural ensembles increase in size as technological and algorithmic advancements take place, making their analysis increasingly demanding. In this regard, cluster analysis remains a go-to approach for their classification. However, many state-of-the-art algorithms are restricted to specific cluster properties. Combined with tedious parameter fine-tuning, cluster analysis of protein structural ensembles suffers from the lack of a generally applicable and easy to use clustering scheme. Results We present CLoNe, an original Python-based clustering scheme that builds on the Density Peaks algorithm of Rodriguez and Laio. CLoNe relies on a probabilistic analysis of local density distributions derived from nearest neighbors to find relevant clusters regardless of cluster shape, size, distribution and amount. We show its capabilities on many toy datasets with properties otherwise dividing state-of-the-art approaches and improves on the original algorithm in key aspects. Applied to structural ensembles, CLoNe was able to extract meaningful conformations from membrane binding events and ligand-binding pocket opening as well as identify dominant dimerization motifs or inter-domain organization. CLoNe additionally saves clusters as individual trajectories for further analysis and provides scripts for automated use with molecular visualization software. Availability and implementation www.epfl.ch/labs/lbm/resources, github.com/LBM-EPFL/CLoNe. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Sylvain Träger
- Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, Lausanne 1025, Switzerland.,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| | - Giorgio Tamò
- Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, Lausanne 1025, Switzerland.,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| | - Deniz Aydin
- Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, Lausanne 1025, Switzerland.,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| | - Giulia Fonti
- Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, Lausanne 1025, Switzerland.,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| | - Martina Audagnotto
- Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, Lausanne 1025, Switzerland.,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| | - Matteo Dal Peraro
- Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, Lausanne 1025, Switzerland.,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| |
Collapse
|
28
|
Malhotra S, Joseph AP, Thiyagalingam J, Topf M. Assessment of protein-protein interfaces in cryo-EM derived assemblies. Nat Commun 2021; 12:3399. [PMID: 34099703 PMCID: PMC8184972 DOI: 10.1038/s41467-021-23692-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Accepted: 04/14/2021] [Indexed: 02/05/2023] Open
Abstract
Structures of macromolecular assemblies derived from cryo-EM maps often contain errors that become more abundant with decreasing resolution. Despite efforts in the cryo-EM community to develop metrics for map and atomistic model validation, thus far, no specific scoring metrics have been applied systematically to assess the interface between the assembly subunits. Here, we comprehensively assessed protein-protein interfaces in macromolecular assemblies derived by cryo-EM. To this end, we developed Protein Interface-score (PI-score), a density-independent machine learning-based metric, trained using the features of protein-protein interfaces in crystal structures. We evaluated 5873 interfaces in 1053 PDB-deposited cryo-EM models (including SARS-CoV-2 complexes), as well as the models submitted to CASP13 cryo-EM targets and the EM model challenge. We further inspected the interfaces associated with low-scores and found that some of those, especially in intermediate-to-low resolution (worse than 4 Å) structures, were not captured by density-based assessment scores. A combined score incorporating PI-score and fit-to-density score showed discriminatory power, allowing our method to provide a powerful complementary assessment tool for the ever-increasing number of complexes solved by cryo-EM.
Collapse
Affiliation(s)
- Sony Malhotra
- grid.4464.20000 0001 2161 2573Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck College, University of London, London, UK ,grid.14467.30Scientific Computing Department, Science and Technology Facilities Council, Didcot, UK
| | - Agnel Praveen Joseph
- grid.14467.30Scientific Computing Department, Science and Technology Facilities Council, Didcot, UK
| | - Jeyan Thiyagalingam
- grid.14467.30Scientific Computing Department, Science and Technology Facilities Council, Didcot, UK
| | - Maya Topf
- grid.4464.20000 0001 2161 2573Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck College, University of London, London, UK ,grid.13648.380000 0001 2180 3484Centre for Structural Systems Biology, Leibniz-Institut für Experimentelle Virologie and Universitätsklinikum Hamburg-Eppendorf (UKE), Hamburg, Germany
| |
Collapse
|
29
|
Pakhrin SC, Shrestha B, Adhikari B, KC DB. Deep Learning-Based Advances in Protein Structure Prediction. Int J Mol Sci 2021; 22:5553. [PMID: 34074028 PMCID: PMC8197379 DOI: 10.3390/ijms22115553] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Revised: 05/12/2021] [Accepted: 05/18/2021] [Indexed: 12/29/2022] Open
Abstract
Obtaining an accurate description of protein structure is a fundamental step toward understanding the underpinning of biology. Although recent advances in experimental approaches have greatly enhanced our capabilities to experimentally determine protein structures, the gap between the number of protein sequences and known protein structures is ever increasing. Computational protein structure prediction is one of the ways to fill this gap. Recently, the protein structure prediction field has witnessed a lot of advances due to Deep Learning (DL)-based approaches as evidenced by the success of AlphaFold2 in the most recent Critical Assessment of protein Structure Prediction (CASP14). In this article, we highlight important milestones and progresses in the field of protein structure prediction due to DL-based methods as observed in CASP experiments. We describe advances in various steps of protein structure prediction pipeline viz. protein contact map prediction, protein distogram prediction, protein real-valued distance prediction, and Quality Assessment/refinement. We also highlight some end-to-end DL-based approaches for protein structure prediction approaches. Additionally, as there have been some recent DL-based advances in protein structure determination using Cryo-Electron (Cryo-EM) microscopy based, we also highlight some of the important progress in the field. Finally, we provide an outlook and possible future research directions for DL-based approaches in the protein structure prediction arena.
Collapse
Affiliation(s)
- Subash C. Pakhrin
- Department of Electrical Engineering and Computer Science, Wichita State University, Wichita, KS 67260, USA;
| | - Bikash Shrestha
- Department of Computer Science, University of Missouri-St. Louis, St. Louis, MO 63121, USA;
| | - Badri Adhikari
- Department of Computer Science, University of Missouri-St. Louis, St. Louis, MO 63121, USA;
| | - Dukka B. KC
- Department of Electrical Engineering and Computer Science, Wichita State University, Wichita, KS 67260, USA;
| |
Collapse
|
30
|
Zhang Y, Krieger J, Mikulska-Ruminska K, Kaynak B, Sorzano COS, Carazo JM, Xing J, Bahar I. State-dependent sequential allostery exhibited by chaperonin TRiC/CCT revealed by network analysis of Cryo-EM maps. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2021; 160:104-120. [PMID: 32866476 PMCID: PMC7914283 DOI: 10.1016/j.pbiomolbio.2020.08.006] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Revised: 06/25/2020] [Accepted: 08/16/2020] [Indexed: 12/17/2022]
Abstract
The eukaryotic chaperonin TRiC/CCT plays a major role in assisting the folding of many proteins through an ATP-driven allosteric cycle. Recent structures elucidated by cryo-electron microscopy provide a broad view of the conformations visited at various stages of the chaperonin cycle, including a sequential activation of its subunits in response to nucleotide binding. But we lack a thorough mechanistic understanding of the structure-based dynamics and communication properties that underlie the TRiC/CCT machinery. In this study, we present a computational methodology based on elastic network models adapted to cryo-EM density maps to gain a deeper understanding of the structure-encoded allosteric dynamics of this hexadecameric machine. We have analysed several structures of the chaperonin resolved in different states toward mapping its conformational landscape. Our study indicates that the overall architecture intrinsically favours cooperative movements that comply with the structural variabilities observed in experiments. Furthermore, the individual subunits CCT1-CCT8 exhibit state-dependent sequential events at different states of the allosteric cycle. For example, in the ATP-bound state, subunits CCT5 and CCT4 selectively initiate the lid closure motions favoured by the overall architecture; whereas in the apo form of the heteromer, the subunit CCT7 exhibits the highest predisposition to structural change. The changes then propagate through parallel fluxes of allosteric signals to neighbours on both rings. The predicted state-dependent mechanisms of sequential activation provide new insights into TRiC/CCT intra- and inter-ring signal transduction events.
Collapse
Affiliation(s)
- Yan Zhang
- Department of Computational and Systems Biology, University of Pittsburgh, 800 Murdoch Building, 3420 Forbes Avenue, Pittsburgh, PA, 15261, USA
| | - James Krieger
- Department of Computational and Systems Biology, University of Pittsburgh, 800 Murdoch Building, 3420 Forbes Avenue, Pittsburgh, PA, 15261, USA
| | - Karolina Mikulska-Ruminska
- Department of Computational and Systems Biology, University of Pittsburgh, 800 Murdoch Building, 3420 Forbes Avenue, Pittsburgh, PA, 15261, USA
| | - Burak Kaynak
- Department of Computational and Systems Biology, University of Pittsburgh, 800 Murdoch Building, 3420 Forbes Avenue, Pittsburgh, PA, 15261, USA
| | | | - José-María Carazo
- Centro Nacional de Biotecnología (CSIC), Darwin, 3, 28049, Madrid, Spain
| | - Jianhua Xing
- Department of Computational and Systems Biology, University of Pittsburgh, 800 Murdoch Building, 3420 Forbes Avenue, Pittsburgh, PA, 15261, USA
| | - Ivet Bahar
- Department of Computational and Systems Biology, University of Pittsburgh, 800 Murdoch Building, 3420 Forbes Avenue, Pittsburgh, PA, 15261, USA.
| |
Collapse
|
31
|
Sun M, Shen B, Li W, Samir P, Browne CM, Link AJ, Frank J. A Time-Resolved Cryo-EM Study of Saccharomyces cerevisiae 80S Ribosome Protein Composition in Response to a Change in Carbon Source. Proteomics 2020; 21:e2000125. [PMID: 33007145 DOI: 10.1002/pmic.202000125] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Revised: 08/31/2020] [Indexed: 12/28/2022]
Abstract
The role of the ribosome in the regulation of gene expression has come into increased focus. It is proposed that ribosomes are catalytic engines capable of changing their protein composition in response to environmental stimuli. Time-resolved cryo-electron microscopy (cryo-EM) techniques are employed to identify quantitative changes in the protein composition and structure of the Saccharomyces cerevisiae 80S ribosomes after shifting the carbon source from glucose to glycerol. Using cryo-EM combined with the computational classification approach, it is found that a fraction of the yeast cells' 80S ribosomes lack ribosomal proteins at the entrance and exit sites for tRNAs, including uL16(RPL10), eS1(RPS1), uS11(RPS14A/B), and eS26(RPS26A/B). This fraction increased after a change from glucose to glycerol medium. The quantitative structural analysis supports the hypothesis that ribosomes are dynamic complexes that alter their composition in response to changes in growth or environmental conditions.
Collapse
Affiliation(s)
- Ming Sun
- Department of Biological Sciences, Columbia University, New York, NY, 10027, USA
| | - Bingxin Shen
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032, USA
| | - Wen Li
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032, USA
| | - Parimal Samir
- Department of Pathology, Microbiology and Immunology, Vanderbilt University School of Medicine, Nashville, TN, 37235, USA.,Department of Biochemistry, Vanderbilt University School of Medicine, Nashville, TN, 37235, USA
| | - Christopher M Browne
- Department of Pathology, Microbiology and Immunology, Vanderbilt University School of Medicine, Nashville, TN, 37235, USA.,Department of Biochemistry, Vanderbilt University School of Medicine, Nashville, TN, 37235, USA
| | - Andrew J Link
- Department of Pathology, Microbiology and Immunology, Vanderbilt University School of Medicine, Nashville, TN, 37235, USA.,Department of Biochemistry, Vanderbilt University School of Medicine, Nashville, TN, 37235, USA.,Department of Chemistry, Vanderbilt University, Nashville, TN, 37235, USA
| | - Joachim Frank
- Department of Biological Sciences, Columbia University, New York, NY, 10027, USA.,Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032, USA
| |
Collapse
|
32
|
Quemin ERJ, Machala EA, Vollmer B, Pražák V, Vasishtan D, Rosch R, Grange M, Franken LE, Baker LA, Grünewald K. Cellular Electron Cryo-Tomography to Study Virus-Host Interactions. Annu Rev Virol 2020; 7:239-262. [PMID: 32631159 DOI: 10.1146/annurev-virology-021920-115935] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Viruses are obligatory intracellular parasites that reprogram host cells upon infection to produce viral progeny. Here, we review recent structural insights into virus-host interactions in bacteria, archaea, and eukaryotes unveiled by cellular electron cryo-tomography (cryoET). This advanced three-dimensional imaging technique of vitreous samples in near-native state has matured over the past two decades and proven powerful in revealing molecular mechanisms underlying viral replication. Initial studies were restricted to cell peripheries and typically focused on early infection steps, analyzing surface proteins and viral entry. Recent developments including cryo-thinning techniques, phase-plate imaging, and correlative approaches have been instrumental in also targeting rare events inside infected cells. When combined with advances in dedicated image analyses and processing methods, details of virus assembly and egress at (sub)nanometer resolution were uncovered. Altogether, we provide a historical and technical perspective and discuss future directions and impacts of cryoET for integrative structural cell biology analyses of viruses.
Collapse
Affiliation(s)
- Emmanuelle R J Quemin
- Centre for Structural Systems Biology, Heinrich-Pette-Institute, Leibniz Institute for Experimental Virology, University of Hamburg, D-22607 Hamburg, Germany;
| | - Emily A Machala
- Division of Structural Biology, Wellcome Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom
| | - Benjamin Vollmer
- Centre for Structural Systems Biology, Heinrich-Pette-Institute, Leibniz Institute for Experimental Virology, University of Hamburg, D-22607 Hamburg, Germany;
- Division of Structural Biology, Wellcome Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom
| | - Vojtěch Pražák
- Division of Structural Biology, Wellcome Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom
| | - Daven Vasishtan
- Division of Structural Biology, Wellcome Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom
| | - Rene Rosch
- Centre for Structural Systems Biology, Heinrich-Pette-Institute, Leibniz Institute for Experimental Virology, University of Hamburg, D-22607 Hamburg, Germany;
| | - Michael Grange
- Division of Structural Biology, Wellcome Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom
| | - Linda E Franken
- Centre for Structural Systems Biology, Heinrich-Pette-Institute, Leibniz Institute for Experimental Virology, University of Hamburg, D-22607 Hamburg, Germany;
| | - Lindsay A Baker
- Division of Structural Biology, Wellcome Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom
| | - Kay Grünewald
- Centre for Structural Systems Biology, Heinrich-Pette-Institute, Leibniz Institute for Experimental Virology, University of Hamburg, D-22607 Hamburg, Germany;
- Division of Structural Biology, Wellcome Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom
| |
Collapse
|
33
|
Zhang B, Zhang X, Pearce R, Shen HB, Zhang Y. A New Protocol for Atomic-Level Protein Structure Modeling and Refinement Using Low-to-Medium Resolution Cryo-EM Density Maps. J Mol Biol 2020; 432:5365-5377. [PMID: 32771523 DOI: 10.1016/j.jmb.2020.07.027] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2020] [Revised: 07/14/2020] [Accepted: 07/31/2020] [Indexed: 12/19/2022]
Abstract
The rapid progress of cryo-electron microscopy (cryo-EM) in structural biology has raised an urgent need for robust methods to create and refine atomic-level structural models using low-resolution EM density maps. We propose a new protocol to create initial models using I-TASSER protein structure prediction, followed by EM density map-based rigid-body structure fitting, flexible fragment adjustment and atomic-level structure refinement simulations. The protocol was tested on a large set of 285 non-homologous proteins and generated structural models with correct folds for 260 proteins, where 28% had RMSDs below 2 Å. Compared to other state-of-the-art methods, the major advantage of the proposed pipeline lies in the uniform structure prediction and refinement protocol, as well as the extensive structural re-assembly simulations, which allow for low-to-medium resolution EM density map-guided structure modeling starting from amino acid sequences. Interestingly, the quality of both the image fitting and subsequent structure refinement was found to be strongly correlated with the correctness of the initial I-TASSER models; this is mainly due to the different correlation patterns observed between force field and structural quality for the models with template modeling score (or TM-score, a metric quantifying the similarity of models to the native) above and below a threshold of 0.5. Overall, the results demonstrate a new avenue that is ready to use for large-scale cryo-EM-based structure modeling and atomic-level density map-guided structure refinement.
Collapse
Affiliation(s)
- Biao Zhang
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai, China; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Xi Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Robin Pearce
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Hong-Bin Shen
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai, China.
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA; Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA.
| |
Collapse
|
34
|
Abriata LA, Dal Peraro M. State-of-the-art web services for de novo protein structure prediction. Brief Bioinform 2020; 22:5870389. [PMID: 34020540 DOI: 10.1093/bib/bbaa139] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2020] [Revised: 06/04/2020] [Accepted: 06/05/2020] [Indexed: 02/06/2023] Open
Abstract
Residue coevolution estimations coupled to machine learning methods are revolutionizing the ability of protein structure prediction approaches to model proteins that lack clear homologous templates in the Protein Data Bank (PDB). This has been patent in the last round of the Critical Assessment of Structure Prediction (CASP), which presented several very good models for the hardest targets. Unfortunately, literature reporting on these advances often lacks digests tailored to lay end users; moreover, some of the top-ranking predictors do not provide webservers that can be used by nonexperts. How can then end users benefit from these advances and correctly interpret the predicted models? Here we review the web resources that biologists can use today to take advantage of these state-of-the-art methods in their research, including not only the best de novo modeling servers but also datasets of models precomputed by experts for structurally uncharacterized protein families. We highlight their features, advantages and pitfalls for predicting structures of proteins without clear templates. We present a broad number of applications that span from driving forward biochemical investigations that lack experimental structures to actually assisting experimental structure determination in X-ray diffraction, cryo-EM and other forms of integrative modeling. We also discuss issues that must be considered by users yet still require further developments, such as global and residue-wise model quality estimates and sources of residue coevolution other than monomeric tertiary structure.
Collapse
Affiliation(s)
- Luciano A Abriata
- Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - Matteo Dal Peraro
- Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| |
Collapse
|
35
|
Kim DN, Gront D, Sanbonmatsu KY. Practical Considerations for Atomistic Structure Modeling with Cryo-EM Maps. J Chem Inf Model 2020; 60:2436-2442. [PMID: 32422044 PMCID: PMC7891309 DOI: 10.1021/acs.jcim.0c00090] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We describe common approaches to atomistic structure modeling with single particle analysis derived cryo-EM maps. Several strategies for atomistic model building and atomistic model fitting methods are discussed, including selection criteria and implementation procedures. In covering basic concepts and caveats, this short perspective aims to help facilitate active discussion between scientists at different levels with diverse backgrounds.
Collapse
Affiliation(s)
- Doo Nam Kim
- Computational Biology Team, Biological Science Division, Pacific Northwest National Laboratory, Richland, Washington, 99354, United States
| | - Dominik Gront
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland
| | - Karissa Y. Sanbonmatsu
- Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, Los Alamos, New Mexico, 87545, United States
- New Mexico Consortium, Los Alamos, New Mexico, 87544, United States
| |
Collapse
|
36
|
Jagger BR, Kochanek SE, Haldar S, Amaro RE, Mulholland AJ. Multiscale simulation approaches to modeling drug-protein binding. Curr Opin Struct Biol 2020; 61:213-221. [PMID: 32113133 DOI: 10.1016/j.sbi.2020.01.014] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Revised: 01/16/2020] [Accepted: 01/21/2020] [Indexed: 01/19/2023]
Abstract
Simulations can provide detailed insight into the molecular processes involved in drug action, such as protein-ligand binding, and can therefore be a valuable tool for drug design and development. Processes with a large range of length and timescales may be involved, and understanding these different scales typically requires different types of simulation methodology. Ideally, simulations should be able to connect across scales, to analyze and predict how changes at one scale can influence another. Multiscale simulation methods, which combine different levels of treatment, are an emerging frontier with great potential in this area. Here we review multiscale frameworks of various types, and selected applications to biomolecular systems with a focus on drug-ligand binding.
Collapse
Affiliation(s)
- Benjamin R Jagger
- Department of Chemistry and Biochemistry, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093, United States
| | - Sarah E Kochanek
- Department of Chemistry and Biochemistry, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093, United States
| | - Susanta Haldar
- Centre for Computational Chemistry, School of Chemistry, University of Bristol, Bristol, BS8 1TS, UK
| | - Rommie E Amaro
- Department of Chemistry and Biochemistry, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093, United States.
| | - Adrian J Mulholland
- Centre for Computational Chemistry, School of Chemistry, University of Bristol, Bristol, BS8 1TS, UK.
| |
Collapse
|
37
|
Rodríguez CF, Pal M, Muñoz-Hernandez H, Pearl LH, Llorca O. Modeling of a 14 kDa RUVBL2-Binding Domain with Medium Resolution Cryo-EM Density. J Chem Inf Model 2020; 60:2541-2551. [PMID: 32175735 DOI: 10.1021/acs.jcim.9b01095] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The number of high-resolution structures of protein complexes obtained using cryo-electron microscopy (cryo-EM) is increasing rapidly. Cryo-EM maps of large macromolecular complexes frequently contain regions resolved at different resolution levels, and modeling atomic structures de novo can be difficult for domains determined at worse than 5 Å in the absence of atomic information from other structures. Here we describe the details and step-by-step decisions in the strategy we followed to model the RUVBL2-binding domain (RBD), a 14 kDa domain at the C-terminus of RNA Polymerase II associated protein 3 (RPAP3) for which atomic information was not available. Modeling was performed on a cryo-EM map at 4.0-5.5 Å resolution, integrating information from secondary structure predictions, homology modeling, restraints from cross-linked mass spectrometry, and molecular dynamics (MD) in AMBER. Here, we compare our model with the structure of RBD determined by NMR to evaluate our strategy. We also perform new MD simulations to describe important residues mediating the interaction of RBD with RUVBL2 and analyze their conservation in RBD homologous domains. Our approach and its evaluation can serve as an example to address the analysis of medium resolution regions in cryo-EM maps.
Collapse
Affiliation(s)
- Carlos F Rodríguez
- Structural Biology Programme, Spanish National Cancer Research Centre (CNIO), Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | - Mohinder Pal
- Genome Damage and Stability Centre, School of Life Sciences, University of Sussex, Falmer, Brighton BN1 9RH, U.K
| | - Hugo Muñoz-Hernandez
- Structural Biology Programme, Spanish National Cancer Research Centre (CNIO), Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | - Laurence H Pearl
- Genome Damage and Stability Centre, School of Life Sciences, University of Sussex, Falmer, Brighton BN1 9RH, U.K
| | - Oscar Llorca
- Structural Biology Programme, Spanish National Cancer Research Centre (CNIO), Melchor Fernández Almagro 3, 28029 Madrid, Spain
| |
Collapse
|
38
|
Fraser JS, Lindorff-Larsen K, Bonomi M. What Will Computational Modeling Approaches Have to Say in the Era of Atomistic Cryo-EM Data? J Chem Inf Model 2020; 60:2410-2412. [PMID: 32090567 DOI: 10.1021/acs.jcim.0c00123] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Affiliation(s)
- James S Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California 94107, United States
| | - Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, DK-2200 Copenhagen, Denmark
| | - Massimiliano Bonomi
- Structural Bioinformatics Unit, Department of Structural Biology and Chemistry; CNRS UMR 3528; C3BI, CNRS USR 3756; Institut Pasteur, 75015 Paris, France
| |
Collapse
|
39
|
Alnabati E, Kihara D. Advances in Structure Modeling Methods for Cryo-Electron Microscopy Maps. Molecules 2019; 25:molecules25010082. [PMID: 31878333 PMCID: PMC6982917 DOI: 10.3390/molecules25010082] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2019] [Revised: 12/20/2019] [Accepted: 12/20/2019] [Indexed: 01/16/2023] Open
Abstract
Cryo-electron microscopy (cryo-EM) has now become a widely used technique for structure determination of macromolecular complexes. For modeling molecular structures from density maps of different resolutions, many algorithms have been developed. These algorithms can be categorized into rigid fitting, flexible fitting, and de novo modeling methods. It is also observed that machine learning (ML) techniques have been increasingly applied following the rapid progress of the ML field. Here, we review these different categories of macromolecule structure modeling methods and discuss their advances over time.
Collapse
Affiliation(s)
- Eman Alnabati
- Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
- Correspondence:
| |
Collapse
|
40
|
Klaholz BP. Deriving and refining atomic models in crystallography and cryo-EM: the latest Phenix tools to facilitate structure analysis. Acta Crystallogr D Struct Biol 2019; 75:878-881. [PMID: 31588919 PMCID: PMC6778849 DOI: 10.1107/s2059798319013391] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Accepted: 09/30/2019] [Indexed: 01/05/2023] Open
Abstract
In structural biology, deriving and refining atomic models into maps obtained from X-ray crystallography or cryo electron microscopy (cryo-EM) is essential for the detailed interpretation of a structure and its functional implications through interactions so that for example hydrogen bonds, drug specificity and associated molecular mechanisms can be analysed. This commentary summarizes the latest features of the Phenix software and also highlights the fact that cryo-EM increasingly contributes to data depositions in the PDB and EMDB.
Collapse
Affiliation(s)
- Bruno P. Klaholz
- Centre for Integrative Biology (CBI), Department of Integrated Structural Biology, IGBMC, CNRS, Inserm, Université de Strasbourg, 1 rue Laurent Fries, Illkirch 67404, France
- Institute of Genetics and of Molecular and Cellular Biology (IGBMC), 1 rue Laurent Fries, Illkirch, France
- Centre National de la Recherche Scientifique (CNRS), UMR 7104, Illkirch, France
- Institut National de la Santé et de la Recherche Médicale (Inserm), U964, Illkirch, France
- Université de Strasbourg, Illkirch, France
| |
Collapse
|