1
|
Poon BK, Terwilliger TC, Adams PD. The Phenix-AlphaFold webservice: Enabling AlphaFold predictions for use in Phenix. Protein Sci 2024; 33:e4992. [PMID: 38647406 PMCID: PMC11034488 DOI: 10.1002/pro.4992] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2023] [Revised: 03/01/2024] [Accepted: 03/31/2024] [Indexed: 04/25/2024]
Abstract
Advances in machine learning have enabled sufficiently accurate predictions of protein structure to be used in macromolecular structure determination with crystallography and cryo-electron microscopy data. The Phenix software suite has AlphaFold predictions integrated into an automated pipeline that can start with an amino acid sequence and data, and automatically perform model-building and refinement to return a protein model fitted into the data. Due to the steep technical requirements of running AlphaFold efficiently, we have implemented a Phenix-AlphaFold webservice that enables all Phenix users to run AlphaFold predictions remotely from the Phenix GUI starting with the official 1.21 release. This webservice will be improved based on how it is used by the research community and the future research directions for Phenix.
Collapse
Affiliation(s)
- Billy K. Poon
- Molecular Biophysics & Integrated Bioimaging DivisionLawrence Berkeley National LaboratoryBerkeleyCaliforniaUSA
| | - Thomas C. Terwilliger
- New Mexico ConsortiumLos AlamosNew MexicoUSA
- Los Alamos National LaboratoryLos AlamosNew MexicoUSA
| | - Paul D. Adams
- Molecular Biophysics & Integrated Bioimaging DivisionLawrence Berkeley National LaboratoryBerkeleyCaliforniaUSA
- Department of BioengineeringUniversity of California, BerkeleyBerkeleyCaliforniaUSA
| |
Collapse
|
2
|
Afonine PV, Grosse-Kunstleve RW, Adams PD, Urzhumtsev A. Bulk-solvent and overall scaling revisited: faster calculations, improved results. Corrigendum. Acta Crystallogr D Struct Biol 2023; 79:S2059798323004825. [PMID: 37338421 DOI: 10.1107/s2059798323004825] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/21/2023] Open
Abstract
Equations in Sections 2.3 and 2.4 of the article by Afonine et al. [Acta Cryst. (2013). D69, 625-634] are corrected.
Collapse
Affiliation(s)
- P V Afonine
- Lawrence Berkeley National Laboratory, One Cyclotron Road, MS64R0121, Berkeley, CA 94720, USA
| | - R W Grosse-Kunstleve
- Lawrence Berkeley National Laboratory, One Cyclotron Road, MS64R0121, Berkeley, CA 94720, USA
| | - P D Adams
- Lawrence Berkeley National Laboratory, One Cyclotron Road, MS64R0121, Berkeley, CA 94720, USA
| | - A Urzhumtsev
- IGBMC, CNRS-INSERM-UdS, 1 Rue Laurent Fries, BP 10142, 67404 Illkirch, France
| |
Collapse
|
3
|
Oeffner RD, Croll TI, Millán C, Poon BK, Schlicksup CJ, Read RJ, Terwilliger TC. Putting AlphaFold models to work with phenix.process_predicted_model and ISOLDE. Acta Crystallogr D Struct Biol 2022; 78:1303-1314. [PMID: 36322415 PMCID: PMC9629492 DOI: 10.1107/s2059798322010026] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Accepted: 10/13/2022] [Indexed: 11/23/2022] Open
Abstract
AlphaFold has recently become an important tool in providing models for experimental structure determination by X-ray crystallography and cryo-EM. Large parts of the predicted models typically approach the accuracy of experimentally determined structures, although there are frequently local errors and errors in the relative orientations of domains. Importantly, residues in the model of a protein predicted by AlphaFold are tagged with a predicted local distance difference test score, informing users about which regions of the structure are predicted with less confidence. AlphaFold also produces a predicted aligned error matrix indicating its confidence in the relative positions of each pair of residues in the predicted model. The phenix.process_predicted_model tool downweights or removes low-confidence residues and can break a model into confidently predicted domains in preparation for molecular replacement or cryo-EM docking. These confidence metrics are further used in ISOLDE to weight torsion and atom-atom distance restraints, allowing the complete AlphaFold model to be interactively rearranged to match the docked fragments and reducing the need for the rebuilding of connecting regions.
Collapse
Affiliation(s)
- Robert D. Oeffner
- Department of Haematology, University of Cambridge, Cambridge Institute for Medical Research, Cambridge Biomedical Campus, The Keith Peters Building, Hills Road, Cambridge CB2 0XY, United Kingdom
| | - Tristan I. Croll
- Department of Haematology, University of Cambridge, Cambridge Institute for Medical Research, Cambridge Biomedical Campus, The Keith Peters Building, Hills Road, Cambridge CB2 0XY, United Kingdom
| | - Claudia Millán
- Department of Haematology, University of Cambridge, Cambridge Institute for Medical Research, Cambridge Biomedical Campus, The Keith Peters Building, Hills Road, Cambridge CB2 0XY, United Kingdom
| | - Billy K. Poon
- Molecular Biophysics and Integrated Bioimaging, Lawrence Berkeley National Laboratory (LBNL), Building 33R0349, Berkeley, CA 94720-8235, USA
| | - Christopher J. Schlicksup
- Molecular Biophysics and Integrated Bioimaging, Lawrence Berkeley National Laboratory (LBNL), Building 33R0349, Berkeley, CA 94720-8235, USA
| | - Randy J. Read
- Department of Haematology, University of Cambridge, Cambridge Institute for Medical Research, Cambridge Biomedical Campus, The Keith Peters Building, Hills Road, Cambridge CB2 0XY, United Kingdom,Correspondence e-mail: ,
| | - Tom C. Terwilliger
- New Mexico Consortium, Los Alamos National Laboratory, 100 Entrada Drive, Los Alamos, NM 87544, USA,Correspondence e-mail: ,
| |
Collapse
|
4
|
Mikhailovskii O, Xue Y, Skrynnikov NR. Modeling a unit cell: crystallographic refinement procedure using the biomolecular MD simulation platform Amber. IUCrJ 2022; 9:114-133. [PMID: 35059216 PMCID: PMC8733891 DOI: 10.1107/s2052252521011891] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Accepted: 11/09/2021] [Indexed: 06/14/2023]
Abstract
A procedure has been developed for the refinement of crystallographic protein structures based on the biomolecular simulation program Amber. The procedure constructs a model representing a crystal unit cell, which generally contains multiple protein molecules and is fully hydrated with TIP3P water. Periodic boundary conditions are applied to the cell in order to emulate the crystal lattice. The refinement is conducted in the form of a specially designed short molecular-dynamics run controlled by the Amber ff14SB force field and the maximum-likelihood potential that encodes the structure-factor-based restraints. The new Amber-based refinement procedure has been tested on a set of 84 protein structures. In most cases, the new procedure led to appreciably lower R free values compared with those reported in the original PDB depositions or obtained by means of the industry-standard phenix.refine program. In particular, the new method has the edge in refining low-accuracy scrambled models. It has also been successful in refining a number of molecular-replacement models, including one with an r.m.s.d. of 2.15 Å. In addition, Amber-refined structures consistently show superior MolProbity scores. The new approach offers a highly realistic representation of protein-protein interactions in the crystal, as well as of protein-water interactions. It also offers a realistic representation of protein crystal dynamics (akin to ensemble-refinement schemes). Importantly, the method fully utilizes the information from the available diffraction data, while relying on state-of-the-art molecular-dynamics modeling to assist with those elements of the structure that do not diffract well (for example mobile loops or side chains). Finally, it should be noted that the protocol employs no tunable parameters, and the calculations can be conducted in a matter of several hours on desktop computers equipped with graphical processing units or using a designated web service.
Collapse
Affiliation(s)
- Oleg Mikhailovskii
- Laboratory of Biomolecular NMR, St Petersburg State University, St Petersburg 199034, Russian Federation
- Department of Chemistry, Purdue University, West Lafayette, IN 47907, USA
| | - Yi Xue
- School of Life Sciences, Tsinghua University, Beijing 100084, People's Republic of China
- Beijing Advanced Innovation Center for Structural Biology, Tsinghua University, Beijing 100084, People's Republic of China
- Tsinghua University-Peking University Joint Center for Life Sciences, Tsinghua University, Beijing 100084, People's Republic of China
| | - Nikolai R Skrynnikov
- Laboratory of Biomolecular NMR, St Petersburg State University, St Petersburg 199034, Russian Federation
- Department of Chemistry, Purdue University, West Lafayette, IN 47907, USA
| |
Collapse
|
5
|
Malý M, Diederichs K, Dohnálek J, Kolenko P. PAIREF: paired refinement also for Phenix users. Acta Crystallogr F Struct Biol Commun 2021; 77:226-229. [PMID: 34196613 PMCID: PMC8248825 DOI: 10.1107/s2053230x21006129] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Accepted: 06/11/2021] [Indexed: 11/10/2022] Open
Abstract
In macromolecular crystallography, paired refinement is generally accepted to be the optimal approach for the determination of the high-resolution cutoff. The software tool PAIREF provides automation of the protocol and associated analysis. Support for phenix.refine as a refinement engine has recently been implemented in the program. This feature is presented here using previously published data for thermolysin. The results demonstrate the importance of the complete cross-validation procedure to obtain a thorough and unbiased insight into the quality of high-resolution data.
Collapse
Affiliation(s)
- Martin Malý
- Faculty of Nuclear Sciences and Physical Engineering, Czech Technical University in Prague, Břehová 7, 115 19 Prague 1, Czech Republic
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Průmyslová 595, 252 50 Vestec, Czech Republic
| | - Kay Diederichs
- University of Konstanz, Box M647, 78457 Konstanz, Germany
| | - Jan Dohnálek
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Průmyslová 595, 252 50 Vestec, Czech Republic
| | - Petr Kolenko
- Faculty of Nuclear Sciences and Physical Engineering, Czech Technical University in Prague, Břehová 7, 115 19 Prague 1, Czech Republic
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Průmyslová 595, 252 50 Vestec, Czech Republic
| |
Collapse
|
6
|
Liebschner D, Afonine PV, Moriarty NW, Poon BK, Chen VB, Adams PD. CERES: a cryo-EM re-refinement system for continuous improvement of deposited models. Acta Crystallogr D Struct Biol 2021; 77:48-61. [PMID: 33404525 PMCID: PMC7787109 DOI: 10.1107/s2059798320015879] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Accepted: 12/03/2020] [Indexed: 11/10/2022] Open
Abstract
The field of electron cryomicroscopy (cryo-EM) has advanced quickly in recent years as the result of numerous technological and methodological developments. This has led to an increase in the number of atomic structures determined using this method. Recently, several tools for the analysis of cryo-EM data and models have been developed within the Phenix software package, such as phenix.real_space_refine for the refinement of atomic models against real-space maps. Also, new validation metrics have been developed for low-resolution cryo-EM models. To understand the quality of deposited cryo-EM structures and how they might be improved, models deposited in the Protein Data Bank that have map resolutions of better than 5 Å were automatically re-refined using current versions of Phenix tools. The results are available on a publicly accessible web page (https://cci.lbl.gov/ceres). The implementation of a Cryo-EM Re-refinement System (CERES) for the improvement of models deposited in the wwPDB, and the results of the re-refinements, are described. Based on these results, contents are proposed for a `cryo-EM Table 1', which summarizes experimental details and validation metrics in a similar way to `Table 1' in crystallography. The consistent use of robust metrics for the evaluation of cryo-EM models and data should accompany every structure deposition and be reported in scientific publications.
Collapse
Affiliation(s)
- Dorothee Liebschner
- Molecular Biosciences and Integrated Bioimaging, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Pavel V. Afonine
- Molecular Biosciences and Integrated Bioimaging, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Nigel W. Moriarty
- Molecular Biosciences and Integrated Bioimaging, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Billy K. Poon
- Molecular Biosciences and Integrated Bioimaging, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Vincent B. Chen
- Department of Biochemistry, Duke University, Durham, NC 27710, USA
| | - Paul D. Adams
- Molecular Biosciences and Integrated Bioimaging, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
- Department of Bioengineering, University of California Berkeley, Berkeley, CA 94720, USA
| |
Collapse
|
7
|
Sobolev OV, Afonine PV, Moriarty NW, Hekkelman ML, Joosten RP, Perrakis A, Adams PD. A Global Ramachandran Score Identifies Protein Structures with Unlikely Stereochemistry. Structure 2020; 28:1249-1258.e2. [PMID: 32857966 DOI: 10.1016/j.str.2020.08.005] [Citation(s) in RCA: 55] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Revised: 06/23/2020] [Accepted: 08/07/2020] [Indexed: 12/18/2022]
Abstract
Ramachandran plots report the distribution of the (ϕ, ψ) torsion angles of the protein backbone and are one of the best quality metrics of experimental structure models. Typically, validation software reports the number of residues belonging to "outlier," "allowed," and "favored" regions. While "zero unexplained outliers" can be considered the current "gold standard," this can be misleading if deviations from expected distributions are not considered. We revisited the Ramachandran Z score (Rama-Z), a quality metric introduced more than two decades ago but underutilized. We describe a reimplementation of the Rama-Z score in the Computational Crystallography Toolbox along with an algorithm to estimate its uncertainty for individual models; final implementations are available in Phenix and PDB-REDO. We discuss the interpretation of the Rama-Z score and advocate including it in the validation reports provided by the Protein Data Bank. We also advocate reporting it alongside the outlier/allowed/favored counts in structural publications.
Collapse
|
8
|
Klaholz BP. Deriving and refining atomic models in crystallography and cryo-EM: the latest Phenix tools to facilitate structure analysis. Acta Crystallogr D Struct Biol 2019; 75:878-881. [PMID: 31588919 PMCID: PMC6778849 DOI: 10.1107/s2059798319013391] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Accepted: 09/30/2019] [Indexed: 01/05/2023] Open
Abstract
In structural biology, deriving and refining atomic models into maps obtained from X-ray crystallography or cryo electron microscopy (cryo-EM) is essential for the detailed interpretation of a structure and its functional implications through interactions so that for example hydrogen bonds, drug specificity and associated molecular mechanisms can be analysed. This commentary summarizes the latest features of the Phenix software and also highlights the fact that cryo-EM increasingly contributes to data depositions in the PDB and EMDB.
Collapse
Affiliation(s)
- Bruno P. Klaholz
- Centre for Integrative Biology (CBI), Department of Integrated Structural Biology, IGBMC, CNRS, Inserm, Université de Strasbourg, 1 rue Laurent Fries, Illkirch 67404, France
- Institute of Genetics and of Molecular and Cellular Biology (IGBMC), 1 rue Laurent Fries, Illkirch, France
- Centre National de la Recherche Scientifique (CNRS), UMR 7104, Illkirch, France
- Institut National de la Santé et de la Recherche Médicale (Inserm), U964, Illkirch, France
- Université de Strasbourg, Illkirch, France
| |
Collapse
|
9
|
Kim DN, Moriarty NW, Kirmizialtin S, Afonine PV, Poon B, Sobolev OV, Adams PD, Sanbonmatsu K. Cryo_fit: Democratization of flexible fitting for cryo-EM. J Struct Biol 2019; 208:1-6. [PMID: 31279069 PMCID: PMC7112765 DOI: 10.1016/j.jsb.2019.05.012] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2019] [Accepted: 05/31/2019] [Indexed: 12/18/2022]
Abstract
Cryo-electron microscopy (cryo-EM) is becoming a method of choice for describing native conformations of biomolecular complexes at high resolution. The rapid growth of cryo-EM in recent years has created a high demand for automated solutions, both in hardware and software. Flexible fitting of atomic models to three-dimensional (3D) cryo-EM reconstructions by molecular dynamics (MD) simulation is a popular technique but often requires technical expertise in computer simulation. This work introduces cryo_fit, a package for the automatic flexible fitting of atomic models in cryo-EM maps using MD simulation. The package is integrated with the Phenix software suite. The module was designed to automate the multiple steps of MD simulation in a reproducible manner, as well as facilitate refinement and validation through Phenix. Through the use of cryo_fit, scientists with little experience in MD simulation can produce high quality atomic models automatically and better exploit the potential of cryo-EM.
Collapse
Affiliation(s)
- Doo Nam Kim
- Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Nigel W Moriarty
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA, USA
| | - Serdal Kirmizialtin
- Chemistry Program, Science Division, New York University, Abu Dhabi, United Arab Emirates
| | - Pavel V Afonine
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA, USA
| | - Billy Poon
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA, USA
| | - Oleg V Sobolev
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA, USA
| | - Paul D Adams
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA, USA; Department of Bioengineering, University of California Berkeley, Berkeley, CA, USA
| | - Karissa Sanbonmatsu
- Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, Los Alamos, NM, USA; New Mexico Consortium, Los Alamos, NM, USA.
| |
Collapse
|
10
|
Liebschner D, Afonine PV, Baker ML, Bunkóczi G, Chen VB, Croll TI, Hintze B, Hung LW, Jain S, McCoy AJ, Moriarty NW, Oeffner RD, Poon BK, Prisant MG, Read RJ, Richardson JS, Richardson DC, Sammito MD, Sobolev OV, Stockwell DH, Terwilliger TC, Urzhumtsev AG, Videau LL, Williams CJ, Adams PD. Macromolecular structure determination using X-rays, neutrons and electrons: recent developments in Phenix. Acta Crystallogr D Struct Biol 2019; 75:861-877. [PMID: 31588918 PMCID: PMC6778852 DOI: 10.1107/s2059798319011471] [Citation(s) in RCA: 3184] [Impact Index Per Article: 636.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2019] [Accepted: 08/15/2019] [Indexed: 12/16/2022] Open
Abstract
Diffraction (X-ray, neutron and electron) and electron cryo-microscopy are powerful methods to determine three-dimensional macromolecular structures, which are required to understand biological processes and to develop new therapeutics against diseases. The overall structure-solution workflow is similar for these techniques, but nuances exist because the properties of the reduced experimental data are different. Software tools for structure determination should therefore be tailored for each method. Phenix is a comprehensive software package for macromolecular structure determination that handles data from any of these techniques. Tasks performed with Phenix include data-quality assessment, map improvement, model building, the validation/rebuilding/refinement cycle and deposition. Each tool caters to the type of experimental data. The design of Phenix emphasizes the automation of procedures, where possible, to minimize repetitive and time-consuming manual tasks, while default parameters are chosen to encourage best practice. A graphical user interface provides access to many command-line features of Phenix and streamlines the transition between programs, project tracking and re-running of previous tasks.
Collapse
Affiliation(s)
- Dorothee Liebschner
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Pavel V. Afonine
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Matthew L. Baker
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX 77030, USA
| | - Gábor Bunkóczi
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge CB2 0XY, England
| | - Vincent B. Chen
- Department of Biochemistry, Duke University, Durham, NC 27710, USA
| | - Tristan I. Croll
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge CB2 0XY, England
| | - Bradley Hintze
- Department of Biochemistry, Duke University, Durham, NC 27710, USA
| | - Li-Wei Hung
- Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| | - Swati Jain
- Department of Biochemistry, Duke University, Durham, NC 27710, USA
| | - Airlie J. McCoy
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge CB2 0XY, England
| | - Nigel W. Moriarty
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Robert D. Oeffner
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge CB2 0XY, England
| | - Billy K. Poon
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | | | - Randy J. Read
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge CB2 0XY, England
| | | | | | - Massimo D. Sammito
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge CB2 0XY, England
| | - Oleg V. Sobolev
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Duncan H. Stockwell
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge CB2 0XY, England
| | - Thomas C. Terwilliger
- Los Alamos National Laboratory, Los Alamos, NM 87545, USA
- New Mexico Consortium, Los Alamos, NM 87544, USA
| | - Alexandre G. Urzhumtsev
- Centre for Integrative Biology, Institut de Génétique et de Biologie Moléculaire et Cellulaire, CNRS–INSERM–UdS, 67404 Illkirch, France
- Faculté des Sciences et Technologies, Université de Lorraine, BP 239, 54506 Vandoeuvre-lès-Nancy, France
| | | | | | - Paul D. Adams
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Department of Bioengineering, University of California Berkeley, Berkeley, CA 94720, USA
| |
Collapse
|
11
|
Liebschner D, Afonine PV, Moriarty NW, Langan P, Adams PD. Evaluation of models determined by neutron diffraction and proposed improvements to their validation and deposition. Acta Crystallogr D Struct Biol 2018; 74:800-813. [PMID: 30082516 PMCID: PMC6079631 DOI: 10.1107/s2059798318004588] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2017] [Accepted: 03/19/2018] [Indexed: 11/10/2022] Open
Abstract
The Protein Data Bank (PDB) contains a growing number of models that have been determined using neutron diffraction or a hybrid method that combines X-ray and neutron diffraction. The advantage of neutron diffraction experiments is that the positions of all atoms can be determined, including H atoms, which are hardly detectable by X-ray diffraction. This allows the determination of protonation states and the assignment of H atoms to water molecules. Because neutrons are scattered differently by hydrogen and its isotope deuterium, neutron diffraction in combination with H/D exchange can provide information on accessibility, dynamics and chemical lability. In this study, the deposited data, models and model-to-data fit for all PDB entries that used neutron diffraction as the source of experimental data have been analysed. In many cases, the reported Rwork and Rfree values were not reproducible. In such cases, the model and data files were analysed to identify the reasons for this mismatch. The issues responsible for the discrepancies are summarized and explained. The analysis unveiled limitations to the annotation, deposition and validation of models and data, and a lack of community-wide accepted standards for the description of neutron models and data, as well as deficiencies in current model refinement tools. Most of the issues identified concern the handling of H atoms. Since the primary use of neutron macromolecular crystallography is to locate and directly visualize H atoms, it is important to address these issues, so that the deposited neutron models allow the retrieval of the maximum amount of information with the smallest effort of manual intervention. A path forward to improving the annotation, validation and deposition of neutron models and hybrid X-ray and neutron models is suggested.
Collapse
Affiliation(s)
- Dorothee Liebschner
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Pavel V. Afonine
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Department of Physics and International Centre for Quantum and Molecular Structures, Shanghai University, Shanghai 200444, People’s Republic of China
| | - Nigel W. Moriarty
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Paul Langan
- Neutron Science Directorate, Oak Ridge National Laboratory, PO Box 2008, Oak Ridge, TN 37831, USA
| | - Paul D. Adams
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Department of Bioengineering, University of California, Berkeley, CA 94720, USA
| |
Collapse
|
12
|
Hintze BJ, Lewis SM, Richardson JS, Richardson DC. Molprobity's ultimate rotamer-library distributions for model validation. Proteins 2016; 84:1177-89. [PMID: 27018641 DOI: 10.1002/prot.25039] [Citation(s) in RCA: 69] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2016] [Revised: 03/16/2016] [Accepted: 03/18/2016] [Indexed: 12/22/2022]
Abstract
Here we describe the updated MolProbity rotamer-library distributions derived from an order-of-magnitude larger and more stringently quality-filtered dataset of about 8000 (vs. 500) protein chains, and we explain the resulting changes and improvements to model validation as seen by users. To include only side-chains with satisfactory justification for their given conformation, we added residue-specific filters for electron-density value and model-to-density fit. The combined new protocol retains a million residues of data, while cleaning up false-positive noise in the multi- χ datapoint distributions. It enables unambiguous characterization of conformational clusters nearly 1000-fold less frequent than the most common ones. We describe examples of local interactions that favor these rare conformations, including the role of authentic covalent bond-angle deviations in enabling presumably strained side-chain conformations. Further, along with favored and outlier, an allowed category (0.3-2.0% occurrence in reference data) has been added, analogous to Ramachandran validation categories. The new rotamer distributions are used for current rotamer validation in MolProbity and PHENIX, and for rotamer choice in PHENIX model-building and refinement. The multi-dimensional χ distributions and Top8000 reference dataset are freely available on GitHub. These rotamers are termed "ultimate" because data sampling and quality are now fully adequate for this task, and also because we believe the future of conformational validation should integrate side-chain with backbone criteria. Proteins 2016; 84:1177-1189. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Bradley J Hintze
- Department of Biochemistry, Duke University, Durham North Carolina 27710
| | - Steven M Lewis
- Department of Biochemistry, Duke University, Durham North Carolina 27710
| | - Jane S Richardson
- Department of Biochemistry, Duke University, Durham North Carolina 27710
| | - David C Richardson
- Department of Biochemistry, Duke University, Durham North Carolina 27710
| |
Collapse
|
13
|
Moriarty NW, Tronrud DE, Adams PD, Karplus PA. A new default restraint library for the protein backbone in Phenix: a conformation-dependent geometry goes mainstream. Acta Crystallogr D Struct Biol 2016; 72:176-9. [PMID: 26894545 PMCID: PMC4756615 DOI: 10.1107/s2059798315022408] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/05/2015] [Accepted: 11/23/2015] [Indexed: 11/12/2022]
Abstract
The default geometry restraints used in Phenix for the protein backbone have been upgraded to account for the known conformation-dependencies of bond angles and lengths. Chemical restraints are a fundamental part of crystallographic protein structure refinement. In response to mounting evidence that conventional restraints have shortcomings, it has previously been documented that using backbone restraints that depend on the protein backbone conformation helps to address these shortcomings and improves the performance of refinements [Moriarty et al. (2014 ▸), FEBS J.281, 4061–4071]. It is important that these improvements be made available to all in the protein crystallography community. Toward this end, a change in the default geometry library used by Phenix is described here. Tests are presented showing that this change will not generate increased numbers of outliers during validation, or deposition in the Protein Data Bank, during the transition period in which some validation tools still use the conventional restraint libraries.
Collapse
Affiliation(s)
- Nigel W Moriarty
- Physical Biosciences, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Dale E Tronrud
- Department of Biochemistry and Biophysics, Oregon State University, Corvallis, OR 97377, USA
| | - Paul D Adams
- Physical Biosciences, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - P Andrew Karplus
- Department of Biochemistry and Biophysics, Oregon State University, Corvallis, OR 97377, USA
| |
Collapse
|