1
|
Accounting for nonuniformity of bulk-solvent: A mosaic model. Protein Sci 2024; 33:e4909. [PMID: 38358136 PMCID: PMC10868464 DOI: 10.1002/pro.4909] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 12/08/2023] [Accepted: 01/08/2024] [Indexed: 02/16/2024]
Abstract
A flat mask-based model is almost universally used in macromolecular crystallography to account for disordered (bulk) solvent. This model assumes any voxel of the crystal unit cell that is not occupied by the atomic model is occupied by the solvent. The properties of this solvent are assumed to be exactly the same across the whole volume of the unit cell. While this is a reasonable approximation in practice, there are a number of scenarios where this model becomes suboptimal. In this work, we enumerate several of these scenarios and describe a new generalized approach to modeling the bulk-solvent which we refer to as mosaic bulk-solvent model. The mosaic bulk-solvent model allows nonuniform features of the solvent in the crystal to be accounted for in a computationally efficient way. It is implemented in the computational crystallography toolbox and the Phenix software.
Collapse
|
2
|
Outcomes of the EMDataResource Cryo-EM Ligand Modeling Challenge. RESEARCH SQUARE 2024:rs.3.rs-3864137. [PMID: 38343795 PMCID: PMC10854310 DOI: 10.21203/rs.3.rs-3864137/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/18/2024]
Abstract
The EMDataResource Ligand Model Challenge aimed to assess the reliability and reproducibility of modeling ligands bound to protein and protein/nucleic-acid complexes in cryogenic electron microscopy (cryo-EM) maps determined at near-atomic (1.9-2.5 Å) resolution. Three published maps were selected as targets: E. coli beta-galactosidase with inhibitor, SARS-CoV-2 RNA-dependent RNA polymerase with covalently bound nucleotide analog, and SARS-CoV-2 ion channel ORF3a with bound lipid. Sixty-one models were submitted from 17 independent research groups, each with supporting workflow details. We found that (1) the quality of submitted ligand models and surrounding atoms varied, as judged by visual inspection and quantification of local map quality, model-to-map fit, geometry, energetics, and contact scores, and (2) a composite rather than a single score was needed to assess macromolecule+ligand model quality. These observations lead us to recommend best practices for assessing cryo-EM structures of liganded macromolecules reported at near-atomic resolution.
Collapse
|
3
|
AlphaFold predictions are valuable hypotheses and accelerate but do not replace experimental structure determination. Nat Methods 2024; 21:110-116. [PMID: 38036854 PMCID: PMC10776388 DOI: 10.1038/s41592-023-02087-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Accepted: 10/11/2023] [Indexed: 12/02/2023]
Abstract
Artificial intelligence-based protein structure prediction methods such as AlphaFold have revolutionized structural biology. The accuracies of these predictions vary, however, and they do not take into account ligands, covalent modifications or other environmental factors. Here, we evaluate how well AlphaFold predictions can be expected to describe the structure of a protein by comparing predictions directly with experimental crystallographic maps. In many cases, AlphaFold predictions matched experimental maps remarkably closely. In other cases, even very high-confidence predictions differed from experimental maps on a global scale through distortion and domain orientation, and on a local scale in backbone and side-chain conformation. We suggest considering AlphaFold predictions as exceptionally useful hypotheses. We further suggest that it is important to consider the confidence in prediction when interpreting AlphaFold predictions and to carry out experimental structure determination to verify structural details, particularly those that involve interactions not included in the prediction.
Collapse
|
4
|
Improved joint X-ray and neutron refinement procedure in Phenix. Acta Crystallogr D Struct Biol 2023; 79:1079-1093. [PMID: 37942718 PMCID: PMC10833352 DOI: 10.1107/s2059798323008914] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 10/11/2023] [Indexed: 11/10/2023] Open
Abstract
Neutron diffraction is one of the three crystallographic techniques (X-ray, neutron and electron diffraction) used to determine the atomic structures of molecules. Its particular strengths derive from the fact that H (and D) atoms are strong neutron scatterers, meaning that their positions, and thus protonation states, can be derived from crystallographic maps. However, because of technical limitations and experimental obstacles, the quality of neutron diffraction data is typically much poorer (completeness, resolution and signal to noise) than that of X-ray diffraction data for the same sample. Further, refinement is more complex as it usually requires additional parameters to describe the H (and D) atoms. The increase in the number of parameters may be mitigated by using the `riding hydrogen' refinement strategy, in which the positions of H atoms without a rotational degree of freedom are inferred from their neighboring heavy atoms. However, this does not address the issues related to poor data quality. Therefore, neutron structure determination often relies on the presence of an X-ray data set for joint X-ray and neutron (XN) refinement. In this approach, the X-ray data serve to compensate for the deficiencies of the neutron diffraction data by refining one model simultaneously against the X-ray and neutron data sets. To be applicable, it is assumed that both data sets are highly isomorphous, and preferably collected from the same crystals and at the same temperature. However, the approach has a number of limitations that are discussed in this work by comparing four separately re-refined neutron models. To address the limitations, a new method for joint XN refinement is introduced that optimizes two different models against the different data sets. This approach is tested using neutron models and data deposited in the Protein Data Bank. The efficacy of refining models with H atoms as riding or as individual atoms is also investigated.
Collapse
|
5
|
Overall protein structure quality assessment using hydrogen-bonding parameters. Acta Crystallogr D Struct Biol 2023:S2059798323005077. [PMID: 37431759 PMCID: PMC10394671 DOI: 10.1107/s2059798323005077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2023] Open
Abstract
Atomic model refinement at low resolution is often a challenging task. This is mostly because the experimental data are not sufficiently detailed to be described by atomic models. To make refinement practical and ensure that a refined atomic model is geometrically meaningful, additional information needs to be used such as restraints on Ramachandran plot distributions or residue side-chain rotameric states. However, using Ramachandran plots or rotameric states as refinement targets diminishes the validating power of these tools. Therefore, finding additional model-validation criteria that are not used or are difficult to use as refinement goals is desirable. Hydrogen bonds are one of the important noncovalent interactions that shape and maintain protein structure. These interactions can be characterized by a specific geometry of hydrogen donor and acceptor atoms. Systematic analysis of these geometries performed for quality-filtered high-resolution models of proteins from the Protein Data Bank shows that they have a distinct and a conserved distribution. Here, it is demonstrated how this information can be used for atomic model validation.
Collapse
|
6
|
Bulk-solvent and overall scaling revisited: faster calculations, improved results. Corrigendum. Acta Crystallogr D Struct Biol 2023; 79:S2059798323004825. [PMID: 37338421 DOI: 10.1107/s2059798323004825] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/21/2023] Open
Abstract
Equations in Sections 2.3 and 2.4 of the article by Afonine et al. [Acta Cryst. (2013). D69, 625-634] are corrected.
Collapse
|
7
|
Efficient structure-factor modeling for crystals with multiple components. Acta Crystallogr A Found Adv 2023:S205327332300356X. [PMID: 37338214 DOI: 10.1107/s205327332300356x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/21/2023] Open
Abstract
Diffraction intensities from a crystallographic experiment include contributions from the entire unit cell of the crystal: the macromolecule, the solvent around it and eventually other compounds. These contributions cannot typically be well described by an atomic model alone, i.e. using point scatterers. Indeed, entities such as disordered (bulk) solvent, semi-ordered solvent (e.g. lipid belts in membrane proteins, ligands, ion channels) and disordered polymer loops require other types of modeling than a collection of individual atoms. This results in the model structure factors containing multiple contributions. Most macromolecular applications assume two-component structure factors: one component arising from the atomic model and the second one describing the bulk solvent. A more accurate and detailed modeling of the disordered regions of the crystal will naturally require more than two components in the structure factors, which presents algorithmic and computational challenges. Here an efficient solution of this problem is proposed. All algorithms described in this work have been implemented in the computational crystallography toolbox (CCTBX) and are also available within Phenix software. These algorithms are rather general and do not use any assumptions about molecule type or size nor about those of its components.
Collapse
|
8
|
Accelerating crystal structure determination with iterative AlphaFold prediction. Acta Crystallogr D Struct Biol 2023; 79:234-244. [PMID: 36876433 PMCID: PMC9986801 DOI: 10.1107/s205979832300102x] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Accepted: 02/03/2023] [Indexed: 02/28/2023] Open
Abstract
Experimental structure determination can be accelerated with artificial intelligence (AI)-based structure-prediction methods such as AlphaFold. Here, an automatic procedure requiring only sequence information and crystallographic data is presented that uses AlphaFold predictions to produce an electron-density map and a structural model. Iterating through cycles of structure prediction is a key element of this procedure: a predicted model rebuilt in one cycle is used as a template for prediction in the next cycle. This procedure was applied to X-ray data for 215 structures released by the Protein Data Bank in a recent six-month period. In 87% of cases our procedure yielded a model with at least 50% of Cα atoms matching those in the deposited models within 2 Å. Predictions from the iterative template-guided prediction procedure were more accurate than those obtained without templates. It is concluded that AlphaFold predictions obtained based on sequence information alone are usually accurate enough to solve the crystallographic phase problem with molecular replacement, and a general strategy for macromolecular structure determination that includes AI-based prediction both as a starting point and as a method of model optimization is suggested.
Collapse
|
9
|
Abstract
Machine-learning prediction algorithms such as AlphaFold and RoseTTAFold can create remarkably accurate protein models, but these models usually have some regions that are predicted with low confidence or poor accuracy. We hypothesized that by implicitly including new experimental information such as a density map, a greater portion of a model could be predicted accurately, and that this might synergistically improve parts of the model that were not fully addressed by either machine learning or experiment alone. An iterative procedure was developed in which AlphaFold models are automatically rebuilt on the basis of experimental density maps and the rebuilt models are used as templates in new AlphaFold predictions. We show that including experimental information improves prediction beyond the improvement obtained with simple rebuilding guided by the experimental data. This procedure for AlphaFold modeling with density has been incorporated into an automated procedure for interpretation of crystallographic and electron cryo-microscopy maps.
Collapse
|
10
|
Protein identification from electron cryomicroscopy maps by automated model building and side-chain matching. Acta Crystallogr D Struct Biol 2021; 77:457-462. [PMID: 33825706 PMCID: PMC8025881 DOI: 10.1107/s2059798321001765] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2020] [Accepted: 02/12/2021] [Indexed: 11/10/2022] Open
Abstract
Using single-particle electron cryo-microscopy (cryo-EM), it is possible to obtain multiple reconstructions showing the 3D structures of proteins imaged as a mixture. Here, it is shown that automatic map interpretation based on such reconstructions can be used to create atomic models of proteins as well as to match the proteins to the correct sequences and thereby to identify them. This procedure was tested using two proteins previously identified from a mixture at resolutions of 3.2 Å, as well as using 91 deposited maps with resolutions between 2 and 4.5 Å. The approach is found to be highly effective for maps obtained at resolutions of 3.5 Å and better, and to have some utility at resolutions as low as 4 Å.
Collapse
|
11
|
Detection of translational noncrystallographic symmetry in Patterson functions. Acta Crystallogr D Struct Biol 2021; 77:131-141. [PMID: 33559603 PMCID: PMC7869901 DOI: 10.1107/s2059798320016836] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Accepted: 12/31/2020] [Indexed: 12/04/2022] Open
Abstract
Detection of translational noncrystallographic symmetry (TNCS) can be critical for success in crystallographic phasing, particularly when molecular-replacement models are poor or anomalous phasing information is weak. If the correct TNCS is detected then expected intensity factors for each reflection can be refined, so that the maximum-likelihood functions underlying molecular replacement and single-wavelength anomalous dispersion use appropriate structure-factor normalization and variance terms. Here, an analysis of a curated database of protein structures from the Protein Data Bank to investigate how TNCS manifests in the Patterson function is described. These studies informed an algorithm for the detection of TNCS, which includes a method for detecting the number of vectors involved in any commensurate modulation (the TNCS order). The algorithm generates a ranked list of possible TNCS associations in the asymmetric unit for exploration during structure solution.
Collapse
|
12
|
Cryo-EM model validation recommendations based on outcomes of the 2019 EMDataResource challenge. Nat Methods 2021; 18:156-164. [PMID: 33542514 PMCID: PMC7864804 DOI: 10.1038/s41592-020-01051-w] [Citation(s) in RCA: 57] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Accepted: 12/21/2020] [Indexed: 01/30/2023]
Abstract
This paper describes outcomes of the 2019 Cryo-EM Model Challenge. The goals were to (1) assess the quality of models that can be produced from cryogenic electron microscopy (cryo-EM) maps using current modeling software, (2) evaluate reproducibility of modeling results from different software developers and users and (3) compare performance of current metrics used for model evaluation, particularly Fit-to-Map metrics, with focus on near-atomic resolution. Our findings demonstrate the relatively high accuracy and reproducibility of cryo-EM models derived by 13 participating teams from four benchmark maps, including three forming a resolution series (1.8 to 3.1 Å). The results permit specific recommendations to be made about validating near-atomic cryo-EM structures both in the context of individual experiments and structure data archives such as the Protein Data Bank. We recommend the adoption of multiple scoring parameters to provide full and objective annotation and assessment of the model, reflective of the observed cryo-EM map density.
Collapse
|
13
|
CERES: a cryo-EM re-refinement system for continuous improvement of deposited models. Acta Crystallogr D Struct Biol 2021; 77:48-61. [PMID: 33404525 PMCID: PMC7787109 DOI: 10.1107/s2059798320015879] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Accepted: 12/03/2020] [Indexed: 11/10/2022] Open
Abstract
The field of electron cryomicroscopy (cryo-EM) has advanced quickly in recent years as the result of numerous technological and methodological developments. This has led to an increase in the number of atomic structures determined using this method. Recently, several tools for the analysis of cryo-EM data and models have been developed within the Phenix software package, such as phenix.real_space_refine for the refinement of atomic models against real-space maps. Also, new validation metrics have been developed for low-resolution cryo-EM models. To understand the quality of deposited cryo-EM structures and how they might be improved, models deposited in the Protein Data Bank that have map resolutions of better than 5 Å were automatically re-refined using current versions of Phenix tools. The results are available on a publicly accessible web page (https://cci.lbl.gov/ceres). The implementation of a Cryo-EM Re-refinement System (CERES) for the improvement of models deposited in the wwPDB, and the results of the re-refinements, are described. Based on these results, contents are proposed for a `cryo-EM Table 1', which summarizes experimental details and validation metrics in a similar way to `Table 1' in crystallography. The consistent use of robust metrics for the evaluation of cryo-EM models and data should accompany every structure deposition and be reported in scientific publications.
Collapse
|
14
|
Real-space quantum-based refinement for cryo-EM: Q|R#3. ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY 2020; 76:1184-1191. [PMID: 33263324 DOI: 10.1107/s2059798320013194] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/25/2020] [Accepted: 09/29/2020] [Indexed: 11/10/2022]
Abstract
Electron cryo-microscopy (cryo-EM) is rapidly becoming a major competitor to X-ray crystallography, especially for large structures that are difficult or impossible to crystallize. While recent spectacular technological improvements have led to significantly higher resolution three-dimensional reconstructions, the average quality of cryo-EM maps is still at the low-resolution end of the range compared with crystallography. A long-standing challenge for atomic model refinement has been the production of stereochemically meaningful models for this resolution regime. Here, it is demonstrated that including accurate model geometry restraints derived from ab initio quantum-chemical calculations (HF-D3/6-31G) can improve the refinement of an example structure (chain A of PDB entry 3j63). The robustness of the procedure is tested for additional structures with up to 7000 atoms (PDB entry 3a5x and chain C of PDB entry 5fn5) using the less expensive semi-empirical (GFN1-xTB) model. The necessary algorithms enabling real-space quantum refinement have been implemented in the latest version of qr.refine and are described here.
Collapse
|
15
|
Abstract
Density modification uses expectations about features of a map such as a flat solvent and expected distributions of density in the region of the macromolecule to improve individual Fourier terms representing the map. This process transfers information from one part of a map to another and can improve the accuracy of a map. Here, the assumptions behind density modification for maps from electron cryomicroscopy are examined and a procedure is presented that allows the incorporation of model-based information. Density modification works best in cases where unfiltered, unmasked maps with clear boundaries between the macromolecule and solvent are visible, and where there is substantial noise in the map, both in the region of the macromolecule and the solvent. It also is most effective if the characteristics of the map are relatively constant within regions of the macromolecule and the solvent. Model-based information can be used to improve density modification, but model bias can in principle occur. Here, model bias is reduced by using ensemble models that allow an estimation of model uncertainty. A test of model bias is presented that suggests that even if the expected density in a region of a map is specified incorrectly by using an incorrect model, the incorrect expectations do not strongly affect the final map.
Collapse
|
16
|
What are the current limits on determination of protonation state using neutron macromolecular crystallography? Methods Enzymol 2020; 634:225-255. [PMID: 32093835 PMCID: PMC7571246 DOI: 10.1016/bs.mie.2020.01.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/30/2024]
Abstract
The rate of deposition of models determined by neutron diffraction, or a hybrid approach that combines X-ray and neutron diffraction, has increased in recent years. The benefit of neutron diffraction is that hydrogen atom (H) positions are detectable, allowing for the determination of protonation state and water molecule orientation. This study analyses all neutron models deposited in the Protein Data Bank to date, focusing on protonation state and properties of H (or deuterium, D) atoms as well as the details of water molecules. In particular, clashes and hydrogen bonds involving H or D atoms are investigated. As water molecules are typically the least reproducible part of a structural model, their positions in neutron models were compared to those in homologous high-resolution X-ray structures. For models determined by joint refinement against X-ray and neutron data, the water structure comparison was also carried out for models re-refined against the X-ray data alone. The homologues have generally fewer conserved water molecules where X-ray only was used and the positions of equivalent waters vary more than in the case of the hybrid X-ray model. As neutron diffraction data are generally less complete than X-ray data, the influence of neutron data completeness on nuclear density maps was also analyzed. We observe and discuss systematic map quality deterioration as result of data incompleteness.
Collapse
|
17
|
Implementation of the riding hydrogen model in CCTBX to support the next generation of X-ray and neutron joint refinement in Phenix. Methods Enzymol 2020; 634:177-199. [PMID: 32093832 PMCID: PMC7574815 DOI: 10.1016/bs.mie.2020.01.007] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
A fundamental prerequisite for implementing new procedures of atomic model refinement against neutron diffraction data is the efficient handling of hydrogen atoms. The riding hydrogen model, which constrains hydrogen atom parameters to those of the non-hydrogen atoms, is a plausible parameterization for refinements. This work describes the implementation of the riding hydrogen model in the Computational Crystallography Toolbox and in Phenix. Riding hydrogen atoms can be found in several different configurations that are characterized by specific geometries. For each configuration, the hydrogen atom parameterization and the expressions for the gradients of refinement target function with respect to non-hydrogen parameters are described.
Collapse
|
18
|
Including crystallographic symmetry in quantum-based refinement: Q|R#2. ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY 2020; 76:41-50. [PMID: 31909742 DOI: 10.1107/s2059798319015122] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Accepted: 11/08/2019] [Indexed: 11/11/2022]
Abstract
Three-dimensional structure models refined using low-resolution data from crystallographic or electron cryo-microscopy experiments can benefit from high-quality restraints derived from quantum-chemical methods. However, nonperiodic atom-centered quantum-chemistry codes do not inherently account for nearest-neighbor interactions of crystallographic symmetry-related copies in a satisfactory way. Here, these nearest-neighbor effects have been included in the model by expanding to a super-cell and then truncating the super-cell to only include residues from neighboring cells that are interacting with the asymmetric unit. In this way, the fragmentation approach can adequately and efficiently include nearest-neighbor effects. It has previously been shown that a moderately sized X-ray structure can be treated using quantum methods if a fragmentation approach is applied. In this study, a target protein (PDB entry 4gif) was partitioned into a number of large fragments. The use of large fragments (typically hundreds of atoms) is tractable when a GPU-based package such as TeraChem is employed or cheaper (semi-empirical) methods are used. The QM calculations were run at the HF-D3/6-31G level. The models refined using a recently developed semi-empirical method (GFN2-xTB) were compared and contrasted. To validate the refinement procedure for a non-P1 structure, a standard set of crystallographic metrics were used. The robustness of the implementation is shown by refining 13 additional protein models across multiple space groups and a summary of the refinement metrics is presented.
Collapse
|
19
|
Cryo-EM map interpretation and protein model-building using iterative map segmentation. Protein Sci 2019; 29:87-99. [PMID: 31599033 PMCID: PMC6933853 DOI: 10.1002/pro.3740] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2019] [Revised: 09/30/2019] [Accepted: 10/01/2019] [Indexed: 11/17/2022]
Abstract
A procedure for building protein chains into maps produced by single‐particle electron cryo‐microscopy (cryo‐EM) is described. The procedure is similar to the way an experienced structural biologist might analyze a map, focusing first on secondary structure elements such as helices and sheets, then varying the contour level to identify connections between these elements. Since the high density in a map typically follows the main‐chain of the protein, the main‐chain connection between secondary structure elements can often be identified as the unbranched path between them with the highest minimum value along the path. This chain‐tracing procedure is then combined with finding side‐chain positions based on the presence of density extending away from the main path of the chain, allowing generation of a Cα model. The Cα model is converted to an all‐atom model and is refined against the map. We show that this procedure is as effective as other existing methods for interpretation of cryo‐EM maps and that it is considerably faster and produces models with fewer chain breaks than our previous methods that were based on approaches developed for crystallographic maps.
Collapse
|
20
|
Abstract
Cryo-electron microscopy (cryo-EM) is becoming a method of choice for describing native conformations of biomolecular complexes at high resolution. The rapid growth of cryo-EM in recent years has created a high demand for automated solutions, both in hardware and software. Flexible fitting of atomic models to three-dimensional (3D) cryo-EM reconstructions by molecular dynamics (MD) simulation is a popular technique but often requires technical expertise in computer simulation. This work introduces cryo_fit, a package for the automatic flexible fitting of atomic models in cryo-EM maps using MD simulation. The package is integrated with the Phenix software suite. The module was designed to automate the multiple steps of MD simulation in a reproducible manner, as well as facilitate refinement and validation through Phenix. Through the use of cryo_fit, scientists with little experience in MD simulation can produce high quality atomic models automatically and better exploit the potential of cryo-EM.
Collapse
|
21
|
Macromolecular structure determination using X-rays, neutrons and electrons: recent developments in Phenix. Acta Crystallogr D Struct Biol 2019; 75:861-877. [PMID: 31588918 PMCID: PMC6778852 DOI: 10.1107/s2059798319011471] [Citation(s) in RCA: 3184] [Impact Index Per Article: 636.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2019] [Accepted: 08/15/2019] [Indexed: 12/16/2022] Open
Abstract
Diffraction (X-ray, neutron and electron) and electron cryo-microscopy are powerful methods to determine three-dimensional macromolecular structures, which are required to understand biological processes and to develop new therapeutics against diseases. The overall structure-solution workflow is similar for these techniques, but nuances exist because the properties of the reduced experimental data are different. Software tools for structure determination should therefore be tailored for each method. Phenix is a comprehensive software package for macromolecular structure determination that handles data from any of these techniques. Tasks performed with Phenix include data-quality assessment, map improvement, model building, the validation/rebuilding/refinement cycle and deposition. Each tool caters to the type of experimental data. The design of Phenix emphasizes the automation of procedures, where possible, to minimize repetitive and time-consuming manual tasks, while default parameters are chosen to encourage best practice. A graphical user interface provides access to many command-line features of Phenix and streamlines the transition between programs, project tracking and re-running of previous tasks.
Collapse
|
22
|
Efficient real-space refinement for cryo-EM and crystallography. Acta Crystallogr A Found Adv 2019. [DOI: 10.1107/s2053273319094701] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
23
|
Updated validation and deposition tools in the Phenix GUI. Acta Crystallogr A Found Adv 2019. [DOI: 10.1107/s0108767319096703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
|
24
|
Announcing mandatory submission of PDBx/mmCIF format files for crystallographic depositions to the Protein Data Bank (PDB). Acta Crystallogr D Struct Biol 2019; 75:451-454. [PMID: 30988261 PMCID: PMC6465986 DOI: 10.1107/s2059798319004522] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2019] [Accepted: 04/03/2019] [Indexed: 11/10/2022] Open
Abstract
This letter announces that PDBx/mmCIF format files will become mandatory for crystallographic depositions to the Protein Data Bank (PDB).
Collapse
|
25
|
A fully automatic method yielding initial models from high-resolution cryo-electron microscopy maps. Nat Methods 2018; 15:905-908. [PMID: 30377346 PMCID: PMC6214191 DOI: 10.1038/s41592-018-0173-1] [Citation(s) in RCA: 101] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2018] [Accepted: 08/06/2018] [Indexed: 01/31/2023]
Abstract
We report a fully automated procedure for the optimization and interpretation of reconstructions from cryo-electron microscopy (cryo-EM) data, available in Phenix as phenix.map_to_model. We applied our approach to 476 datasets with resolution of 4.5 Å or better, including reconstructions of 47 ribosomes and 32 other protein-RNA complexes. The median fraction of residues in the deposited structures reproduced automatically was 71% for reconstructions determined at resolutions of 3 Å or better and 47% for those at resolutions worse than 3 Å.
Collapse
|
26
|
Map segmentation, automated model-building and their application to the Cryo-EM Model Challenge. J Struct Biol 2018; 204:338-343. [PMID: 30063987 PMCID: PMC6163059 DOI: 10.1016/j.jsb.2018.07.016] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2018] [Revised: 07/11/2018] [Accepted: 07/27/2018] [Indexed: 11/27/2022]
Abstract
A recently-developed method for identifying a compact, contiguous region representing the unique part of a density map was applied to 218 Cryo-EM maps with resolutions of 4.5 Å or better. The key elements of the segmentation procedure are (1) identification of all regions of density above a threshold and (2) choice of a unique set of these regions, taking symmetry into consideration, that maximize connectivity and compactness. This segmentation approach was then combined with tools for automated map sharpening and model-building to generate models for the 12 maps in the 2016 Cryo-EM Model Challenge in a fully automated manner. The resulting models have completeness from 24% to 82% and RMS distances from reference interpretations of 0.6 Å-2.1 Å.
Collapse
|
27
|
New tools for the analysis and validation of cryo-EM maps and atomic models. Acta Crystallogr D Struct Biol 2018; 74:814-840. [PMID: 30198894 PMCID: PMC6130467 DOI: 10.1107/s2059798318009324] [Citation(s) in RCA: 430] [Impact Index Per Article: 71.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2017] [Accepted: 06/27/2018] [Indexed: 11/25/2022] Open
Abstract
Recent advances in the field of electron cryomicroscopy (cryo-EM) have resulted in a rapidly increasing number of atomic models of biomacromolecules that have been solved using this technique and deposited in the Protein Data Bank and the Electron Microscopy Data Bank. Similar to macromolecular crystallography, validation tools for these models and maps are required. While some of these validation tools may be borrowed from crystallography, new methods specifically designed for cryo-EM validation are required. Here, new computational methods and tools implemented in PHENIX are discussed, including d99 to estimate resolution, phenix.auto_sharpen to improve maps and phenix.mtriage to analyze cryo-EM maps. It is suggested that cryo-EM half-maps and masks should be deposited to facilitate the evaluation and validation of cryo-EM-derived atomic models and maps. The application of these tools to deposited cryo-EM atomic models and maps is also presented.
Collapse
|
28
|
The expected log-likelihood gain for decision making in molecular replacement. Acta Crystallogr A Found Adv 2018. [DOI: 10.1107/s2053273318089027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
|
29
|
Evaluation of models determined by neutron diffraction and proposed improvements to their validation and deposition. Acta Crystallogr D Struct Biol 2018; 74:800-813. [PMID: 30082516 PMCID: PMC6079631 DOI: 10.1107/s2059798318004588] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2017] [Accepted: 03/19/2018] [Indexed: 11/10/2022] Open
Abstract
The Protein Data Bank (PDB) contains a growing number of models that have been determined using neutron diffraction or a hybrid method that combines X-ray and neutron diffraction. The advantage of neutron diffraction experiments is that the positions of all atoms can be determined, including H atoms, which are hardly detectable by X-ray diffraction. This allows the determination of protonation states and the assignment of H atoms to water molecules. Because neutrons are scattered differently by hydrogen and its isotope deuterium, neutron diffraction in combination with H/D exchange can provide information on accessibility, dynamics and chemical lability. In this study, the deposited data, models and model-to-data fit for all PDB entries that used neutron diffraction as the source of experimental data have been analysed. In many cases, the reported Rwork and Rfree values were not reproducible. In such cases, the model and data files were analysed to identify the reasons for this mismatch. The issues responsible for the discrepancies are summarized and explained. The analysis unveiled limitations to the annotation, deposition and validation of models and data, and a lack of community-wide accepted standards for the description of neutron models and data, as well as deficiencies in current model refinement tools. Most of the issues identified concern the handling of H atoms. Since the primary use of neutron macromolecular crystallography is to locate and directly visualize H atoms, it is important to address these issues, so that the deposited neutron models allow the retrieval of the maximum amount of information with the smallest effort of manual intervention. A path forward to improving the annotation, validation and deposition of neutron models and hybrid X-ray and neutron models is suggested.
Collapse
|
30
|
Polder maps: improving OMIT maps for ligand building and validation. Acta Crystallogr A Found Adv 2018. [DOI: 10.1107/s0108767318096927] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
31
|
Better bulk-solvent models can improve model-to-data fit. Acta Crystallogr A Found Adv 2018. [DOI: 10.1107/s0108767318095740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
|
32
|
From deep TLS validation to ensembles of atomic models built from elemental motions. II. Analysis of TLS refinement results by explicit interpretation. Acta Crystallogr D Struct Biol 2018; 74:621-631. [PMID: 29968672 PMCID: PMC6038382 DOI: 10.1107/s2059798318005764] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2017] [Accepted: 04/12/2018] [Indexed: 12/19/2022] Open
Abstract
TLS modelling was developed by Schomaker and Trueblood to describe atomic displacement parameters through concerted (rigid-body) harmonic motions of an atomic group [Schomaker & Trueblood (1968), Acta Cryst. B24, 63-76]. The results of a TLS refinement are T, L and S matrices that provide individual anisotropic atomic displacement parameters (ADPs) for all atoms belonging to the group. These ADPs can be calculated analytically using a formula that relates the elements of the TLS matrices to atomic parameters. Alternatively, ADPs can be obtained numerically from the parameters of concerted atomic motions corresponding to the TLS matrices. Both procedures are expected to produce the same ADP values and therefore can be used to assess the results of TLS refinement. Here, the implementation of this approach in PHENIX is described and several illustrations, including the use of all models from the PDB that have been subjected to TLS refinement, are provided.
Collapse
|
33
|
Automated map sharpening by maximization of detail and connectivity. Acta Crystallogr D Struct Biol 2018; 74:545-559. [PMID: 29872005 PMCID: PMC6096490 DOI: 10.1107/s2059798318004655] [Citation(s) in RCA: 156] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2017] [Accepted: 03/21/2018] [Indexed: 01/18/2023] Open
Abstract
An algorithm for automatic map sharpening is presented that is based on optimization of the detail and connectivity of the sharpened map. The detail in the map is reflected in the surface area of an iso-contour surface that contains a fixed fraction of the volume of the map, where a map with high level of detail has a high surface area. The connectivity of the sharpened map is reflected in the number of connected regions defined by the same iso-contour surfaces, where a map with high connectivity has a small number of connected regions. By combining these two measures in a metric termed the `adjusted surface area', map quality can be evaluated in an automated fashion. This metric was used to choose optimal map-sharpening parameters without reference to a model or other interpretations of the map. Map sharpening by optimization of the adjusted surface area can be carried out for a map as a whole or it can be carried out locally, yielding a locally sharpened map. To evaluate the performance of various approaches, a simple metric based on map-model correlation that can reproduce visual choices of optimally sharpened maps was used. The map-model correlation is calculated using a model with B factors (atomic displacement factors; ADPs) set to zero. This model-based metric was used to evaluate map sharpening and to evaluate map-sharpening approaches, and it was found that optimization of the adjusted surface area can be an effective tool for map sharpening.
Collapse
|
34
|
Real-space refinement in PHENIX for cryo-EM and crystallography. Acta Crystallogr D Struct Biol 2018; 74:531-544. [PMID: 29872004 PMCID: PMC6096492 DOI: 10.1107/s2059798318006551] [Citation(s) in RCA: 1486] [Impact Index Per Article: 247.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2018] [Accepted: 04/27/2018] [Indexed: 02/23/2023] Open
Abstract
This article describes the implementation of real-space refinement in the phenix.real_space_refine program from the PHENIX suite. The use of a simplified refinement target function enables very fast calculation, which in turn makes it possible to identify optimal data-restraint weights as part of routine refinements with little runtime cost. Refinement of atomic models against low-resolution data benefits from the inclusion of as much additional information as is available. In addition to standard restraints on covalent geometry, phenix.real_space_refine makes use of extra information such as secondary-structure and rotamer-specific restraints, as well as restraints or constraints on internal molecular symmetry. The re-refinement of 385 cryo-EM-derived models available in the Protein Data Bank at resolutions of 6 Å or better shows significant improvement of the models and of the fit of these models to the target maps.
Collapse
|
35
|
On the application of the expected log-likelihood gain to decision making in molecular replacement. Acta Crystallogr D Struct Biol 2018; 74:245-255. [PMID: 29652252 PMCID: PMC5892874 DOI: 10.1107/s2059798318004357] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2017] [Accepted: 03/14/2018] [Indexed: 11/18/2022] Open
Abstract
Molecular-replacement phasing of macromolecular crystal structures is often fast, but if a molecular-replacement solution is not immediately obtained the crystallographer must judge whether to pursue molecular replacement or to attempt experimental phasing as the quickest path to structure solution. The introduction of the expected log-likelihood gain [eLLG; McCoy et al. (2017), Proc. Natl Acad. Sci. USA, 114, 3637-3641] has given the crystallographer a powerful new tool to aid in making this decision. The eLLG is the log-likelihood gain on intensity [LLGI; Read & McCoy (2016), Acta Cryst. D72, 375-387] expected from a correctly placed model. It is calculated as a sum over the reflections of a function dependent on the fraction of the scattering for which the model accounts, the estimated model coordinate error and the measurement errors in the data. It is shown how the eLLG may be used to answer the question `can I solve my structure by molecular replacement?'. However, this is only the most obvious of the applications of the eLLG. It is also discussed how the eLLG may be used to determine the search order and minimal data requirements for obtaining a molecular-replacement solution using a given model, and for decision making in fragment-based molecular replacement, single-atom molecular replacement and likelihood-guided model pruning.
Collapse
|
36
|
|
37
|
DiSCaMB: a software library for aspherical atom model X-ray scattering factor calculations with CPUs and GPUs. J Appl Crystallogr 2018; 51:193-199. [PMID: 29507550 PMCID: PMC5822993 DOI: 10.1107/s1600576717015825] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2017] [Accepted: 10/30/2017] [Indexed: 11/10/2022] Open
Abstract
It has been recently established that the accuracy of structural parameters from X-ray refinement of crystal structures can be improved by using a bank of aspherical pseudoatoms instead of the classical spherical model of atomic form factors. This comes, however, at the cost of increased complexity of the underlying calculations. In order to facilitate the adoption of this more advanced electron density model by the broader community of crystallographers, a new software implementation called DiSCaMB, 'densities in structural chemistry and molecular biology', has been developed. It addresses the challenge of providing for high performance on modern computing architectures. With parallelization options for both multi-core processors and graphics processing units (using CUDA), the library features calculation of X-ray scattering factors and their derivatives with respect to structural parameters, gives access to intermediate steps of the scattering factor calculations (thus allowing for experimentation with modifications of the underlying electron density model), and provides tools for basic structural crystallographic operations. Permissively (MIT) licensed, DiSCaMB is an open-source C++ library that can be embedded in both academic and commercial tools for X-ray structure refinement.
Collapse
|
38
|
Solving the scalability issue in quantum-based refinement: Q|R#1. Acta Crystallogr D Struct Biol 2017; 73:1020-1028. [PMID: 29199981 PMCID: PMC5713877 DOI: 10.1107/s2059798317016746] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2017] [Accepted: 11/20/2017] [Indexed: 12/27/2022] Open
Abstract
Accurately refining biomacromolecules using a quantum-chemical method is challenging because the cost of a quantum-chemical calculation scales approximately as nm, where n is the number of atoms and m (≥3) is based on the quantum method of choice. This fundamental problem means that quantum-chemical calculations become intractable when the size of the system requires more computational resources than are available. In the development of the software package called Q|R, this issue is referred to as Q|R#1. A divide-and-conquer approach has been developed that fragments the atomic model into small manageable pieces in order to solve Q|R#1. Firstly, the atomic model of a crystal structure is analyzed to detect noncovalent interactions between residues, and the results of the analysis are represented as an interaction graph. Secondly, a graph-clustering algorithm is used to partition the interaction graph into a set of clusters in such a way as to minimize disruption to the noncovalent interaction network. Thirdly, the environment surrounding each individual cluster is analyzed and any residue that is interacting with a particular cluster is assigned to the buffer region of that particular cluster. A fragment is defined as a cluster plus its buffer region. The gradients for all atoms from each of the fragments are computed, and only the gradients from each cluster are combined to create the total gradients. A quantum-based refinement is carried out using the total gradients as chemical restraints. In order to validate this interaction graph-based fragmentation approach in Q|R, the entire atomic model of an amyloid cross-β spine crystal structure (PDB entry 2oNA) was refined.
Collapse
|
39
|
Interactive comparison and remediation of collections of macromolecular structures. Protein Sci 2017; 27:182-194. [PMID: 28901593 DOI: 10.1002/pro.3296] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2017] [Revised: 09/08/2017] [Accepted: 09/11/2017] [Indexed: 11/09/2022]
Abstract
Often similar structures need to be compared to reveal local differences throughout the entire model or between related copies within the model. Therefore, a program to compare multiple structures and enable correction any differences not supported by the density map was written within the Phenix framework (Adams et al., Acta Cryst 2010; D66:213-221). This program, called Structure Comparison, can also be used for structures with multiple copies of the same protein chain in the asymmetric unit, that is, as a result of non-crystallographic symmetry (NCS). Structure Comparison was designed to interface with Coot(Emsley et al., Acta Cryst 2010; D66:486-501) and PyMOL(DeLano, PyMOL 0.99; 2002) to facilitate comparison of large numbers of related structures. Structure Comparison analyzes collections of protein structures using several metrics, such as the rotamer conformation of equivalent residues, displays the results in tabular form and allows superimposed protein chains and density maps to be quickly inspected and edited (via the tools in Coot) for consistency, completeness and correctness.
Collapse
|
40
|
Phasing strategies II – molecular replacement. Acta Crystallogr A Found Adv 2017. [DOI: 10.1107/s0108767317096192] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
41
|
Polder maps: improving OMIT maps by excluding bulk solvent. Acta Crystallogr D Struct Biol 2017; 73:148-157. [PMID: 28177311 PMCID: PMC5297918 DOI: 10.1107/s2059798316018210] [Citation(s) in RCA: 423] [Impact Index Per Article: 60.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2016] [Accepted: 11/14/2016] [Indexed: 11/10/2022] Open
Abstract
The crystallographic maps that are routinely used during the structure-solution workflow are almost always model-biased because model information is used for their calculation. As these maps are also used to validate the atomic models that result from model building and refinement, this constitutes an immediate problem: anything added to the model will manifest itself in the map and thus hinder the validation. OMIT maps are a common tool to verify the presence of atoms in the model. The simplest way to compute an OMIT map is to exclude the atoms in question from the structure, update the corresponding structure factors and compute a residual map. It is then expected that if these atoms are present in the crystal structure, the electron density for the omitted atoms will be seen as positive features in this map. This, however, is complicated by the flat bulk-solvent model which is almost universally used in modern crystallographic refinement programs. This model postulates constant electron density at any voxel of the unit-cell volume that is not occupied by the atomic model. Consequently, if the density arising from the omitted atoms is weak then the bulk-solvent model may obscure it further. A possible solution to this problem is to prevent bulk solvent from entering the selected OMIT regions, which may improve the interpretative power of residual maps. This approach is called a polder (OMIT) map. Polder OMIT maps can be particularly useful for displaying weak densities of ligands, solvent molecules, side chains, alternative conformations and residues both in terminal regions and in loops. The tools described in this manuscript have been implemented and are available in PHENIX.
Collapse
|
42
|
Abstract
Quantum-based refinement utilizes chemical restraints derived from quantum-chemical methods instead of the standard parameterized library-based restraints used in refinement packages. The motivation is twofold: firstly, the restraints have the potential to be more accurate, and secondly, the restraints can be more easily applied to new molecules such as drugs or novel cofactors. Here, a new project called Q|R aimed at developing quantum-based refinement of biomacromolecules is under active development by researchers at Shanghai University together with PHENIX developers. The central focus of this long-term project is to develop software that is built on top of open-source components. A development version of Q|R was used to compare quantum-based refinements with standard refinement using a small model system.
Collapse
|
43
|
Structure of photosystem II and substrate binding at room temperature. Nature 2016; 540:453-457. [PMID: 27871088 DOI: 10.1038/nature20161] [Citation(s) in RCA: 281] [Impact Index Per Article: 35.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2016] [Accepted: 10/14/2016] [Indexed: 12/16/2022]
Abstract
Light-induced oxidation of water by photosystem II (PS II) in plants, algae and cyanobacteria has generated most of the dioxygen in the atmosphere. PS II, a membrane-bound multi-subunit pigment protein complex, couples the one-electron photochemistry at the reaction centre with the four-electron redox chemistry of water oxidation at the Mn4CaO5 cluster in the oxygen-evolving complex (OEC). Under illumination, the OEC cycles through five intermediate S-states (S0 to S4), in which S1 is the dark-stable state and S3 is the last semi-stable state before O-O bond formation and O2 evolution. A detailed understanding of the O-O bond formation mechanism remains a challenge, and will require elucidation of both the structures of the OEC in the different S-states and the binding of the two substrate waters to the catalytic site. Here we report the use of femtosecond pulses from an X-ray free electron laser (XFEL) to obtain damage-free, room temperature structures of dark-adapted (S1), two-flash illuminated (2F; S3-enriched), and ammonia-bound two-flash illuminated (2F-NH3; S3-enriched) PS II. Although the recent 1.95 Å resolution structure of PS II at cryogenic temperature using an XFEL provided a damage-free view of the S1 state, measurements at room temperature are required to study the structural landscape of proteins under functional conditions, and also for in situ advancement of the S-states. To investigate the water-binding site(s), ammonia, a water analogue, has been used as a marker, as it binds to the Mn4CaO5 cluster in the S2 and S3 states. Since the ammonia-bound OEC is active, the ammonia-binding Mn site is not a substrate water site. This approach, together with a comparison of the native dark and 2F states, is used to discriminate between proposed O-O bond formation mechanisms.
Collapse
|
44
|
From deep TLS validation to ensembles of atomic models built from elemental motions. Addenda and corrigendum. ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY 2016; 72:1073-1075. [PMID: 27599739 PMCID: PMC5013599 DOI: 10.1107/s2059798316013048] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/24/2016] [Accepted: 08/12/2016] [Indexed: 11/24/2022]
Abstract
Some key parts of the algorithm for interpretation of TLS matrices in terms of elemental atomic motions and corresponding ensembles of atomic models described in the article by Urzhumtsev et al. [(2015) Acta Cryst. D71, 1668–1683] are clarified and developed, and a reference on a wrong model is corrected. Researcher feedback has indicated that in Urzhumtsev et al. [(2015) Acta Cryst. D71, 1668–1683] clarification of key parts of the algorithm for interpretation of TLS matrices in terms of elemental atomic motions and corresponding ensembles of atomic models is required. Also, it has been brought to the attention of the authors that the incorrect PDB code was reported for one of test models. These issues are addressed in this article.
Collapse
|
45
|
New bulk-solvent model improves model-to-data fit and facilitates map interpretation. Acta Crystallogr A Found Adv 2016. [DOI: 10.1107/s2053273316099642] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
46
|
Predicting X-ray diffuse scattering from translation-libration-screw structural ensembles. ACTA CRYSTALLOGRAPHICA. SECTION D, BIOLOGICAL CRYSTALLOGRAPHY 2015; 71:1657-67. [PMID: 26249347 PMCID: PMC4528799 DOI: 10.1107/s1399004715007415] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/17/2014] [Accepted: 04/15/2015] [Indexed: 01/01/2023]
Abstract
Identifying the intramolecular motions of proteins and nucleic acids is a major challenge in macromolecular X-ray crystallography. Because Bragg diffraction describes the average positional distribution of crystalline atoms with imperfect precision, the resulting electron density can be compatible with multiple models of motion. Diffuse X-ray scattering can reduce this degeneracy by reporting on correlated atomic displacements. Although recent technological advances are increasing the potential to accurately measure diffuse scattering, computational modeling and validation tools are still needed to quantify the agreement between experimental data and different parameterizations of crystalline disorder. A new tool, phenix.diffuse, addresses this need by employing Guinier's equation to calculate diffuse scattering from Protein Data Bank (PDB)-formatted structural ensembles. As an example case, phenix.diffuse is applied to translation-libration-screw (TLS) refinement, which models rigid-body displacement for segments of the macromolecule. To enable the calculation of diffuse scattering from TLS-refined structures, phenix.tls_as_xyz builds multi-model PDB files that sample the underlying T, L and S tensors. In the glycerophosphodiesterase GpdQ, alternative TLS-group partitioning and different motional correlations between groups yield markedly dissimilar diffuse scattering maps with distinct implications for molecular mechanism and allostery. These methods demonstrate how, in principle, X-ray diffuse scattering could extend macromolecular structural refinement, validation and analysis.
Collapse
|
47
|
From deep TLS validation to ensembles of atomic models built from elemental motions. ACTA CRYSTALLOGRAPHICA. SECTION D, BIOLOGICAL CRYSTALLOGRAPHY 2015; 71:1668-83. [PMID: 26249348 PMCID: PMC4528800 DOI: 10.1107/s1399004715011426] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/18/2014] [Accepted: 06/12/2015] [Indexed: 02/06/2023]
Abstract
The translation-libration-screw model first introduced by Cruickshank, Schomaker and Trueblood describes the concerted motions of atomic groups. Using TLS models can improve the agreement between calculated and experimental diffraction data. Because the T, L and S matrices describe a combination of atomic vibrations and librations, TLS models can also potentially shed light on molecular mechanisms involving correlated motions. However, this use of TLS models in mechanistic studies is hampered by the difficulties in translating the results of refinement into molecular movement or a structural ensemble. To convert the matrices into a constituent molecular movement, the matrix elements must satisfy several conditions. Refining the T, L and S matrix elements as independent parameters without taking these conditions into account may result in matrices that do not represent concerted molecular movements. Here, a mathematical framework and the computational tools to analyze TLS matrices, resulting in either explicit decomposition into descriptions of the underlying motions or a report of broken conditions, are described. The description of valid underlying motions can then be output as a structural ensemble. All methods are implemented as part of the PHENIX project.
Collapse
|
48
|
Programming new geometry restraints: parallelity of atomic groups. J Appl Crystallogr 2015; 48:1130-1141. [PMID: 26306091 PMCID: PMC4520290 DOI: 10.1107/s1600576715010432] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2015] [Accepted: 05/31/2015] [Indexed: 11/17/2022] Open
Abstract
Details are described of the calculation of new parallelity restraints recently introduced in cctbx and PHENIX. Improvements in structural biology methods, in particular crystallography and cryo-electron microscopy, have created an increased demand for the refinement of atomic models against low-resolution experimental data. One way to compensate for the lack of high-resolution experimental data is to use a priori information about model geometry that can be utilized in refinement in the form of stereochemical restraints or constraints. Here, the definition and calculation of the restraints that can be imposed on planar atomic groups, in particular the angle between such groups, are described. Detailed derivations of the restraint targets and their gradients are provided so that they can be readily implemented in other contexts. Practical implementations of the restraints, and of associated data structures, in the Computational Crystallography Toolbox (cctbx) are presented.
Collapse
|
49
|
The solvent component of macromolecular crystals. ACTA CRYSTALLOGRAPHICA. SECTION D, BIOLOGICAL CRYSTALLOGRAPHY 2015; 71:1023-38. [PMID: 25945568 PMCID: PMC4427195 DOI: 10.1107/s1399004715006045] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/05/2014] [Accepted: 03/25/2015] [Indexed: 11/10/2022]
Abstract
The mother liquor from which a biomolecular crystal is grown will contain water, buffer molecules, native ligands and cofactors, crystallization precipitants and additives, various metal ions, and often small-molecule ligands or inhibitors. On average, about half the volume of a biomolecular crystal consists of this mother liquor, whose components form the disordered bulk solvent. Its scattering contributions can be exploited in initial phasing and must be included in crystal structure refinement as a bulk-solvent model. Concomitantly, distinct electron density originating from ordered solvent components must be correctly identified and represented as part of the atomic crystal structure model. Herein, are reviewed (i) probabilistic bulk-solvent content estimates, (ii) the use of bulk-solvent density modification in phase improvement, (iii) bulk-solvent models and refinement of bulk-solvent contributions and (iv) modelling and validation of ordered solvent constituents. A brief summary is provided of current tools for bulk-solvent analysis and refinement, as well as of modelling, refinement and analysis of ordered solvent components, including small-molecule ligands.
Collapse
|
50
|
FEM: feature-enhanced map. ACTA CRYSTALLOGRAPHICA. SECTION D, BIOLOGICAL CRYSTALLOGRAPHY 2015; 71:646-66. [PMID: 25760612 PMCID: PMC4356370 DOI: 10.1107/s1399004714028132] [Citation(s) in RCA: 132] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/10/2014] [Accepted: 12/25/2014] [Indexed: 11/29/2022]
Abstract
A method is presented that modifies a 2mFobs - DFmodel σA-weighted map such that the resulting map can strengthen a weak signal, if present, and can reduce model bias and noise. The method consists of first randomizing the starting map and filling in missing reflections using multiple methods. This is followed by restricting the map to regions with convincing density and the application of sharpening. The final map is then created by combining a series of histogram-equalized intermediate maps. In the test cases shown, the maps produced in this way are found to have increased interpretability and decreased model bias compared with the starting 2mFobs - DFmodel σA-weighted map.
Collapse
|