1
|
AlphaFold illuminates half of the dark human proteins. Curr Opin Struct Biol 2022; 74:102372. [PMID: 35439658 PMCID: PMC10669925 DOI: 10.1016/j.sbi.2022.102372] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 03/02/2022] [Accepted: 03/13/2022] [Indexed: 01/05/2023]
Abstract
We investigate the use of confidence scores to evaluate the accuracy of a given AlphaFold (AF2) protein model for drug discovery. Prediction of accuracy is improved by not considering confidence scores below 80 due to the effects of disorder. On a set of recent crystal structures, 95% are likely to have accurate folds. Conformational discordance in the training set has a much more significant effect on accuracy than sequence divergence. We propose criteria for models and residues that are possibly useful for virtual screening. Based on these criteria, AF2 provides models for half of understudied (dark) human proteins and two-thirds of residues in those models.
Collapse
|
2
|
Abstract
In this chapter, we introduce the main components of the Legume Information System ( https://legumeinfo.org ) and several associated resources. Additionally, we provide an example of their use by exploring a biological question: is there a common molecular basis, across legume species, that underlies the photoperiod-mediated transition from vegetative to reproductive development, that is, days to flowering? The Legume Information System (LIS) holds genetic and genomic data for a large number of crop and model legumes and provides a set of online bioinformatic tools designed to help biologists address questions and tasks related to legume biology. Such tasks include identifying the molecular basis of agronomic traits; identifying orthologs/syntelogs for known genes; determining gene expression patterns; accessing genomic datasets; identifying markers for breeding work; and identifying genetic similarities and differences among selected accessions. LIS integrates with other legume-focused informatics resources such as SoyBase ( https://soybase.org ), PeanutBase ( https://peanutbase.org ), and projects of the Legume Federation ( https://legumefederation.org ).
Collapse
|
3
|
California condor microbiomes: Bacterial variety and functional properties in captive-bred individuals. PLoS One 2019; 14:e0225858. [PMID: 31825977 PMCID: PMC6905524 DOI: 10.1371/journal.pone.0225858] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2019] [Accepted: 11/13/2019] [Indexed: 12/02/2022] Open
Abstract
Around the world, scavenging birds such as vultures and condors have been experiencing drastic population declines. Scavenging birds have a distinct digestive process to deal with higher amounts of bacteria in their primary diet of carcasses in varying levels of decay. These observations motivate us to present an analysis of captive and healthy California condor (Gymnogyps californianus) microbiomes to characterize a population raised together under similar conditions. Shotgun metagenomic DNA sequences were analyzed from fecal and cloacal samples of captive birds. Classification of shotgun DNA sequence data with peptide signatures using the Sequedex package provided both phylogenetic and functional profiles, as well as individually annotated reads for targeted confirmatory analysis. We observed bacterial species previously associated with birds and gut microbiomes, including both virulent and opportunistic pathogens such as Clostridium perfringens, Propionibacterium acnes, Shigella flexneri, and Fusobacterium mortiferum, common flora such as Lactobacillus johnsonii, Lactobacillus ruminus, and Bacteroides vulgatus, and mucosal microbes such as Delftia acidovorans, Stenotrophomonas maltophilia, and Corynebacterium falsnii. Classification using shotgun metagenomic reads from phylogenetic marker genes was consistent with, and more specific than, analysis based on 16S rDNA data. Classification of samples based on either phylogenetic or functional profiles of genomic fragments differentiated three types of samples: fecal, mature cloacal and immature cloacal, with immature birds having approximately 40% higher diversity of microbes.
Collapse
|
4
|
California Condor Microbiomes. Biophys J 2017. [DOI: 10.1016/j.bpj.2016.11.1534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
|
5
|
Constructing rigorous and broad biosurveillance networks for detecting emerging zoonotic outbreaks. PLoS One 2015; 10:e0124037. [PMID: 25946164 PMCID: PMC4422680 DOI: 10.1371/journal.pone.0124037] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2014] [Accepted: 03/10/2015] [Indexed: 11/19/2022] Open
Abstract
Determining optimal surveillance networks for an emerging pathogen is difficult since it is not known beforehand what the characteristics of a pathogen will be or where it will emerge. The resources for surveillance of infectious diseases in animals and wildlife are often limited and mathematical modeling can play a supporting role in examining a wide range of scenarios of pathogen spread. We demonstrate how a hierarchy of mathematical and statistical tools can be used in surveillance planning help guide successful surveillance and mitigation policies for a wide range of zoonotic pathogens. The model forecasts can help clarify the complexities of potential scenarios, and optimize biosurveillance programs for rapidly detecting infectious diseases. Using the highly pathogenic zoonotic H5N1 avian influenza 2006-2007 epidemic in Nigeria as an example, we determined the risk for infection for localized areas in an outbreak and designed biosurveillance stations that are effective for different pathogen strains and a range of possible outbreak locations. We created a general multi-scale, multi-host stochastic SEIR epidemiological network model, with both short and long-range movement, to simulate the spread of an infectious disease through Nigerian human, poultry, backyard duck, and wild bird populations. We chose parameter ranges specific to avian influenza (but not to a particular strain) and used a Latin hypercube sample experimental design to investigate epidemic predictions in a thousand simulations. We ranked the risk of local regions by the number of times they became infected in the ensemble of simulations. These spatial statistics were then complied into a potential risk map of infection. Finally, we validated the results with a known outbreak, using spatial analysis of all the simulation runs to show the progression matched closely with the observed location of the farms infected in the 2006-2007 epidemic.
Collapse
|
6
|
Rapid phylogenetic and functional classification of short genomic fragments with signature peptides. BMC Res Notes 2012; 5:460. [PMID: 22925230 PMCID: PMC3772700 DOI: 10.1186/1756-0500-5-460] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2012] [Accepted: 08/08/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Classification is difficult for shotgun metagenomics data from environments such as soils, where the diversity of sequences is high and where reference sequences from close relatives may not exist. Approaches based on sequence-similarity scores must deal with the confounding effects that inheritance and functional pressures exert on the relation between scores and phylogenetic distance, while approaches based on sequence alignment and tree-building are typically limited to a small fraction of gene families. We describe an approach based on finding one or more exact matches between a read and a precomputed set of peptide 10-mers. RESULTS At even the largest phylogenetic distances, thousands of 10-mer peptide exact matches can be found between pairs of bacterial genomes. Genes that share one or more peptide 10-mers typically have high reciprocal BLAST scores. Among a set of 403 representative bacterial genomes, some 20 million 10-mer peptides were found to be shared. We assign each of these peptides as a signature of a particular node in a phylogenetic reference tree based on the RNA polymerase genes. We classify the phylogeny of a genomic fragment (e.g., read) at the most specific node on the reference tree that is consistent with the phylogeny of observed signature peptides it contains. Using both synthetic data from four newly-sequenced soil-bacterium genomes and ten real soil metagenomics data sets, we demonstrate a sensitivity and specificity comparable to that of the MEGAN metagenomics analysis package using BLASTX against the NR database. Phylogenetic and functional similarity metrics applied to real metagenomics data indicates a signal-to-noise ratio of approximately 400 for distinguishing among environments. Our method assigns ~6.6 Gbp/hr on a single CPU, compared with 25 kbp/hr for methods based on BLASTX against the NR database. CONCLUSIONS Classification by exact matching against a precomputed list of signature peptides provides comparable results to existing techniques for reads longer than about 300 bp and does not degrade severely with shorter reads. Orders of magnitude faster than existing methods, the approach is suitable now for inclusion in analysis pipelines and appears to be extensible in several different directions.
Collapse
|
7
|
Recurrent perineal hernia repair: a novel approach. Hernia 2011; 17:141-4. [PMID: 21584817 DOI: 10.1007/s10029-011-0821-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2010] [Accepted: 04/14/2011] [Indexed: 01/26/2023]
Abstract
PURPOSE Postoperative perineal hernias are rare complications from procedures, which compromise the pelvic floor, mainly abdominoperineal resection, proctocolectomy, and partial or total pelvic exenteration. Surgical repair can be accomplished through abdominal, laparoscopic, or transperineal approaches. METHODS We present a case report of a 70-year-old man who underwent two prior operations for recurrent perineal hernia and was ultimately successfully treated with a third operation, a synthetic mesh redo procedure that utilized a synthetic mesh system marketed for women with pelvic organ prolapse. RESULTS Although there is no "gold standard" for perineal hernia repair, our patient had multiple surgeries employing a variety of approaches. Final success was achieved using a mesh system with improved fixation to secure pelvic ligaments, using an exclusive perineal approach. Now, more than five years following the final surgery, the patient remains symptom free with no clinical evidence of perineal hernia recurrence. CONCLUSIONS To date, this is the only report of using this mesh system in a male. The advantages of using this mesh system are (1) exclusive perineal approach without the accompanying risks of abdominal or laparoscopic approach; (2) improved fixation of mesh to secure pelvic ligaments; and (3) lightweight, flexible, and large mesh shape that can easily be trimmed to allow versatility in procedures.
Collapse
|
8
|
Bayesian correlated MAD phasing. ACTA CRYSTALLOGRAPHICA SECTION D: BIOLOGICAL CRYSTALLOGRAPHY 2005; 53:571-9. [PMID: 15299888 DOI: 10.1107/s0907444997005398] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
A Bayesian treatment for phase calculation in the multiwavelength anomalous diffraction (MAD) technique is presented. This approach explicitly treats effects of errors correlated among measurements at different wavelengths and between Bijvoet pairs. The resulting method, which is called Bayesian correlated MAD phasing, gives proper statistical consideration to all data and does not give special treatment to data from a particular wavelength. Results obtained using Bayesian correlated MAD phasing and two other strategies on both a model test case and on data obtained in two actual MAD experiments are compared. Although all procedures performed well when the completeness of the data was high, it is shown that Bayesian correlated MAD phasing is more robust with respect to incompleteness of data than the other methods are. At 60% completeness the improvement over other methods for the examples given was nearly 50% in the correlation coefficients, and made a substantial difference in the interpretability of an electron-density map.
Collapse
|
9
|
Bayesian weighting for macromolecular crystallographic refinement. ACTA CRYSTALLOGRAPHICA SECTION D: BIOLOGICAL CRYSTALLOGRAPHY 2005; 52:743-8. [PMID: 15299638 DOI: 10.1107/s0907444996001473] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/30/2023]
Abstract
A simple weighting scheme for atomic refinement is discussed. The approach, called 'Bayesian weighting', is designed to be robust with respect to the bias that arises from the incomplete nature of the atomic model, which in macromolecular crystallography is typically quite serious. Bayesian weights are based on the mean-squared residual errors over shells of resolution, with centric and acentric reflections considered separately and with allowances made for experimental uncertainties. Use of Bayesian weighting is shown in test cases typical for macromolecular crystallography to improve the accuracy of the refined coordinates when compared with schemes employing unit weights or experimental variances.
Collapse
|
10
|
Difference refinement: obtaining differences between two related structures. ACTA CRYSTALLOGRAPHICA SECTION D: BIOLOGICAL CRYSTALLOGRAPHY 2005; 51:609-18. [PMID: 15299790 DOI: 10.1107/s0907444994013247] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
There are many examples in macromolecular crystallography where interest focuses on the differences between a previously determined 'native' structure and a nearly isomorphous 'variant'. In such cases, a useful approach to atomic refinement of the variant structure is through weighted least-squares minimization of the residual between the observed and calculated differences in amplitudes of structure factors, a strategy first used in the refinement of deoxycobalt hemoglobin [Fermi, Perutz, Dickinson & Chien (1982). J. Mol. Biol. 155, 495-505] and termed 'difference refinement'. For cases in which the modeling errors for the native and variant structures are correlated, theoretical arguments indicate that difference refinement should lead to improved estimates of structural differences when compared with conventional independent refinement. Tests employing simulated peptide data sets and real data from a wild-type protein and a mutant show that difference refinement can substantially reduce errors in the differences between structures when compared with independent refinement. The algorithm is very easy to implement and does not increase the computational demands of refinement.
Collapse
|
11
|
Abstract
The TB Structural Genomics Consortium is an organization devoted to encouraging, coordinating, and facilitating the determination and analysis of structures of proteins from Mycobacterium tuberculosis. The Consortium members hope to work together with other M. tuberculosis researchers to identify M. tuberculosis proteins for which structural information could provide important biological information, to analyze and interpret structures of M. tuberculosis proteins, and to work collaboratively to test ideas about M. tuberculosis protein function that are suggested by structure or related to structural information. This review describes the TB Structural Genomics Consortium and some of the proteins for which the Consortium is in the progress of determining three-dimensional structures.
Collapse
|
12
|
Crystal structure of beta-D-xylosidase from Thermoanaerobacterium saccharolyticum, a family 39 glycoside hydrolase. J Mol Biol 2004; 335:155-65. [PMID: 14659747 DOI: 10.1016/j.jmb.2003.10.026] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
1,4-beta-D-Xylan is the major component of plant cell-wall hemicelluloses. beta-D-Xylosidases are involved in the breakdown of xylans into xylose and belong to families 3, 39, 43, 52, and 54 of glycoside hydrolases. Here, we report the first crystal structure of a member of family 39 glycoside hydrolase, i.e. beta-D-xylosidase from Thermoanaerobacterium saccharolyticum strain B6A-RI. This study also represents the first structure of any beta-xylosidase of the above five glycoside hydrolase families. Each monomer of T. saccharolyticum beta-xylosidase comprises three distinct domains; a catalytic domain of the canonical (beta/alpha)(8)-barrel fold, a beta-sandwich domain, and a small alpha-helical domain. We have determined the structure in two forms: D-xylose-bound enzyme and a covalent 2-deoxy-2-fluoro-alpha-D-xylosyl-enzyme intermediate complex, thus providing two snapshots in the reaction pathway. This study provides structural evidence for the proposed double displacement mechanism that involves a covalent intermediate. Furthermore, it reveals possible functional roles for His228 as the auxiliary acid/base and Glu323 as a key residue in substrate recognition.
Collapse
|
13
|
|
14
|
Abstract
Structural genomics has the ambitious goal of delivering three-dimensional structural information on a genome-wide scale. Yet only a small fraction of natural proteins are suitable for structure determination because of bottlenecks such as poor expression, aggregation, and misfolding of proteins, and difficulties in solubilization and crystallization. We propose to overcome these bottlenecks by producing soluble, highly expressed proteins that are derived from and closely related to their natural homologs. Here we demonstrate the utility of this approach by using a green fluorescent protein (GFP) folding reporter assay to evolve an enzymatically active, soluble variant of a hyperthermophilic protein that is normally insoluble when expressed in Escherichia coli, and determining its structure by X-ray crystallography. Analysis of the structure provides insight into the substrate specificity of the enzyme and the improved solubility of the variant.
Collapse
|
15
|
The TB structural genomics consortium: providing a structural foundation for drug discovery. CURRENT DRUG TARGETS. INFECTIOUS DISORDERS 2002; 2:121-41. [PMID: 12462144 DOI: 10.2174/1568005023342551] [Citation(s) in RCA: 62] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Structural genomics, the large-scale determination of protein structures, promises to provide a broad structural foundation for drug discovery. The tuberculosis (TB) Structural Genomics Consortium is devoted to encouraging, coordinating, and facilitating the determination of structures of proteins from Mycobacterium tuberculosis and hopes to determine 400 TB protein structures over 5 years. The Consortium has determined structures of 28 proteins from TB to date. These protein structures are already providing a basis for drug discovery efforts.
Collapse
|
16
|
Crystal structure and functional analysis of the SurE protein identify a novel phosphatase family. NATURE STRUCTURAL BIOLOGY 2001; 8:789-94. [PMID: 11524683 DOI: 10.1038/nsb0901-789] [Citation(s) in RCA: 152] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Homologs of the Escherichia coli surE gene are present in many eubacteria and archaea. Despite the evolutionary conservation, little information is available on the structure and function of their gene products. We have determined the crystal structure of the SurE protein from Thermotoga maritima. The structure reveals the dimeric arrangement of the subunits and an active site around a bound metal ion. We also demonstrate that the SurE protein exhibits a divalent metal ion-dependent phosphatase activity that is inhibited by vanadate or tungstate. In the vanadate- and tungstate-complexed structures, the inhibitors bind adjacent to the divalent metal ion. Our structural and functional analyses identify the SurE proteins as a novel family of metal ion-dependent phosphatases.
Collapse
|
17
|
Abstract
The transport of protons across membranes is an important process in cellular bioenergetics. The light-driven proton pump bacteriorhodopsin is the best-characterized protein providing this function. Photon energy is absorbed by the chromophore retinal, covalently bound to Lys 216 via a protonated Schiff base. The light-induced all-trans to 13-cis isomerization of the retinal results in deprotonation of the Schiff base followed by alterations in protonatable groups within bacteriorhodopsin. The changed force field induces changes, even in the tertiary structure, which are necessary for proton pumping. The recent report of a high-resolution X-ray crystal structure for the late M intermediate of a mutant bacteriorhopsin (with Asp 96-->Asn) displays the structure of a proton pathway highly disturbed by the mutation. To observe an unperturbed proton pathway, we determined the structure of the late M intermediate of wild-type bacteriorhodopsin (2.25 A resolution). The cytoplasmic side of our M2 structure shows a water net that allows proton transfer from the proton donor group Asp 96 towards the Schiff base. An enlarged cavity system above Asp 96 is observed, which facilitates the de- and reprotonation of this group by fluctuating water molecules in the last part of the cycle.
Collapse
|
18
|
Abstract
The genome projects are changing biology by providing the genetic blueprints of entire organisms. The blueprints are tantalizing but we cannot deduce everything we need to know from them, including the structures and detailed functions of proteins. In this paper we describe an approach for obtaining structural information about proteins on a genomic scale. We describe how structural and functional information might eventually be put together to form a basis for describing life at many levels. We then describe how structural information fits into this picture and classes of proteins for which structural information would be useful in a genomic context. We conclude with a proposal for an initiative to determine protein structures on a very large scale.
Collapse
|
19
|
Abstract
Members of the cytochrome P450 superfamily catalyze the addition of molecular oxygen to nonactivated hydrocarbons at physiological temperature-a reaction that requires high temperature to proceed in the absence of a catalyst. Structures were obtained for three intermediates in the hydroxylation reaction of camphor by P450cam with trapping techniques and cryocrystallography. The structure of the ferrous dioxygen adduct of P450cam was determined with 0.91 angstrom wavelength x-rays; irradiation with 1.5 angstrom x-rays results in breakdown of the dioxygen molecule to an intermediate that would be consistent with an oxyferryl species. The structures show conformational changes in several important residues and reveal a network of bound water molecules that may provide the protons needed for the reaction.
Collapse
|
20
|
The role of cavities in protein dynamics: crystal structure of a photolytic intermediate of a mutant myoglobin. Proc Natl Acad Sci U S A 2000; 97:2058-63. [PMID: 10681426 PMCID: PMC15753 DOI: 10.1073/pnas.040459697] [Citation(s) in RCA: 121] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We determined the structure of the photolytic intermediate of a sperm whale myoglobin (Mb) mutant called Mb-YQR [Leu-(B10)-->Tyr; His(E7)-->Gln; Thr(E10)-->Arg] to 1.4-A resolution by ultra-low temperature (20 K) x-ray diffraction. Starting with the CO complex, illumination leads to photolysis of the Fe-CO bond, and migration of the photolyzed carbon monoxide (CO*) to a niche in the protein 8.1 A from the heme iron; this cavity corresponds to that hosting an atom of Xe when the crystal is equilibrated with xenon gas at 7 atmospheres [Tilton, R. F., Jr., Kuntz, I. D. & Petsko, G. A. (1984) Biochemistry 23, 2849-2857]. The site occupied by CO* corresponds to that predicted by molecular dynamics simulations previously carried out to account for the NO geminate rebinding of Mb-YQR observed in laser photolysis experiments at room temperature. This secondary docking site differs from the primary docking site identified by previous crystallographic studies on the photolyzed intermediate of wild-type sperm whale Mb performed at cryogenic temperatures [Teng et al. (1994) Nat. Struct. Biol. 1, 701-705] and room temperature [Srajer et al. (1996) Science 274, 1726-1729]. Our experiment shows that the pathway of a small molecule in its trajectory through a protein may be modified by site-directed mutagenesis, and that migration within the protein matrix to the active site involves a limited number of pre-existing cavities identified in the interior space of the protein.
Collapse
|
21
|
Abstract
Small molecules such as NO, O2, CO or H2 are important biological ligands that bind to metalloproteins to function crucially in processes such as signal transduction, respiration and catalysis. A key issue for understanding the regulation of reaction mechanisms in these systems is whether ligands gain access to the binding sites through specific channels and docking sites, or by random diffusion through the protein matrix. A model system for studying this issue is myoglobin, a simple haem protein. Myoglobin has been studied extensively by spectroscopy, crystallography, computation and theory. It serves as an aid to oxygen diffusion but also binds carbon monoxide, a byproduct of endogenous haem catabolism. Molecular dynamics simulations, random mutagenesis and flash photolysis studies indicate that ligand migration occurs through a limited number of pathways involving docking sites. Here we report the 1.4 A resolution crystal structure of a ligand-binding intermediate in carbonmonoxy myoglobin that may have far-reaching implications for understanding the dynamics of ligand binding and catalysis.
Collapse
|
22
|
Evaluation of macromolecular electron-density map quality using the correlation of local r.m.s. density. ACTA CRYSTALLOGRAPHICA SECTION D: BIOLOGICAL CRYSTALLOGRAPHY 1999; 55:1872-7. [PMID: 10531485 PMCID: PMC2745882 DOI: 10.1107/s090744499901029x] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
It has recently been shown that the standard deviation of local r.m. s. electron density is a good indicator of the presence of distinct regions of solvent and protein in macromolecular electron-density maps [Terwilliger & Berendzen (1999). Acta Cryst. D55, 501-505]. Here, it is demonstrated that a complementary measure, the correlation of local r.m.s. density in adjacent regions on the unit cell, is also a good measure of the presence of distinct solvent and protein regions. The correlation of local r.m.s. density is essentially a measure of how contiguous the solvent (and protein) regions are in the electron-density map. This statistic can be calculated in real space or in reciprocal space and has potential uses in evaluation of heavy-atom solutions in the MIR and MAD methods as well as for evaluation of trial phase sets in ab initio phasing procedures.
Collapse
|
23
|
Abstract
We have used x-ray crystallography to determine the structures of sperm whale myoglobin (Mb) in four different ligation states (unligated, ferric aquomet, oxygenated, and carbonmonoxygenated) to a resolution of better than 1.2 A. Data collection and analysis were performed in as much the same way as possible to reduce model bias in differences between structures. The structural differences among the ligation states are much smaller than previously estimated, with differences of <0.25 A root-mean-square deviation among all atoms. One structural parameter previously thought to vary among the ligation states, the proximal histidine (His-93) azimuthal angle, is nearly identical in all the ferrous complexes, although the tilt of the proximal histidine is different in the unligated form. There are significant differences, however, in the heme geometry, in the position of the heme in the pocket, and in the distal histidine (His-64) conformations. In the CO complex the majority conformation of ligand is at an angle of 18 +/- 3 degrees with respect to the heme plane, with a geometry similar to that seen in encumbered model compounds; this angle is significantly smaller than reported previously by crystallographic studies on monoclinic Mb crystals, but still significantly larger than observed by photoselection. The distal histidine in unligated Mb and in the dioxygenated complex is best described as having two conformations. Two similar conformations are observed in MbCO, in addition to another conformation that has been seen previously in low-pH structures where His-64 is doubly protonated. We suggest that these conformations of the distal histidine correspond to the different conformational substates of MbCO and MbO(2) seen in vibrational spectra. Full-matrix refinement provides uncertainty estimates of important structural parameters. Anisotropic refinement yields information about correlated disorder of atoms; we find that the proximal (F) helix and heme move approximately as rigid bodies, but that the distal (E) helix does not.
Collapse
|
24
|
Abstract
Formation of the chromophore of green fluorescent protein (GFP) depends on the correct folding of the protein. We constructed a "folding reporter" vector, in which a test protein is expressed as an N-terminal fusion with GFP. Using a test panel of 20 proteins, we demonstrated that the fluorescence of Escherichia coli cells expressing such GFP fusions is related to the productive folding of the upstream protein domains expressed alone. We used this fluorescent indicator of protein folding to evolve proteins that are normally prone to aggregation during expression in E. coli into closely related proteins that fold robustly and are fully soluble and functional. This approach to improving protein folding does not require functional assays for the protein of interest and provides a simple route to improving protein folding and expression by directed evolution.
Collapse
|
25
|
Automated MAD and MIR structure solution. ACTA CRYSTALLOGRAPHICA. SECTION D, BIOLOGICAL CRYSTALLOGRAPHY 1999; 55:849-61. [PMID: 10089316 PMCID: PMC2746121 DOI: 10.1107/s0907444999000839] [Citation(s) in RCA: 2717] [Impact Index Per Article: 108.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/14/1998] [Accepted: 01/15/1999] [Indexed: 11/13/2022]
Abstract
Obtaining an electron-density map from X-ray diffraction data can be difficult and time-consuming even after the data have been collected, largely because MIR and MAD structure determinations currently require many subjective evaluations of the qualities of trial heavy-atom partial structures before a correct heavy-atom solution is obtained. A set of criteria for evaluating the quality of heavy-atom partial solutions in macromolecular crystallography have been developed. These have allowed the conversion of the crystal structure-solution process into an optimization problem and have allowed its automation. The SOLVE software has been used to solve MAD data sets with as many as 52 selenium sites in the asymmetric unit. The automated structure-solution process developed is a major step towards the fully automated structure-determination, model-building and refinement procedure which is needed for genomic scale structure determinations.
Collapse
|
26
|
Discrimination of solvent from protein regions in native Fouriers as a means of evaluating heavy-atom solutions in the MIR and MAD methods. ACTA CRYSTALLOGRAPHICA SECTION D: BIOLOGICAL CRYSTALLOGRAPHY 1999; 55:501-5. [PMID: 10089362 PMCID: PMC2746120 DOI: 10.1107/s0907444998012657] [Citation(s) in RCA: 44] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
An automated examination of the native Fourier is tested as a means of evaluation of a heavy-atom solution in MAD and MIR methods for macromolecular crystallography. It is found that the presence of distinct regions of high and low density variation in electron-density maps is a good indicator of the correctness of a heavy-atom solution in the MIR and MAD methods. The method can be used to evaluate heavy-atom solutions during MAD and MIR structure solutions and to determine the handedness of the structure if anomalous data have been measured.
Collapse
|
27
|
Abstract
BACKGROUND Translation initiation factor 5A (IF-5A) is reported to be involved in the first step of peptide bond formation in translation, to be involved in cell-cycle regulation and to be a cofactor for the Rev and Rex transactivator proteins of human immunodeficiency virus-1 and T-cell leukemia virus I, respectively. IF-5A contains an unusual amino acid, hypusine (N-epsilon-(4-aminobutyl-2-hydroxy)lysine), that is required for its function. The first step in the post-translational modification of lysine to hypusine is catalyzed by the enzyme deoxyhypusine synthase, the structure of which has been published recently. RESULTS IF-5A from the archebacterium Pyrobaculum aerophilum has been heterologously expressed in Escherichia coli with selenomethionine substitution. The crystal structure of IF-5A has been determined by multiwavelength anomalous diffraction and refined to 1.75 A. Unmodified P. aerophilum IF-5A is found to be a beta structure with two domains and three separate hydrophobic cores. CONCLUSIONS The lysine (Lys42) that is post-translationally modified by deoxyhypusine synthase is found at one end of the IF-5A molecule in an turn between beta strands beta4 and beta5; this lysine residue is freely solvent accessible. The C-terminal domain is found to be homologous to the cold-shock protein CspA of E. coli, which has a well characterized RNA-binding fold, suggesting that IF-5A is involved in RNA binding.
Collapse
|
28
|
Abstract
The recent sequencing of many complete genomes, combined with the development of methods that allow rapid structure determination for many proteins, has changed the way in which protein structure determinations can be approached. One-by-one determinations of individual protein structures will soon be augmented by class-directed structure analyses in which a group of proteins is targeted and structures of representative members are determined and used to represent the entire group. Such a shift in approach would be the foundation for a broad protein structure initiative targeting classes of proteins important for biotechnology and for a fundamental understanding of protein function.
Collapse
|
29
|
Abstract
A first real glance at the structural, spectral and temporal interplay that constitutes the photocycle of the photoactive yellow protein (PYP) has been obtained from a combination of time-resolved crystallography with mutational analysis and spectroscopic studies.
Collapse
|
30
|
Bayesian Difference Refinement. ACTA CRYSTALLOGRAPHICA SECTION D: BIOLOGICAL CRYSTALLOGRAPHY 1996; 52:1004-11. [PMID: 15299610 DOI: 10.1107/s0907444996006725] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Interest in a pair of highly isomorphous structures often focuses on the differences between them. In cases where substantial correlated model errors exist or where there are differences in the quality of the two experimental data sets (cases quite common in macromolecular crystallography), independent refinement of the two structures does not lead to the most accurate estimate of the differences between them. An alternative procedure that has proven effective in some such cases is difference refinement, in which the residual between observed and calculated differences in structure-factor amplitudes between the two structures is minimized. A Bayesian approach has been used to extend the range of applicability of difference refinement to cases where there is only partial correlation in model errors and where the overlap between the data sets is limited. The resulting method, Bayesian difference refinement, uses residuals to be minimized that vary smoothly between difference refinement and independent refinement. When the errors in the two structural models are very similar, difference refinement is used; when they are very different, independent refinement is used; and when they are partially correlated, a combination of the two is used. The procedure is very simple to apply and does not significantly increase the computational demands of refinement.
Collapse
|
31
|
Intermediates in the reaction pathway of cytochrome P450cam. Acta Crystallogr A 1996. [DOI: 10.1107/s0108767396097085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
32
|
Correlated Phasing of Multiple Isomorphous Replacement Data. ACTA CRYSTALLOGRAPHICA SECTION D: BIOLOGICAL CRYSTALLOGRAPHY 1996; 52:749-57. [PMID: 15299639 DOI: 10.1107/s0907444996000832] [Citation(s) in RCA: 51] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Substantial highly correlated differences sometimes exist between a series of heavy-atom derivatives of a macromolecule and the native structure. Use of such a series of derivatives for phase determination by multiple isomorphous replacement (MIR) has been difficult because MIR analysis has treated errors as independent. A simple Bayesian approach has been used to derive probability distributions for the phase in the case where a group of MIR derivatives have correlated errors. The utility of the resulting 'correlated-phasing' method has been examined by applying it to both simulated and real MIR data sets that contain sizeable correlated errors and it has been found that it can dramatically improve MIR phase estimates in these cases. Correlated phasing is applicable to situations where derivatives exhibit substantial correlated changes in protein conformation or crystal packing or where correlated errors in heavy-atom models are large. Correlated phasing does not substantially increase the complexity of phase computation and is suitable for routine use.
Collapse
|
33
|
Abstract
Myoglobin is a globular haem protein that reversibly binds ligands such as O2 and CO. Single photons of visible light can break the covalent bond between CO and the haem iron in carbon-monoxy-myoglobin (MbCO) and thus form an unstable intermediate, Mb*CO, with the CO inside the protein. The ensuing rebinding process has been extensively studied as a model for the interplay of dynamics, structure and function in protein reactions. We have used X-ray crystallography at liquid-helium temperatures to determine the structure of Mb*CO to a resolution of 1.5 A. The photodissociated CO lies on top of the haem pyrrole ring C. Comparison with the CO-bound and unligated myoglobin structures reveals that on photodissociation of the CO, the haem 'domes', the iron moves partially out of the haem plane, the iron-proximal histidine bonds is compressed, the F helix is strained and the distal histidine swings towards the outside of the ligand-binding pocket.
Collapse
|
34
|
Abstract
Ligand binding to heme proteins is studied by using flash photolysis over wide ranges in time (100 ns-1 ks) and temperature (10-320 K). Below about 200 K in 75% glycerol/water solvent, ligand rebinding occurs from the heme pocket and is nonexponential in time. The kinetics is explained by a distribution, g(H), of the enthalpic barrier of height H between the pocket and the bound state. Above 170 K rebinding slows markedly. Previously we interpreted the slowing as a "matrix process" resulting from the ligand entering the protein matrix before rebinding. Experiments on band III, an inhomogeneously broadened charge-transfer band near 760 nm (approximately 13,000 cm-1) in the photolyzed state (Mb*) of (carbonmonoxy)myoglobin (MbCO), force us to reinterpret the data. Kinetic hole-burning measurements on band III in Mb* establish a relation between the position of a homogeneous component of band III and the barrier H. Since band III is red-shifted by 116 cm-1 in Mb* compared with Mb, the relation implies that the barrier in relaxed Mb is 12 kJ/mol higher than in Mb*. The slowing of the rebinding kinetics above 170 K hence is caused by the relaxation Mb*----Mb, as suggested by Agmon and Hopfield [(1983) J. Chem. Phys. 79, 2042-2053]. This conclusion is supported by a fit to the rebinding data between 160 and 290 K which indicates that the entire distribution g(H) shifts. Above about 200 K, equilibrium fluctuations among conformational substates open pathways for the ligands through the protein matrix and also narrow the rate distribution. The protein relaxations and fluctuations are nonexponential in time and non-Arrhenius in temperature, suggesting a collective nature for these protein motions. The relaxation Mb*----Mb is essentially independent of the solvent viscosity, implying that this motion involves internal parts of the protein. The protein fluctuations responsible for the opening of the pathways, however, depend strongly on the solvent viscosity, suggesting that a large part of the protein participates. While the detailed studies concern MbCO, similar data have been obtained for MbO2 and CO binding to the beta chains of human hemoglobin and hemoglobin Zürich. The results show that protein dynamics is essential for protein function and that the association coefficient for binding from the solvent at physiological temperatures in all these heme proteins is governed by the barrier at the heme.
Collapse
|
35
|
Abstract
A relaxation method that measures the derivative of a population with respect to temperature is introduced and used to study the recombination of CO to sperm whale myoglobin after a photolyzing flash. Measurement of the geminate process in the infrared CO-stretch bands shows distributed activation enthalpies with different distributions for each band, transitions between two bands that correspond to photolyzed ligands, and kinetic hole burning. The data are well described by gaussian enthalpy distributions; the results match and complement those of isothermal methods. The temperature-derivative technique is further used to explore the recombination of CO from outside the heme pocket.
Collapse
|
36
|
Ligand binding to synthetic mutant myoglobin (His-E7----Gly): role of the distal histidine. Proc Natl Acad Sci U S A 1988; 85:8497-501. [PMID: 3186740 PMCID: PMC282485 DOI: 10.1073/pnas.85.22.8497] [Citation(s) in RCA: 66] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Low-temperature flash photolysis with IR and visible spectroscopy was used to probe the influence of the distal histidine His-64(E7) of sperm-whale myoglobin (Mb) on the orientation of bound carbon monoxide (CO) and on the kinetics of CO rebinding. The synthesis and high-level expression of a sperm-whale myoglobin gene in Escherichia coli permits the efficient substitution of the distal histidine through site-directed mutagenesis. Substitution of His-E7 with glycine [GlyE7]Mb bound with CO (CO[GlyE7]Mb) results in one broad bound-CO IR stretch band, v(C-O), centered at 1973 cm-1 at 10 K, in contrast to three distinct bands for native and synthetic wild-type MbCO at 1966, 1945, and 1929 cm-1. After flash photolysis at 10 K, the unbound state of CO[GlyE7]Mb exhibits two CO stretch bands, whereas MbCO has three. Fourier transform IR spectroscopy measurements of the linear dichroism after photoselective flash photolysis of CO bound to [GlyE7]Mb at 10 K reveals the bound CO to be oriented at an angle of alpha = 20 degrees +/- 2 degrees with respect to the heme normal. Flash photolysis data from 10 to 300 K provide evidence for a larger distal pocket and a smaller enthalpy barrier (by approximately 4 kJ/mol) for [GlyE7]MbCO as compared with wild-type MbCO. These results reinforce the notion that the dominant control of the binding step at the heme iron comes from the proximal side through the protein structure.
Collapse
|
37
|
Abstract
The infrared stretching bands of carboxymyoglobin (MbCO) and the rebinding of CO to Mb after photodissociation have been studied in the temperature range 10-300 K in a variety of solvents. Four stretching bands imply that MbCO can exist in four substates, A0-A3. The temperature dependences of the intensities of the four bands yield the relative binding enthalpies and and entropies. The integrated absorbances and pH dependences of the bands permit identification of the substates with the conformations observed in the X-ray data (Kuriyan et al., J. Mol. Biol. 192 (1986) 133). At low pH, A0 is hydrogen-bonded to His E7. The substates A0-A3 interconvert above about 180 K in a 75% glycerol/water solvent and above 270 K in buffered water. No major interconversion is seen at any temperature if MbCO is embedded in a solid polyvinyl alcohol matrix. The dependence of the transition on solvent characteristics is explained as a slaved glass transition. After photodissociation at low temperature the CO is in the heme pocket B. The resulting CO stretching bands which are identified as B substates are blue-shifted from those of the A substates. At 40 K, rebinding after flash photolysis has been studied in the Soret, the near-infrared, and the integrated A and B substates. All data lie on the same rebinding curve and demonstrate that rebinding is nonexponential in time from at least 100 ns to 100 ks. No evidence for discrete exponentials is found. Flash photolysis with monitoring in the infrared region shows four different pathways within the pocket B to the bound substates Ai. Rebinding in each of the four pathways B----A is nonexponential in time to at least 10 ks and the four pathways have different kinetics below 180 K. From the time and temperature dependence of the rebinding, activation enthalpy distributions g(HBA) and preexponentials ABA are extracted. No pumping from one A substate to another, or one B substate to another, is observed below the transition temperature of about 180 K. If MbCO is exposed to intense white light for 10-10(3) s before being fully photolyzed by a laser flash, the amplitude of the long-lived states increases. The effect is explained in terms of a hierarchy of substates and substate symmetry breaking. The characteristics of the CO stretching bands and of the rebinding processes in the heme pocket depend strongly on the external parameters of solvent, pH and pressure. This sensitivity suggests possible control mechanisms for protein reactions.
Collapse
|
38
|
Abstract
After photodissociation of carbon monoxide bound to myoglobin, the protein relaxes to the deoxy equilibrium structure in a quake-like motion. Investigation of the proteinquake and of related intramolecular equilibrium motions shows that states and motions have a hierarchical glass-like structure.
Collapse
|
39
|
Abstract
The magnetic susceptibility of photodissociated carbon monoxy myoglobin has been measured over the temperature range from 1.7 to 25 K at 10 and 50 kG with a superconducting susceptometer. The spin and the crystal field parameters of the iron ion were extracted by a spin Hamiltonian approach. Under equivalent conditions the magnetic susceptibility of deoxy myoglobin was measured. In both experiments the CO-bound protein was used as a diamagnetic reference. Above about 5 K the metastable photolysed state and the equilibrium deoxy form of myoglobin are magnetically indistinguishable and can be fitted with S = 2 and g = 2. The transition from spin 0 to spin 2 and the conformational changes known to accompany the electronic change thus also occur after photolysis at low temperature. At temperatures below 5 K, differences become apparent, indicating a somewhat smaller zero-field splitting in the photoproduct as compared to the ligand-free state at equilibrium. In qualitative agreement with observations made by other techniques, the data imply that even at 1.7 K substantial structural relaxation occurs in the heme region of myoglobin after photodissociation. The results are important for the interpretation of the ligand binding kinetics after flash photolysis at low temperature and contribute to the understanding of the relationship between electronic structure and function in heme proteins.
Collapse
|