1
|
Jointly modeling deep mutational scans identifies shifted mutational effects among SARS-CoV-2 spike homologs. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.31.551037. [PMID: 37577604 PMCID: PMC10418112 DOI: 10.1101/2023.07.31.551037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]
Abstract
Deep mutational scanning (DMS) is a high-throughput experimental technique that measures the effects of thousands of mutations to a protein. These experiments can be performed on multiple homologs of a protein or on the same protein selected under multiple conditions. It is often of biological interest to identify mutations with shifted effects across homologs or conditions. However, it is challenging to determine if observed shifts arise from biological signal or experimental noise. Here, we describe a method for jointly inferring mutational effects across multiple DMS experiments while also identifying mutations that have shifted in their effects among experiments. A key aspect of our method is to regularize the inferred shifts, so that they are nonzero only when strongly supported by the data. We apply this method to DMS experiments that measure how mutations to spike proteins from SARS-CoV-2 variants (Delta, Omicron BA.1, and Omicron BA.2) affect cell entry. Most mutational effects are conserved between these spike homologs, but a fraction have markedly shifted. We experimentally validate a subset of the mutations inferred to have shifted effects, and confirm differences of > 1,000-fold in the impact of the same mutation on spike-mediated viral infection across spikes from different SARS-CoV-2 variants. Overall, our work establishes a general approach for comparing sets of DMS experiments to identify biologically important shifts in mutational effects.
Collapse
|
2
|
Abstract
Proteins need to selectively interact with specific targets among a multitude of similar molecules in the cell. However, despite a firm physical understanding of binding interactions, we lack a general theory of how proteins evolve high specificity. Here, we present such a model that combines chemistry, mechanics, and genetics and explains how their interplay governs the evolution of specific protein–ligand interactions. The model shows that there are many routes to achieving molecular discrimination—by varying degrees of flexibility and shape/chemistry complementarity—but the key ingredient is precision. Harder discrimination tasks require more collective and precise coaction of structure, forces, and movements. Proteins can achieve this through correlated mutations extending far from a binding site, which fine-tune the localized interaction with the ligand. Thus, the solution of more complicated tasks is enabled by increasing the protein size, and proteins become more evolvable and robust when they are larger than the bare minimum required for discrimination. The model makes testable, specific predictions about the role of flexibility and shape mismatch in discrimination, and how evolution can independently tune affinity and specificity. Thus, the proposed theory of specific binding addresses the natural question of “why are proteins so big?”. A possible answer is that molecular discrimination is often a hard task best performed by adding more layers to the protein.
Collapse
|
3
|
Abstract
An accurate estimation of the Protein Space size, in light of the factors that govern it, is a long-standing problem and of paramount importance in evolutionary biology, since it determines the nature of protein evolvability. A simple analysis will enable us to, firstly, reduce an unrealistic Protein Space size of ~ 10130 sequences, for a 100-residues polypeptide chain, to ~ 109 functional proteins and, secondly, estimate a robust average-mutation rate per amino acid (ξ ~ 1.23) and infer from it, in light of the protein marginal stability, that only a fraction of the sequence will be available at any one time for a functional protein to evolve. Although this result does not solve the Protein Space vastness problem frames it in a more rational one and illustrates the impact of the marginal stability on protein evolvability.
Collapse
|
4
|
Ancestral AlaX editing enzymes for control of genetic code fidelity are not tRNA-specific. J Biol Chem 2015; 290:10495-503. [PMID: 25724653 DOI: 10.1074/jbc.m115.640060] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2015] [Indexed: 01/15/2023] Open
Abstract
Accurate protein synthesis requires the hydrolytic editing of tRNAs incorrectly aminoacylated by aminoacyl-tRNA synthetases (ARSs). Recognition of cognate tRNAs by ARS is less error-prone than amino acid recognition, and, consequently, editing domains are generally believed to act only on the tRNAs cognate to their related ARSs. For example, the AlaX family of editing domains, including the editing domain of alanyl-tRNA synthetase and the related free-standing trans-editing AlaX enzymes, are thought to specifically act on tRNA(Ala), whereas the editing domains of threonyl-tRNA synthetases are specific for tRNA(Thr). Here we show that, contrary to this belief, AlaX-S, the smallest of the extant AlaX enzymes, deacylates Ser-tRNA(Thr) in addition to Ser-tRNA(Ala) and that a single residue is important to determine this behavior. Our data indicate that promiscuous forms of AlaX are ancestral to tRNA-specific AlaXs. We propose that former AlaX domains were used to maintain translational fidelity in earlier stages of genetic code evolution when mis-serylation of several tRNAs was possible.
Collapse
|
5
|
The Streptomyces coelicolor lipoate-protein ligase is a circularly permuted version of the Escherichia coli enzyme composed of discrete interacting domains. J Biol Chem 2015; 290:7280-90. [PMID: 25631049 DOI: 10.1074/jbc.m114.626879] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
Lipoate-protein ligases are used to scavenge lipoic acid from the environment and attach the coenzyme to its cognate proteins, which are generally the E2 components of the 2-oxoacid dehydrogenases. The enzymes use ATP to activate lipoate to its adenylate, lipoyl-AMP, which remains tightly bound in the active site. This mixed anhydride is attacked by the ϵ-amino group of a specific lysine present on a highly conserved acceptor protein domain, resulting in the amide-linked coenzyme. The Streptomyces coelicolor genome encodes only a single putative lipoate ligase. However, this protein had only low sequence identity (<25%) to the lipoate ligases of demonstrated activity and appears to be a circularly permuted version of the known lipoate ligase proteins in that the canonical C-terminal domain seems to have been transposed to the N terminus. We tested the activity of this protein both by in vivo complementation of an Escherichia coli ligase-deficient strain and by in vitro assays. Moreover, when the domains were rearranged into a protein that mimicked the arrangement found in the canonical lipoate ligases, the enzyme retained complementation activity. Finally, when the two domains were separated into two proteins, both domain-containing proteins were required for complementation and catalysis of the overall ligase reaction in vitro. However, only the large domain-containing protein was required for transfer of lipoate from the lipoyl-AMP intermediate to the acceptor proteins, whereas both domain-containing proteins were required to form lipoyl-AMP.
Collapse
|
6
|
Abstract
The matrix metalloproteinases (MMPs) are a family of secreted soluble or membrane-anchored multimodular peptidases regularly found in several paralogous copies in animals and plants, where they have multiple functions. The minimal consensus domain architecture comprises a signal peptide, a 60-90-residue globular prodomain with a conserved sequence motif including a cysteine engaged in "cysteine-switch" or "Velcro" mediated latency, and a catalytic domain. Karilysin, from the human periodontopathogen Tannerella forsythia, is the only bacterial MMP to have been characterized biochemically to date. It shares with eukaryotic forms the catalytic domain but none of the flanking domains. Instead of the consensus MMP prodomain, it features a 14-residue propeptide, the shortest reported for a metallopeptidase, which lacks cysteines. Here we determined the structure of a prokarilysin fragment encompassing the propeptide and the catalytic domain, and found that the former runs across the cleft in the opposite direction to a bound substrate and inhibits the latter through an "aspartate-switch" mechanism. This finding is reminiscent of latency maintenance in the otherwise unrelated astacin and fragilysin metallopeptidase families. In addition, in vivo and biochemical assays showed that the propeptide contributes to protein folding and stability. Our analysis of prokarilysin reveals a novel mechanism of latency and activation in MMPs. Finally, our findings support the view that the karilysin catalytic domain was co-opted by competent bacteria through horizontal gene transfer from a eukaryotic source, and later evolved in a specific bacterial environment.
Collapse
|
7
|
Nucleotide insertions and deletions complement point mutations to massively expand the diversity created by somatic hypermutation of antibodies. J Biol Chem 2014; 289:33557-67. [PMID: 25320089 DOI: 10.1074/jbc.m114.607176] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
During somatic hypermutation (SHM), deamination of cytidine by activation-induced cytidine deaminase and subsequent DNA repair generates mutations within immunoglobulin V-regions. Nucleotide insertions and deletions (indels) have recently been shown to be critical for the evolution of antibody binding. Affinity maturation of 53 antibodies using in vitro SHM in a non-B cell context was compared with mutation patterns observed for SHM in vivo. The origin and frequency of indels seen during in vitro maturation were similar to that in vivo. Indels are localized to CDRs, and secondary mutations within insertions further optimize antigen binding. Structural determination of an antibody matured in vitro and comparison with human-derived antibodies containing insertions reveal conserved patterns of antibody maturation. These findings indicate that activation-induced cytidine deaminase acting on V-region sequences is sufficient to initiate authentic formation of indels in vitro and in vivo and that point mutations, indel formation, and clonal selection form a robust tripartite system for antibody evolution.
Collapse
|
8
|
Coevolution of the ATPase ClpV, the sheath proteins TssB and TssC, and the accessory protein TagJ/HsiE1 distinguishes type VI secretion classes. J Biol Chem 2014; 289:33032-43. [PMID: 25305017 PMCID: PMC4239648 DOI: 10.1074/jbc.m114.600510] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
The type VI secretion system (T6SS) is a bacterial nanomachine for the transport of effector molecules into prokaryotic and eukaryotic cells. It involves the assembly of a tubular structure composed of TssB and TssC that is similar to the tail sheath of bacteriophages. The sheath contracts to provide the energy needed for effector delivery. The AAA+ ATPase ClpV disassembles the contracted sheath, which resets the systems for reassembly of an extended sheath that is ready to fire again. This mechanism is crucial for T6SS function. In Vibrio cholerae, ClpV binds the N terminus of TssC within a hydrophobic groove. In this study, we resolved the crystal structure of the N-terminal domain of Pseudomonas aeruginosa ClpV1 and observed structural alterations in the hydrophobic groove. The modification in the ClpV1 groove is matched by a change in the N terminus of TssC, suggesting the existence of distinct T6SS classes. An accessory T6SS component, TagJ/HsiE, exists predominantly in one of the classes. Using bacterial two-hybrid approaches, we showed that the P. aeruginosa homolog HsiE1 interacts strongly with ClpV1. We then resolved the crystal structure of HsiE1 in complex with the N terminus of HsiB1, a TssB homolog and component of the contractile sheath. Phylogenetic analysis confirmed that these differences distinguish T6SS classes that resulted from a functional co-evolution between TssB, TssC, TagJ/HsiE, and ClpV. The interaction of TagJ/HsiE with the sheath as well as with ClpV suggests an alternative mode of disassembly in which HsiE recruits the ATPase to the sheath.
Collapse
|
9
|
Abstract
Chaperones assist protein folding by preventing unproductive protein aggregation in the cell. In Escherichia coli, chaperonin GroEL/GroES (GroE) is the only indispensable chaperone and is absolutely required for the de novo folding of at least ∼60 proteins. We previously found that several orthologs of the obligate GroE substrates in Ureaplasma urealyticum, which lacks the groE gene in the genome, are E. coli GroE-independent folders, despite their significant sequence identities. Here, we investigated the key features that define the GroE dependence. Chimera or random mutagenesis analyses revealed that independent multiple point mutations, and even single mutations, were sufficient to confer GroE dependence on the Ureaplasma MetK. Strikingly, the GroE dependence was well correlated with the propensity to form protein aggregates during folding. The results reveal the delicate balance between GroE dependence and independence. The function of GroE to buffering the aggregation-prone mutations plays a role in maintaining higher genetic diversity of proteins.
Collapse
|
10
|
Abstract
Urzymes are catalysts derived from invariant cores of protein superfamilies. Urzymes from both aminoacyl-tRNA synthetase classes possess sophisticated catalytic mechanisms: pre-steady state bursts, significant transition-state stabilization of both amino acid activation, and tRNA acylation. However, they have insufficient specificity to ensure a fully developed genetic code, suggesting that they participated in synthesizing statistical proteins. They represent a robust experimental platform from which to articulate and test hypotheses both about their own ancestors and about how they, in turn, evolved into modern enzymes. They help reshape numerous paradigms from the RNA World hypothesis to protein structure databases and allostery.
Collapse
|
11
|
New insights about enzyme evolution from large scale studies of sequence and structure relationships. J Biol Chem 2014; 289:30221-30228. [PMID: 25210038 PMCID: PMC4215206 DOI: 10.1074/jbc.r114.569350] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Understanding how enzymes have evolved offers clues about their structure-function relationships and mechanisms. Here, we describe evolution of functionally diverse enzyme superfamilies, each representing a large set of sequences that evolved from a common ancestor and that retain conserved features of their structures and active sites. Using several examples, we describe the different structural strategies nature has used to evolve new reaction and substrate specificities in each unique superfamily. The results provide insight about enzyme evolution that is not easily obtained from studies of one or only a few enzymes.
Collapse
|
12
|
Abstract
The role of evolutionary pressure on the chemical step catalyzed by enzymes is somewhat enigmatic, in part because chemistry is not rate-limiting for many optimized systems. Herein, we present studies that examine various aspects of the evolutionary relationship between protein dynamics and the chemical step in two paradigmatic enzyme families, dihydrofolate reductases and alcohol dehydrogenases. Molecular details of both convergent and divergent evolution are beginning to emerge. The findings suggest that protein dynamics across an entire enzyme can play a role in adaptation to differing physiological conditions. The growing tool kit of kinetics, kinetic isotope effects, molecular biology, biophysics, and bioinformatics provides means to link evolutionary changes in structure-dynamics function to the vibrational and conformational states of each protein.
Collapse
|
13
|
Abstract
Catalytic promiscuity and substrate ambiguity are keys to evolvability, which in turn is pivotal to the successful acquisition of novel biological functions. Action on multiple substrates (substrate ambiguity) can be harnessed for performance of functions in the cell that supersede catalysis of a single metabolite. These functions include proofreading, scavenging of nutrients, removal of antimetabolites, balancing of metabolite pools, and establishing system redundancy. In this review, we present examples of enzymes that perform these cellular roles by leveraging substrate ambiguity and then present the structural features that support both specificity and ambiguity. We focus on the phosphatases of the haloalkanoate dehalogenase superfamily and the thioesterases of the hotdog fold superfamily.
Collapse
|
14
|
Escherichia coli D-malate dehydrogenase, a generalist enzyme active in the leucine biosynthesis pathway. J Biol Chem 2014; 289:29086-96. [PMID: 25160617 DOI: 10.1074/jbc.m114.595363] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The enzymes of the β-decarboxylating dehydrogenase superfamily catalyze the oxidative decarboxylation of D-malate-based substrates with various specificities. Here, we show that, in addition to its natural function affording bacterial growth on D-malate as a carbon source, the D-malate dehydrogenase of Escherichia coli (EcDmlA) naturally expressed from its chromosomal gene is capable of complementing leucine auxotrophy in a leuB(-) strain lacking the paralogous isopropylmalate dehydrogenase enzyme. To our knowledge, this is the first example of an enzyme that contributes with a physiologically relevant level of activity to two distinct pathways of the core metabolism while expressed from its chromosomal locus. EcDmlA features relatively high catalytic activity on at least three different substrates (L(+)-tartrate, D-malate, and 3-isopropylmalate). Because of these properties both in vivo and in vitro, EcDmlA may be defined as a generalist enzyme. Phylogenetic analysis highlights an ancient origin of DmlA, indicating that the enzyme has maintained its generalist character throughout evolution. We discuss the implication of these findings for protein evolution.
Collapse
|
15
|
Physiological IgM class catalytic antibodies selective for transthyretin amyloid. J Biol Chem 2014; 289:13243-58. [PMID: 24648510 PMCID: PMC4036335 DOI: 10.1074/jbc.m114.557231] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2014] [Revised: 03/13/2014] [Indexed: 01/10/2023] Open
Abstract
Peptide bond-hydrolyzing catalytic antibodies (catabodies) could degrade toxic proteins, but acquired immunity principles have not provided evidence for beneficial catabodies. Transthyretin (TTR) forms misfolded β-sheet aggregates responsible for age-associated amyloidosis. We describe nucleophilic catabodies from healthy humans without amyloidosis that degraded misfolded TTR (misTTR) without reactivity to the physiological tetrameric TTR (phyTTR). IgM class B cell receptors specifically recognized the electrophilic analog of misTTR but not phyTTR. IgM but not IgG class antibodies hydrolyzed the particulate and soluble misTTR species. No misTTR-IgM binding was detected. The IgMs accounted for essentially all of the misTTR hydrolytic activity of unfractionated human serum. The IgMs did not degrade non-amyloidogenic, non-superantigenic proteins. Individual monoclonal IgMs (mIgMs) expressed variable misTTR hydrolytic rates and differing oligoreactivity directed to amyloid β peptide and microbial superantigen proteins. A subset of the mIgMs was monoreactive for misTTR. Excess misTTR was dissolved by a hydrolytic mIgM. The studies reveal a novel antibody property, the innate ability of IgMs to selectively degrade and dissolve toxic misTTR species as a first line immune function.
Collapse
|
16
|
Abstract
Intein-mediated protein splicing raises questions and creates opportunities in many scientific areas. Evolutionary biologists question whether inteins are primordial enzymes or simply selfish elements, whereas biochemists seek to understand how inteins work. Synthetic chemists exploit inteins in the semisynthesis of proteins with or without nonribosomal modifications, whereas biotechnologists use modified inteins in an ever increasing variety of applications. The four minireviews in this series explore these themes. The first minireview focuses on the evolution and biological function of inteins, whereas the second describes the mechanisms that underlie the remarkable ability of inteins to perform complex sets of choreographed enzymatic reactions. The third explores the relationship between the three-dimensional structure and dynamics of inteins and their biochemical capabilities. The fourth describes intein applications that have moved beyond simple technology development to utilizing inteins in more sophisticated applications, such as biosensors for identifying ligands of human hormone receptors or improved methods of biofuel and plant-based sugar production.
Collapse
|
17
|
A Golgi-localized mannosidase (MAN1B1) plays a non-enzymatic gatekeeper role in protein biosynthetic quality control. J Biol Chem 2014; 289:11844-11858. [PMID: 24627495 DOI: 10.1074/jbc.m114.552091] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Conformation-based disorders are manifested at the level of protein structure, necessitating an accurate understanding of how misfolded proteins are processed by the cellular proteostasis network. Asparagine-linked glycosylation plays important roles for protein quality control within the secretory pathway. The suspected role for the MAN1B1 gene product MAN1B1, also known as ER mannosidase I, is to function within the ER similar to the yeast ortholog Mns1p, which removes a terminal mannose unit to initiate a glycan-based ER-associated degradation (ERAD) signal. However, we recently discovered that MAN1B1 localizes to the Golgi complex in human cells and uncovered its participation in ERAD substrate retention, retrieval to the ER, and subsequent degradation from this organelle. The objective of the current study was to further characterize the contribution of MAN1B1 as part of a Golgi-based quality control network. Multiple lines of experimental evidence support a model in which neither the mannosidase activity nor catalytic domain is essential for the retention or degradation of the misfolded ERAD substrate Null Hong Kong. Instead, a highly conserved, vertebrate-specific non-enzymatic decapeptide sequence in the luminal stem domain plays a significant role in controlling the fate of overexpressed Null Hong Kong. Together, these findings define a new functional paradigm in which Golgi-localized MAN1B1 can play a mannosidase-independent gatekeeper role in the proteostasis network of higher eukaryotes.
Collapse
|
18
|
Structure of the chicken CD3εδ/γ heterodimer and its assembly with the αβT cell receptor. J Biol Chem 2014; 289:8240-51. [PMID: 24488493 DOI: 10.1074/jbc.m113.544965] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
In mammals, the αβT cell receptor (TCR) signaling complex is composed of a TCRαβ heterodimer that is noncovalently coupled to three dimeric signaling molecules, CD3εδ, CD3εγ, and CD3ζζ. The nature of the TCR signaling complex and subunit arrangement in different species remains unclear however. Here we present a structural and biochemical analysis of the more primitive ancestral form of the TCR signaling complex found in chickens. In contrast to mammals, chickens do not express separate CD3δ and CD3γ chains but instead encode a single hybrid chain, termed CD3δ/γ, that is capable of pairing with CD3ε. The NMR structure of the chicken CD3εδ/γ heterodimer revealed a unique dimer interface that results in a heterodimer with considerable deviation from the distinct side-by-side architecture found in human and murine CD3εδ and CD3εγ. The chicken CD3εδ/γ heterodimer also contains a unique molecular surface, with the vast majority of surface-exposed, nonconserved residues being clustered to a single face of the heterodimer. Using an in vitro biochemical assay, we demonstrate that CD3εδ/γ can assemble with both chicken TCRα and TCRβ via conserved polar transmembrane sites. Moreover, analogous to the human TCR signaling complex, the presence of two copies of CD3εδ/γ is required for ζζ assembly. These data provide insight into the evolution of this critical receptor signaling apparatus.
Collapse
|
19
|
Structure and identification of a pterin dehydratase-like protein as a ribulose-bisphosphate carboxylase/oxygenase (RuBisCO) assembly factor in the α-carboxysome. J Biol Chem 2014; 289:7973-81. [PMID: 24459150 DOI: 10.1074/jbc.m113.531236] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Carboxysomes are proteinaceous bacterial microcompartments that increase the efficiency of the rate-limiting step in carbon fixation by sequestering reaction substrates. Typically, α-carboxysomes are genetically encoded as a single operon expressing the structural proteins and the encapsulated enzymes of the microcompartment. In addition, depending on phylogeny, as many as 13 other genes are found to co-occur near or within α-carboxysome operons. One of these genes codes for a protein with distant homology to pterin-4α-carbinolamine dehydratase (PCD) enzymes. It is present in all α-carboxysome containing bacteria and has homologs in algae and higher plants. Canonical PCDs play an important role in amino acid hydroxylation, a reaction not associated with carbon fixation. We determined the crystal structure of an α-carboxysome PCD-like protein from the chemoautotrophic bacterium Thiomonas intermedia K12, at 1.3-Å resolution. The protein retains a three-dimensional fold similar to canonical PCDs, although the prominent active site cleft present in PCD enzymes is disrupted in the α-carboxysome PCD-like protein. Using a cell-based complementation assay, we tested the PCD-like proteins from T. intermedia and two additional bacteria, and found no evidence for PCD enzymatic activity. However, we discovered that heterologous co-expression of the PCD-like protein from Halothiobacillus neapolitanus with RuBisCO and GroELS in Escherichia coli increased the amount of soluble, assembled RuBisCO recovered from cell lysates compared with co-expression of RuBisCO with GroELS alone. We conclude that this conserved PCD-like protein, renamed here α-carboxysome RuBisCO assembly factor (or acRAF), is a novel RuBisCO chaperone integral to α-carboxysome function.
Collapse
|
20
|
Glycerol dehydrogenase plays a dual role in glycerol metabolism and 2,3-butanediol formation in Klebsiella pneumoniae. J Biol Chem 2014; 289:6080-90. [PMID: 24429283 DOI: 10.1074/jbc.m113.525535] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Glycerol dehydrogenase (GDH) is an important polyol dehydrogenase for glycerol metabolism in diverse microorganisms and for value-added utilization of glycerol in the industry. Two GDHs from Klebsiella pneumoniae, DhaD and GldA, were expressed in Escherichia coli, purified and characterized for substrate specificity and kinetic parameters. Both DhaD and GldA could catalyze the interconversion of (3R)-acetoin/(2R,3R)-2,3-butanediol or (3S)-acetoin/meso-2,3-butanediol, in addition to glycerol oxidation. Although purified GldA appeared more active than DhaD, in vivo inactivation and quantitation of their respective mRNAs indicate that dhaD is highly induced by glycerol and plays a dual role in glycerol metabolism and 2,3-butanediol formation. Complementation in K. pneumoniae further confirmed the dual role of DhaD. Promiscuity of DhaD may have vital physiological consequences for K. pneumoniae growing on glycerol, which include balancing the intracellular NADH/NAD(+) ratio, preventing acidification, and storing carbon and energy. According to the kinetic response of DhaD to modified NADH concentrations, DhaD appears to show positive homotropic interaction with NADH, suggesting that the physiological role could be regulated by intracellular NADH levels. The co-existence of two functional GDH enzymes might be due to a gene duplication event. We propose that whereas DhaD is specialized for glycerol utilization, GldA plays a role in backup compensation and can turn into a more proficient catalyst to promote a survival advantage to the organism. Revelation of the dual role of DhaD could further the understanding of mechanisms responsible for enzyme evolution through promiscuity, and guide metabolic engineering methods of glycerol metabolism.
Collapse
|
21
|
A novel family of human leukocyte antigen class II receptors may have its origin in archaic human species. J Biol Chem 2014; 289:639-53. [PMID: 24214983 PMCID: PMC3887193 DOI: 10.1074/jbc.m113.515767] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2013] [Revised: 11/08/2013] [Indexed: 01/10/2023] Open
Abstract
HLA class II α and β chains form receptors for antigen presentation to CD4(+) T cells. Numerous pairings of class II α and β subunits from the wide range of haplotypes and isotypes may form, but most of these combinations, in particular those produced by isotype mixing, yielded mismatched dimers. It is unclear how selection of functional receptors is achieved. At the atomic level, it is not known which interactions of class II residues regulate selection of matched αβ heterodimers and the evolutionary origin of matched isotype mixed dimer formation. In this study we investigated assembly of isotype-mixed HLA class II α and β heterodimers. Assembly and carbohydrate maturation of various HLA-class II isotype-mixed α and β subunits was dependent on the groove binding section of the invariant chain (Ii). By mutation of polymorphic DPβ sequences, we identified two motifs, Lys-69 and GGPM-(84-87), that are engaged in Ii-dependent assembly of DPβ with DRα. We identified five members of a family of DPβ chains containing Lys-69 and GGPM 84-87, which assemble with DRα. The Lys/GGPM motif is present in the DPβ sequence of the Neanderthal genome, and this ancient sequence is related to the human allele DPB1*0401. By site-directed mutagenesis, we inspected Neanderthal amino acid residues that differ from the DPB1*0401 allele and aimed to determine whether matched heterodimers are formed by assembly of DPβ mutants with DRα. Because the *0401 allele is rare in the sub-Saharan population but frequent in the European population, it may have arisen in modern humans by admixture with Neanderthals in Europe.
Collapse
|
22
|
Tracing primordial protein evolution through structurally guided stepwise segment elongation. J Biol Chem 2013; 289:3394-404. [PMID: 24356963 DOI: 10.1074/jbc.m113.530592] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The understanding of how primordial proteins emerged has been a fundamental and longstanding issue in biology and biochemistry. For a better understanding of primordial protein evolution, we synthesized an artificial protein on the basis of an evolutionary hypothesis, segment-based elongation starting from an autonomously foldable short peptide. A 10-residue protein, chignolin, the smallest foldable polypeptide ever reported, was used as a structural support to facilitate higher structural organization and gain-of-function in the development of an artificial protein. Repetitive cycles of segment elongation and subsequent phage display selection successfully produced a 25-residue protein, termed AF.2A1, with nanomolar affinity against the Fc region of immunoglobulin G. AF.2A1 shows exquisite molecular recognition ability such that it can distinguish conformational differences of the same molecule. The structure determined by NMR measurements demonstrated that AF.2A1 forms a globular protein-like conformation with the chignolin-derived β-hairpin and a tryptophan-mediated hydrophobic core. Using sequence analysis and a mutation study, we discovered that the structural organization and gain-of-function emerged from the vicinity of the chignolin segment, revealing that the structural support served as the core in both structural and functional development. Here, we propose an evolutionary model for primordial proteins in which a foldable segment serves as the evolving core to facilitate structural and functional evolution. This study provides insights into primordial protein evolution and also presents a novel methodology for designing small sized proteins useful for industrial and pharmaceutical applications.
Collapse
|
23
|
Preservation of protein dynamics in dihydrofolate reductase evolution. J Biol Chem 2013; 288:35961-8. [PMID: 24158440 PMCID: PMC3861645 DOI: 10.1074/jbc.m113.507632] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2013] [Revised: 10/09/2013] [Indexed: 11/06/2022] Open
Abstract
The hydride transfer reaction catalyzed by dihydrofolate reductase (DHFR) is a model for examining how protein dynamics contribute to enzymatic function. The relationship between functional motions and enzyme evolution has attracted significant attention. Recent studies on N23PP Escherichia coli DHFR (ecDHFR) mutant, designed to resemble parts of the human enzyme, indicated a reduced single turnover rate. NMR relaxation dispersion experiments with that enzyme showed rigidification of millisecond Met-20 loop motions (Bhabha, G., Lee, J., Ekiert, D. C., Gam, J., Wilson, I. A., Dyson, H. J., Benkovic, S. J., and Wright, P. E. (2011) Science 332, 234-238). A more recent study of this mutant, however, indicated that fast motions along the reaction coordinate are actually more dispersed than for wild-type ecDHFR (WT). Furthermore, a double mutant (N23PP/G51PEKN) that better mimics the human enzyme seems to restore both the single turnover rates and narrow distribution of fast dynamics (Liu, C. T., Hanoian, P., French, T. H., Hammes-Schiffer, S., and Benkovic, S. J. (2013) Proc. Natl. Acad. Sci. U.S.A. 110, 10159-11064). Here, we measured intrinsic kinetic isotope effects for both N23PP and N23PP/G51PEKN double mutant DHFRs over a temperature range. The findings indicate that although the C-H→C transfer and dynamics along the reaction coordinate are impaired in the altered N23PP mutant, both seem to be restored in the N23PP/G51PEKN double mutant. This indicates that the evolution of G51PEKN, although remote from the Met-20 loop, alleviated the loop rigidification that would have been caused by N23PP, enabling WT-like H-tunneling. The correlation between the calculated dynamics, the nature of C-H→C transfer, and a phylogenetic analysis of DHFR sequences are consistent with evolutionary preservation of the protein dynamics to enable H-tunneling from well reorganized active sites.
Collapse
|
24
|
Abstract
ATP-dependent proteases are responsible for most energy-dependent protein degradation across all species. Proteases initially bind an unstructured region on a substrate and then translocate along the polypeptide chain, unfolding and degrading protein domains as they are encountered. Although this process is normally processive, resulting in the complete degradation of substrate proteins to small peptides, some substrates are released prematurely. Regions of low sequence complexity within the substrate such as the glycine-rich region (GRR) from p105 or glycine-alanine repeats (GAr) from the EBNA1 (Epstein-Barr virus nuclear antigen-1) protein, can trigger partial degradation and fragment release. Loss of processivity could be due to inability to hold on to the substrate (faster release) or inability to unfold and degrade a substrate domain (slower unfolding). I previously showed that the GRR slows domain unfolding by the proteasome (Kraut, D. A., Israeli, E., Schrader, E. K., Patil, A., Nakai, K., Nanavati, D., Inobe, T., and Matouschek, A. (2012) ACS Chem. Biol. 7, 1444-1453). In contrast, a recently published study concluded that GArs increase the rate of substrate release from ClpXP, a bacterial ATP-dependent protease (Too, P. H., Erales, J., Simen, J. D., Marjanovic, A., and Coffino, P. (2013) J. Biol. Chem. 288, 13243-13257). Here, I show that these apparently contradictory results can be reconciled through a reanalysis of the ClpXP GAr data. This reanalysis shows that, as with the proteasome, low complexity sequences in substrates slow their unfolding and degradation by ClpXP, with little effect on release rates. Thus, despite their evolutionary distance and limited sequence identity, both ClpXP and the proteasome share a common mechanism by which substrate sequences regulate the processivity of degradation.
Collapse
|
25
|
Full implementation of the genetic code by tryptophanyl-tRNA synthetase requires intermodular coupling. J Biol Chem 2013; 288:34736-45. [PMID: 24142809 DOI: 10.1074/jbc.m113.510958] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Tryptophanyl-tRNA Synthetase (TrpRS) Urzyme (fragments A and C), a 130-residue construct containing only secondary structures positioning the HIGH and KMSKS active site signatures and the specificity helix, accelerates tRNA(Trp) aminoacylation with ∼10-fold specificity toward tryptophan, relative to structurally related tyrosine. We proposed that including the 76-residue connecting peptide 1 insertion (Fragment B) might enhance tryptophan affinity and hence amino acid specificity, because that subdomain constrains the orientation of the specificity helix. We test that hypothesis by characterizing two new constructs: the catalytic domain (fragments A-C) and the Urzyme supplemented with the anticodon-binding domain (fragments A, C, and D). The three constructs, together with the full-length enzyme (fragments A-D), comprise a factorial experiment from which we deduce individual and combined contributions of the two modules to the steady-state kinetics parameters for tryptophan-dependent (32)PPi exchange, specificity for tryptophan versus tyrosine, and aminoacylation of tRNA(Trp). Factorial design directly measures the energetic coupling between the two more recent modules in the contemporary enzyme and demonstrates its functionality. Combining the TrpRS Urzyme individually in cis with each module affords an analysis of long term evolution of amino acid specificity and tRNA aminoacylation, both essential for expanding the genetic code. Either module significantly enhances tryptophan activation but unexpectedly eliminates amino acid specificity for tryptophan, relative to tyrosine, and significantly reduces tRNA aminoacylation. Exclusive dependence of both enhanced functionalities of full-length TrpRS on interdomain coupling energies between the two new modules argues that independent recruitment of connecting peptide 1 and the anticodon-binding domain during evolutionary development of Urzymes would have entailed significant losses of fitness.
Collapse
|
26
|
Structural and functional evolution of positively selected sites in pine glutathione S-transferase enzyme family. J Biol Chem 2013; 288:24441-51. [PMID: 23846689 DOI: 10.1074/jbc.m113.456863] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Phylogenetic analyses have identified positive selection as an important driver of protein evolution, both structural and functional. However, the lack of appropriate combined functional and structural assays has generally hindered attempts to elucidate patterns of positively selected sites and their effects on enzyme activity and substrate specificity. In this study we investigated the evolutionary divergence of the glutathione S-transferase (GST) family in Pinus tabuliformis, a pine that is widely distributed from northern to central China, including cold temperate and drought-stressed regions. GSTs play important roles in plant stress tolerance and detoxification. We cloned 44 GST genes from P. tabuliformis and found that 26 of the 44 belong to the largest (Tau) class of GSTs and are differentially expressed across tissues and developmental stages. Substitution models identified five positively selected sites in the Tau GSTs. To examine the functional significance of these positively selected sites, we applied protein structural modeling and site-directed mutagenesis. We found that four of the five positively selected sites significantly affect the enzyme activity and specificity; thus their variation broadens the GST family substrate spectrum. In addition, positive selection has mainly acted on secondary substrate binding sites or sites close to (but not directly at) the primary substrate binding site; thus their variation enables the acquisition of new catalytic functions without compromising the protein primary biochemical properties. Our study sheds light on selective aspects of the functional and structural divergence of the GST family in pine and other organisms.
Collapse
|
27
|
Danio rerio αE-catenin is a monomeric F-actin binding protein with distinct properties from Mus musculus αE-catenin. J Biol Chem 2013; 288:22324-32. [PMID: 23788645 DOI: 10.1074/jbc.m113.458406] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
It is unknown whether homologs of the cadherin·catenin complex have conserved structures and functions across the Metazoa. Mammalian αE-catenin is an allosterically regulated actin-binding protein that binds the cadherin·β-catenin complex as a monomer and whose dimerization potentiates F-actin association. We tested whether these functional properties are conserved in another vertebrate, the zebrafish Danio rerio. Here we show, despite 90% sequence identity, that Danio rerio and Mus musculus αE-catenin have striking functional differences. We demonstrate that D. rerio αE-catenin is monomeric by size exclusion chromatography, native PAGE, and small angle x-ray scattering. D. rerio αE-catenin binds F-actin in cosedimentation assays as a monomer and as an α/β-catenin heterodimer complex. D. rerio αE-catenin also bundles F-actin, as shown by negative stained transmission electron microscopy, and does not inhibit Arp2/3 complex-mediated actin nucleation in bulk polymerization assays. Thus, core properties of α-catenin function, F-actin and β-catenin binding, are conserved between mouse and zebrafish. We speculate that unique regulatory properties have evolved to match specific developmental requirements.
Collapse
|
28
|
Abstract
ADAMDEC1 (Decysin-1) is a putative ADAM (a disintegrin and metalloprotease)-like metalloprotease with an unknown physiological role, selectively expressed in mature dendritic cells and macrophages. When compared with other members of the ADAM family, ADAMDEC1 displays some unusual features. It lacks the auxiliary cysteine-rich, EGF, and transmembrane domains, as well as the cytoplasmic tail. The active site of ADAMDEC1 is unique by being the only mammalian ADAM protease with a non-histidine zinc ligand, having an aspartic acid residue instead. Here we demonstrate that ADAMDEC1, despite these unique features, functions as an active metalloprotease. Thus, ADAMDEC1 is secreted as a mature, glycosylated, and proteolytically active metalloprotease, capable of cleaving macromolecular substrates. In the recombinant form, three of the four potential N-linked glycosylation sites are modified by carbohydrate attachment. Substitution of basic residues at the predicted proprotein convertase cleavage site blocks proprotein processing, revealing both specific ADAMDEC1-dependent and specific ADAMDEC1-independent cleavage of the prodomain. The pro-form of ADAMDEC1 does not have proteolytic activity, demonstrating that the prodomain of ADAMDEC1, like in other members of the ADAM family, confers catalytic latency. Interestingly, the proteolytic activity of mature ADAMDEC1 can be significantly enhanced when a canonical ADAM active site with three zinc-coordinating histidine residues is introduced.
Collapse
|
29
|
Structure prediction and analysis of DNA transposon and LINE retrotransposon proteins. J Biol Chem 2013; 288:16127-38. [PMID: 23530042 PMCID: PMC3668768 DOI: 10.1074/jbc.m113.451500] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2013] [Revised: 03/21/2013] [Indexed: 01/15/2023] Open
Abstract
Despite the considerable amount of research on transposable elements, no large-scale structural analyses of the TE proteome have been performed so far. We predicted the structures of hundreds of proteins from a representative set of DNA and LINE transposable elements and used the obtained structural data to provide the first general structural characterization of TE proteins and to estimate the frequency of TE domestication and horizontal transfer events. We show that 1) ORF1 and Gag proteins of retrotransposons contain high amounts of structural disorder; thus, despite their very low conservation, the presence of disordered regions and probably their chaperone function is conserved. 2) The distribution of SCOP classes in DNA transposons and LINEs indicates that the proteins of DNA transposons are more ancient, containing folds that already existed when the first cellular organisms appeared. 3) DNA transposon proteins have lower contact order than randomly selected reference proteins, indicating rapid folding, most likely to avoid protein aggregation. 4) Structure-based searches for TE homologs indicate that the overall frequency of TE domestication events is low, whereas we found a relatively high number of cases where horizontal transfer, frequently involving parasites, is the most likely explanation for the observed homology.
Collapse
|
30
|
Simultaneous surface display and secretion of proteins from mammalian cells facilitate efficient in vitro selection and maturation of antibodies. J Biol Chem 2013; 288:19861-9. [PMID: 23689374 DOI: 10.1074/jbc.m113.452482] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
A mammalian expression system has been developed that permits simultaneous cell surface display and secretion of the same protein through alternate splicing of pre-mRNA. This enables a flexible system for in vitro protein evolution in mammalian cells where the displayed protein phenotype remains linked to genotype, but with the advantage of soluble protein also being produced without the requirement for any further recloning to allow a wide range of assays, including biophysical and cell-based functional assays, to be used during the selection process. This system has been used for the simultaneous surface presentation and secretion of IgG during antibody discovery and maturation. Presentation and secretion of monomeric Fab can also be achieved to minimize avidity effects. Manipulation of the splice donor site sequence enables control of the relative amounts of cell surface and secreted antibody. Multi-domain proteins may be presented and secreted in different formats to enable flexibility in experimental design, and secreted proteins may be produced with epitope tags to facilitate high-throughput testing. This system is particularly useful in the context of in situ mutagenesis, as in the case of in vitro somatic hypermutation.
Collapse
|
31
|
The evolutionary rewiring of the ribosomal protein transcription pathway modifies the interaction of transcription factor heteromer Ifh1-Fhl1 (interacts with forkhead 1-forkhead-like 1) with the DNA-binding specificity element. J Biol Chem 2013; 288:17508-19. [PMID: 23625919 DOI: 10.1074/jbc.m112.436683] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
The genes encoding the ribosomal proteins of fungi form a regulon whose expression is enhanced under good growth conditions and down-regulated under starvation conditions. The fungal pathogen Candida albicans contains an evolutionarily ancient control circuit for this regulon where a heteromer made up of the transcription regulators Ifh1 (interacts with Forkhead 1) and Fhl1 (Forkhead-like 1) is targeted to the ribosomal protein genes by the DNA binding factor Tbf1. In the more recently evolved circuit in the model yeast Saccharomyces cerevisiae (Sc), the generalist repressor-activator protein Rap1 now directs the Ifh1-Fhl1 module to the ribosomal protein genes. Even though overall sequence similarity is low for the respective Fhl1 and Ifh1 subunits, in both species, the Ifh1 protein links to the Forkhead-associated domain of Fhl1 through its FHB domain. Intriguingly, correlated with the transition to the Rap1-regulated circuit, the Sc-Ifh1 contains a Rap1 binding domain that is not present in the C. albicans protein. Because no extensive common sequences are found in Tbf1 and Rap1, it appears that these targeting proteins must connect to the Ifh1-Fhl1 module in distinct ways. Two-hybrid and co-immunoprecipitation analysis has been used to show that in C. albicans Tbf1 is linked to the heterodimer through direct association with Fhl1. By contrast, in S. cerevisiae, the linkage of the heteromer to Rap1 occurs through Ifh1. Thus, in the ascomycetes, the Ifh1-Fhl1 heterodimer has reconfigured its protein associations to remain connected to the ribosomal protein regulon despite rewiring of the targeting transcription factor from Tbf1 to Rap1.
Collapse
|
32
|
Phosphatidylinositol transfer proteins: sequence motifs in structural and evolutionary analyses. ACTA ACUST UNITED AC 2010; 3:65-77. [PMID: 27429707 DOI: 10.4236/jbise.2010.31010] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Phosphatidylinositol transfer proteins (PITP) are a family of monomeric proteins that bind and transfer phosphatidylinositol and phosphatidylcholine between membrane compartments. They are required for production of inositol and diacylglycerol second messengers, and are found in most metazoan organisms. While PITPs are known to carry out crucial cell-signaling roles in many organisms, the structure, function and evolution of the majority of family members remains unexplored; primarily because the ubiquity and diversity of the family thwarts traditional methods of global alignment. To surmount this obstacle, we instead took a novel approach, using MEME and a parsimony-based analysis to create a cladogram of conserved sequence motifs in 56 PITP family proteins from 26 species. In keeping with previous functional annotations, three clades were supported within our evolutionary analysis; two classes of soluble proteins and a class of membrane-associated proteins. By, focusing on conserved regions, the analysis allowed for in depth queries regarding possible functional roles of PITP proteins in both intra- and extra- cellular signaling.
Collapse
|