1
|
The Functional Significance of High Cysteine Content in Eye Lens γ-Crystallins. Biomolecules 2024; 14:594. [PMID: 38786000 PMCID: PMC11118217 DOI: 10.3390/biom14050594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Revised: 05/07/2024] [Accepted: 05/14/2024] [Indexed: 05/25/2024] Open
Abstract
Cataract disease is strongly associated with progressively accumulating oxidative damage to the extremely long-lived crystallin proteins of the lens. Cysteine oxidation affects crystallin folding, interactions, and light-scattering aggregation especially strongly due to the formation of disulfide bridges. Minimizing crystallin aggregation is crucial for lifelong lens transparency, so one might expect the ubiquitous lens crystallin superfamilies (α and βγ) to contain little cysteine. Yet, the Cys content of γ-crystallins is well above the average for human proteins. We review literature relevant to this longstanding puzzle and take advantage of expanding genomic databases and improved machine learning tools for protein structure prediction to investigate it further. We observe remarkably low Cys conservation in the βγ-crystallin superfamily; however, in γ-crystallin, the spatial positioning of Cys residues is clearly fine-tuned by evolution. We propose that the requirements of long-term lens transparency and high lens optical power impose competing evolutionary pressures on lens βγ-crystallins, leading to distinct adaptations: high Cys content in γ-crystallins but low in βB-crystallins. Aquatic species need more powerful lenses than terrestrial ones, which explains the high methionine content of many fish γ- (and even β-) crystallins. Finally, we discuss synergies between sulfur-containing and aromatic residues in crystallins and suggest future experimental directions.
Collapse
|
2
|
Trehalose synthases from the subfamily GH13_16 involved in α-glucan biosynthesis - a focus on their maltokinase domain. Int J Biol Macromol 2024; 268:131680. [PMID: 38641282 DOI: 10.1016/j.ijbiomac.2024.131680] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Revised: 04/16/2024] [Accepted: 04/16/2024] [Indexed: 04/21/2024]
Abstract
The subfamily GH13_16 trehalose synthase (TreS) converts maltose to trehalose and vice versa. Typically, it consists of three domains, but it may contain a C-terminal extension exhibiting clear sequence features of a maltokinase (MaK). The present in silico study was focused on collection of naturally fused TreS-MaKs and their subsequent detailed bioinformatics analysis. Hence a set of total 3354 unique sequences was compared consisting of 1900 single TreSs, 1426 fused TreS-MaKs and 28 single MaKs. Fused TreS-MaKs were divided into five groups, namely with a standard MaK, with mutations in the maltose-binding site, of the catalytic nucleophile, of the general acid/base and of both catalytic residues. Sequence logos bearing the best conserved sequence regions were prepared for both TreSs and MaKs in an effort to find unique sequence features. In addition, linkers connecting the TreS and MaK parts in the fused enzymes were analysed. This analysis revealed that MaKs in fused enzymes have an extended N-terminal regions compared to single MaKs. Finally, the evolutionary relationships were demonstrated by phylogenetic trees of TreS parts from single TreSs and fused TreS-MaKs from the same organism as well as of single TreSs existing in multiple isoforms in the same organism.
Collapse
|
3
|
Significance of Histidine Hydrogen-Deuterium Exchange Mass Spectrometry in Protein Structural Biology. BIOLOGY 2024; 13:37. [PMID: 38248468 PMCID: PMC10813008 DOI: 10.3390/biology13010037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 01/04/2024] [Accepted: 01/06/2024] [Indexed: 01/23/2024]
Abstract
Histidine residues play crucial roles in shaping the function and structure of proteins due to their unique ability to act as both acids and bases. In other words, they can serve as proton donors and acceptors at physiological pH. This exceptional property is attributed to the side-chain imidazole ring of histidine residues. Consequently, determining the acid-base dissociation constant (Ka) of histidine imidazole rings in proteins often yields valuable insights into protein functions. Significant efforts have been dedicated to measuring the pKa values of histidine residues in various proteins, with nuclear magnetic resonance (NMR) spectroscopy being the most commonly used technique. However, NMR-based methods encounter challenges in assigning signals to individual imidazole rings and require a substantial amount of proteins. To address these issues associated with NMR-based approaches, a mass-spectrometry-based method known as histidine hydrogen-deuterium exchange mass spectrometry (His-HDX-MS) has been developed. This technique not only determines the pKa values of histidine imidazole groups but also quantifies their solvent accessibility. His-HDX-MS has proven effective across diverse proteins, showcasing its utility. This review aims to clarify the fundamental principles of His-HDX-MS, detail the experimental workflow, explain data analysis procedures and provide guidance for interpreting the obtained results.
Collapse
|
4
|
Amino acid intake strategies define pluripotent cell states. Nat Metab 2024; 6:127-140. [PMID: 38172382 PMCID: PMC10842923 DOI: 10.1038/s42255-023-00940-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/04/2022] [Accepted: 11/07/2023] [Indexed: 01/05/2024]
Abstract
Mammalian preimplantation development is associated with marked metabolic robustness, and embryos can develop under a wide variety of nutrient conditions, including even the complete absence of soluble amino acids. Here we show that mouse embryonic stem cells (ESCs) capture the unique metabolic state of preimplantation embryos and proliferate in the absence of several essential amino acids. Amino acid independence is enabled by constitutive uptake of exogenous protein through macropinocytosis, alongside a robust lysosomal digestive system. Following transition to more committed states, ESCs reduce digestion of extracellular protein and instead become reliant on exogenous amino acids. Accordingly, amino acid withdrawal selects for ESCs that mimic the preimplantation epiblast. More broadly, we find that all lineages of preimplantation blastocysts exhibit constitutive macropinocytic protein uptake and digestion. Taken together, these results highlight exogenous protein uptake and digestion as an intrinsic feature of preimplantation development and provide insight into the catabolic strategies that enable embryos to sustain viability before implantation.
Collapse
|
5
|
Somatic mutations of MLL4/COMPASS induce cytoplasmic localization providing molecular insight into cancer prognosis and treatment. Proc Natl Acad Sci U S A 2023; 120:e2310063120. [PMID: 38113256 PMCID: PMC10756272 DOI: 10.1073/pnas.2310063120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Accepted: 11/17/2023] [Indexed: 12/21/2023] Open
Abstract
Cancer genome sequencing consortiums have recently catalogued an abundance of somatic mutations, across a wide range of human cancers, in the chromatin-modifying enzymes that regulate gene expression. Defining the molecular mechanisms underlying the potentially oncogenic functions of these epigenetic mutations could serve as the basis for precision medicine approaches to cancer therapy. MLL4 encoded by the KMT2D gene highly mutated in a large number of human cancers, is a key histone lysine monomethyltransferase within the Complex of Proteins Associated with Set1 (COMPASS) family that regulates gene expression through enhancer function, potentially functioning as a tumor suppressor. We report that the KMT2D mutations which cause MLL4 protein truncation also alter MLL4's subcellular localization, resulting in loss-of-function in the nucleus and gain-of-function in the cytoplasm. We demonstrate that isogenic correction of KMT2D truncation mutation rescues the aberrant localization phenotype and restores multiple regulatory functions of MLL4, including COMPASS integrity/stabilization, histone H3K4 mono-methylation, enhancer activation, and therefore transcriptional regulation. Moreover, isogenic correction diminishes the sensitivity of KMT2D-mutated cancer cells to targeted metabolic inhibition. Using immunohistochemistry, we identified that cytoplasmic MLL4 is unique to the tissue of bladder cancer patients with KMT2D truncation mutations. Using a preclinical carcinogen model of bladder cancer in mouse, we demonstrate that truncated cytoplasmic MLL4 predicts response to targeted metabolic inhibition therapy for bladder cancer and could be developed as a biomarker for KMT2D-mutated cancers. We also highlight the broader potential for prognosis, patient stratification and treatment decision-making based on KMT2D mutation status in MLL4 truncation-relevant diseases, including human cancers and Kabuki Syndrome.
Collapse
|
6
|
Comparison of cysteine content in whole proteomes across the three domains of life. PLoS One 2023; 18:e0294268. [PMID: 37956129 PMCID: PMC10642813 DOI: 10.1371/journal.pone.0294268] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Accepted: 10/29/2023] [Indexed: 11/15/2023] Open
Abstract
An empirical observation suggests that Giardia lamblia proteins have larger cysteine content than their counterparts in other organisms. As this parasite lacks conventional antioxidant stress systems, it is generally accepted that high cysteine content helps G. lamblia cope with oxygen toxicity, a strategy apparently shared by other organisms. Here, we question whether the high cysteine content in some organisms is genuine or just a simple assumption based on singular observations. To this end, we analyzed the cysteine content in 78 proteomes of organisms spanning the three domains of life. The results indicate that the cysteine content in eukaryota is approximately double that in archaea and bacteria, with G. lamblia among the highest. Atypical cysteine contents were found in a few organisms correlating with specific environmental conditions, supporting the evolutionary amino acid-level selection of amino acid composition.
Collapse
|
7
|
Main Factors Shaping Amino Acid Usage Across Evolution. J Mol Evol 2023:10.1007/s00239-023-10120-5. [PMID: 37264211 DOI: 10.1007/s00239-023-10120-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Accepted: 05/17/2023] [Indexed: 06/03/2023]
Abstract
The standard genetic code determines that in most species, including viruses, there are 20 amino acids that are coded by 61 codons, while the other three codons are stop triplets. Considering the whole proteome each species features its own amino acid frequencies, given the slow rate of change, closely related species display similar GC content and amino acids usage. In contrast, distantly related species display different amino acid frequencies. Furthermore, within certain multicellular species, as mammals, intragenomic differences in the usage of amino acids are evident. In this communication, we shall summarize some of the most prominent and well-established factors that determine the differences found in the amino acid usage, both across evolution and intragenomically.
Collapse
|
8
|
An overview of descriptors to capture protein properties - Tools and perspectives in the context of QSAR modeling. Comput Struct Biotechnol J 2023; 21:3234-3247. [PMID: 38213891 PMCID: PMC10781719 DOI: 10.1016/j.csbj.2023.05.022] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 05/23/2023] [Accepted: 05/23/2023] [Indexed: 01/13/2024] Open
Abstract
Proteins are important ingredients in food and feed, they are the active components of many pharmaceutical products, and they are necessary, in the form of enzymes, for the success of many technical processes. However, production can be challenging, especially when using heterologous host cells such as bacteria to express and assemble recombinant mammalian proteins. The manufacturability of proteins can be hindered by low solubility, a tendency to aggregate, or inefficient purification. Tools such as in silico protein engineering and models that predict separation criteria can overcome these issues but usually require the complex shape and surface properties of proteins to be represented by a small number of quantitative numeric values known as descriptors, as similarly used to capture the features of small molecules. Here, we review the current status of protein descriptors, especially for application in quantitative structure activity relationship (QSAR) models. First, we describe the complexity of proteins and the properties that descriptors must accommodate. Then we introduce descriptors of shape and surface properties that quantify the global and local features of proteins. Finally, we highlight the current limitations of protein descriptors and propose strategies for the derivation of novel protein descriptors that are more informative.
Collapse
|
9
|
A Structure-Based Mechanism for the Denaturing Action of Urea, Guanidinium Ion and Thiocyanate Ion. BIOLOGY 2022; 11:biology11121764. [PMID: 36552273 PMCID: PMC9775367 DOI: 10.3390/biology11121764] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 11/28/2022] [Accepted: 12/02/2022] [Indexed: 12/12/2022]
Abstract
An exhaustive analysis of all the protein structures deposited in the Protein Data Bank, here performed, has allowed the identification of hundredths of protein-bound urea molecules and the structural characterization of such binding sites. It emerged that, even though urea molecules are largely involved in hydrogen bonds with both backbone and side chains, they are also able to make van der Waals contacts with nonpolar moieties. As similar findings have also been previously reported for guanidinium and thiocyanate, this observation suggests that promiscuity is a general property of protein denaturants. Present data provide strong support for a mechanism based on the protein-denaturant direct interactions with a denaturant binding model to equal and independent sites. In this general framework, our investigations also highlight some interesting insights into the different denaturing power of urea compared to guanidinium/thiocyanate.
Collapse
|
10
|
Exonic splicing code and protein binding sites for calcium. Nucleic Acids Res 2022; 50:5493-5512. [PMID: 35474482 PMCID: PMC9177970 DOI: 10.1093/nar/gkac270] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Revised: 04/01/2022] [Accepted: 04/05/2022] [Indexed: 11/12/2022] Open
Abstract
Auxilliary splicing sequences in exons, known as enhancers (ESEs) and silencers (ESSs), have been subject to strong selection pressures at the RNA and protein level. The protein component of this splicing code is substantial, recently estimated at ∼50% of the total information within ESEs, but remains poorly understood. The ESE/ESS profiles were previously associated with the Irving-Williams (I-W) stability series for divalent metals, suggesting that the ESE/ESS evolution was shaped by metal binding sites. Here, we have examined splicing activities of exonic sequences that encode protein binding sites for Ca2+, a weak binder in the I-W affinity order. We found that predicted exon inclusion levels for the EF-hand motifs and for Ca2+-binding residues in nonEF-hand proteins were higher than for average exons. For canonical EF-hands, the increase was centred on the EF-hand chelation loop and, in particular, on Ca2+-coordinating residues, with a 1>12>3∼5>9 hierarchy in the 12-codon loop consensus and usage bias at codons 1 and 12. The same hierarchy but a lower increase was observed for noncanonical EF-hands, except for S100 proteins. EF-hand loops preferentially accumulated exon splits in two clusters, one located in their N-terminal halves and the other around codon 12. Using splicing assays and published crosslinking and immunoprecipitation data, we identify candidate trans-acting factors that preferentially bind conserved GA-rich motifs encoding negatively charged amino acids in the loops. Together, these data provide evidence for the high capacity of codons for Ca2+-coordinating residues to be retained in mature transcripts, facilitating their exon-level expansion during eukaryotic evolution.
Collapse
|
11
|
A simple model of protein cold denaturation. Chem Phys Lett 2022. [DOI: 10.1016/j.cplett.2022.139504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
12
|
A Protein Data Bank survey of multimodal binding of thiocyanate to proteins: Evidence for thiocyanate promiscuity. Int J Biol Macromol 2022; 208:29-36. [PMID: 35259436 DOI: 10.1016/j.ijbiomac.2022.03.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Revised: 02/16/2022] [Accepted: 03/02/2022] [Indexed: 11/28/2022]
Abstract
Over the last one and half century, a myriad of studies has demonstrated that Hofmeister ions have a major impact on protein stability and solubility. Nevertheless, the definition of the physico-chemical basis of their activity has proved to be highly challenging and controversial. Here, by exploiting the enormous information content of the Protein Data Bank, we explored the binding to proteins of thiocyanate, the anion of the series exerting the highest solubilization/destabilization effects. The survey, which led to the identification and characterization of 712 thiocyanate binding sites, provides a comprehensive and atomic-level view of the varied interactions that the ion forms with proteins. The inspection of these sites highlights a limited tendency of thiocyanate to interact with structured water molecules, in line with the reported poor hydration of the ion. On the other hand, the thiocyanate makes interactions with protein nonpolar moieties, especially with the backbone Cα atom. In as many as 104 cases, the ion exclusively makes nonpolar contacts. In conclusion, these findings suggest that the ability of thiocyanate to bind all types of protein exposed patches may lead to the formation of a negatively charged electrostatic barrier that could prevent protein-protein aggregation and promote protein solubility. Moreover, the denaturing action of thiocyanate may be ascribed to its ability to establish multiple attractive interactions with protein surfaces.
Collapse
|
13
|
A thermodynamic atlas of proteomes reveals energetic innovation across the tree of life. Mol Biol Evol 2022; 39:6509521. [PMID: 35038744 PMCID: PMC8896757 DOI: 10.1093/molbev/msac010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Protein stability is a fundamental molecular property enabling organisms to adapt to their biological niches. How this is facilitated and whether there are kingdom specific or more general universal strategies is not known. A principal obstacle to addressing this issue is that the vast majority of proteins lack annotation, specifically thermodynamic annotation, beyond the amino acid and chromosome information derived from genome sequencing. To address this gap and facilitate future investigation into large-scale patterns of protein stability and dynamics within and between organisms, we applied a unique ensemble-based thermodynamic characterization of protein folds to a substantial portion of extant sequenced genomes. Using this approach, we compiled a database resource focused on the position-specific variation in protein stability. Interrogation of the database reveals; 1) domains of life exhibit distinguishing thermodynamic features, with eukaryotes particularly different from both archaea and bacteria, 2) the optimal growth temperature of an organism is proportional to the average apolar enthalpy of its proteome, 3) intrinsic disorder content is also proportional to the apolar enthalpy (but unexpectedly not the predicted stability at 25 °C), and 4) secondary structure and global stability information of individual proteins is extractable. We hypothesize that wider access to residue-specific thermodynamic information of proteomes will result in deeper understanding of mechanisms driving functional adaptation and protein evolution. Our database is free for download at https://afc-science.github.io/thermo-env-atlas/.
Collapse
|
14
|
PEGylation Increases the Strength of a Nearby NH-π Hydrogen Bond in the WW Domain. Biochemistry 2021; 60:2064-2070. [PMID: 34137579 DOI: 10.1021/acs.biochem.1c00132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Here we show that an NH-π interaction between a highly conserved Asn and a nearby Trp stabilizes the WW domain of the human protein Pin1. The strength of this NH-π interaction depends on the structure of the arene, with NH-π interactions involving Trp or naphthylalanine being substantially more stabilizing than those involving Tyr or Phe. Calculations suggest arene size and polarizability are key structural determinants of NH-π interaction strength. Methylation or PEGylation of the Asn side-chain amide nitrogen each strengthens the associated NH-π interaction, though likely for different reasons. We hypothesize that methylation introduces steric clashes that destabilize conformations in which the NH-π interaction is not possible, whereas PEGylation strengthens the NH-π interaction via localized desolvation of the protein surface.
Collapse
|
15
|
Comparison of Three Glycoproteomic Methods for the Analysis of the Secretome of CHO Cells Treated with 1,3,4- O-Bu 3ManNAc. Bioengineering (Basel) 2020; 7:bioengineering7040144. [PMID: 33182731 PMCID: PMC7712478 DOI: 10.3390/bioengineering7040144] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Revised: 10/24/2020] [Accepted: 11/06/2020] [Indexed: 01/08/2023] Open
Abstract
Comprehensive analysis of the glycoproteome is critical due to the importance of glycosylation to many aspects of protein function. The tremendous complexity of this post-translational modification, however, makes it difficult to adequately characterize the glycoproteome using any single method. To overcome this pitfall, in this report we compared three glycoproteomic analysis methods; first the recently developed N-linked glycans and glycosite-containing peptides (NGAG) chemoenzymatic method, second, solid-phase extraction of N-linked glycoproteins (SPEG), and third, hydrophilic interaction liquid chromatography (HILIC) by characterizing N-linked glycosites in the secretome of Chinese hamster ovary (CHO) cells. Interestingly, the glycosites identified by SPEG and HILIC overlapped considerably whereas NGAG identified many glycosites not observed in the other two methods. Further, utilizing enhanced intact glycopeptide identification afforded by the NGAG workflow, we found that the sugar analog 1,3,4-O-Bu3ManNAc, a "high flux" metabolic precursor for sialic acid biosynthesis, increased sialylation of secreted proteins including recombinant human erythropoietin (rhEPO).
Collapse
|
16
|
Sequence Characterization and Molecular Modeling of Clinically Relevant Variants of the SARS-CoV-2 Main Protease. Biochemistry 2020; 59:3741-3756. [PMID: 32931703 PMCID: PMC7518256 DOI: 10.1021/acs.biochem.0c00462] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 09/12/2020] [Indexed: 02/08/2023]
Abstract
The SARS-CoV-2 main protease (Mpro) is essential to viral replication and cleaves highly specific substrate sequences, making it an obvious target for inhibitor design. However, as for any virus, SARS-CoV-2 is subject to constant neutral drift and selection pressure, with new Mpro mutations arising over time. Identification and structural characterization of Mpro variants is thus critical for robust inhibitor design. Here we report sequence analysis, structure predictions, and molecular modeling for seventy-nine Mpro variants, constituting all clinically observed mutations in this protein as of April 29, 2020. Residue substitution is widely distributed, with some tendency toward larger and more hydrophobic residues. Modeling and protein structure network analysis suggest differences in cohesion and active site flexibility, revealing patterns in viral evolution that have relevance for drug discovery.
Collapse
|
17
|
Sequence characterization and molecular modeling of clinically relevant variants of the SARS-CoV-2 main protease. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2020:2020.05.15.097493. [PMID: 32511408 PMCID: PMC7263555 DOI: 10.1101/2020.05.15.097493] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/28/2023]
Abstract
The SARS-CoV-2 main protease (M pro ) is essential to viral replication and cleaves highly specific substrate sequences, making it an obvious target for inhibitor design. However, as for any virus, SARS-CoV-2 is subject to constant selection pressure, with new M pro mutations arising over time. Identification and structural characterization of M pro variants is thus critical for robust inhibitor design. Here we report sequence analysis, structure predictions, and molecular modeling for seventy-nine M pro variants, constituting all clinically observed mutations in this protein as of April 29, 2020. Residue substitution is widely distributed, with some tendency toward larger and more hydrophobic residues. Modeling and protein structure network analysis suggest differences in cohesion and active site flexibility, revealing patterns in viral evolution that have relevance for drug discovery.
Collapse
|
18
|
A new class of disordered elements controls DNA replication through initiator self-assembly. eLife 2019; 8:e48562. [PMID: 31560342 PMCID: PMC6764820 DOI: 10.7554/elife.48562] [Citation(s) in RCA: 73] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2019] [Accepted: 08/14/2019] [Indexed: 12/11/2022] Open
Abstract
The initiation of DNA replication in metazoans occurs at thousands of chromosomal sites known as origins. At each origin, the Origin Recognition Complex (ORC), Cdc6, and Cdt1 co-assemble to load the Mcm2-7 replicative helicase onto chromatin. Current replication models envisage a linear arrangement of isolated origins functioning autonomously; the extent of inter-origin organization and communication is unknown. Here, we report that the replication initiation machinery of D. melanogaster unexpectedly undergoes liquid-liquid phase separation (LLPS) upon binding DNA in vitro. We find that ORC, Cdc6, and Cdt1 contain intrinsically disordered regions (IDRs) that drive LLPS and constitute a new class of phase separating elements. Initiator IDRs are shown to regulate multiple functions, including chromosome recruitment, initiator-specific co-assembly, and Mcm2-7 loading. These data help explain how CDK activity controls replication initiation and suggest that replication programs are subject to higher-order levels of inter-origin organization.
Collapse
|
19
|
Abstract
Background Cell contain diverse array of proteins with different molecular weight and isoelectric point (pI). The molecular weight and pI of protein play important role in determining the molecular biochemical function. Therefore, it was important to understand the detail regarding the molecular weight and pI of the plant proteins. Results A proteome-wide analysis of plant proteomes from 145 species revealed a pI range of 1.99 (epsin) to 13.96 (hypothetical protein). The spectrum of molecular mass of the plant proteins varied from 0.54 to 2236.8 kDa. A putative Type-I polyketide synthase (22244 amino acids) in Volvox carteri was found to be the largest protein in the plant kingdom. However, Type-I polyketide synthase was not found in higher plant species. Titin (806.46 kDa) and misin/midasin (730.02 kDa) were the largest proteins identified in higher plant species. The pI and molecular weight of the plant proteins showed a trimodal distribution. An acidic pI (56.44% of proteins) was found to be predominant over a basic pI (43.34% of proteins) and the abundance of acidic pI proteins was higher in unicellular algae species relative to multicellular higher plants. In contrast, the seaweed, Porphyra umbilicalis, possesses a higher proportion of basic pI proteins (70.09%). Plant proteomes were also found to contain selenocysteine (Sec), amino acid that was found only in lower eukaryotic aquatic plant lineage. Amino acid composition analysis showed Leu was high and Trp was low abundant amino acids in the plant proteome. Additionally, the plant proteomes also possess ambiguous amino acids Xaa (unknown), Asx (asparagine or aspartic acid), Glx (glutamine or glutamic acid), and Xle (leucine or isoleucine) as well. Conclusion The diverse molecular weight and isoelectric point range of plant proteome will be helpful to understand their biochemical and functional aspects. The presence of selenocysteine proteins in lower eukaryotic organism is of interest and their expression in higher plant system can help us to understand their functional role. Electronic supplementary material The online version of this article (10.1186/s12864-019-5983-8) contains supplementary material, which is available to authorized users.
Collapse
|
20
|
Abstract
Liquid chromatography (LC) prefractionation is often implemented to increase proteomic coverage; however, while effective, this approach is laborious, requires considerable sample amount, and can be cumbersome. We describe how interfacing a recently described high-field asymmetric waveform ion mobility spectrometry (FAIMS) device between a nanoelectrospray ionization (nanoESI) emitter and an Orbitrap hybrid mass spectrometer (MS) enables the collection of single-shot proteomic data with comparable depth to that of conventional two-dimensional LC approaches. This next generation FAIMS device incorporates improved ion sampling at the ESI-FAIMS interface, increased electric field strength, and a helium-free ion transport gas. With fast internal compensation voltage (CV) stepping (25 ms/transition), multiple unique gas-phase fractions may be analyzed simultaneously over the course of an MS analysis. We have comprehensively demonstrated how this device performs for bottom-up proteomics experiments as well as characterized the effects of peptide charge state, mass loading, analysis time, and additional variables. We also offer recommendations for the number of CVs and which CVs to use for different lengths of experiments. Internal CV stepping experiments increase protein identifications from a single-shot experiment to >8000, from over 100 000 peptide identifications in as little as 5 h. In single-shot 4 h label-free quantitation (LFQ) experiments of a human cell line, we quantified 7818 proteins with FAIMS using intra-analysis CV switching compared to 6809 without FAIMS. Single-shot FAIMS results also compare favorably with LC fractionation experiments. A 6 h single-shot FAIMS experiment generates 8007 protein identifications, while four fractions analyzed for 1.5 h each produce 7776 protein identifications.
Collapse
|