1
|
The C-terminal disordered loop domain of Apc8 unlocks APC/C mitotic activation. Cell Rep 2024; 43:114262. [PMID: 38776225 DOI: 10.1016/j.celrep.2024.114262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 04/16/2024] [Accepted: 05/07/2024] [Indexed: 05/24/2024] Open
Abstract
The anaphase-promoting complex/cyclosome (APC/C) is a critical and tightly regulated E3 ligase that orchestrates the cellular life cycle by controlling the degradation of cell cycle regulators. An intriguing feature of this complex is an autoinhibition mechanism: an intrinsically disordered loop domain, Apc1-300L, blocks Cdc20 coactivator binding, yet phosphorylation of Apc1-300L counteracts this autoinhibition. Many such disordered loops within APC/C remain unexplored. Our systematic analysis of loop-deficient APC/C mutants uncovered a pivotal role for Apc8's C-terminal loop (Apc8-L) in mitotic activation. Apc8-L directly recruits the CDK adaptor protein, Xe-p9/Cks2, positioning the Xe-p9-CDK-CycB complex near Apc1-300L. This stimulates the phosphorylation and removal of Apc1-300L, prompting the formation of active APC/CCdc20. Strikingly, without both Apc8-L and Apc3-L, the APC/C is rendered inactive during mitosis, highlighting Apc8-L's synergistic role with other loops and kinases. This study broadens our understanding of the intricate dynamics in APC/C regulation and provides insights on the regulation of macromolecular complexes.
Collapse
|
2
|
CDK1-PP2A-B55 interplay ensures cell cycle oscillation via Apc1-loop 300. Cell Rep 2024; 43:114155. [PMID: 38678563 DOI: 10.1016/j.celrep.2024.114155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 03/12/2024] [Accepted: 04/10/2024] [Indexed: 05/01/2024] Open
Abstract
Cell cycle control relies on a delicate balance of phosphorylation with CDK1 and phosphatases like PP1 and PP2A-B55. Yet, identifying the primary substrate responsible for cell cycle oscillations remains a challenge. We uncover the pivotal role of phospho-regulation in the anaphase-promoting complex/cyclosome (APC/C), particularly through the Apc1-loop300 domain (Apc1-300L), orchestrated by CDK1 and PP2A-B55. Premature activation of PP2A-B55 during mitosis, induced by Greatwall kinase depletion, leads to Apc1-300L dephosphorylation, stalling APC/C activity and delaying Cyclin B degradation. This effect can be counteracted using the B55-specific inhibitor pEnsa or by removing Apc1-300L. We also show Cdc20's dynamic APC/C interaction across cell cycle stages, but dephosphorylation of Apc1-300L specifically inhibits further Cdc20 recruitment. Our study underscores APC/C's central role in cell cycle oscillation, identifying it as a primary substrate regulated by the CDK-PP2A partnership.
Collapse
|
3
|
PepPre: Promote Peptide Identification Using Accurate and Comprehensive Precursors. J Proteome Res 2024; 23:574-584. [PMID: 38157563 DOI: 10.1021/acs.jproteome.3c00293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2024]
Abstract
Accurate and comprehensive peptide precursor ions are crucial to tandem mass-spectrometry-based peptide identification. An identification engine can derive great advantages from the search space reduction enabled by credible and detailed precursors. Furthermore, by considering multiple precursors per spectrum, both the number of identifications and the spectrum explainability can be substantially improved. Here, we introduce PepPre, which detects precursors by decomposing peaks into multiple isotope clusters using linear programming methods. The detected precursors are scored and ranked, and the high-scoring ones are used for subsequent peptide identification. PepPre is evaluated both on regular and cross-linked peptide data sets and compared with 11 methods. The experimental results show that PepPre achieves a remarkable increase of 203% in PSM and 68% in peptide identifications compared to instrument software for regular peptides and 99% in PSM and 27% in peptide pair identifications for cross-linked peptides, surpassing the performance of all other evaluated methods. In addition to the increased identification numbers, further credibility evaluations evidence the reliability of the identified results. Moreover, by widening the isolation window of data acquisition from 2 to 8 Th, with PepPre, an engine is able to identify at least 64% more PSMs, thereby demonstrating the potential advantages of wide-window data acquisition. PepPre is open-source and available at http://peppre.ctarn.io.
Collapse
|
4
|
Structural mechanism of extranucleosomal DNA readout by the INO80 complex. SCIENCE ADVANCES 2022; 8:eadd3189. [PMID: 36490333 PMCID: PMC9733932 DOI: 10.1126/sciadv.add3189] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
The nucleosomal landscape of chromatin depends on the concerted action of chromatin remodelers. The INO80 remodeler specifically places nucleosomes at the boundary of gene regulatory elements, which is proposed to be the result of an ATP-dependent nucleosome sliding activity that is regulated by extranucleosomal DNA features. Here, we use cryo-electron microscopy and functional assays to reveal how INO80 binds and is regulated by extranucleosomal DNA. Structures of the regulatory A-module bound to DNA clarify the mechanism of linker DNA binding. The A-module is connected to the motor unit via an HSA/post-HSA lever element to chemomechanically couple the motor and linker DNA sensing. Two notable sites of curved DNA recognition by coordinated action of the four actin/actin-related proteins and the motor suggest how sliding by INO80 can be regulated by extranucleosomal DNA features. Last, the structures clarify the recruitment of YY1/Ies4 subunits and reveal deep architectural similarities between the regulatory modules of INO80 and SWI/SNF complexes.
Collapse
|
5
|
Mechanistic basis for Sgo1-mediated centromere localization and function of the CPC. J Cell Biol 2022; 221:213318. [PMID: 35776132 PMCID: PMC9253516 DOI: 10.1083/jcb.202108156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 04/08/2022] [Accepted: 06/13/2022] [Indexed: 11/24/2022] Open
Abstract
Centromere association of the chromosomal passenger complex (CPC; Borealin-Survivin-INCENP-Aurora B) and Sgo1 is crucial for chromosome biorientation, a process essential for error-free chromosome segregation. Phosphorylated histone H3 Thr3 (H3T3ph; directly recognized by Survivin) and histone H2A Thr120 (H2AT120ph; indirectly recognized via Sgo1), together with CPC’s intrinsic nucleosome-binding ability, facilitate CPC centromere recruitment. However, the molecular basis for CPC–Sgo1 binding and how their physical interaction influences CPC centromere localization are lacking. Here, using an integrative structure-function approach, we show that the “histone H3-like” Sgo1 N-terminal tail-Survivin BIR domain interaction acts as a hotspot essential for CPC–Sgo1 assembly, while downstream Sgo1 residues and Borealin contribute for high-affinity binding. Disrupting Sgo1–Survivin interaction abolished CPC–Sgo1 assembly and perturbed CPC centromere localization and function. Our findings reveal that Sgo1 and H3T3ph use the same surface on Survivin to bind CPC. Hence, it is likely that these interactions take place in a spatiotemporally restricted manner, providing a rationale for the Sgo1-mediated “kinetochore-proximal” CPC centromere pool.
Collapse
|
6
|
Leveraging Parameter Dependencies in High-Field Asymmetric Waveform Ion-Mobility Spectrometry and Size Exclusion Chromatography for Proteome-wide Cross-Linking Mass Spectrometry. Anal Chem 2022; 94:4627-4634. [PMID: 35276035 PMCID: PMC8943524 DOI: 10.1021/acs.analchem.1c04373] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 12/21/2021] [Indexed: 11/29/2022]
Abstract
Ion-mobility spectrometry shows great promise to tackle analytically challenging research questions by adding another separation dimension to liquid chromatography-mass spectrometry. The understanding of how analyte properties influence ion mobility has increased through recent studies, but no clear rationale for the design of customized experimental settings has emerged. Here, we leverage machine learning to deepen our understanding of field asymmetric waveform ion-mobility spectrometry for the analysis of cross-linked peptides. Knowing that predominantly m/z and then the size and charge state of an analyte influence the separation, we found ideal compensation voltages correlating with the size exclusion chromatography fraction number. The effect of this relationship on the analytical depth can be substantial as exploiting it allowed us to almost double unique residue pair detections in a proteome-wide cross-linking experiment. Other applications involving liquid- and gas-phase separation may also benefit from considering such parameter dependencies.
Collapse
|
7
|
Abstract
Cross-linking combined with mass spectrometry (XL-MS) provides a wealth of information about the three-dimensional (3D) structure of proteins and their interactions. We introduce MaxLynx, a novel computational proteomics workflow for XL-MS integrated into the MaxQuant environment. It is applicable to noncleavable and MS-cleavable cross-linkers. For both, we have generalized the Andromeda peptide database search engine to efficiently identify cross-linked peptides. For noncleavable peptides, we implemented a novel dipeptide Andromeda score, which is the basis for a computationally efficient N-squared search engine. Additionally, partial scores summarize the evidence for the two constituents of the dipeptide individually. A posterior error probability (PEP) based on total and partial scores is used to control false discovery rates (FDRs). For MS-cleavable cross-linkers, a score of signature peaks is combined with the conventional Andromeda score on the cleavage products. The MaxQuant 3D peak detection was improved to ensure more accurate determination of the monoisotopic peak of isotope patterns for heavy molecules, which cross-linked peptides typically are. A wide selection of filtering parameters can replace the manual filtering of identifications, which is often necessary when using other pipelines. On benchmark data sets of synthetic peptides, MaxLynx outperforms all other tested software on data for both types of cross-linkers and on a proteome-wide data set of cross-linked Drosophila melanogaster cell lysate. The workflow also supports ion mobility-enhanced MS data. MaxLynx runs on Windows and Linux, contains an interactive viewer for displaying annotated cross-linked spectra, and is freely available at https://www.maxquant.org/.
Collapse
|
8
|
Progress in methodologies and quality-control strategies in protein cross-linking mass spectrometry. Proteomics 2021; 21:e2100145. [PMID: 34647422 DOI: 10.1002/pmic.202100145] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Accepted: 10/04/2021] [Indexed: 11/10/2022]
Abstract
Deciphering the interaction networks and structural dynamics of proteins is pivotal to better understanding their biological functions. Cross-linking mass spectrometry (XL-MS) is a powerful and increasingly popular technology that provides information about protein-protein interactions and their structural constraints for individual proteins and multiprotein complexes on a proteome-scale. In this review, we first assess the coverage and depth of the XL-MS technique by utilizing publicly available datasets. We then delve into the progress in XL-MS experimental and computational methodologies and examine different quality-control strategies reported in the literature. Finally, we discuss the progress in XL-MS applications along with the scope for future improvements.
Collapse
|
9
|
Evaluation of mass spectrometry MS/MS spectra for the presence of isopeptide crosslinked peptides. PLoS One 2021; 16:e0254450. [PMID: 34242352 PMCID: PMC8270460 DOI: 10.1371/journal.pone.0254450] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2021] [Accepted: 06/25/2021] [Indexed: 12/14/2022] Open
Abstract
Isopeptide crosslinked proteins can be the product of transglutaminase or of exposure to organophosphorus toxicants (OP). Transglutaminase links glutamine to lysine with loss of ammonia. OP toxicants induce a link between glutamic acid and lysine with loss of water. Our goal was to establish criteria to distinguish real from false isopeptide crosslinks reported by software searches of mass spectrometry data. We used fragmentation spectra of tryptic peptides from MAP-rich tubulin Sus scrofa as a test system for detection of naturally-occurring isopeptide crosslinks. Data were analyzed with Protein Prospector. Criteria for the assignments included the presence of at least 1 crosslink specific product ion, fragment ions from both peptides, Protein Prospector scores ≥20, and best fit of the MS/MS data to the crosslinked peptide as opposed to a linear peptide. Out of 301,364 spectra, 15 potential transglutaminase-type crosslinked peptide candidates were identified. Manual evaluation of these MS/MS spectra reduced the number to 1 valid crosslink between Q112 of NFH and K368 of Tau. Immunopurification with anti-isopeptide 81D1C2 confirmed that MAP-rich tubulin contained only one isopeptide. Support for this isopeptide bond was obtained by showing that transglutaminase was capable of incorporating dansyl-aminohexyl -QQIV into K368. A model of the KIETHK-QLEAHNR isopeptide was synthesized with the aid of transglutaminase. MS/MS spectra of the model validated our interpretation of the native isopeptide. An OP-induced isopeptide bond between K163 of tubulin alpha-1A and E158 of tubulin beta-4B was induced by treating MAP-rich tubulin with 100 μM chlorpyrifos oxon. This crosslink was supported by the criteria described above and by the presence of diethoxyphospho-lysine 163 in the tubulin alpha-1A peptide. The information obtained in this work is valuable for future studies that aim to understand why exposure to OP is associated with increased risk of neurodegenerative disease.
Collapse
|
10
|
Reliable identification of protein-protein interactions by crosslinking mass spectrometry. Nat Commun 2021; 12:3564. [PMID: 34117231 PMCID: PMC8196013 DOI: 10.1038/s41467-021-23666-z] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Accepted: 04/28/2021] [Indexed: 11/09/2022] Open
Abstract
Protein-protein interactions govern most cellular pathways and processes, and multiple technologies have emerged to systematically map them. Assessing the error of interaction networks has been a challenge. Crosslinking mass spectrometry is currently widening its scope from structural analyses of purified multi-protein complexes towards systems-wide analyses of protein-protein interactions (PPIs). Using a carefully controlled large-scale analysis of Escherichia coli cell lysate, we demonstrate that false-discovery rates (FDR) for PPIs identified by crosslinking mass spectrometry can be reliably estimated. We present an interaction network comprising 590 PPIs at 1% decoy-based PPI-FDR. The structural information included in this network localises the binding site of the hitherto uncharacterised protein YacL to near the DNA exit tunnel on the RNA polymerase.
Collapse
|
11
|
Retention time prediction using neural networks increases identifications in crosslinking mass spectrometry. Nat Commun 2021; 12:3237. [PMID: 34050149 PMCID: PMC8163845 DOI: 10.1038/s41467-021-23441-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Accepted: 04/26/2021] [Indexed: 12/13/2022] Open
Abstract
Crosslinking mass spectrometry has developed into a robust technique that is increasingly used to investigate the interactomes of organelles and cells. However, the incomplete and noisy information in the mass spectra of crosslinked peptides limits the numbers of protein-protein interactions that can be confidently identified. Here, we leverage chromatographic retention time information to aid the identification of crosslinked peptides from mass spectra. Our Siamese machine learning model xiRT achieves highly accurate retention time predictions of crosslinked peptides in a multi-dimensional separation of crosslinked E. coli lysate. Importantly, supplementing the search engine score with retention time features leads to a substantial increase in protein-protein interactions without affecting confidence. This approach is not limited to cell lysates and multi-dimensional separation but also improves considerably the analysis of crosslinked multiprotein complexes with a single chromatographic dimension. Retention times are a powerful complement to mass spectrometric information to increase the sensitivity of crosslinking mass spectrometry analyses.
Collapse
|
12
|
Abstract
Cross-linking mass spectrometry (XL-MS) has become a powerful technique that enables insights into protein structures and protein interactions. The development of cleavable cross-linkers has further promoted XL-MS through search space reduction, thereby allowing for proteome-wide studies. These new analysis possibilities foster the development of new cross-linkers, which not every search engine can deal with out of the box. In addition, some search engines for XL-MS data also struggle with the validation of identified cross-linked peptides, that is, false discovery rate (FDR) estimation, as FDR calculation is hampered by the fact that not only one but two peptides in a single spectrum have to be correct. We here present our new search engine, MS Annika, which can identify cross-linked peptides in MS2 spectra from a wide variety of cleavable cross-linkers. We show that MS Annika provides realistic estimates of FDRs without the need of arbitrary score cutoffs, being able to provide on average 44% more identifications at a similar or better true FDR than comparable tools. In addition, MS Annika can be used on proteome-wide studies due to fast, parallelized processing and provides a way to visualize the identified cross-links in protein 3D structures.
Collapse
|
13
|
Identification-Free Control over the Precursor Isotopic Mass Misassignment in Orbitrap-Based Proteomics. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2021; 32:218-224. [PMID: 33119294 DOI: 10.1021/jasms.0c00281] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Selection of a precursor ion from a peptide isotopic cluster to obtain a fragmentation mass spectrum is a crucial step in data-dependent proteome analysis. However, the monoisotopic mass assignment performed in this step is often an issue confronted by the data acquisition software of hybrid Orbitrap FTMS that is most widely used in proteomics. To address the problem, many data processing tools, such as raw data converters and search engines, have optional accounting for the precursor mass shift due to the isotopic error. These solutions require additional data preprocessing steps and lead to an increase in the search space, thus making the analysis longer and/or less reliable. In this work, we processed 100 Orbitrap-based LC-MS/MS runs from 10 publicly available data sets to examine the rate of precursor isotope misassignment. The effect from taking the isotope error into account during the search on the number of identified peptides varied in a wide range from 0 to 33%. Thus, it may be tempting to spend extra time before or during a search to account for the mass assignment issue. Alternatively, this effect can be predicted a priori using an identification-free metric, which can be a part of data quality control software. Based on the results obtained in this work, we propose such a metric be further added into the visual and intuitive quality control software, viQC, developed previously and available at https://github.com/lisavetasol/viQC. It takes about a minute to calculate and plot nine quality metrics, including the proposed one for typical proteome analysis.
Collapse
|
14
|
Purified Smc5/6 Complex Exhibits DNA Substrate Recognition and Compaction. Mol Cell 2020; 80:1039-1054.e6. [PMID: 33301732 PMCID: PMC7758880 DOI: 10.1016/j.molcel.2020.11.012] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Revised: 10/12/2020] [Accepted: 11/04/2020] [Indexed: 02/03/2023]
Abstract
Eukaryotic SMC complexes, cohesin, condensin, and Smc5/6, use ATP hydrolysis to power a plethora of functions requiring organization and restructuring of eukaryotic chromosomes in interphase and during mitosis. The Smc5/6 mechanism of action and its activity on DNA are largely unknown. Here we purified the budding yeast Smc5/6 holocomplex and characterized its core biochemical and biophysical activities. Purified Smc5/6 exhibits DNA-dependent ATP hydrolysis and SUMO E3 ligase activity. We show that Smc5/6 binds DNA topologically with affinity for supercoiled and catenated DNA templates. Employing single-molecule assays to analyze the functional and dynamic characteristics of Smc5/6 bound to DNA, we show that Smc5/6 locks DNA plectonemes and can compact DNA in an ATP-dependent manner. These results demonstrate that the Smc5/6 complex recognizes DNA tertiary structures involving juxtaposed helices and might modulate DNA topology by plectoneme stabilization and local compaction.
Collapse
|
15
|
Abstract
![]()
Development
of new reagents for protein cross-linking is constantly
ongoing. The chemical formulas for the linker adducts formed by these
reagents are usually deduced from expert knowledge and then validated
by mass spectrometry. Clearly, it would be more rigorous to infer
the chemical compositions of the adducts directly from the data without
any prior assumptions on their chemistries. Unfortunately, the analysis
tools that are currently available to detect chemical modifications
on linear peptides are not applicable to the case of two cross-linked
peptides. Here, we show that an adaptation of the open search strategy
that works on linear peptides can be used to characterize cross-link
modifications in pairs of peptides. We benchmark our approach by correctly
inferring the linker masses of two well-known reagents, DSS and formaldehyde,
to accuracies of a few parts per million. We then investigate the
cross-linking chemistries of two poorly characterized reagents: EMCS
and glutaraldehyde. In the case of EMCS, we find that the expected
cross-linking chemistry is accompanied by a competing chemistry that
targets other amino acid types. In the case of glutaraldehyde, we
find that the chemical formula of the dominant linker is C5H4, which indicates a ringed aromatic structure. These
results demonstrate how, with very little effort, our approach can
yield nontrivial insights to better characterize new cross-linkers.
Collapse
|
16
|
Open Database Searching Enables the Identification and Comparison of Bacterial Glycoproteomes without Defining Glycan Compositions Prior to Searching. Mol Cell Proteomics 2020. [PMID: 32576591 DOI: 10.1101/2020.04.21.052845] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/15/2023] Open
Abstract
Mass spectrometry has become an indispensable tool for the characterization of glycosylation across biological systems. Our ability to generate rich fragmentation of glycopeptides has dramatically improved over the last decade yet our informatic approaches still lag behind. Although glycoproteomic informatics approaches using glycan databases have attracted considerable attention, database independent approaches have not. This has significantly limited high throughput studies of unusual or atypical glycosylation events such as those observed in bacteria. As such, computational approaches to examine bacterial glycosylation and identify chemically diverse glycans are desperately needed. Here we describe the use of wide-tolerance (up to 2000 Da) open searching as a means to rapidly examine bacterial glycoproteomes. We benchmarked this approach using N-linked glycopeptides of Campylobacter fetus subsp. fetus as well as O-linked glycopeptides of Acinetobacter baumannii and Burkholderia cenocepacia revealing glycopeptides modified with a range of glycans can be readily identified without defining the glycan masses before database searching. Using this approach, we demonstrate how wide tolerance searching can be used to compare glycan use across bacterial species by examining the glycoproteomes of eight Burkholderia species (B. pseudomallei; B. multivorans; B. dolosa; B. humptydooensis; B. ubonensis, B. anthina; B. diffusa; B. pseudomultivorans). Finally, we demonstrate how open searching enables the identification of low frequency glycoforms based on shared modified peptides sequences. Combined, these results show that open searching is a robust computational approach for the determination of glycan diversity within bacterial proteomes.
Collapse
|
17
|
Open Database Searching Enables the Identification and Comparison of Bacterial Glycoproteomes without Defining Glycan Compositions Prior to Searching. Mol Cell Proteomics 2020; 19:1561-1574. [PMID: 32576591 PMCID: PMC8143609 DOI: 10.1074/mcp.tir120.002100] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Revised: 05/27/2020] [Indexed: 12/23/2022] Open
Abstract
Mass spectrometry has become an indispensable tool for the characterization of glycosylation across biological systems. Our ability to generate rich fragmentation of glycopeptides has dramatically improved over the last decade yet our informatic approaches still lag behind. Although glycoproteomic informatics approaches using glycan databases have attracted considerable attention, database independent approaches have not. This has significantly limited high throughput studies of unusual or atypical glycosylation events such as those observed in bacteria. As such, computational approaches to examine bacterial glycosylation and identify chemically diverse glycans are desperately needed. Here we describe the use of wide-tolerance (up to 2000 Da) open searching as a means to rapidly examine bacterial glycoproteomes. We benchmarked this approach using N-linked glycopeptides of Campylobacter fetus subsp. fetus as well as O-linked glycopeptides of Acinetobacter baumannii and Burkholderia cenocepacia revealing glycopeptides modified with a range of glycans can be readily identified without defining the glycan masses before database searching. Using this approach, we demonstrate how wide tolerance searching can be used to compare glycan use across bacterial species by examining the glycoproteomes of eight Burkholderia species (B. pseudomallei; B. multivorans; B. dolosa; B. humptydooensis; B. ubonensis, B. anthina; B. diffusa; B. pseudomultivorans). Finally, we demonstrate how open searching enables the identification of low frequency glycoforms based on shared modified peptides sequences. Combined, these results show that open searching is a robust computational approach for the determination of glycan diversity within bacterial proteomes.
Collapse
|
18
|
Mass spectrometry reveals the chemistry of formaldehyde cross-linking in structured proteins. Nat Commun 2020; 11:3128. [PMID: 32561732 PMCID: PMC7305180 DOI: 10.1038/s41467-020-16935-w] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Accepted: 06/02/2020] [Indexed: 12/17/2022] Open
Abstract
Whole-cell cross-linking coupled to mass spectrometry is one of the few tools that can probe protein–protein interactions in intact cells. A very attractive reagent for this purpose is formaldehyde, a small molecule which is known to rapidly penetrate into all cellular compartments and to preserve the protein structure. In light of these benefits, it is surprising that identification of formaldehyde cross-links by mass spectrometry has so far been unsuccessful. Here we report mass spectrometry data that reveal formaldehyde cross-links to be the dimerization product of two formaldehyde-induced amino acid modifications. By integrating the revised mechanism into a customized search algorithm, we identify hundreds of cross-links from in situ formaldehyde fixation of human cells. Interestingly, many of the cross-links could not be mapped onto known atomic structures, and thus provide new structural insights. These findings enhance the use of formaldehyde cross-linking and mass spectrometry for structural studies. Formaldehyde (FA) is a popular cross-linking reagent, but applying it for cross-linking mass spectrometry (XLMS) has been largely unsuccessful. Here, the authors show that cross-links in structured proteins are the product of two FA molecules and identify hundreds of FA cross-links by XLMS in vitro and in situ.
Collapse
|
19
|
MaXLinker: Proteome-wide Cross-link Identifications with High Specificity and Sensitivity. Mol Cell Proteomics 2020; 19:554-568. [PMID: 31839598 PMCID: PMC7050104 DOI: 10.1074/mcp.tir119.001847] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2019] [Indexed: 11/06/2022] Open
Abstract
Protein-protein interactions play a vital role in nearly all cellular functions. Hence, understanding their interaction patterns and three-dimensional structural conformations can provide crucial insights about various biological processes and underlying molecular mechanisms for many disease phenotypes. Cross-linking mass spectrometry (XL-MS) has the unique capability to detect protein-protein interactions at a large scale along with spatial constraints between interaction partners. The inception of MS-cleavable cross-linkers enabled the MS2-MS3 XL-MS acquisition strategy that provides cross-link information from both MS2 and MS3 level. However, the current cross-link search algorithm available for MS2-MS3 strategy follows a "MS2-centric" approach and suffers from a high rate of mis-identified cross-links. We demonstrate the problem using two new quality assessment metrics ["fraction of mis-identifications" (FMI) and "fraction of interprotein cross-links from known interactions" (FKI)]. We then address this problem, by designing a novel "MS3-centric" approach for cross-link identification and implementing it as a search engine named MaXLinker. MaXLinker outperforms the currently popular search engine with a lower mis-identification rate, and higher sensitivity and specificity. Moreover, we performed human proteome-wide cross-linking mass spectrometry using K562 cells. Employing MaXLinker, we identified a comprehensive set of 9319 unique cross-links at 1% false discovery rate, comprising 8051 intraprotein and 1268 interprotein cross-links. Finally, we experimentally validated the quality of a large number of novel interactions identified in our study, providing a conclusive evidence for MaXLinker's robust performance.
Collapse
|
20
|
FANCD2-FANCI is a clamp stabilized on DNA by monoubiquitination of FANCD2 during DNA repair. Nat Struct Mol Biol 2020; 27:240-248. [PMID: 32066963 PMCID: PMC7067600 DOI: 10.1038/s41594-020-0380-1] [Citation(s) in RCA: 70] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2019] [Accepted: 01/14/2020] [Indexed: 01/18/2023]
Abstract
Vertebrate DNA crosslink repair excises toxic replication-blocking DNA crosslinks. Numerous factors involved in crosslink repair have been identified, and mutations in their corresponding genes cause Fanconi anemia (FA). A key step in crosslink repair is monoubiquitination of the FANCD2-FANCI heterodimer, which then recruits nucleases to remove the DNA lesion. Here, we use cryo-EM to determine the structures of recombinant chicken FANCD2 and FANCI complexes. FANCD2-FANCI adopts a closed conformation when the FANCD2 subunit is monoubiquitinated, creating a channel that encloses double-stranded DNA (dsDNA). Ubiquitin is positioned at the interface of FANCD2 and FANCI, where it acts as a covalent molecular pin to trap the complex on DNA. In contrast, isolated FANCD2 is a homodimer that is unable to bind DNA, suggestive of an autoinhibitory mechanism that prevents premature activation. Together, our work suggests that FANCD2-FANCI is a clamp that is locked onto DNA by ubiquitin, with distinct interfaces that may recruit other DNA repair factors.
Collapse
|
21
|
Abstract
The field of structural biology is increasingly focusing on studying proteins in situ, i.e., in their greater biological context. Cross-linking mass spectrometry (CLMS) is contributing to this effort, typically through the use of mass spectrometry (MS)-cleavable cross-linkers. Here, we apply the popular noncleavable cross-linker disuccinimidyl suberate (DSS) to human mitochondria and identify 5518 distance restraints between protein residues. Each distance restraint on proteins or their interactions provides structural information within mitochondria. Comparing these restraints to protein data bank (PDB)-deposited structures and comparative models reveals novel protein conformations. Our data suggest, among others, substrates and protein flexibility of mitochondrial heat shock proteins. Through this study, we bring forward two central points for the progression of CLMS towards large-scale in situ structural biology: First, clustered conflicts of cross-link data reveal in situ protein conformation states in contrast to error-rich individual conflicts. Second, noncleavable cross-linkers are compatible with proteome-wide studies.
Collapse
|
22
|
Borealin-nucleosome interaction secures chromosome association of the chromosomal passenger complex. J Cell Biol 2019; 218:3912-3925. [PMID: 31570499 PMCID: PMC6891087 DOI: 10.1083/jcb.201905040] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2019] [Revised: 08/30/2019] [Accepted: 09/16/2019] [Indexed: 11/23/2022] Open
Abstract
Chromosome association of the chromosomal passenger complex (CPC; consisting of Borealin, Survivin, INCENP, and the Aurora B kinase) is essential to achieve error-free chromosome segregation during cell division. Hence, understanding the mechanisms driving the chromosome association of the CPC is of paramount importance. Here using a multifaceted approach, we show that the CPC binds nucleosomes through a multivalent interaction predominantly involving Borealin. Strikingly, Survivin, previously suggested to target the CPC to centromeres, failed to bind nucleosomes on its own and requires Borealin and INCENP for its binding. Disrupting Borealin-nucleosome interactions excluded the CPC from chromosomes and caused chromosome congression defects. We also show that Borealin-mediated chromosome association of the CPC is critical for Haspin- and Bub1-mediated centromere enrichment of the CPC and works upstream of the latter. Our work thus establishes Borealin as a master regulator determining the chromosome association and function of the CPC.
Collapse
|
23
|
Assessment of chemical-crosslink-assisted protein structure modeling in CASP13. Proteins 2019; 87:1283-1297. [PMID: 31569265 PMCID: PMC6851497 DOI: 10.1002/prot.25816] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2019] [Revised: 08/08/2019] [Accepted: 09/13/2019] [Indexed: 12/22/2022]
Abstract
With the advance of experimental procedures obtaining chemical crosslinking information is becoming a fast and routine practice. Information on crosslinks can greatly enhance the accuracy of protein structure modeling. Here, we review the current state of the art in modeling protein structures with the assistance of experimentally determined chemical crosslinks within the framework of the 13th meeting of Critical Assessment of Structure Prediction approaches. This largest-to-date blind assessment reveals benefits of using data assistance in difficult to model protein structure prediction cases. However, in a broader context, it also suggests that with the unprecedented advance in accuracy to predict contacts in recent years, experimental crosslinks will be useful only if their specificity and accuracy further improved and they are better integrated into computational workflows.
Collapse
|
24
|
Abstract
The Fanconi anaemia (FA) pathway repairs DNA damage caused by endogenous and chemotherapy-induced DNA crosslinks, and responds to replication stress1,2. Genetic inactivation of this pathway by mutation of genes encoding FA complementation group (FANC) proteins impairs development, prevents blood production and promotes cancer1,3. The key molecular step in the FA pathway is the monoubiquitination of a pseudosymmetric heterodimer of FANCD2-FANCI4,5 by the FA core complex-a megadalton multiprotein E3 ubiquitin ligase6,7. Monoubiquitinated FANCD2 then recruits additional protein factors to remove the DNA crosslink or to stabilize the stalled replication fork. A molecular structure of the FA core complex would explain how it acts to maintain genome stability. Here we reconstituted an active, recombinant FA core complex, and used cryo-electron microscopy and mass spectrometry to determine its structure. The FA core complex comprises two central dimers of the FANCB and FA-associated protein of 100 kDa (FAAP100) subunits, flanked by two copies of the RING finger subunit, FANCL. These two heterotrimers act as a scaffold to assemble the remaining five subunits, resulting in an extended asymmetric structure. Destabilization of the scaffold would disrupt the entire complex, resulting in a non-functional FA pathway. Thus, the structure provides a mechanistic basis for the low numbers of patients with mutations in FANCB, FANCL and FAAP100. Despite a lack of sequence homology, FANCB and FAAP100 adopt similar structures. The two FANCL subunits are in different conformations at opposite ends of the complex, suggesting that each FANCL has a distinct role. This structural and functional asymmetry of dimeric RING finger domains may be a general feature of E3 ligases. The cryo-electron microscopy structure of the FA core complex provides a foundation for a detailed understanding of its E3 ubiquitin ligase activity and DNA interstrand crosslink repair.
Collapse
|
25
|
Lamin A molecular compression and sliding as mechanisms behind nucleoskeleton elasticity. Nat Commun 2019; 10:3056. [PMID: 31296869 PMCID: PMC6624373 DOI: 10.1038/s41467-019-11063-6] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2018] [Accepted: 06/20/2019] [Indexed: 11/16/2022] Open
Abstract
Lamin A is a nuclear intermediate filament protein critical for nuclear architecture and mechanics and mutated in a wide range of human diseases. Yet little is known about the molecular architecture of lamins and mechanisms of their assembly. Here we use SILAC cross-linking mass spectrometry to determine interactions within lamin dimers and between dimers in higher-order polymers. We find evidence for a compression mechanism where coiled coils in the lamin A rod can slide onto each other to contract rod length, likely driven by a wide range of electrostatic interactions with the flexible linkers between coiled coils. Similar interactions occur with unstructured regions flanking the rod domain during oligomeric assembly. Mutations linked to human disease block these interactions, suggesting that this spring-like contraction can explain in part the dynamic mechanical stretch and flexibility properties of the lamin polymer and other intermediate filament networks.
Collapse
|
26
|
A Simple Cross-Linking/Mass Spectrometry Workflow for Studying System-wide Protein Interactions. Anal Chem 2019; 91:10236-10244. [DOI: 10.1021/acs.analchem.9b02372] [Citation(s) in RCA: 86] [Impact Index Per Article: 17.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
|
27
|
Abstract
Structural maintenance of chromosomes (SMC)-kleisin complexes organize chromosomal DNAs in all domains of life, with key roles in chromosome segregation, DNA repair and regulation of gene expression. They function through the entrapment and active translocation of DNA, but the underlying conformational changes are largely unclear. Using structural biology, mass spectrometry and cross-linking, we investigated the architecture of two evolutionarily distant SMC-kleisin complexes: MukBEF from Escherichia coli, and cohesin from Saccharomyces cerevisiae. We show that both contain a dynamic coiled-coil discontinuity, the elbow, near the middle of their arms that permits a folded conformation. Bending at the elbow brings into proximity the hinge dimerization domain and the head-kleisin module, situated at opposite ends of the arms. Our findings favour SMC activity models that include a large conformational change in the arms, such as a relative movement between DNA contact sites during DNA loading and translocation.
Collapse
|
28
|
Protein Dynamics in Solution by Quantitative Crosslinking/Mass Spectrometry. Trends Biochem Sci 2018; 43:908-920. [PMID: 30318267 PMCID: PMC6240160 DOI: 10.1016/j.tibs.2018.09.003] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Revised: 08/20/2018] [Accepted: 09/12/2018] [Indexed: 01/09/2023]
Abstract
The dynamics of protein structures and their interactions are responsible for many cellular processes. The rearrangements and interactions of proteins, which are often transient, occur in solution and may require a biological environment that is difficult to maintain in traditional structural biological approaches. Quantitative crosslinking/mass spectrometry (QCLMS) has emerged as an excellent method to fill this gap. Numerous recent applications of the technique have demonstrated that protein dynamics can now be studied in solution at sufficient resolution to gain valuable biological insights, suggesting that extending these investigations to native environments is possible. These breakthroughs have been based on the maturation of CLMS at large, and its recent fusion with quantitative proteomics. We provide here an overview of the current state of the technique, the available workflows and their applications, and remaining challenges. In-solution dynamics of protein structures and their interactions can be studied by QCLMS. Successful applications of QCLMS provide insights into multiple different biological processes. Recent advances in QCLMS allow analyses in the context of native cellular environments, including living cells. Alternative workflows allow researchers to tailor the analysis to their biological question. Progress in data processing now offers this technique to researchers with limited initial expertise in crosslinking and quantitative proteomics.
Collapse
|