Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Total Articles

212
(from Reference Citation Analysis)

Article PDFs (59)

Cited by > 0 (200)

Searched Name

Alexandre M. J. J. Bonvin

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Statistics

Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Category

Show more Refine

Number	Citation Analysis
1	ARCTIC-3D: automatic retrieval and clustering of interfaces in complexes from 3D structural information. Commun Biol 2024;7:49. [PMID: 38184711 PMCID: PMC10771469 DOI: 10.1038/s42003-023-05718-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Accepted: 12/18/2023] [Indexed: 01/08/2024] Open Abstract The formation of a stable complex between proteins lies at the core of a wide variety of biological processes and has been the focus of countless experiments. The huge amount of information contained in the protein structural interactome in the Protein Data Bank can now be used to characterise and classify the existing biological interfaces. We here introduce ARCTIC-3D, a fast and user-friendly data mining and clustering software to retrieve data and rationalise the interface information associated with the protein input data. We demonstrate its use by various examples ranging from showing the increased interaction complexity of eukaryotic proteins, 20% of which on average have more than 3 different interfaces compared to only 10% for prokaryotes, to associating different functions to different interfaces. In the context of modelling biomolecular assemblies, we introduce the concept of "recognition entropy", related to the number of possible interfaces of the components of a protein-protein complex, which we demonstrate to correlate with the modelling difficulty in classical docking approaches. The identified interface clusters can also be used to generate various combinations of interface-specific restraints for integrative modelling. The ARCTIC-3D software is freely available at github.com/haddocking/arctic3d and can be accessed as a web-service at wenmr.science.uu.nl/arctic3d. Collapse Key Words molecular modelling protein analysis Collapse MESH Headings Cluster Analysis Data Mining Databases, Protein Entropy Prokaryotic Cells Collapse Grants 823830 EC \| Horizon 2020 Framework Programme (EU Framework Programme for Research and Innovation H2020) 101017567 EC \| Horizon 2020 Framework Programme (EU Framework Programme for Research and Innovation H2020) 027.020.G13 Netherlands eScience Center (eScience Center) Collapse
2	DeepRank-GNN-esm: a graph neural network for scoring protein-protein models using protein language model. BIOINFORMATICS ADVANCES 2024;4:vbad191. [PMID: 38213822 PMCID: PMC10782804 DOI: 10.1093/bioadv/vbad191] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 12/19/2023] [Indexed: 01/13/2024] Abstract Motivation Protein-Protein interactions (PPIs) play critical roles in numerous cellular processes. By modelling the 3D structures of the correspond protein complexes valuable insights can be obtained, providing, e.g. starting points for drug and protein design. One challenge in the modelling process is however the identification of near-native models from the large pool of generated models. To this end we have previously developed DeepRank-GNN, a graph neural network that integrates structural and sequence information to enable effective pattern learning at PPI interfaces. Its main features are related to the Position Specific Scoring Matrices (PSSMs), which are computationally expensive to generate, significantly limits the algorithm's usability. Results We introduce here DeepRank-GNN-esm that includes as additional features protein language model embeddings from the ESM-2 model. We show that the ESM-2 embeddings can actually replace the PSSM features at no cost in-, or even better performance on two PPI-related tasks: scoring docking poses and detecting crystal artifacts. This new DeepRank version bypasses thus the need of generating PSSM, greatly improving the usability of the software and opening new application opportunities for systems for which PSSM profiles cannot be obtained or are irrelevant (e.g. antibody-antigen complexes). Availability and implementation DeepRank-GNN-esm is freely available from https://github.com/DeepRank/DeepRank-GNN-esm. Collapse Key Words Collapse MESH Headings Collapse Grants European Union Horizon 2020 BioExcel China Scholarship Council Collapse
3	Impact of AlphaFold on structure prediction of protein complexes: The CASP15-CAPRI experiment. Proteins 2023;91:1658-1683. [PMID: 37905971 PMCID: PMC10841881 DOI: 10.1002/prot.26609] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 09/22/2023] [Accepted: 09/28/2023] [Indexed: 11/02/2023] Abstract We present the results for CAPRI Round 54, the 5th joint CASP-CAPRI protein assembly prediction challenge. The Round offered 37 targets, including 14 homodimers, 3 homo-trimers, 13 heterodimers including 3 antibody-antigen complexes, and 7 large assemblies. On average ~70 CASP and CAPRI predictor groups, including more than 20 automatics servers, submitted models for each target. A total of 21 941 models submitted by these groups and by 15 CAPRI scorer groups were evaluated using the CAPRI model quality measures and the DockQ score consolidating these measures. The prediction performance was quantified by a weighted score based on the number of models of acceptable quality or higher submitted by each group among their five best models. Results show substantial progress achieved across a significant fraction of the 60+ participating groups. High-quality models were produced for about 40% of the targets compared to 8% two years earlier. This remarkable improvement is due to the wide use of the AlphaFold2 and AlphaFold2-Multimer software and the confidence metrics they provide. Notably, expanded sampling of candidate solutions by manipulating these deep learning inference engines, enriching multiple sequence alignments, or integration of advanced modeling tools, enabled top performing groups to exceed the performance of a standard AlphaFold2-Multimer version used as a yard stick. This notwithstanding, performance remained poor for complexes with antibodies and nanobodies, where evolutionary relationships between the binding partners are lacking, and for complexes featuring conformational flexibility, clearly indicating that the prediction of protein complexes remains a challenging problem. Collapse Key Words AlphaFold CAPRI CASP blind prediction deep learning protein assemblies protein complexes protein-protein interaction Collapse MESH Headings Protein Interaction Mapping/methods Protein Conformation Protein Binding Molecular Docking Simulation Algorithms Computational Biology/methods Software Collapse Grants R35 GM136409 NIGMS NIH HHS R01 GM140098 NIGMS NIH HHS FC001003 Arthritis Research UK R35 GM141881 NIGMS NIH HHS R35 GM118078 NIGMS NIH HHS R35 GM124952 NIGMS NIH HHS FC0001003 Cancer Research UK Wellcome Trust FC001003 Wellcome Trust R35 GM144083 NIGMS NIH HHS R01 GM146340 NIGMS NIH HHS R01 GM123055 NIGMS NIH HHS R43 GM144992 NIGMS NIH HHS T32 GM132024 NIGMS NIH HHS R01 GM133840 NIGMS NIH HHS R01 GM093123 NIGMS NIH HHS Francis Crick Institute Cancer Research UK UK Medical Research Council Wellcome Trust Netherlands e‐Science Center US National Institutes of Health National Institute of Health National Natural Science Foundation of China European Molecular Biology Organization (EMBO) Marie Skłodowska‐Curie National Science Foundation National Institute of General Medical Sciences NIH/NIGMS National Science Center of Poland (Narodowe Centrum Nauki) (NCN) Research Council of Lithuania Knut and Alice Wallenberg Foundation (KAW) Swedish Research Council Fundamental Research Funds for the Central Universities of China Collapse
4	Improving the quality of co-evolution intermolecular contact prediction with DisVis. Proteins 2023;91:1407-1416. [PMID: 37237441 DOI: 10.1002/prot.26514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Revised: 03/29/2023] [Accepted: 04/19/2023] [Indexed: 05/28/2023] Abstract The steep rise in protein sequences and structures has paved the way for bioinformatics approaches to predict residue-residue interactions in protein complexes. Multiple sequence alignments are commonly used in contact predictions to identify co-evolving residues. These contacts, however, often include false positives (FPs), which may impair their use to predict three dimensional structures of biomolecular complexes and affect the accuracy of the generated models. Previously, we have developed DisVis to identify FP in mass spectrometry cross-linking data. DisVis allows to assess the accessible interaction space between two proteins consistent with a set of distance restraints. Here, we investigate if a similar approach could be applied to co-evolution predicted contacts in order to improve their precision prior to using them for modeling. We analyze co-evolution contact predictions with DisVis for a set of 26 protein-protein complexes. The DisVis-reranked and the original co-evolution contacts are then used to model the complexes with our integrative docking software HADDOCK using different filtering scenarios. Our results show that HADDOCK is robust with respect to the precision of the predicted contacts due to the 50% random contact removal during docking and can enhance the quality of docking predictions when combined with DisVis filtering for low precision contact data. DisVis can thus have a beneficial effect on low quality data, but overall HADDOCK can accommodate FP restraints without negatively impacting the quality of the resulting models. Other more precision-sensitive docking protocols might, however, benefit from the increased precision of the predicted contacts after DisVis filtering. Collapse Key Words HADDOCK docking false positive filtering intermolecular residue-residue contacts protein-protein complexes Collapse MESH Headings Software Computational Biology/methods Collapse Grants Collapse
5	An antibiotic from an uncultured bacterium binds to an immutable target. Cell 2023;186:4059-4073.e27. [PMID: 37611581 DOI: 10.1016/j.cell.2023.07.038] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Revised: 06/01/2023] [Accepted: 07/28/2023] [Indexed: 08/25/2023] Abstract Antimicrobial resistance is a leading mortality factor worldwide. Here, we report the discovery of clovibactin, an antibiotic isolated from uncultured soil bacteria. Clovibactin efficiently kills drug-resistant Gram-positive bacterial pathogens without detectable resistance. Using biochemical assays, solid-state nuclear magnetic resonance, and atomic force microscopy, we dissect its mode of action. Clovibactin blocks cell wall synthesis by targeting pyrophosphate of multiple essential peptidoglycan precursors (C55PP, lipid II, and lipid IIIWTA). Clovibactin uses an unusual hydrophobic interface to tightly wrap around pyrophosphate but bypasses the variable structural elements of precursors, accounting for the lack of resistance. Selective and efficient target binding is achieved by the sequestration of precursors into supramolecular fibrils that only form on bacterial membranes that contain lipid-anchored pyrophosphate groups. This potent antibiotic holds the promise of enabling the design of improved therapeutics that kill bacterial pathogens without resistance development. Collapse Key Words animal models antibiotic resistance antibiotics atomic force microscopy autolysis cell wall infection lipid II mechanism of action peptidoglycan solid-state NMR uncultured bacteria Collapse MESH Headings Anti-Bacterial Agents/isolation & purification Anti-Bacterial Agents/pharmacology Bacteria Biological Assay Diphosphates Soil Microbiology Collapse Grants P01 AI118687 NIAID NIH HHS R01 AI170962 NIAID NIH HHS Collapse
6	MetaScore: A Novel Machine-Learning-Based Approach to Improve Traditional Scoring Functions for Scoring Protein-Protein Docking Conformations. Biomolecules 2023;13:121. [PMID: 36671507 PMCID: PMC9855734 DOI: 10.3390/biom13010121] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 12/22/2022] [Accepted: 12/26/2022] [Indexed: 01/11/2023] Open Abstract Protein-protein interactions play a ubiquitous role in biological function. Knowledge of the three-dimensional (3D) structures of the complexes they form is essential for understanding the structural basis of those interactions and how they orchestrate key cellular processes. Computational docking has become an indispensable alternative to the expensive and time-consuming experimental approaches for determining the 3D structures of protein complexes. Despite recent progress, identifying near-native models from a large set of conformations sampled by docking-the so-called scoring problem-still has considerable room for improvement. We present MetaScore, a new machine-learning-based approach to improve the scoring of docked conformations. MetaScore utilizes a random forest (RF) classifier trained to distinguish near-native from non-native conformations using their protein-protein interfacial features. The features include physicochemical properties, energy terms, interaction-propensity-based features, geometric properties, interface topology features, evolutionary conservation, and also scores produced by traditional scoring functions (SFs). MetaScore scores docked conformations by simply averaging the score produced by the RF classifier with that produced by any traditional SF. We demonstrate that (i) MetaScore consistently outperforms each of the nine traditional SFs included in this work in terms of success rate and hit rate evaluated over conformations ranked among the top 10; (ii) an ensemble method, MetaScore-Ensemble, that combines 10 variants of MetaScore obtained by combining the RF score with each of the traditional SFs outperforms each of the MetaScore variants. We conclude that the performance of traditional SFs can be improved upon by using machine learning to judiciously leverage protein-protein interfacial features and by using ensemble methods to combine multiple scoring functions. Collapse Key Words machine learning method combination protein–protein docking scoring functions Collapse MESH Headings Proteins/chemistry Protein Binding Ligands Protein Conformation Machine Learning Collapse Grants 027016G04 Netherlands eScience Center 1640834 National Science Foundation 1636795 National Science Foundation UL1 TR000127 NCATS NIH HHS 675728 European H2020 e-Infrastructure grant BioExcel 722.014.005 Veni 823830 European H2020 e-Infrastructure grant BioExcel UL1 TR002014 NCATS NIH HHS 1518732 National Science Foundation Collapse
7	Information-Driven Antibody-Antigen Modelling with HADDOCK. Methods Mol Biol 2023;2552:267-282. [PMID: 36346597 DOI: 10.1007/978-1-0716-2609-2_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023] Abstract In the recent years, therapeutic use of antibodies has seen a huge growth, "due to their inherent proprieties and technological advances in the methods used to study and characterize them. Effective design and engineering of antibodies for therapeutic purposes are heavily dependent on knowledge of the structural principles that regulate antibody-antigen interactions. Several experimental techniques such as X-ray crystallography, cryo-electron microscopy, NMR, or mutagenesis analysis can be applied, but these are usually expensive and time-consuming. Therefore computational approaches like molecular docking may offer a valuable alternative for the characterization of antibody-antigen complexes.Here we describe a protocol for the prediction of the 3D structure of antibody-antigen complexes using the integrative modelling platform HADDOCK. The protocol consists of (1) the identification of the antibody residues belonging to the hypervariable loops which are known to be crucial for the binding and can be used to guide the docking and (2) the detailed steps to perform docking with the HADDOCK 2.4 webserver following different strategies depending on the availability of information about epitope residues. Collapse Key Words Antibody HADDOCK Information-driven docking Collapse MESH Headings Molecular Docking Simulation Cryoelectron Microscopy Crystallography, X-Ray Magnetic Resonance Spectroscopy Antigen-Antibody Complex/chemistry Protein Binding Collapse Grants Collapse
8	DeepRank-GNN: a graph neural network framework to learn patterns in protein-protein interfaces. Bioinformatics 2022;39:6845451. [PMID: 36420989 PMCID: PMC9805592 DOI: 10.1093/bioinformatics/btac759] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Revised: 10/19/2022] [Accepted: 11/23/2022] [Indexed: 11/25/2022] Open Abstract MOTIVATION Gaining structural insights into the protein-protein interactome is essential to understand biological phenomena and extract knowledge for rational drug design or protein engineering. We have previously developed DeepRank, a deep-learning framework to facilitate pattern learning from protein-protein interfaces using convolutional neural network (CNN) approaches. However, CNN is not rotation invariant and data augmentation is required to desensitize the network to the input data orientation which dramatically impairs the computation performance. Representing protein-protein complexes as atomic- or residue-scale rotation invariant graphs instead enables using graph neural networks (GNN) approaches, bypassing those limitations. RESULTS We have developed DeepRank-GNN, a framework that converts protein-protein interfaces from PDB 3D coordinates files into graphs that are further provided to a pre-defined or user-defined GNN architecture to learn problem-specific interaction patterns. DeepRank-GNN is designed to be highly modularizable, easily customized and is wrapped into a user-friendly python3 package. Here, we showcase DeepRank-GNN's performance on two applications using a dedicated graph interaction neural network: (i) the scoring of docking poses and (ii) the discriminating of biological and crystal interfaces. In addition to the highly competitive performance obtained in those tasks as compared to state-of-the-art methods, we show a significant improvement in speed and storage requirement using DeepRank-GNN as compared to DeepRank. AVAILABILITY AND IMPLEMENTATION DeepRank-GNN is freely available from https://github.com/DeepRank/DeepRank-GNN. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
9	Pathogen-sugar interactions revealed by universal saturation transfer analysis. Science 2022;377:eabm3125. [PMID: 35737812 DOI: 10.1126/science.abm3125] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Abstract Many pathogens exploit host cell-surface glycans. However, precise analyses of glycan ligands binding with heavily modified pathogen proteins can be confounded by overlapping sugar signals and/or compounded with known experimental constraints. Universal saturation transfer analysis (uSTA) builds on existing nuclear magnetic resonance spectroscopy to provide an automated workflow for quantitating protein-ligand interactions. uSTA reveals that early-pandemic, B-origin-lineage severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) spike trimer binds sialoside sugars in an "end-on" manner. uSTA-guided modeling and a high-resolution cryo-electron microscopy structure implicate the spike N-terminal domain (NTD) and confirm end-on binding. This finding rationalizes the effect of NTD mutations that abolish sugar binding in SARS-CoV-2 variants of concern. Together with genetic variance analyses in early pandemic patient cohorts, this binding implicates a sialylated polylactosamine motif found on tetraantennary N-linked glycoproteins deep in the human lung as potentially relevant to virulence and/or zoonosis. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
10	Cyclization and Docking Protocol for Cyclic Peptide-Protein Modeling Using HADDOCK2.4. J Chem Theory Comput 2022;18:4027-4040. [PMID: 35652781 PMCID: PMC9202357 DOI: 10.1021/acs.jctc.2c00075] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Abstract An emerging class of therapeutic molecules are cyclic peptides with over 40 cyclic peptide drugs currently in clinical use. Their mode of action is, however, not fully understood, impeding rational drug design. Computational techniques could positively impact their design, but modeling them and their interactions remains challenging due to their cyclic nature and their flexibility. This study presents a step-by-step protocol for generating cyclic peptide conformations and docking them to their protein target using HADDOCK2.4. A dataset of 30 cyclic peptide-protein complexes was used to optimize both cyclization and docking protocols. It supports peptides cyclized via an N- and C-terminus peptide bond and/or a disulfide bond. An ensemble of cyclic peptide conformations is then used in HADDOCK to dock them onto their target protein using knowledge of the binding site on the protein side to drive the modeling. The presented protocol predicts at least one acceptable model according to the critical assessment of prediction of interaction criteria for each complex of the dataset when the top 10 HADDOCK-ranked single structures are considered (100% success rate top 10) both in the bound and unbound docking scenarios. Moreover, its performance in both bound and fully unbound docking is similar to the state-of-the-art software in the field, Autodock CrankPep. The presented cyclization and docking protocol should make HADDOCK a valuable tool for rational cyclic peptide-based drug design and high-throughput screening. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
11	Interface refinement of low- to medium-resolution Cryo-EM complexes using HADDOCK2.4. Structure 2022;30:476-484.e3. [PMID: 35216656 DOI: 10.1016/j.str.2022.02.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Revised: 11/25/2021] [Accepted: 01/28/2022] [Indexed: 10/19/2022] Abstract A wide range of cellular processes requires the formation of multimeric protein complexes. The rise of cryo-electron microscopy (cryo-EM) has enabled the structural characterization of these protein assemblies. The density maps produced can, however, still suffer from limited resolution, impeding the process of resolving structures at atomic resolution. In order to solve this issue, monomers can be fitted into low- to medium-resolution maps. Unfortunately, the models produced frequently contain atomic clashes at the protein-protein interfaces (PPIs), as intermolecular interactions are typically not considered during monomer fitting. Here, we present a refinement approach based on HADDOCK2.4 to remove intermolecular clashes and optimize PPIs. A dataset of 14 cryo-EM complexes was used to test eight protocols. The best-performing protocol, consisting of a semi-flexible simulated annealing refinement with centroid restraints on the monomers, was able to decrease intermolecular atomic clashes by 98% without significantly deteriorating the quality of the cryo-EM density fit. Collapse Key Words HADDOCK atomic clashes electron density map interface quality protein-protein interfaces refinement Collapse MESH Headings Collapse Grants Collapse
12	DeepRank: a deep learning framework for data mining 3D protein-protein interfaces. Nat Commun 2021;12:7068. [PMID: 34862392 PMCID: PMC8642403 DOI: 10.1038/s41467-021-27396-0] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Accepted: 11/12/2021] [Indexed: 11/08/2022] Open Abstract Three-dimensional (3D) structures of protein complexes provide fundamental information to decipher biological processes at the molecular scale. The vast amount of experimentally and computationally resolved protein-protein interfaces (PPIs) offers the possibility of training deep learning models to aid the predictions of their biological relevance. We present here DeepRank, a general, configurable deep learning framework for data mining PPIs using 3D convolutional neural networks (CNNs). DeepRank maps features of PPIs onto 3D grids and trains a user-specified CNN on these 3D grids. DeepRank allows for efficient training of 3D CNNs with data sets containing millions of PPIs and supports both classification and regression. We demonstrate the performance of DeepRank on two distinct challenges: The classification of biological versus crystallographic PPIs, and the ranking of docking models. For both problems DeepRank is competitive with, or outperforms, state-of-the-art methods, demonstrating the versatility of the framework for research in structural biology. Collapse Key Words data mining machine learning protein structure predictions software molecular modelling Collapse MESH Headings Crystallography Data Mining/methods Datasets as Topic Deep Learning Molecular Docking Simulation Protein Interaction Domains and Motifs Protein Interaction Mapping/methods Protein Interaction Maps Collapse Grants Netherlands eScience Center (eScience Center) SURF Open Lab “Machine learning enhanced HPC applications” grant (AB/FA/10573) Radboud Universitair Medisch Centrum (Radboudumc) EC \| Horizon 2020 Framework Programme (EU Framework Programme for Research and Innovation H2020) Collapse
13	Prediction of protein assemblies, the next frontier: The CASP14-CAPRI experiment. Proteins 2021;89:1800-1823. [PMID: 34453465 PMCID: PMC8616814 DOI: 10.1002/prot.26222] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Revised: 07/24/2021] [Accepted: 08/05/2021] [Indexed: 12/19/2022] Abstract We present the results for CAPRI Round 50, the fourth joint CASP-CAPRI protein assembly prediction challenge. The Round comprised a total of twelve targets, including six dimers, three trimers, and three higher-order oligomers. Four of these were easy targets, for which good structural templates were available either for the full assembly, or for the main interfaces (of the higher-order oligomers). Eight were difficult targets for which only distantly related templates were found for the individual subunits. Twenty-five CAPRI groups including eight automatic servers submitted ~1250 models per target. Twenty groups including six servers participated in the CAPRI scoring challenge submitted ~190 models per target. The accuracy of the predicted models was evaluated using the classical CAPRI criteria. The prediction performance was measured by a weighted scoring scheme that takes into account the number of models of acceptable quality or higher submitted by each group as part of their five top-ranking models. Compared to the previous CASP-CAPRI challenge, top performing groups submitted such models for a larger fraction (70-75%) of the targets in this Round, but fewer of these models were of high accuracy. Scorer groups achieved stronger performance with more groups submitting correct models for 70-80% of the targets or achieving high accuracy predictions. Servers performed less well in general, except for the MDOCKPP and LZERD servers, who performed on par with human groups. In addition to these results, major advances in methodology are discussed, providing an informative overview of where the prediction of protein assemblies currently stands. Collapse Key Words CAPRI CASP blind prediction docking oligomeric state protein assemblies protein complexes protein docking protein-protein interaction template-based modeling Collapse MESH Headings Binding Sites Computational Biology/methods Models, Molecular Molecular Docking Simulation Protein Interaction Domains and Motifs Proteins/chemistry Proteins/metabolism Sequence Analysis, Protein Software Collapse Grants R35 GM136409 NIGMS NIH HHS Wellcome Trust R01 GM074255 NIGMS NIH HHS R21 GM127952 NIGMS NIH HHS R35 GM118078 NIGMS NIH HHS R35 GM124952 NIGMS NIH HHS Medical Research Council R01 HL142301 NHLBI NIH HHS FC001003 Cancer Research UK T32 GM132024 NIGMS NIH HHS RM1 GM135136 NIGMS NIH HHS R01 GM109980 NIGMS NIH HHS 10748 Cancer Research UK R01 GM078221 NIGMS NIH HHS R01 GM123055 NIGMS NIH HHS R01 GM133840 NIGMS NIH HHS R01 GM093123 NIGMS NIH HHS Collapse
14	Emergence and spread of SARS-CoV-2 lineage B.1.620 with variant of concern-like mutations and deletions. Nat Commun 2021;12:5769. [PMID: 34599175 PMCID: PMC8486757 DOI: 10.1038/s41467-021-26055-8] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Accepted: 09/15/2021] [Indexed: 12/12/2022] Open Abstract Distinct SARS-CoV-2 lineages, discovered through various genomic surveillance initiatives, have emerged during the pandemic following unprecedented reductions in worldwide human mobility. We here describe a SARS-CoV-2 lineage - designated B.1.620 - discovered in Lithuania and carrying many mutations and deletions in the spike protein shared with widespread variants of concern (VOCs), including E484K, S477N and deletions HV69Δ, Y144Δ, and LLA241/243Δ. As well as documenting the suite of mutations this lineage carries, we also describe its potential to be resistant to neutralising antibodies, accompanying travel histories for a subset of European cases, evidence of local B.1.620 transmission in Europe with a focus on Lithuania, and significance of its prevalence in Central Africa owing to recent genome sequencing efforts there. We make a case for its likely Central African origin using advanced phylogeographic inference methodologies incorporating recorded travel histories of infected travellers. Collapse Key Words molecular evolution phylogenetics pathogens viral infection Collapse MESH Headings Africa, Central/epidemiology Antibodies, Neutralizing/immunology COVID-19/epidemiology COVID-19/transmission COVID-19/virology Europe/epidemiology Humans Immune Evasion/genetics Mutation Phylogeny Phylogeography SARS-CoV-2/classification SARS-CoV-2/genetics SARS-CoV-2/immunology Spike Glycoprotein, Coronavirus/genetics Travel/statistics & numerical data Collapse Grants 001 World Health Organization Collapse
15	Shape-Restrained Modeling of Protein-Small-Molecule Complexes with High Ambiguity Driven DOCKing. J Chem Inf Model 2021;61:4807-4818. [PMID: 34436890 PMCID: PMC8479858 DOI: 10.1021/acs.jcim.1c00796] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Abstract Small-molecule docking remains one of the most valuable computational techniques for the structure prediction of protein-small-molecule complexes. It allows us to study the interactions between compounds and the protein receptors they target at atomic detail in a timely and efficient manner. Here, we present a new protocol in HADDOCK (High Ambiguity Driven DOCKing), our integrative modeling platform, which incorporates homology information for both receptor and compounds. It makes use of HADDOCK's unique ability to integrate information in the simulation to drive it toward conformations, which agree with the provided data. The focal point is the use of shape restraints derived from homologous compounds bound to the target receptors. We have developed two protocols: in the first, the shape is composed of dummy atom beads based on the position of the heavy atoms of the homologous template compound, whereas in the second, the shape is additionally annotated with pharmacophore data for some or all beads. For both protocols, ambiguous distance restraints are subsequently defined between those beads and the heavy atoms of the ligand to be docked. We have benchmarked the performance of these protocols with a fully unbound version of the widely used DUD-E (Database of Useful Decoys-Enhanced) dataset. In this unbound docking scenario, our template/shape-based docking protocol reaches an overall success rate of 81% when a reliable template can be identified (which was the case for 99 out of 102 complexes in the DUD-E dataset), which is close to the best results reported for bound docking on the DUD-E dataset. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
16	Native or Non-Native Protein-Protein Docking Models? Molecular Dynamics to the Rescue. J Chem Theory Comput 2021;17:5944-5954. [PMID: 34342983 PMCID: PMC8444332 DOI: 10.1021/acs.jctc.1c00336] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Indexed: 11/29/2022] Abstract Molecular docking excels at creating a plethora of potential models of protein-protein complexes. To correctly distinguish the favorable, native-like models from the remaining ones remains, however, a challenge. We assessed here if a protocol based on molecular dynamics (MD) simulations would allow distinguishing native from non-native models to complement scoring functions used in docking. To this end, the first models for 25 protein-protein complexes were generated using HADDOCK. Next, MD simulations complemented with machine learning were used to discriminate between native and non-native complexes based on a combination of metrics reporting on the stability of the initial models. Native models showed higher stability in almost all measured properties, including the key ones used for scoring in the Critical Assessment of PRedicted Interaction (CAPRI) competition, namely the positional root mean square deviations and fraction of native contacts from the initial docked model. A random forest classifier was trained, reaching a 0.85 accuracy in correctly distinguishing native from non-native complexes. Reasonably modest simulation lengths of the order of 50-100 ns are sufficient to reach this accuracy, which makes this approach applicable in practice. Collapse Key Words Collapse MESH Headings Algorithms Machine Learning Molecular Docking Simulation Protein Interaction Mapping Proteins/chemistry Collapse Grants Horizon 2020 Framework Programme Collapse
17	Structural Biology in the Clouds: The WeNMR-EOSC Ecosystem. Front Mol Biosci 2021;8:729513. [PMID: 34395534 PMCID: PMC8356364 DOI: 10.3389/fmolb.2021.729513] [Citation(s) in RCA: 252] [Impact Index Per Article: 84.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Accepted: 07/13/2021] [Indexed: 12/05/2022] Open Abstract Structural biology aims at characterizing the structural and dynamic properties of biological macromolecules at atomic details. Gaining insight into three dimensional structures of biomolecules and their interactions is critical for understanding the vast majority of cellular processes, with direct applications in health and food sciences. Since 2010, the WeNMR project (www.wenmr.eu) has implemented numerous web-based services to facilitate the use of advanced computational tools by researchers in the field, using the high throughput computing infrastructure provided by EGI. These services have been further developed in subsequent initiatives under H2020 projects and are now operating as Thematic Services in the European Open Science Cloud portal (www.eosc-portal.eu), sending >12 millions of jobs and using around 4,000 CPU-years per year. Here we review 10 years of successful e-infrastructure solutions serving a large worldwide community of over 23,000 users to date, providing them with user-friendly, web-based solutions that run complex workflows in structural biology. The current set of active WeNMR portals are described, together with the complex backend machinery that allows distributed computing resources to be harvested efficiently. Collapse Key Words distributed computing e-infrastructure structural biology web portal web services Collapse MESH Headings Collapse Grants Collapse
18	50 years of PBD: a catalyst in structural biology. Nat Methods 2021;18:448-449. [PMID: 33963336 DOI: 10.1038/s41592-021-01138-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Abstract Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
19	Integrating quantitative proteomics with accurate genome profiling of transcription factors by greenCUT&RUN. Nucleic Acids Res 2021;49:e49. [PMID: 33524153 PMCID: PMC8136828 DOI: 10.1093/nar/gkab038] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Revised: 01/08/2021] [Accepted: 01/16/2021] [Indexed: 11/14/2022] Open Abstract Genome-wide localization of chromatin and transcription regulators can be detected by a variety of techniques. Here, we describe a novel method ‘greenCUT&RUN’ for genome-wide profiling of transcription regulators, which has a very high sensitivity, resolution, accuracy and reproducibility, whilst assuring specificity. Our strategy begins with tagging of the protein of interest with GFP and utilizes a GFP-specific nanobody fused to MNase to profile genome-wide binding events. By using a GFP-nanobody the greenCUT&RUN approach eliminates antibody dependency and variability. Robust genomic profiles were obtained with greenCUT&RUN, which are accurate and unbiased towards open chromatin. By integrating greenCUT&RUN with nanobody-based affinity purification mass spectrometry, ‘piggy-back’ DNA binding events can be identified on a genomic scale. The unique design of greenCUT&RUN grants target protein flexibility and yields high resolution footprints. In addition, greenCUT&RUN allows rapid profiling of mutants of chromatin and transcription proteins. In conclusion, greenCUT&RUN is a widely applicable and versatile genome-mapping technique. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
20	Information-driven modeling of biomolecular complexes. Curr Opin Struct Biol 2021;70:70-77. [PMID: 34139639 DOI: 10.1016/j.sbi.2021.05.003] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Accepted: 05/10/2021] [Indexed: 11/15/2022] Abstract Proteins play crucial roles in every cellular process by interacting with each other, nucleic acids, metabolites, and other molecules. The resulting assemblies can be very large and intricate and pose challenges to experimental methods. In the current era of integrative modeling, it is often only by a combination of various experimental techniques and computations that three-dimensional models of those molecular machines can be obtained. Among the various computational approaches available, molecular docking is often the method of choice when it comes to predicting three-dimensional structures of complexes. Docking can generate particularly accurate models when taking into account the available information on the complex of interest. We review here the use of experimental and bioinformatics data in protein-protein docking, describing recent software developments and highlighting applications for the modeling of antibody-antigen complexes and membrane protein complexes, and the use of evolutionary and shape information. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
21	MENSAdb: a thorough structural analysis of membrane protein dimers. Database (Oxford) 2021;2021:baab013. [PMID: 33822911 PMCID: PMC8023553 DOI: 10.1093/database/baab013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2020] [Revised: 01/19/2021] [Accepted: 03/01/2021] [Indexed: 11/14/2022] Abstract Membrane proteins (MPs) are key players in a variety of different cellular processes and constitute the target of around 60% of all Food and Drug Administration-approved drugs. Despite their importance, there is still a massive lack of relevant structural, biochemical and mechanistic information mainly due to their localization within the lipid bilayer. To help fulfil this gap, we developed the MEmbrane protein dimer Novel Structure Analyser database (MENSAdb). This interactive web application summarizes the evolutionary and physicochemical properties of dimeric MPs to expand the available knowledge on the fundamental principles underlying their formation. Currently, MENSAdb contains features of 167 unique MPs (63% homo- and 37% heterodimers) and brings insights into the conservation of residues, accessible solvent area descriptors, average B-factors, intermolecular contacts at 2.5 Å and 4.0 Å distance cut-offs, hydrophobic contacts, hydrogen bonds, salt bridges, π-π stacking, T-stacking and cation-π interactions. The regular update and organization of all these data into a unique platform will allow a broad community of researchers to collect and analyse a large number of features efficiently, thus facilitating their use in the development of prediction models associated with MPs. Database URL: http://www.moreiralab.com/resources/mensadb. Collapse Key Words Collapse MESH Headings Databases, Protein Hydrogen Bonding Lipid Bilayers Membrane Proteins/genetics Collapse Grants Nederlandse Organisatie voor Wetenschappelijk Onderzoek European Regional Development Fund Fundação para a Ciência e a Tecnologia Collapse
22	proABC-2: PRediction of AntiBody contacts v2 and its application to information-driven docking. Bioinformatics 2021;36:5107-5108. [PMID: 32683441 PMCID: PMC7755408 DOI: 10.1093/bioinformatics/btaa644] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Revised: 06/10/2020] [Accepted: 07/13/2020] [Indexed: 11/26/2022] Open Abstract Motivation Monoclonal antibodies are essential tools in the contemporary therapeutic armory. Understanding how these recognize their antigen is a fundamental step in their rational design and engineering. The rising amount of publicly available data is catalyzing the development of computational approaches able to offer valuable, faster and cheaper alternatives to classical experimental methodologies used for the study of antibody–antigen complexes. Results Here, we present proABC-2, an update of the original random-forest antibody paratope predictor, based on a convolutional neural network algorithm. We also demonstrate how the predictions can be fruitfully used to drive the docking in HADDOCK. Availability and implementation The proABC-2 server is freely available at: https://wenmr.science.uu.nl/proabc2/. Supplementary information Supplementary data are available at Bioinformatics online. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
23	Characterization of nucleosome sediments for protein interaction studies by solid-state NMR spectroscopy. MAGNETIC RESONANCE (GOTTINGEN, GERMANY) 2021;2:187-202. [PMID: 35647606 PMCID: PMC9135053 DOI: 10.5194/mr-2-187-2021] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/25/2023] Abstract Regulation of DNA-templated processes such as gene transcription and DNA repair depend on the interaction of a wide range of proteins with the nucleosome, the fundamental building block of chromatin. Both solution and solid-state NMR spectroscopy have become an attractive approach to study the dynamics and interactions of nucleosomes, despite their high molecular weight of ~ 200 kDa. For solid-state NMR (ssNMR) studies, dilute solutions of nucleosomes are converted to a dense phase by sedimentation or precipitation. Since nucleosomes are known to self-associate, these dense phases may induce extensive interactions between nucleosomes, which could interfere with protein-binding studies. Here, we characterized the packing of nucleosomes in the dense phase created by sedimentation using NMR and small-angle X-ray scattering (SAXS) experiments. We found that nucleosome sediments are gels with variable degrees of solidity, have nucleosome concentration close to that found in crystals, and are stable for weeks under high-speed magic angle spinning (MAS). Furthermore, SAXS data recorded on recovered sediments indicate that there is no pronounced long-range ordering of nucleosomes in the sediment. Finally, we show that the sedimentation approach can also be used to study low-affinity protein interactions with the nucleosome. Together, our results give new insights into the sample characteristics of nucleosome sediments for ssNMR studies and illustrate the broad applicability of sedimentation-based NMR studies. Collapse Key Words Collapse MESH Headings Collapse Grants R01 GM125195 NIGMS NIH HHS R01 CA252707 NCI NIH HHS R01 GM135671 NIGMS NIH HHS R01 HL151334 NHLBI NIH HHS R01 AG067664 NIA NIH HHS Collapse
24	Control over the fibrillization yield by varying the oligomeric nucleation propensities of self-assembling peptides. Commun Chem 2020;3:164. [PMID: 36703336 PMCID: PMC9814929 DOI: 10.1038/s42004-020-00417-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Accepted: 10/19/2020] [Indexed: 01/29/2023] Open Abstract Self-assembling peptides are an exemplary class of supramolecular biomaterials of broad biomedical utility. Mechanistic studies on the peptide self-assembly demonstrated the importance of the oligomeric intermediates towards the properties of the supramolecular biomaterials being formed. In this study, we demonstrate how the overall yield of the supramolecular assemblies are moderated through subtle molecular changes in the peptide monomers. This strategy is exemplified with a set of surfactant-like peptides (SLPs) with different β-sheet propensities and charged residues flanking the aggregation domains. By integrating different techniques, we show that these molecular changes can alter both the nucleation propensity of the oligomeric intermediates and the thermodynamic stability of the fibril structures. We demonstrate that the amount of assembled nanofibers are critically defined by the oligomeric nucleation propensities. Our findings offer guidance on designing self-assembling peptides for different biomedical applications, as well as insights into the role of protein gatekeeper sequences in preventing amyloidosis. Collapse Key Words peptides solid-state nmr self-assembly bioinspired materials Collapse MESH Headings Collapse Grants Collapse
25	PDB-tools web: A user-friendly interface for the manipulation of PDB files. Proteins 2020;89:330-335. [PMID: 33111403 PMCID: PMC7855443 DOI: 10.1002/prot.26018] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Revised: 10/20/2020] [Accepted: 10/26/2020] [Indexed: 01/06/2023] Abstract The Protein Data Bank (PDB) file format remains a popular format used and supported by many software to represent coordinates of macromolecular structures. It however suffers from drawbacks such as error‐prone manual editing. Because of that, various software toolkits have been developed to facilitate its editing and manipulation, but, to date, there is no online tool available for this purpose. Here we present PDB‐Tools Web, a flexible online service for manipulating PDB files. It offers a rich and user‐friendly graphical user interface that allows users to mix‐and‐match more than 40 individual tools from the pdb‐tools suite. Those can be combined in a few clicks to perform complex pipelines, which can be saved and uploaded. The resulting processed PDB files can be visualized online and downloaded. The web server is freely available at https://wenmr.science.uu.nl/pdbtools. Collapse Key Words PDB bioinformatics structural biology web server Collapse MESH Headings Collapse Grants Collapse
26	A click-flipped enzyme substrate boosts the performance of the diagnostic screening for Hunter syndrome. Chem Sci 2020;11:12671-12676. [PMID: 34094461 PMCID: PMC8163285 DOI: 10.1039/d0sc04696e] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Accepted: 10/23/2020] [Indexed: 11/23/2022] Open Abstract We report on the unexpected finding that click modification of iduronyl azides results in a conformational flip of the pyranose ring, which led to the development of a new strategy for the design of superior enzyme substrates for the diagnostic assaying of iduronate-2-sulfatase (I2S), a lysosomal enzyme related to Hunter syndrome. Synthetic substrates are essential in testing newborns for metabolic disorders to enable early initiation of therapy. Our click-flipped iduronyl triazole showed a remarkably better performance with I2S than commonly used O-iduronates. We found that both O- and triazole-linked substrates are accepted by the enzyme, irrespective of their different conformations, but only the O-linked product inhibits the activity of I2S. Thus, in the long reaction times required for clinical assays, the triazole substrate substantially outperforms the O-iduronate. Applying our click-flipped substrate to assay I2S in dried blood spots sampled from affected patients and random newborns significantly increased the confidence in discriminating between these groups, clearly indicating the potential of the click-flip strategy to control the biomolecular function of carbohydrates. Collapse Key Words Collapse MESH Headings Collapse Grants Österreichische Forschungsförderungsgesellschaft Nederlandse Organisatie voor Wetenschappelijk Onderzoek Collapse
27	Understanding Docking Complexes of Macromolecules Using HADDOCK: The Synergy between Experimental Data and Computations. Bio Protoc 2020;10:e3793. [PMID: 33659447 PMCID: PMC7842552 DOI: 10.21769/bioprotoc.3793] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Revised: 08/17/2020] [Accepted: 09/01/2020] [Indexed: 11/02/2022] Open Abstract This protocol illustrates the modelling of a protein-peptide complex using the synergic combination of in silico analysis and experimental results. To this end, we use the integrative modelling software HADDOCK, which possesses the powerful ability to incorporate experimental data, such as NMR Chemical Shift Perturbations and biochemical protein-peptide interaction data, as restraints to guide the docking process. Based on the modelling results, a rational mutagenesis approach is used to validate the generated models. The experimental results allow to select a final structural model best representing the bona fide protein-peptide complex. The described protocol can also be applied to model protein-protein complexes. There is no size limit for the macromolecular complexes that can be characterized by HADDOCK as long as the 3D structures of the individual components are available. Collapse Key Words Biomolecular interactions HADDOCK HCN channels ITC Integrative Modelling Macromolecular complex Molecular docking NMR Rational mutagenesis TRIP8b Collapse MESH Headings Collapse Grants Collapse
28	Computational approaches to therapeutic antibody design: established methods and emerging trends. Brief Bioinform 2020;21:1549-1567. [PMID: 31626279 PMCID: PMC7947987 DOI: 10.1093/bib/bbz095] [Citation(s) in RCA: 101] [Impact Index Per Article: 25.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2019] [Revised: 06/07/2019] [Accepted: 07/05/2019] [Indexed: 12/31/2022] Open Abstract Antibodies are proteins that recognize the molecular surfaces of potentially noxious molecules to mount an adaptive immune response or, in the case of autoimmune diseases, molecules that are part of healthy cells and tissues. Due to their binding versatility, antibodies are currently the largest class of biotherapeutics, with five monoclonal antibodies ranked in the top 10 blockbuster drugs. Computational advances in protein modelling and design can have a tangible impact on antibody-based therapeutic development. Antibody-specific computational protocols currently benefit from an increasing volume of data provided by next generation sequencing and application to related drug modalities based on traditional antibodies, such as nanobodies. Here we present a structured overview of available databases, methods and emerging trends in computational antibody analysis and contextualize them towards the engineering of candidate antibody therapeutics. Collapse Key Words antibody–antigen complexes databases docking homology modelling therapeutic antibodies Collapse MESH Headings Antibodies, Monoclonal/chemistry Antibodies, Monoclonal/immunology Antibodies, Monoclonal/therapeutic use Computational Biology/methods Databases, Protein Molecular Docking Simulation Protein Conformation Collapse Grants European Union Horizon 2020 BioExcel EOSC-hub Simons Foundation Collapse
29	LightDock goes information-driven. Bioinformatics 2020;36:950-952. [PMID: 31418773 PMCID: PMC7005597 DOI: 10.1093/bioinformatics/btz642] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2019] [Revised: 08/09/2019] [Accepted: 08/14/2019] [Indexed: 11/13/2022] Open Abstract MOTIVATION The use of experimental information has been demonstrated to increase the success rate of computational macromolecular docking. Many methods use information to post-filter the simulation output while others drive the simulation based on experimental restraints, which can become problematic for more complex scenarios such as multiple binding interfaces. RESULTS We present a novel method for including interface information into protein docking simulations within the LightDock framework. Prior to the simulation, irrelevant regions from the receptor are excluded for sampling (filter of initial swarms) and initial ligand poses are pre-oriented based on ligand input information. We demonstrate the applicability of this approach on the new 55 cases of the Protein-Protein Docking Benchmark 5, using different amounts of information. Even with incomplete or incorrect information, a significant improvement in performance is obtained compared to blind ab initio docking. AVAILABILITY AND IMPLEMENTATION The software is supported and freely available from https://github.com/brianjimenez/lightdock and analysis data from https://github.com/brianjimenez/lightdock_bm5. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
30	Editorial: Multiscale Modeling From Macromolecules to Cell: Opportunities and Challenges of Biomolecular Simulations. Front Mol Biosci 2020;7:194. [PMID: 33005628 PMCID: PMC7484804 DOI: 10.3389/fmolb.2020.00194] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2020] [Accepted: 07/21/2020] [Indexed: 01/29/2023] Open Abstract Collapse Key Words advanced sampling methods bioinformatics coarse grained models macro-biomolecules molecular crowding molecular dynamics simulations multiscale modeling system biology Collapse MESH Headings Collapse Grants R01 EY027440 NEI NIH HHS Collapse
31	Inhibition of the integrated stress response by viral proteins that block p-eIF2-eIF2B association. Nat Microbiol 2020;5:1361-1373. [PMID: 32690955 DOI: 10.1038/s41564-020-0759-0] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2019] [Accepted: 06/22/2020] [Indexed: 11/09/2022] Abstract Eukaryotic cells, when exposed to environmental or internal stress, activate the integrated stress response (ISR) to restore homeostasis and promote cell survival. Specific stress stimuli prompt dedicated stress kinases to phosphorylate eukaryotic initiation factor 2 (eIF2). Phosphorylated eIF2 (p-eIF2) in turn sequesters the eIF2-specific guanine exchange factor eIF2B to block eIF2 recycling, thereby halting translation initiation and reducing global protein synthesis. To circumvent stress-induced translational shutdown, viruses encode ISR antagonists. Those identified so far prevent or reverse eIF2 phosphorylation. We now describe two viral proteins-one from a coronavirus and the other from a picornavirus-that have independently acquired the ability to counteract the ISR at its very core by acting as a competitive inhibitor of p-eIF2-eIF2B interaction. This allows continued formation of the eIF2-GTP-Met-tRNAi ternary complex and unabated global translation at high p-eIF2 levels that would otherwise cause translational arrest. We conclude that eIF2 and p-eIF2 differ in their interaction with eIF2B to such effect that p-eIF2-eIF2B association can be selectively inhibited. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
32	Large-scale prediction of binding affinity in protein-small ligand complexes: the PRODIGY-LIG web server. Bioinformatics 2020;35:1585-1587. [PMID: 31051038 DOI: 10.1093/bioinformatics/bty816] [Citation(s) in RCA: 105] [Impact Index Per Article: 26.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2018] [Revised: 08/15/2018] [Accepted: 09/19/2018] [Indexed: 11/14/2022] Open Abstract SUMMARY Recently we published PROtein binDIng enerGY (PRODIGY), a web-server for the prediction of binding affinity in protein-protein complexes. By using a combination of simple structural properties, such as the residue-contacts made at the interface, PRODIGY has demonstrated a top performance compared with other state-of-the-art predictors in the literature. Here we present an extension of it, named PRODIGY-LIG, aimed at the prediction of affinity in protein-small ligand complexes. The predictive method, properly readapted for small ligand by making use of atomic instead of residue contacts, has been successfully applied for the blind prediction of 102 protein-ligand complexes during the D3R Grand Challenge 2. PRODIGY-LIG has the advantage of being simple, generic and applicable to any kind of protein-ligand complex. It provides an automatic, fast and user-friendly tool ensuring broad accessibility. AVAILABILITY AND IMPLEMENTATION PRODIGY-LIG is freely available without registration requirements at http://milou.science.uu.nl/services/PRODIGY-LIG. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
33	A community proposal to integrate structural bioinformatics activities in ELIXIR (3D-Bioinfo Community). F1000Res 2020;9. [PMID: 32566135 PMCID: PMC7284151 DOI: 10.12688/f1000research.20559.1] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/05/2020] [Indexed: 12/11/2022] Open Abstract Structural bioinformatics provides the scientific methods and tools to analyse, archive, validate, and present the biomolecular structure data generated by the structural biology community. It also provides an important link with the genomics community, as structural bioinformaticians also use the extensive sequence data to predict protein structures and their functional sites. A very broad and active community of structural bioinformaticians exists across Europe, and 3D-Bioinfo will establish formal platforms to address their needs and better integrate their activities and initiatives. Our mission will be to strengthen the ties with the structural biology research communities in Europe covering life sciences, as well as chemistry and physics and to bridge the gap between these researchers in order to fully realize the potential of structural bioinformatics. Our Community will also undertake dedicated educational, training and outreach efforts to facilitate this, bringing new insights and thus facilitating the development of much needed innovative applications e.g. for human health, drug and protein design. Our combined efforts will be of critical importance to keep the European research efforts competitive in this respect. Here we highlight the major European contributions to the field of structural bioinformatics, the most pressing challenges remaining and how Europe-wide interactions, enabled by ELIXIR and its platforms, will help in addressing these challenges and in coordinating structural bioinformatics resources across Europe. In particular, we present recent activities and future plans to consolidate an ELIXIR 3D-Bioinfo Community in structural bioinformatics and propose means to develop better links across the community. These include building new consortia, organising workshops to establish data standards and seeking community agreement on benchmark data sets and strategies. We also highlight existing and planned collaborations with other ELIXIR Communities and other European infrastructures, such as the structural biology community supported by Instruct-ERIC, with whom we have synergies and overlapping common interests. Collapse Key Words ELIXIR Instruct-ERIC biomolecular structure nucleic acids structure protein structure structural bioinformatics Collapse MESH Headings Collapse Grants Collapse
34	iScore: a novel graph kernel-based function for scoring protein-protein docking models. Bioinformatics 2020;36:112-121. [PMID: 31199455 PMCID: PMC6956772 DOI: 10.1093/bioinformatics/btz496] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2018] [Revised: 05/08/2019] [Accepted: 06/11/2019] [Indexed: 11/12/2022] Open Abstract MOTIVATION Protein complexes play critical roles in many aspects of biological functions. Three-dimensional (3D) structures of protein complexes are critical for gaining insights into structural bases of interactions and their roles in the biomolecular pathways that orchestrate key cellular processes. Because of the expense and effort associated with experimental determinations of 3D protein complex structures, computational docking has evolved as a valuable tool to predict 3D structures of biomolecular complexes. Despite recent progress, reliably distinguishing near-native docking conformations from a large number of candidate conformations, the so-called scoring problem, remains a major challenge. RESULTS Here we present iScore, a novel approach to scoring docked conformations that combines HADDOCK energy terms with a score obtained using a graph representation of the protein-protein interfaces and a measure of evolutionary conservation. It achieves a scoring performance competitive with, or superior to, that of state-of-the-art scoring functions on two independent datasets: (i) Docking software-specific models and (ii) the CAPRI score set generated by a wide variety of docking approaches (i.e. docking software-non-specific). iScore ranks among the top scoring approaches on the CAPRI score set (13 targets) when compared with the 37 scoring groups in CAPRI. The results demonstrate the utility of combining evolutionary, topological and energetic information for scoring docked conformations. This work represents the first successful demonstration of graph kernels to protein interfaces for effective discrimination of near-native and non-native conformations of protein complexes. AVAILABILITY AND IMPLEMENTATION The iScore code is freely available from Github: https://github.com/DeepRank/iScore (DOI: 10.5281/zenodo.2630567). And the docking models used are available from SBGrid: https://data.sbgrid.org/dataset/684). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online. Collapse Key Words Collapse MESH Headings Algorithms Computational Biology/methods Molecular Docking Simulation/methods Protein Binding Protein Conformation Proteins/chemistry Proteins/metabolism Software Collapse Grants UL1 TR000127 NCATS NIH HHS UL1 TR002014 NCATS NIH HHS Collapse
35	Protein-Protein Modeling Using Cryo-EM Restraints. Methods Mol Biol 2020;2112:145-162. [PMID: 32006284 DOI: 10.1007/978-1-0716-0270-6_11] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023] Abstract Recent improvements in cryo-electron microscopy (cryo-EM) in the past few years are now allowing to observe molecular complexes at atomic resolution. As a consequence, numerous structures derived from cryo-EM are now available in the Protein Data Bank. However, if for some complexes atomic resolution is reached, this is not true for all. This is also the case in cryo-electron tomography where the achievable resolution is still limited. Furthermore the resolution in a cryo-EM map is not a constant, with often outer regions being of lower resolution, possibly linked to conformational variability. Although those low- to medium-resolution EM maps (or regions thereof) cannot directly provide atomic structure of large molecular complexes, they provide valuable information to model the individual components and their assembly into them. Most approaches for this kind of modeling are performing rigid fitting of the individual components into the EM density map. While this would appear an obvious option, they ignore key aspects of molecular recognition, the energetics and flexibility of the interfaces. Moreover, this often restricts the modeling to a unique source of data, the EM density map.In this chapter, we describe a protocol where an EM map is used as restraint in HADDOCK to guide the modeling process. In the first step, rigid-body fitting is performed with PowerFit in order to identify the most likely locations of the molecules into the map. These are then used as centroids to which distance restraints are defined from the center of mass of the components of the complex for the initial rigid-body docking. The EM density is then directly used as an additional restraint energy term, which can be combined with all the other types of data supported by HADDOCK. This protocol relies on the new version 2.4 of both the HADDOCK webserver and software. Preparation steps consisting of cropping the EM map and rigid-body fitting of the atomic structure are explained. Then, the EM-driven docking protocol using HADDOCK is illustrated. Collapse Key Words Biomolecular interactions Cryo-EM data Flexibility HADDOCK Information-driven docking Molecular modeling Collapse MESH Headings Cryoelectron Microscopy/methods Databases, Protein Molecular Docking Simulation/methods Protein Conformation Protein Interaction Domains and Motifs Proteins/chemistry Software Collapse Grants Collapse
36	An overview of data-driven HADDOCK strategies in CAPRI rounds 38-45. Proteins 2019;88:1029-1036. [PMID: 31886559 DOI: 10.1002/prot.25869] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2019] [Revised: 12/17/2019] [Accepted: 12/26/2019] [Indexed: 01/18/2023] Abstract Our information-driven docking approach HADDOCK has demonstrated a sustained performance since the start of its participation to CAPRI. This is due, in part, to its ability to integrate data into the modeling process, and to the robustness of its scoring function. We participated in CAPRI both as server and manual predictors. In CAPRI rounds 38-45, we have used various strategies depending on the available information. These ranged from imposing restraints to a few residues identified from literature as being important for the interaction, to binding pockets identified from homologous complexes or template-based refinement/CA-CA restraint-guided docking from identified templates. When relevant, symmetry restraints were used to limit the conformational sampling. We also tested for a large decamer target a new implementation of the MARTINI coarse-grained force field in HADDOCK. Overall, we obtained acceptable or better predictions for 13 and 11 server and manual submissions, respectively, out of the 22 interfaces. Our server performance (acceptable or higher-quality models when considering the top 10) was better (59%) than the manual (50%) one, in which we typically experiment with various combinations of protocols and data sources. Again, our simple scoring function based on a linear combination of intermolecular van der Waals and electrostatic energies and an empirical desolvation term demonstrated a good performance in the scoring experiment with a 63% success rate across all 22 interfaces. An analysis of model quality indicates that, while we are consistently performing well in generating acceptable models, there is room for improvement for generating/identifying higher quality models. Collapse Key Words biomolecular interactions complexes integrative modeling prediction scoring Collapse MESH Headings Collapse Grants Collapse
37	Blind prediction of homo- and hetero-protein complexes: The CASP13-CAPRI experiment. Proteins 2019;87:1200-1221. [PMID: 31612567 PMCID: PMC7274794 DOI: 10.1002/prot.25838] [Citation(s) in RCA: 79] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2019] [Revised: 09/26/2019] [Accepted: 09/27/2019] [Indexed: 12/28/2022] Abstract We present the results for CAPRI Round 46, the third joint CASP-CAPRI protein assembly prediction challenge. The Round comprised a total of 20 targets including 14 homo-oligomers and 6 heterocomplexes. Eight of the homo-oligomer targets and one heterodimer comprised proteins that could be readily modeled using templates from the Protein Data Bank, often available for the full assembly. The remaining 11 targets comprised 5 homodimers, 3 heterodimers, and two higher-order assemblies. These were more difficult to model, as their prediction mainly involved "ab-initio" docking of subunit models derived from distantly related templates. A total of ~30 CAPRI groups, including 9 automatic servers, submitted on average ~2000 models per target. About 17 groups participated in the CAPRI scoring rounds, offered for most targets, submitting ~170 models per target. The prediction performance, measured by the fraction of models of acceptable quality or higher submitted across all predictors groups, was very good to excellent for the nine easy targets. Poorer performance was achieved by predictors for the 11 difficult targets, with medium and high quality models submitted for only 3 of these targets. A similar performance "gap" was displayed by scorer groups, highlighting yet again the unmet challenge of modeling the conformational changes of the protein components that occur upon binding or that must be accounted for in template-based modeling. Our analysis also indicates that residues in binding interfaces were less well predicted in this set of targets than in previous Rounds, providing useful insights for directions of future improvements. Collapse Key Words CAPRI CASP blind prediction docking oligomeric state protein assemblies protein complexes protein-protein interaction template-based modeling Collapse MESH Headings Algorithms Binding Sites/genetics Computational Biology Databases, Protein Models, Molecular Protein Binding/genetics Protein Conformation Protein Interaction Mapping Proteins/chemistry Proteins/genetics Proteins/ultrastructure Software Structural Homology, Protein Collapse Grants R35GM124952 NIH HHS R01GM074255 NIH HHS R01 GM074255 NIGMS NIH HHS R21 GM127952 NIGMS NIH HHS R01 GM123055 NIGMS NIH HHS R01GM123055 NIH HHS R35 GM118078 NIGMS NIH HHS R35 GM124952 NIGMS NIH HHS FC001003 Cancer Research UK Wellcome Trust FC001003 Wellcome Trust R01 GM093123 NIGMS NIH HHS R01 GM109980 NIGMS NIH HHS 10748 Cancer Research UK T32 AI125186 NIAID NIH HHS FC001003 Medical Research Council Collapse
38	Modeling Antibody-Antigen Complexes by Information-Driven Docking. Structure 2019;28:119-129.e2. [PMID: 31727476 DOI: 10.1016/j.str.2019.10.011] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2019] [Revised: 07/03/2019] [Accepted: 10/18/2019] [Indexed: 10/25/2022] Abstract Antibodies are Y-shaped proteins essential for immune response. Their capability to recognize antigens with high specificity makes them excellent therapeutic targets. Understanding the structural basis of antibody-antigen interactions is therefore crucial for improving our ability to design efficient biological drugs. Computational approaches such as molecular docking are providing a valuable and fast alternative to experimental structural characterization for these complexes. We investigate here how information about complementarity-determining regions and binding epitopes can be used to drive the modeling process, and present a comparative study of four different docking software suites (ClusPro, LightDock, ZDOCK, and HADDOCK) providing specific options for antibody-antigen modeling. Their performance on a dataset of 16 complexes is reported. HADDOCK, which includes information to drive the docking, is shown to perform best in terms of both success rate and quality of the generated models in both the presence and absence of information about the epitope on the antigen. Collapse Key Words ClusPro H3 modeling HADDOCK LightDock ZDOCK antibody binding sites conformational changes docking Collapse MESH Headings Collapse Grants Collapse
39	Sharing Data from Molecular Simulations. J Chem Inf Model 2019;59:4093-4099. [PMID: 31525920 DOI: 10.1021/acs.jcim.9b00665] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Abstract Given the need for modern researchers to produce open, reproducible scientific output, the lack of standards and best practices for sharing data and workflows used to produce and analyze molecular dynamics (MD) simulations has become an important issue in the field. There are now multiple well-established packages to perform molecular dynamics simulations, often highly tuned for exploiting specific classes of hardware, each with strong communities surrounding them, but with very limited interoperability/transferability options. Thus, the choice of the software package often dictates the workflow for both simulation production and analysis. The level of detail in documenting the workflows and analysis code varies greatly in published work, hindering reproducibility of the reported results and the ability for other researchers to build on these studies. An increasing number of researchers are motivated to make their data available, but many challenges remain in order to effectively share and reuse simulation data. To discuss these and other issues related to best practices in the field in general, we organized a workshop in November 2018 ( https://bioexcel.eu/events/workshop-on-sharing-data-from-molecular-simulations/ ). Here, we present a brief overview of this workshop and topics discussed. We hope this effort will spark further conversation in the MD community to pave the way toward more open, interoperable, and reproducible outputs coming from research studies using MD simulations. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
40	Less Is More: Coarse-Grained Integrative Modeling of Large Biomolecular Assemblies with HADDOCK. J Chem Theory Comput 2019;15:6358-6367. [PMID: 31539250 PMCID: PMC6854652 DOI: 10.1021/acs.jctc.9b00310] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Abstract Predicting the 3D structure of protein interactions remains a challenge in the field of computational structural biology. This is in part due to difficulties in sampling the complex energy landscape of multiple interacting flexible polypeptide chains. Coarse-graining approaches, which reduce the number of degrees of freedom of the system, help address this limitation by smoothing the energy landscape, allowing an easier identification of the global energy minimum. They also accelerate the calculations, allowing for modeling larger assemblies. Here, we present the implementation of the MARTINI coarse-grained force field for proteins into HADDOCK, our integrative modeling platform. Docking and refinement are performed at the coarse-grained level, and the resulting models are then converted back to atomistic resolution through a distance restraints-guided morphing procedure. Our protocol, tested on the largest complexes of the protein docking benchmark 5, shows an overall ∼7-fold speed increase compared to standard all-atom calculations, while maintaining a similar accuracy and yielding substantially more near-native solutions. To showcase the potential of our method, we performed simultaneous 7 body docking to model the 1:6 KaiC-KaiB complex, integrating mutagenesis and hydrogen/deuterium exchange data from mass spectrometry with symmetry restraints, and validated the resulting models against a recently published cryo-EM structure. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
41	MARTINI-Based Protein-DNA Coarse-Grained HADDOCKing. Front Mol Biosci 2019;6:102. [PMID: 31632986 PMCID: PMC6779769 DOI: 10.3389/fmolb.2019.00102] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Accepted: 09/17/2019] [Indexed: 11/13/2022] Open Abstract Modeling biomolecular assemblies is an important field in computational structural biology. The inherent complexity of their energy landscape and the computational cost associated with modeling large and complex assemblies are major drawbacks for integrative modeling approaches. The so-called coarse-graining approaches, which reduce the degrees of freedom of the system by grouping several atoms into larger “pseudo-atoms,” have been shown to alleviate some of those limitations, facilitating the identification of the global energy minima assumed to correspond to the native state of the complex, while making the calculations more efficient. Here, we describe and assess the implementation of the MARTINI force field for DNA into HADDOCK, our integrative modeling platform. We combine it with our previous implementation for protein-protein coarse-grained docking, enabling coarse-grained modeling of protein-nucleic acid complexes. The system is modeled using MARTINI topologies and interaction parameters during the rigid body docking and semi-flexible refinement stages of HADDOCK, and the resulting models are then converted back to atomistic resolution by an atom-to-bead distance restraints-guided protocol. We first demonstrate the performance of this protocol using 44 complexes from the protein-DNA docking benchmark, which shows an overall ~6-fold speed increase and maintains similar accuracy as compared to standard atomistic calculations. As a proof of concept, we then model the interaction between the PRC1 and the nucleosome (a former CAPRI target in round 31), using the same information available at the time the target was offered, and compare all-atom and coarse-grained models. Collapse Key Words biomolecular complexes coarse-graining docking force field nucleic acids Collapse MESH Headings Collapse Grants Collapse
42	Pre- and post-docking sampling of conformational changes using ClustENM and HADDOCK for protein-protein and protein-DNA systems. Proteins 2019;88:292-306. [PMID: 31441121 PMCID: PMC6973081 DOI: 10.1002/prot.25802] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Revised: 08/15/2019] [Accepted: 08/19/2019] [Indexed: 02/01/2023] Abstract Incorporating the dynamic nature of biomolecules in the modeling of their complexes is a challenge, especially when the extent and direction of the conformational changes taking place upon binding is unknown. Estimating whether the binding of a biomolecule to its partner(s) occurs in a conformational state accessible to its unbound form (“conformational selection”) and/or the binding process induces conformational changes (“induced‐fit”) is another challenge. We propose here a method combining conformational sampling using ClustENM—an elastic network‐based modeling procedure—with docking using HADDOCK, in a framework that incorporates conformational selection and induced‐fit effects upon binding. The extent of the applied deformation is estimated from its energetical costs, inspired from mechanical tensile testing on materials. We applied our pre‐ and post‐docking sampling of conformational changes to the flexible multidomain protein‐protein docking benchmark and a subset of the protein‐DNA docking benchmark. Our ClustENM‐HADDOCK approach produced acceptable to medium quality models in 7/11 and 5/6 cases for the protein‐protein and protein‐DNA complexes, respectively. The conformational selection (sampling prior to docking) has the highest impact on the quality of the docked models for the protein‐protein complexes. The induced‐fit stage of the pipeline (post‐sampling), however, improved the quality of the final models for the protein‐DNA complexes. Compared to previously described strategies to handle conformational changes, ClustENM‐HADDOCK performs better than two‐body docking in protein‐protein cases but worse than a flexible multidomain docking approach. However, it does show a better or similar performance compared to previous protein‐DNA docking approaches, which makes it a suitable alternative. Collapse Key Words biomolecular complexes conformational flexibility elastic network modeling Collapse MESH Headings Collapse Grants Collapse
43	The structural details of the interaction of single-stranded DNA binding protein hSSB2 (NABP1/OBFC2A) with UV-damaged DNA. Proteins 2019;88:319-326. [PMID: 31443132 DOI: 10.1002/prot.25806] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2019] [Revised: 08/02/2019] [Accepted: 08/19/2019] [Indexed: 12/17/2022] Abstract Single-stranded DNA-binding proteins (SSBs) are required for all known DNA metabolic events such as DNA replication, recombination and repair. While a wealth of structural and functional data is available on the essential human SSB, hSSB1 (NABP2/OBFC2B), the close homolog hSSB2 (NABP1/OBFC2A) remains relatively uncharacterized. Both SSBs possess a well-structured OB (oligonucleotide/oligosaccharide-binding) domain that is able to recognize single-stranded DNA (ssDNA) followed by a flexible carboxyl-tail implicated in the interaction with other proteins. Despite the high sequence similarity of the OB domain, several recent studies have revealed distinct functional differences between hSSB1 and hSSB2. In this study, we show that hSSB2 is able to recognize cyclobutane pyrimidine dimers (CPD) that form in cellular DNA as a consequence of UV damage. Using a combination of biolayer interferometry and NMR, we determine the molecular details of the binding of the OB domain of hSSB2 to CPD-containing ssDNA, confirming the role of four key aromatic residues in hSSB2 (W59, Y78, W82, and Y89) that are also conserved in hSSB1. Our structural data thus demonstrate that ssDNA recognition by the OB fold of hSSB2 is highly similar to hSSB1, indicating that one SSB may be able to replace the other in any initial ssDNA binding event. However, any subsequent recruitment of other repair proteins most likely depends on the divergent carboxyl-tail and as such is likely to be different between hSSB1 and hSSB2. Collapse Key Words DNA repair NABP1 NMR OBFC2A SSB hSSB2 Collapse MESH Headings Collapse Grants Collapse
44	Folding Then Binding vs Folding Through Binding in Macrocyclic Peptide Inhibitors of Human Pancreatic α-Amylase. ACS Chem Biol 2019;14:1751-1759. [PMID: 31241898 PMCID: PMC6700688 DOI: 10.1021/acschembio.9b00290] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Accepted: 06/26/2019] [Indexed: 12/14/2022] Abstract De novo macrocyclic peptides, derived using selection technologies such as phage and mRNA display, present unique and unexpected solutions to challenging biological problems. This is due in part to their unusual folds, which are able to present side chains in ways not available to canonical structures such as α-helices and β-sheets. Despite much recent interest in these molecules, their folding and binding behavior remains poorly characterized. In this work, we present cocrystallization, docking, and solution NMR structures of three de novo macrocyclic peptides that all bind as competitive inhibitors with single-digit nanomolar Ki to the active site of human pancreatic α-amylase. We show that a short stably folded motif in one of these is nucleated by internal hydrophobic interactions in an otherwise dynamic conformation in solution. Comparison of the solution structures with a target-bound structure from docking indicates that stabilization of the bound conformation is provided through interactions with the target protein after binding. These three structures also reveal a surprising functional convergence to present a motif of a single arginine sandwiched between two aromatic residues in the interactions of the peptide with the key catalytic residues of the enzyme, despite little to no other structural homology. Our results suggest that intramolecular hydrophobic interactions are important for priming binding of small macrocyclic peptides to their target and that high rigidity is not necessary for high affinity. Collapse Key Words Collapse MESH Headings Catalytic Domain Crystallization Enzyme Inhibitors/metabolism Humans Molecular Docking Simulation Pancreatic alpha-Amylases/antagonists & inhibitors Pancreatic alpha-Amylases/chemistry Pancreatic alpha-Amylases/metabolism Peptides, Cyclic/metabolism Protein Binding Protein Conformation Protein Folding Collapse Grants P41 GM103393 NIGMS NIH HHS Collapse
45	Holo-like and Druggable Protein Conformations from Enhanced Sampling of Binding Pocket Volume and Shape. J Chem Inf Model 2019;59:1515-1528. [PMID: 30883122 DOI: 10.1021/acs.jcim.8b00730] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Abstract Understanding molecular recognition of small molecules by proteins in atomistic detail is key for drug design. Molecular docking is a widely used computational method to mimic ligand-protein association in silico. However, predicting conformational changes occurring in proteins upon ligand binding is still a major challenge. Ensemble docking approaches address this issue by considering a set of different conformations of the protein obtained either experimentally or from computer simulations, e.g., molecular dynamics. However, holo structures prone to host (the correct) ligands are generally poorly sampled by standard molecular dynamics simulations of the apo protein. In order to address this limitation, we introduce a computational approach based on metadynamics simulations called ensemble docking with enhanced sampling of pocket shape (EDES) that allows holo-like conformations of proteins to be generated by exploiting only their apo structures. This is achieved by defining a set of collective variables that effectively sample different shapes of the binding site, ultimately mimicking the steric effect due to the ligand. We assessed the method on three challenging proteins undergoing different extents of conformational changes upon ligand binding. In all cases our protocol generates a significant fraction of structures featuring a low RMSD from the experimental holo geometry. Moreover, ensemble docking calculations using those conformations yielded in all cases native-like poses among the top-ranked ones. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
46	West-Life: A Virtual Research Environment for structural biology. JOURNAL OF STRUCTURAL BIOLOGY-X 2019;1:100006. [PMID: 32647812 PMCID: PMC7337051 DOI: 10.1016/j.yjsbx.2019.100006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Abstract • Data processing and data management services for structural biology. • Enhancements to existing web services for structure solution and analysis. • New pipelines to link these services into more complex higher-level workflows. • New data management facilities. • Making the benefits of European e-Infrastructures more accessible to structural biologists. The West-Life project (https://about.west-life.eu/) is a Horizon 2020 project funded by the European Commission to provide data processing and data management services for the international community of structural biologists, and in particular to support integrative experimental approaches within the field of structural biology. It has developed enhancements to existing web services for structure solution and analysis, created new pipelines to link these services into more complex higher-level workflows, and added new data management facilities. Through this work it has striven to make the benefits of European e-Infrastructures more accessible to life-science researchers in general and structural biologists in particular. Collapse Key Words Cloud computing Data management Grid computing Structural biology Virtual Research Environment Collapse MESH Headings Collapse Grants Collapse
47	Natural helix 9 mutants of PPARγ differently affect its transcriptional activity. Mol Metab 2019;20:115-127. [PMID: 30595551 PMCID: PMC6358588 DOI: 10.1016/j.molmet.2018.12.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Revised: 12/05/2018] [Accepted: 12/11/2018] [Indexed: 12/17/2022] Open Abstract OBJECTIVE The nuclear receptor PPARγ is the master regulator of adipocyte differentiation, distribution, and function. In addition, PPARγ induces terminal differentiation of several epithelial cell lineages, including colon epithelia. Loss-of-function mutations in PPARG result in familial partial lipodystrophy subtype 3 (FPDL3), a rare condition characterized by aberrant adipose tissue distribution and severe metabolic complications, including diabetes. Mutations in PPARG have also been reported in sporadic colorectal cancers, but the significance of these mutations is unclear. Studying these natural PPARG mutations provides valuable insights into structure-function relationships in the PPARγ protein. We functionally characterized a novel FPLD3-associated PPARγ L451P mutation in helix 9 of the ligand binding domain (LBD). Interestingly, substitution of the adjacent amino acid K450 was previously reported in a human colon carcinoma cell line. METHODS We performed a detailed side-by-side functional comparison of these two PPARγ mutants. RESULTS PPARγ L451P shows multiple intermolecular defects, including impaired cofactor binding and reduced RXRα heterodimerisation and subsequent DNA binding, but not in DBD-LBD interdomain communication. The K450Q mutant displays none of these functional defects. Other colon cancer-associated PPARγ mutants displayed diverse phenotypes, ranging from complete loss of activity to wildtype activity. CONCLUSIONS Amino acid changes in helix 9 can differently affect LBD integrity and function. In addition, FPLD3-associated PPARγ mutations consistently cause intra- and/or intermolecular defects; colon cancer-associated PPARγ mutations on the other hand may play a role in colon cancer onset and progression, but this is not due to their effects on the most well-studied functional characteristics of PPARγ. Collapse Key Words Adipose tissue FPLD3 Lipodystrophy Nuclear receptor PPARG Collapse MESH Headings Adult Binding Sites Cell Line, Tumor Colorectal Neoplasms/genetics Colorectal Neoplasms/pathology Female HEK293 Cells Humans Lipodystrophy, Familial Partial/genetics Lipodystrophy, Familial Partial/pathology Mutation, Missense PPAR gamma/chemistry PPAR gamma/genetics PPAR gamma/metabolism Phenotype Protein Multimerization Collapse Grants Wellcome Trust Collapse
48	Finding the ΔΔ G spot: Are predictors of binding affinity changes upon mutations in protein–protein interactions ready for it? WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2019. [DOI: 10.1002/wcms.1410] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Abstract Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
49	Protein-ligand pose and affinity prediction: Lessons from D3R Grand Challenge 3. J Comput Aided Mol Des 2019;33:83-91. [PMID: 30128928 PMCID: PMC6373529 DOI: 10.1007/s10822-018-0148-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2018] [Accepted: 08/09/2018] [Indexed: 12/30/2022] Abstract We report the performance of HADDOCK in the 2018 iteration of the Grand Challenge organised by the D3R consortium. Building on the findings of our participation in last year's challenge, we significantly improved our pose prediction protocol which resulted in a mean RMSD for the top scoring pose of 3.04 and 2.67 Å for the cross-docking and self-docking experiments respectively, which corresponds to an overall success rate of 63% and 71% when considering the top1 and top5 models respectively. This performance ranks HADDOCK as the 6th and 3rd best performing group (excluding multiple submissions from a same group) out of a total of 44 and 47 submissions respectively. Our ligand-based binding affinity predictor is the 3rd best predictor overall, behind only the two leading structure-based implementations, and the best ligand-based one with a Kendall's Tau correlation of 0.36 for the Cathepsin challenge. It also performed well in the classification part of the Kinase challenges, with Matthews Correlation Coefficients of 0.49 (ranked 1st), 0.39 (ranked 4th) and 0.21 (ranked 4th) for the JAK2, vEGFR2 and p38a targets respectively. Through our participation in last year's competition we came to the conclusion that template selection is of critical importance for the successful outcome of the docking. This year we have made improvements in two additional areas of importance: ligand conformer selection and initial positioning, which have been key to our excellent pose prediction performance this year. Collapse Key Words Binding affinity D3R Docking Drug Design Data Resource HADDOCK Ranking Collapse MESH Headings Binding Sites Cathepsins/chemistry Computer-Aided Design Crystallography, X-Ray Databases, Protein Drug Design Ligands Molecular Conformation Molecular Docking Simulation/methods Protein Binding Protein Kinases/chemistry Thermodynamics Collapse Grants European H2020 e-Infrastructure Nederlandse Organisatie voor Wetenschappelijk Onderzoek Collapse
50	pdb-tools: a swiss army knife for molecular structures. F1000Res 2018;7:1961. [PMID: 30705752 PMCID: PMC6343223 DOI: 10.12688/f1000research.17456.1] [Citation(s) in RCA: 64] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 12/12/2018] [Indexed: 11/20/2022] Open Abstract The pdb-tools are a collection of Python scripts for working with molecular structure data in the Protein Data Bank (PDB) format. They allow users to edit, convert, and validate PDB files, from the command-line, in a simple but efficient manner. The pdb-tools are implemented in Python, without any external dependencies, and are freely available under the open-source Apache License at https://github.com/haddocking/pdb-tools/ and on PyPI. Collapse Key Words PDB Python bioinformatics chemistry macromolecules protein structure structural biology Collapse MESH Headings Collapse Grants Collapse