1
|
ARCTIC-3D: automatic retrieval and clustering of interfaces in complexes from 3D structural information. Commun Biol 2024; 7:49. [PMID: 38184711 PMCID: PMC10771469 DOI: 10.1038/s42003-023-05718-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Accepted: 12/18/2023] [Indexed: 01/08/2024] Open
Abstract
The formation of a stable complex between proteins lies at the core of a wide variety of biological processes and has been the focus of countless experiments. The huge amount of information contained in the protein structural interactome in the Protein Data Bank can now be used to characterise and classify the existing biological interfaces. We here introduce ARCTIC-3D, a fast and user-friendly data mining and clustering software to retrieve data and rationalise the interface information associated with the protein input data. We demonstrate its use by various examples ranging from showing the increased interaction complexity of eukaryotic proteins, 20% of which on average have more than 3 different interfaces compared to only 10% for prokaryotes, to associating different functions to different interfaces. In the context of modelling biomolecular assemblies, we introduce the concept of "recognition entropy", related to the number of possible interfaces of the components of a protein-protein complex, which we demonstrate to correlate with the modelling difficulty in classical docking approaches. The identified interface clusters can also be used to generate various combinations of interface-specific restraints for integrative modelling. The ARCTIC-3D software is freely available at github.com/haddocking/arctic3d and can be accessed as a web-service at wenmr.science.uu.nl/arctic3d.
Collapse
|
2
|
DeepRank-GNN-esm: a graph neural network for scoring protein-protein models using protein language model. BIOINFORMATICS ADVANCES 2024; 4:vbad191. [PMID: 38213822 PMCID: PMC10782804 DOI: 10.1093/bioadv/vbad191] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 12/19/2023] [Indexed: 01/13/2024]
Abstract
Motivation Protein-Protein interactions (PPIs) play critical roles in numerous cellular processes. By modelling the 3D structures of the correspond protein complexes valuable insights can be obtained, providing, e.g. starting points for drug and protein design. One challenge in the modelling process is however the identification of near-native models from the large pool of generated models. To this end we have previously developed DeepRank-GNN, a graph neural network that integrates structural and sequence information to enable effective pattern learning at PPI interfaces. Its main features are related to the Position Specific Scoring Matrices (PSSMs), which are computationally expensive to generate, significantly limits the algorithm's usability. Results We introduce here DeepRank-GNN-esm that includes as additional features protein language model embeddings from the ESM-2 model. We show that the ESM-2 embeddings can actually replace the PSSM features at no cost in-, or even better performance on two PPI-related tasks: scoring docking poses and detecting crystal artifacts. This new DeepRank version bypasses thus the need of generating PSSM, greatly improving the usability of the software and opening new application opportunities for systems for which PSSM profiles cannot be obtained or are irrelevant (e.g. antibody-antigen complexes). Availability and implementation DeepRank-GNN-esm is freely available from https://github.com/DeepRank/DeepRank-GNN-esm.
Collapse
|
3
|
Impact of AlphaFold on structure prediction of protein complexes: The CASP15-CAPRI experiment. Proteins 2023; 91:1658-1683. [PMID: 37905971 PMCID: PMC10841881 DOI: 10.1002/prot.26609] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 09/22/2023] [Accepted: 09/28/2023] [Indexed: 11/02/2023]
Abstract
We present the results for CAPRI Round 54, the 5th joint CASP-CAPRI protein assembly prediction challenge. The Round offered 37 targets, including 14 homodimers, 3 homo-trimers, 13 heterodimers including 3 antibody-antigen complexes, and 7 large assemblies. On average ~70 CASP and CAPRI predictor groups, including more than 20 automatics servers, submitted models for each target. A total of 21 941 models submitted by these groups and by 15 CAPRI scorer groups were evaluated using the CAPRI model quality measures and the DockQ score consolidating these measures. The prediction performance was quantified by a weighted score based on the number of models of acceptable quality or higher submitted by each group among their five best models. Results show substantial progress achieved across a significant fraction of the 60+ participating groups. High-quality models were produced for about 40% of the targets compared to 8% two years earlier. This remarkable improvement is due to the wide use of the AlphaFold2 and AlphaFold2-Multimer software and the confidence metrics they provide. Notably, expanded sampling of candidate solutions by manipulating these deep learning inference engines, enriching multiple sequence alignments, or integration of advanced modeling tools, enabled top performing groups to exceed the performance of a standard AlphaFold2-Multimer version used as a yard stick. This notwithstanding, performance remained poor for complexes with antibodies and nanobodies, where evolutionary relationships between the binding partners are lacking, and for complexes featuring conformational flexibility, clearly indicating that the prediction of protein complexes remains a challenging problem.
Collapse
|
4
|
Improving the quality of co-evolution intermolecular contact prediction with DisVis. Proteins 2023; 91:1407-1416. [PMID: 37237441 DOI: 10.1002/prot.26514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Revised: 03/29/2023] [Accepted: 04/19/2023] [Indexed: 05/28/2023]
Abstract
The steep rise in protein sequences and structures has paved the way for bioinformatics approaches to predict residue-residue interactions in protein complexes. Multiple sequence alignments are commonly used in contact predictions to identify co-evolving residues. These contacts, however, often include false positives (FPs), which may impair their use to predict three dimensional structures of biomolecular complexes and affect the accuracy of the generated models. Previously, we have developed DisVis to identify FP in mass spectrometry cross-linking data. DisVis allows to assess the accessible interaction space between two proteins consistent with a set of distance restraints. Here, we investigate if a similar approach could be applied to co-evolution predicted contacts in order to improve their precision prior to using them for modeling. We analyze co-evolution contact predictions with DisVis for a set of 26 protein-protein complexes. The DisVis-reranked and the original co-evolution contacts are then used to model the complexes with our integrative docking software HADDOCK using different filtering scenarios. Our results show that HADDOCK is robust with respect to the precision of the predicted contacts due to the 50% random contact removal during docking and can enhance the quality of docking predictions when combined with DisVis filtering for low precision contact data. DisVis can thus have a beneficial effect on low quality data, but overall HADDOCK can accommodate FP restraints without negatively impacting the quality of the resulting models. Other more precision-sensitive docking protocols might, however, benefit from the increased precision of the predicted contacts after DisVis filtering.
Collapse
|
5
|
An antibiotic from an uncultured bacterium binds to an immutable target. Cell 2023; 186:4059-4073.e27. [PMID: 37611581 DOI: 10.1016/j.cell.2023.07.038] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Revised: 06/01/2023] [Accepted: 07/28/2023] [Indexed: 08/25/2023]
Abstract
Antimicrobial resistance is a leading mortality factor worldwide. Here, we report the discovery of clovibactin, an antibiotic isolated from uncultured soil bacteria. Clovibactin efficiently kills drug-resistant Gram-positive bacterial pathogens without detectable resistance. Using biochemical assays, solid-state nuclear magnetic resonance, and atomic force microscopy, we dissect its mode of action. Clovibactin blocks cell wall synthesis by targeting pyrophosphate of multiple essential peptidoglycan precursors (C55PP, lipid II, and lipid IIIWTA). Clovibactin uses an unusual hydrophobic interface to tightly wrap around pyrophosphate but bypasses the variable structural elements of precursors, accounting for the lack of resistance. Selective and efficient target binding is achieved by the sequestration of precursors into supramolecular fibrils that only form on bacterial membranes that contain lipid-anchored pyrophosphate groups. This potent antibiotic holds the promise of enabling the design of improved therapeutics that kill bacterial pathogens without resistance development.
Collapse
|
6
|
MetaScore: A Novel Machine-Learning-Based Approach to Improve Traditional Scoring Functions for Scoring Protein-Protein Docking Conformations. Biomolecules 2023; 13:121. [PMID: 36671507 PMCID: PMC9855734 DOI: 10.3390/biom13010121] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 12/22/2022] [Accepted: 12/26/2022] [Indexed: 01/11/2023] Open
Abstract
Protein-protein interactions play a ubiquitous role in biological function. Knowledge of the three-dimensional (3D) structures of the complexes they form is essential for understanding the structural basis of those interactions and how they orchestrate key cellular processes. Computational docking has become an indispensable alternative to the expensive and time-consuming experimental approaches for determining the 3D structures of protein complexes. Despite recent progress, identifying near-native models from a large set of conformations sampled by docking-the so-called scoring problem-still has considerable room for improvement. We present MetaScore, a new machine-learning-based approach to improve the scoring of docked conformations. MetaScore utilizes a random forest (RF) classifier trained to distinguish near-native from non-native conformations using their protein-protein interfacial features. The features include physicochemical properties, energy terms, interaction-propensity-based features, geometric properties, interface topology features, evolutionary conservation, and also scores produced by traditional scoring functions (SFs). MetaScore scores docked conformations by simply averaging the score produced by the RF classifier with that produced by any traditional SF. We demonstrate that (i) MetaScore consistently outperforms each of the nine traditional SFs included in this work in terms of success rate and hit rate evaluated over conformations ranked among the top 10; (ii) an ensemble method, MetaScore-Ensemble, that combines 10 variants of MetaScore obtained by combining the RF score with each of the traditional SFs outperforms each of the MetaScore variants. We conclude that the performance of traditional SFs can be improved upon by using machine learning to judiciously leverage protein-protein interfacial features and by using ensemble methods to combine multiple scoring functions.
Collapse
|
7
|
Information-Driven Antibody-Antigen Modelling with HADDOCK. Methods Mol Biol 2023; 2552:267-282. [PMID: 36346597 DOI: 10.1007/978-1-0716-2609-2_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
In the recent years, therapeutic use of antibodies has seen a huge growth, "due to their inherent proprieties and technological advances in the methods used to study and characterize them. Effective design and engineering of antibodies for therapeutic purposes are heavily dependent on knowledge of the structural principles that regulate antibody-antigen interactions. Several experimental techniques such as X-ray crystallography, cryo-electron microscopy, NMR, or mutagenesis analysis can be applied, but these are usually expensive and time-consuming. Therefore computational approaches like molecular docking may offer a valuable alternative for the characterization of antibody-antigen complexes.Here we describe a protocol for the prediction of the 3D structure of antibody-antigen complexes using the integrative modelling platform HADDOCK. The protocol consists of (1) the identification of the antibody residues belonging to the hypervariable loops which are known to be crucial for the binding and can be used to guide the docking and (2) the detailed steps to perform docking with the HADDOCK 2.4 webserver following different strategies depending on the availability of information about epitope residues.
Collapse
|
8
|
DeepRank-GNN: a graph neural network framework to learn patterns in protein-protein interfaces. Bioinformatics 2022; 39:6845451. [PMID: 36420989 PMCID: PMC9805592 DOI: 10.1093/bioinformatics/btac759] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Revised: 10/19/2022] [Accepted: 11/23/2022] [Indexed: 11/25/2022] Open
Abstract
MOTIVATION Gaining structural insights into the protein-protein interactome is essential to understand biological phenomena and extract knowledge for rational drug design or protein engineering. We have previously developed DeepRank, a deep-learning framework to facilitate pattern learning from protein-protein interfaces using convolutional neural network (CNN) approaches. However, CNN is not rotation invariant and data augmentation is required to desensitize the network to the input data orientation which dramatically impairs the computation performance. Representing protein-protein complexes as atomic- or residue-scale rotation invariant graphs instead enables using graph neural networks (GNN) approaches, bypassing those limitations. RESULTS We have developed DeepRank-GNN, a framework that converts protein-protein interfaces from PDB 3D coordinates files into graphs that are further provided to a pre-defined or user-defined GNN architecture to learn problem-specific interaction patterns. DeepRank-GNN is designed to be highly modularizable, easily customized and is wrapped into a user-friendly python3 package. Here, we showcase DeepRank-GNN's performance on two applications using a dedicated graph interaction neural network: (i) the scoring of docking poses and (ii) the discriminating of biological and crystal interfaces. In addition to the highly competitive performance obtained in those tasks as compared to state-of-the-art methods, we show a significant improvement in speed and storage requirement using DeepRank-GNN as compared to DeepRank. AVAILABILITY AND IMPLEMENTATION DeepRank-GNN is freely available from https://github.com/DeepRank/DeepRank-GNN. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
9
|
Abstract
Many pathogens exploit host cell-surface glycans. However, precise analyses of glycan ligands binding with heavily modified pathogen proteins can be confounded by overlapping sugar signals and/or compounded with known experimental constraints. Universal saturation transfer analysis (uSTA) builds on existing nuclear magnetic resonance spectroscopy to provide an automated workflow for quantitating protein-ligand interactions. uSTA reveals that early-pandemic, B-origin-lineage severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) spike trimer binds sialoside sugars in an "end-on" manner. uSTA-guided modeling and a high-resolution cryo-electron microscopy structure implicate the spike N-terminal domain (NTD) and confirm end-on binding. This finding rationalizes the effect of NTD mutations that abolish sugar binding in SARS-CoV-2 variants of concern. Together with genetic variance analyses in early pandemic patient cohorts, this binding implicates a sialylated polylactosamine motif found on tetraantennary N-linked glycoproteins deep in the human lung as potentially relevant to virulence and/or zoonosis.
Collapse
|
10
|
Abstract
An emerging class of therapeutic molecules are cyclic peptides with over 40 cyclic peptide drugs currently in clinical use. Their mode of action is, however, not fully understood, impeding rational drug design. Computational techniques could positively impact their design, but modeling them and their interactions remains challenging due to their cyclic nature and their flexibility. This study presents a step-by-step protocol for generating cyclic peptide conformations and docking them to their protein target using HADDOCK2.4. A dataset of 30 cyclic peptide-protein complexes was used to optimize both cyclization and docking protocols. It supports peptides cyclized via an N- and C-terminus peptide bond and/or a disulfide bond. An ensemble of cyclic peptide conformations is then used in HADDOCK to dock them onto their target protein using knowledge of the binding site on the protein side to drive the modeling. The presented protocol predicts at least one acceptable model according to the critical assessment of prediction of interaction criteria for each complex of the dataset when the top 10 HADDOCK-ranked single structures are considered (100% success rate top 10) both in the bound and unbound docking scenarios. Moreover, its performance in both bound and fully unbound docking is similar to the state-of-the-art software in the field, Autodock CrankPep. The presented cyclization and docking protocol should make HADDOCK a valuable tool for rational cyclic peptide-based drug design and high-throughput screening.
Collapse
|
11
|
Interface refinement of low- to medium-resolution Cryo-EM complexes using HADDOCK2.4. Structure 2022; 30:476-484.e3. [PMID: 35216656 DOI: 10.1016/j.str.2022.02.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Revised: 11/25/2021] [Accepted: 01/28/2022] [Indexed: 10/19/2022]
Abstract
A wide range of cellular processes requires the formation of multimeric protein complexes. The rise of cryo-electron microscopy (cryo-EM) has enabled the structural characterization of these protein assemblies. The density maps produced can, however, still suffer from limited resolution, impeding the process of resolving structures at atomic resolution. In order to solve this issue, monomers can be fitted into low- to medium-resolution maps. Unfortunately, the models produced frequently contain atomic clashes at the protein-protein interfaces (PPIs), as intermolecular interactions are typically not considered during monomer fitting. Here, we present a refinement approach based on HADDOCK2.4 to remove intermolecular clashes and optimize PPIs. A dataset of 14 cryo-EM complexes was used to test eight protocols. The best-performing protocol, consisting of a semi-flexible simulated annealing refinement with centroid restraints on the monomers, was able to decrease intermolecular atomic clashes by 98% without significantly deteriorating the quality of the cryo-EM density fit.
Collapse
|
12
|
DeepRank: a deep learning framework for data mining 3D protein-protein interfaces. Nat Commun 2021; 12:7068. [PMID: 34862392 PMCID: PMC8642403 DOI: 10.1038/s41467-021-27396-0] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Accepted: 11/12/2021] [Indexed: 11/08/2022] Open
Abstract
Three-dimensional (3D) structures of protein complexes provide fundamental information to decipher biological processes at the molecular scale. The vast amount of experimentally and computationally resolved protein-protein interfaces (PPIs) offers the possibility of training deep learning models to aid the predictions of their biological relevance. We present here DeepRank, a general, configurable deep learning framework for data mining PPIs using 3D convolutional neural networks (CNNs). DeepRank maps features of PPIs onto 3D grids and trains a user-specified CNN on these 3D grids. DeepRank allows for efficient training of 3D CNNs with data sets containing millions of PPIs and supports both classification and regression. We demonstrate the performance of DeepRank on two distinct challenges: The classification of biological versus crystallographic PPIs, and the ranking of docking models. For both problems DeepRank is competitive with, or outperforms, state-of-the-art methods, demonstrating the versatility of the framework for research in structural biology.
Collapse
|
13
|
Prediction of protein assemblies, the next frontier: The CASP14-CAPRI experiment. Proteins 2021; 89:1800-1823. [PMID: 34453465 PMCID: PMC8616814 DOI: 10.1002/prot.26222] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Revised: 07/24/2021] [Accepted: 08/05/2021] [Indexed: 12/19/2022]
Abstract
We present the results for CAPRI Round 50, the fourth joint CASP-CAPRI protein assembly prediction challenge. The Round comprised a total of twelve targets, including six dimers, three trimers, and three higher-order oligomers. Four of these were easy targets, for which good structural templates were available either for the full assembly, or for the main interfaces (of the higher-order oligomers). Eight were difficult targets for which only distantly related templates were found for the individual subunits. Twenty-five CAPRI groups including eight automatic servers submitted ~1250 models per target. Twenty groups including six servers participated in the CAPRI scoring challenge submitted ~190 models per target. The accuracy of the predicted models was evaluated using the classical CAPRI criteria. The prediction performance was measured by a weighted scoring scheme that takes into account the number of models of acceptable quality or higher submitted by each group as part of their five top-ranking models. Compared to the previous CASP-CAPRI challenge, top performing groups submitted such models for a larger fraction (70-75%) of the targets in this Round, but fewer of these models were of high accuracy. Scorer groups achieved stronger performance with more groups submitting correct models for 70-80% of the targets or achieving high accuracy predictions. Servers performed less well in general, except for the MDOCKPP and LZERD servers, who performed on par with human groups. In addition to these results, major advances in methodology are discussed, providing an informative overview of where the prediction of protein assemblies currently stands.
Collapse
|
14
|
Emergence and spread of SARS-CoV-2 lineage B.1.620 with variant of concern-like mutations and deletions. Nat Commun 2021; 12:5769. [PMID: 34599175 PMCID: PMC8486757 DOI: 10.1038/s41467-021-26055-8] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Accepted: 09/15/2021] [Indexed: 12/12/2022] Open
Abstract
Distinct SARS-CoV-2 lineages, discovered through various genomic surveillance initiatives, have emerged during the pandemic following unprecedented reductions in worldwide human mobility. We here describe a SARS-CoV-2 lineage - designated B.1.620 - discovered in Lithuania and carrying many mutations and deletions in the spike protein shared with widespread variants of concern (VOCs), including E484K, S477N and deletions HV69Δ, Y144Δ, and LLA241/243Δ. As well as documenting the suite of mutations this lineage carries, we also describe its potential to be resistant to neutralising antibodies, accompanying travel histories for a subset of European cases, evidence of local B.1.620 transmission in Europe with a focus on Lithuania, and significance of its prevalence in Central Africa owing to recent genome sequencing efforts there. We make a case for its likely Central African origin using advanced phylogeographic inference methodologies incorporating recorded travel histories of infected travellers.
Collapse
|
15
|
Shape-Restrained Modeling of Protein-Small-Molecule Complexes with High Ambiguity Driven DOCKing. J Chem Inf Model 2021; 61:4807-4818. [PMID: 34436890 PMCID: PMC8479858 DOI: 10.1021/acs.jcim.1c00796] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Small-molecule docking remains one of the most valuable computational techniques for the structure prediction of protein-small-molecule complexes. It allows us to study the interactions between compounds and the protein receptors they target at atomic detail in a timely and efficient manner. Here, we present a new protocol in HADDOCK (High Ambiguity Driven DOCKing), our integrative modeling platform, which incorporates homology information for both receptor and compounds. It makes use of HADDOCK's unique ability to integrate information in the simulation to drive it toward conformations, which agree with the provided data. The focal point is the use of shape restraints derived from homologous compounds bound to the target receptors. We have developed two protocols: in the first, the shape is composed of dummy atom beads based on the position of the heavy atoms of the homologous template compound, whereas in the second, the shape is additionally annotated with pharmacophore data for some or all beads. For both protocols, ambiguous distance restraints are subsequently defined between those beads and the heavy atoms of the ligand to be docked. We have benchmarked the performance of these protocols with a fully unbound version of the widely used DUD-E (Database of Useful Decoys-Enhanced) dataset. In this unbound docking scenario, our template/shape-based docking protocol reaches an overall success rate of 81% when a reliable template can be identified (which was the case for 99 out of 102 complexes in the DUD-E dataset), which is close to the best results reported for bound docking on the DUD-E dataset.
Collapse
|
16
|
Native or Non-Native Protein-Protein Docking Models? Molecular Dynamics to the Rescue. J Chem Theory Comput 2021; 17:5944-5954. [PMID: 34342983 PMCID: PMC8444332 DOI: 10.1021/acs.jctc.1c00336] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Indexed: 11/29/2022]
Abstract
Molecular docking excels at creating a plethora of potential models of protein-protein complexes. To correctly distinguish the favorable, native-like models from the remaining ones remains, however, a challenge. We assessed here if a protocol based on molecular dynamics (MD) simulations would allow distinguishing native from non-native models to complement scoring functions used in docking. To this end, the first models for 25 protein-protein complexes were generated using HADDOCK. Next, MD simulations complemented with machine learning were used to discriminate between native and non-native complexes based on a combination of metrics reporting on the stability of the initial models. Native models showed higher stability in almost all measured properties, including the key ones used for scoring in the Critical Assessment of PRedicted Interaction (CAPRI) competition, namely the positional root mean square deviations and fraction of native contacts from the initial docked model. A random forest classifier was trained, reaching a 0.85 accuracy in correctly distinguishing native from non-native complexes. Reasonably modest simulation lengths of the order of 50-100 ns are sufficient to reach this accuracy, which makes this approach applicable in practice.
Collapse
|
17
|
Structural Biology in the Clouds: The WeNMR-EOSC Ecosystem. Front Mol Biosci 2021; 8:729513. [PMID: 34395534 PMCID: PMC8356364 DOI: 10.3389/fmolb.2021.729513] [Citation(s) in RCA: 252] [Impact Index Per Article: 84.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Accepted: 07/13/2021] [Indexed: 12/05/2022] Open
Abstract
Structural biology aims at characterizing the structural and dynamic properties of biological macromolecules at atomic details. Gaining insight into three dimensional structures of biomolecules and their interactions is critical for understanding the vast majority of cellular processes, with direct applications in health and food sciences. Since 2010, the WeNMR project (www.wenmr.eu) has implemented numerous web-based services to facilitate the use of advanced computational tools by researchers in the field, using the high throughput computing infrastructure provided by EGI. These services have been further developed in subsequent initiatives under H2020 projects and are now operating as Thematic Services in the European Open Science Cloud portal (www.eosc-portal.eu), sending >12 millions of jobs and using around 4,000 CPU-years per year. Here we review 10 years of successful e-infrastructure solutions serving a large worldwide community of over 23,000 users to date, providing them with user-friendly, web-based solutions that run complex workflows in structural biology. The current set of active WeNMR portals are described, together with the complex backend machinery that allows distributed computing resources to be harvested efficiently.
Collapse
|
18
|
|
19
|
Integrating quantitative proteomics with accurate genome profiling of transcription factors by greenCUT&RUN. Nucleic Acids Res 2021; 49:e49. [PMID: 33524153 PMCID: PMC8136828 DOI: 10.1093/nar/gkab038] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Revised: 01/08/2021] [Accepted: 01/16/2021] [Indexed: 11/14/2022] Open
Abstract
Genome-wide localization of chromatin and transcription regulators can be detected by a variety of techniques. Here, we describe a novel method ‘greenCUT&RUN’ for genome-wide profiling of transcription regulators, which has a very high sensitivity, resolution, accuracy and reproducibility, whilst assuring specificity. Our strategy begins with tagging of the protein of interest with GFP and utilizes a GFP-specific nanobody fused to MNase to profile genome-wide binding events. By using a GFP-nanobody the greenCUT&RUN approach eliminates antibody dependency and variability. Robust genomic profiles were obtained with greenCUT&RUN, which are accurate and unbiased towards open chromatin. By integrating greenCUT&RUN with nanobody-based affinity purification mass spectrometry, ‘piggy-back’ DNA binding events can be identified on a genomic scale. The unique design of greenCUT&RUN grants target protein flexibility and yields high resolution footprints. In addition, greenCUT&RUN allows rapid profiling of mutants of chromatin and transcription proteins. In conclusion, greenCUT&RUN is a widely applicable and versatile genome-mapping technique.
Collapse
|
20
|
Information-driven modeling of biomolecular complexes. Curr Opin Struct Biol 2021; 70:70-77. [PMID: 34139639 DOI: 10.1016/j.sbi.2021.05.003] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Accepted: 05/10/2021] [Indexed: 11/15/2022]
Abstract
Proteins play crucial roles in every cellular process by interacting with each other, nucleic acids, metabolites, and other molecules. The resulting assemblies can be very large and intricate and pose challenges to experimental methods. In the current era of integrative modeling, it is often only by a combination of various experimental techniques and computations that three-dimensional models of those molecular machines can be obtained. Among the various computational approaches available, molecular docking is often the method of choice when it comes to predicting three-dimensional structures of complexes. Docking can generate particularly accurate models when taking into account the available information on the complex of interest. We review here the use of experimental and bioinformatics data in protein-protein docking, describing recent software developments and highlighting applications for the modeling of antibody-antigen complexes and membrane protein complexes, and the use of evolutionary and shape information.
Collapse
|
21
|
MENSAdb: a thorough structural analysis of membrane protein dimers. Database (Oxford) 2021; 2021:baab013. [PMID: 33822911 PMCID: PMC8023553 DOI: 10.1093/database/baab013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2020] [Revised: 01/19/2021] [Accepted: 03/01/2021] [Indexed: 11/14/2022]
Abstract
Membrane proteins (MPs) are key players in a variety of different cellular processes and constitute the target of around 60% of all Food and Drug Administration-approved drugs. Despite their importance, there is still a massive lack of relevant structural, biochemical and mechanistic information mainly due to their localization within the lipid bilayer. To help fulfil this gap, we developed the MEmbrane protein dimer Novel Structure Analyser database (MENSAdb). This interactive web application summarizes the evolutionary and physicochemical properties of dimeric MPs to expand the available knowledge on the fundamental principles underlying their formation. Currently, MENSAdb contains features of 167 unique MPs (63% homo- and 37% heterodimers) and brings insights into the conservation of residues, accessible solvent area descriptors, average B-factors, intermolecular contacts at 2.5 Å and 4.0 Å distance cut-offs, hydrophobic contacts, hydrogen bonds, salt bridges, π-π stacking, T-stacking and cation-π interactions. The regular update and organization of all these data into a unique platform will allow a broad community of researchers to collect and analyse a large number of features efficiently, thus facilitating their use in the development of prediction models associated with MPs. Database URL: http://www.moreiralab.com/resources/mensadb.
Collapse
|
22
|
proABC-2: PRediction of AntiBody contacts v2 and its application to information-driven docking. Bioinformatics 2021; 36:5107-5108. [PMID: 32683441 PMCID: PMC7755408 DOI: 10.1093/bioinformatics/btaa644] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Revised: 06/10/2020] [Accepted: 07/13/2020] [Indexed: 11/26/2022] Open
Abstract
Motivation Monoclonal antibodies are essential tools in the contemporary therapeutic armory. Understanding how these recognize their antigen is a fundamental step in their rational design and engineering. The rising amount of publicly available data is catalyzing the development of computational approaches able to offer valuable, faster and cheaper alternatives to classical experimental methodologies used for the study of antibody–antigen complexes. Results Here, we present proABC-2, an update of the original random-forest antibody paratope predictor, based on a convolutional neural network algorithm. We also demonstrate how the predictions can be fruitfully used to drive the docking in HADDOCK. Availability and implementation The proABC-2 server is freely available at: https://wenmr.science.uu.nl/proabc2/. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
|
23
|
Characterization of nucleosome sediments for protein interaction studies by solid-state NMR spectroscopy. MAGNETIC RESONANCE (GOTTINGEN, GERMANY) 2021; 2:187-202. [PMID: 35647606 PMCID: PMC9135053 DOI: 10.5194/mr-2-187-2021] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/25/2023]
Abstract
Regulation of DNA-templated processes such as gene transcription and DNA repair depend on the interaction of a wide range of proteins with the nucleosome, the fundamental building block of chromatin. Both solution and solid-state NMR spectroscopy have become an attractive approach to study the dynamics and interactions of nucleosomes, despite their high molecular weight of ~ 200 kDa. For solid-state NMR (ssNMR) studies, dilute solutions of nucleosomes are converted to a dense phase by sedimentation or precipitation. Since nucleosomes are known to self-associate, these dense phases may induce extensive interactions between nucleosomes, which could interfere with protein-binding studies. Here, we characterized the packing of nucleosomes in the dense phase created by sedimentation using NMR and small-angle X-ray scattering (SAXS) experiments. We found that nucleosome sediments are gels with variable degrees of solidity, have nucleosome concentration close to that found in crystals, and are stable for weeks under high-speed magic angle spinning (MAS). Furthermore, SAXS data recorded on recovered sediments indicate that there is no pronounced long-range ordering of nucleosomes in the sediment. Finally, we show that the sedimentation approach can also be used to study low-affinity protein interactions with the nucleosome. Together, our results give new insights into the sample characteristics of nucleosome sediments for ssNMR studies and illustrate the broad applicability of sedimentation-based NMR studies.
Collapse
|
24
|
Control over the fibrillization yield by varying the oligomeric nucleation propensities of self-assembling peptides. Commun Chem 2020; 3:164. [PMID: 36703336 PMCID: PMC9814929 DOI: 10.1038/s42004-020-00417-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Accepted: 10/19/2020] [Indexed: 01/29/2023] Open
Abstract
Self-assembling peptides are an exemplary class of supramolecular biomaterials of broad biomedical utility. Mechanistic studies on the peptide self-assembly demonstrated the importance of the oligomeric intermediates towards the properties of the supramolecular biomaterials being formed. In this study, we demonstrate how the overall yield of the supramolecular assemblies are moderated through subtle molecular changes in the peptide monomers. This strategy is exemplified with a set of surfactant-like peptides (SLPs) with different β-sheet propensities and charged residues flanking the aggregation domains. By integrating different techniques, we show that these molecular changes can alter both the nucleation propensity of the oligomeric intermediates and the thermodynamic stability of the fibril structures. We demonstrate that the amount of assembled nanofibers are critically defined by the oligomeric nucleation propensities. Our findings offer guidance on designing self-assembling peptides for different biomedical applications, as well as insights into the role of protein gatekeeper sequences in preventing amyloidosis.
Collapse
|
25
|
PDB-tools web: A user-friendly interface for the manipulation of PDB files. Proteins 2020; 89:330-335. [PMID: 33111403 PMCID: PMC7855443 DOI: 10.1002/prot.26018] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Revised: 10/20/2020] [Accepted: 10/26/2020] [Indexed: 01/06/2023]
Abstract
The Protein Data Bank (PDB) file format remains a popular format used and supported by many software to represent coordinates of macromolecular structures. It however suffers from drawbacks such as error‐prone manual editing. Because of that, various software toolkits have been developed to facilitate its editing and manipulation, but, to date, there is no online tool available for this purpose. Here we present PDB‐Tools Web, a flexible online service for manipulating PDB files. It offers a rich and user‐friendly graphical user interface that allows users to mix‐and‐match more than 40 individual tools from the pdb‐tools suite. Those can be combined in a few clicks to perform complex pipelines, which can be saved and uploaded. The resulting processed PDB files can be visualized online and downloaded. The web server is freely available at https://wenmr.science.uu.nl/pdbtools.
Collapse
|
26
|
A click-flipped enzyme substrate boosts the performance of the diagnostic screening for Hunter syndrome. Chem Sci 2020; 11:12671-12676. [PMID: 34094461 PMCID: PMC8163285 DOI: 10.1039/d0sc04696e] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Accepted: 10/23/2020] [Indexed: 11/23/2022] Open
Abstract
We report on the unexpected finding that click modification of iduronyl azides results in a conformational flip of the pyranose ring, which led to the development of a new strategy for the design of superior enzyme substrates for the diagnostic assaying of iduronate-2-sulfatase (I2S), a lysosomal enzyme related to Hunter syndrome. Synthetic substrates are essential in testing newborns for metabolic disorders to enable early initiation of therapy. Our click-flipped iduronyl triazole showed a remarkably better performance with I2S than commonly used O-iduronates. We found that both O- and triazole-linked substrates are accepted by the enzyme, irrespective of their different conformations, but only the O-linked product inhibits the activity of I2S. Thus, in the long reaction times required for clinical assays, the triazole substrate substantially outperforms the O-iduronate. Applying our click-flipped substrate to assay I2S in dried blood spots sampled from affected patients and random newborns significantly increased the confidence in discriminating between these groups, clearly indicating the potential of the click-flip strategy to control the biomolecular function of carbohydrates.
Collapse
|
27
|
Understanding Docking Complexes of Macromolecules Using HADDOCK: The Synergy between Experimental Data and Computations. Bio Protoc 2020; 10:e3793. [PMID: 33659447 PMCID: PMC7842552 DOI: 10.21769/bioprotoc.3793] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Revised: 08/17/2020] [Accepted: 09/01/2020] [Indexed: 11/02/2022] Open
Abstract
This protocol illustrates the modelling of a protein-peptide complex using the synergic combination of in silico analysis and experimental results. To this end, we use the integrative modelling software HADDOCK, which possesses the powerful ability to incorporate experimental data, such as NMR Chemical Shift Perturbations and biochemical protein-peptide interaction data, as restraints to guide the docking process. Based on the modelling results, a rational mutagenesis approach is used to validate the generated models. The experimental results allow to select a final structural model best representing the bona fide protein-peptide complex. The described protocol can also be applied to model protein-protein complexes. There is no size limit for the macromolecular complexes that can be characterized by HADDOCK as long as the 3D structures of the individual components are available.
Collapse
|
28
|
Computational approaches to therapeutic antibody design: established methods and emerging trends. Brief Bioinform 2020; 21:1549-1567. [PMID: 31626279 PMCID: PMC7947987 DOI: 10.1093/bib/bbz095] [Citation(s) in RCA: 101] [Impact Index Per Article: 25.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2019] [Revised: 06/07/2019] [Accepted: 07/05/2019] [Indexed: 12/31/2022] Open
Abstract
Antibodies are proteins that recognize the molecular surfaces of potentially noxious molecules to mount an adaptive immune response or, in the case of autoimmune diseases, molecules that are part of healthy cells and tissues. Due to their binding versatility, antibodies are currently the largest class of biotherapeutics, with five monoclonal antibodies ranked in the top 10 blockbuster drugs. Computational advances in protein modelling and design can have a tangible impact on antibody-based therapeutic development. Antibody-specific computational protocols currently benefit from an increasing volume of data provided by next generation sequencing and application to related drug modalities based on traditional antibodies, such as nanobodies. Here we present a structured overview of available databases, methods and emerging trends in computational antibody analysis and contextualize them towards the engineering of candidate antibody therapeutics.
Collapse
|
29
|
LightDock goes information-driven. Bioinformatics 2020; 36:950-952. [PMID: 31418773 PMCID: PMC7005597 DOI: 10.1093/bioinformatics/btz642] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2019] [Revised: 08/09/2019] [Accepted: 08/14/2019] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION The use of experimental information has been demonstrated to increase the success rate of computational macromolecular docking. Many methods use information to post-filter the simulation output while others drive the simulation based on experimental restraints, which can become problematic for more complex scenarios such as multiple binding interfaces. RESULTS We present a novel method for including interface information into protein docking simulations within the LightDock framework. Prior to the simulation, irrelevant regions from the receptor are excluded for sampling (filter of initial swarms) and initial ligand poses are pre-oriented based on ligand input information. We demonstrate the applicability of this approach on the new 55 cases of the Protein-Protein Docking Benchmark 5, using different amounts of information. Even with incomplete or incorrect information, a significant improvement in performance is obtained compared to blind ab initio docking. AVAILABILITY AND IMPLEMENTATION The software is supported and freely available from https://github.com/brianjimenez/lightdock and analysis data from https://github.com/brianjimenez/lightdock_bm5. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
30
|
Editorial: Multiscale Modeling From Macromolecules to Cell: Opportunities and Challenges of Biomolecular Simulations. Front Mol Biosci 2020; 7:194. [PMID: 33005628 PMCID: PMC7484804 DOI: 10.3389/fmolb.2020.00194] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2020] [Accepted: 07/21/2020] [Indexed: 01/29/2023] Open
|
31
|
Inhibition of the integrated stress response by viral proteins that block p-eIF2-eIF2B association. Nat Microbiol 2020; 5:1361-1373. [PMID: 32690955 DOI: 10.1038/s41564-020-0759-0] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2019] [Accepted: 06/22/2020] [Indexed: 11/09/2022]
Abstract
Eukaryotic cells, when exposed to environmental or internal stress, activate the integrated stress response (ISR) to restore homeostasis and promote cell survival. Specific stress stimuli prompt dedicated stress kinases to phosphorylate eukaryotic initiation factor 2 (eIF2). Phosphorylated eIF2 (p-eIF2) in turn sequesters the eIF2-specific guanine exchange factor eIF2B to block eIF2 recycling, thereby halting translation initiation and reducing global protein synthesis. To circumvent stress-induced translational shutdown, viruses encode ISR antagonists. Those identified so far prevent or reverse eIF2 phosphorylation. We now describe two viral proteins-one from a coronavirus and the other from a picornavirus-that have independently acquired the ability to counteract the ISR at its very core by acting as a competitive inhibitor of p-eIF2-eIF2B interaction. This allows continued formation of the eIF2-GTP-Met-tRNAi ternary complex and unabated global translation at high p-eIF2 levels that would otherwise cause translational arrest. We conclude that eIF2 and p-eIF2 differ in their interaction with eIF2B to such effect that p-eIF2-eIF2B association can be selectively inhibited.
Collapse
|
32
|
Large-scale prediction of binding affinity in protein-small ligand complexes: the PRODIGY-LIG web server. Bioinformatics 2020; 35:1585-1587. [PMID: 31051038 DOI: 10.1093/bioinformatics/bty816] [Citation(s) in RCA: 105] [Impact Index Per Article: 26.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2018] [Revised: 08/15/2018] [Accepted: 09/19/2018] [Indexed: 11/14/2022] Open
Abstract
SUMMARY Recently we published PROtein binDIng enerGY (PRODIGY), a web-server for the prediction of binding affinity in protein-protein complexes. By using a combination of simple structural properties, such as the residue-contacts made at the interface, PRODIGY has demonstrated a top performance compared with other state-of-the-art predictors in the literature. Here we present an extension of it, named PRODIGY-LIG, aimed at the prediction of affinity in protein-small ligand complexes. The predictive method, properly readapted for small ligand by making use of atomic instead of residue contacts, has been successfully applied for the blind prediction of 102 protein-ligand complexes during the D3R Grand Challenge 2. PRODIGY-LIG has the advantage of being simple, generic and applicable to any kind of protein-ligand complex. It provides an automatic, fast and user-friendly tool ensuring broad accessibility. AVAILABILITY AND IMPLEMENTATION PRODIGY-LIG is freely available without registration requirements at http://milou.science.uu.nl/services/PRODIGY-LIG.
Collapse
|
33
|
Abstract
Structural bioinformatics provides the scientific methods and tools to analyse, archive, validate, and present the biomolecular structure data generated by the structural biology community. It also provides an important link with the genomics community, as structural bioinformaticians also use the extensive sequence data to predict protein structures and their functional sites. A very broad and active community of structural bioinformaticians exists across Europe, and 3D-Bioinfo will establish formal platforms to address their needs and better integrate their activities and initiatives. Our mission will be to strengthen the ties with the structural biology research communities in Europe covering life sciences, as well as chemistry and physics and to bridge the gap between these researchers in order to fully realize the potential of structural bioinformatics. Our Community will also undertake dedicated educational, training and outreach efforts to facilitate this, bringing new insights and thus facilitating the development of much needed innovative applications e.g. for human health, drug and protein design. Our combined efforts will be of critical importance to keep the European research efforts competitive in this respect. Here we highlight the major European contributions to the field of structural bioinformatics, the most pressing challenges remaining and how Europe-wide interactions, enabled by ELIXIR and its platforms, will help in addressing these challenges and in coordinating structural bioinformatics resources across Europe. In particular, we present recent activities and future plans to consolidate an ELIXIR 3D-Bioinfo Community in structural bioinformatics and propose means to develop better links across the community. These include building new consortia, organising workshops to establish data standards and seeking community agreement on benchmark data sets and strategies. We also highlight existing and planned collaborations with other ELIXIR Communities and other European infrastructures, such as the structural biology community supported by Instruct-ERIC, with whom we have synergies and overlapping common interests.
Collapse
|
34
|
iScore: a novel graph kernel-based function for scoring protein-protein docking models. Bioinformatics 2020; 36:112-121. [PMID: 31199455 PMCID: PMC6956772 DOI: 10.1093/bioinformatics/btz496] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2018] [Revised: 05/08/2019] [Accepted: 06/11/2019] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Protein complexes play critical roles in many aspects of biological functions. Three-dimensional (3D) structures of protein complexes are critical for gaining insights into structural bases of interactions and their roles in the biomolecular pathways that orchestrate key cellular processes. Because of the expense and effort associated with experimental determinations of 3D protein complex structures, computational docking has evolved as a valuable tool to predict 3D structures of biomolecular complexes. Despite recent progress, reliably distinguishing near-native docking conformations from a large number of candidate conformations, the so-called scoring problem, remains a major challenge. RESULTS Here we present iScore, a novel approach to scoring docked conformations that combines HADDOCK energy terms with a score obtained using a graph representation of the protein-protein interfaces and a measure of evolutionary conservation. It achieves a scoring performance competitive with, or superior to, that of state-of-the-art scoring functions on two independent datasets: (i) Docking software-specific models and (ii) the CAPRI score set generated by a wide variety of docking approaches (i.e. docking software-non-specific). iScore ranks among the top scoring approaches on the CAPRI score set (13 targets) when compared with the 37 scoring groups in CAPRI. The results demonstrate the utility of combining evolutionary, topological and energetic information for scoring docked conformations. This work represents the first successful demonstration of graph kernels to protein interfaces for effective discrimination of near-native and non-native conformations of protein complexes. AVAILABILITY AND IMPLEMENTATION The iScore code is freely available from Github: https://github.com/DeepRank/iScore (DOI: 10.5281/zenodo.2630567). And the docking models used are available from SBGrid: https://data.sbgrid.org/dataset/684). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
35
|
Abstract
Recent improvements in cryo-electron microscopy (cryo-EM) in the past few years are now allowing to observe molecular complexes at atomic resolution. As a consequence, numerous structures derived from cryo-EM are now available in the Protein Data Bank. However, if for some complexes atomic resolution is reached, this is not true for all. This is also the case in cryo-electron tomography where the achievable resolution is still limited. Furthermore the resolution in a cryo-EM map is not a constant, with often outer regions being of lower resolution, possibly linked to conformational variability. Although those low- to medium-resolution EM maps (or regions thereof) cannot directly provide atomic structure of large molecular complexes, they provide valuable information to model the individual components and their assembly into them. Most approaches for this kind of modeling are performing rigid fitting of the individual components into the EM density map. While this would appear an obvious option, they ignore key aspects of molecular recognition, the energetics and flexibility of the interfaces. Moreover, this often restricts the modeling to a unique source of data, the EM density map.In this chapter, we describe a protocol where an EM map is used as restraint in HADDOCK to guide the modeling process. In the first step, rigid-body fitting is performed with PowerFit in order to identify the most likely locations of the molecules into the map. These are then used as centroids to which distance restraints are defined from the center of mass of the components of the complex for the initial rigid-body docking. The EM density is then directly used as an additional restraint energy term, which can be combined with all the other types of data supported by HADDOCK. This protocol relies on the new version 2.4 of both the HADDOCK webserver and software. Preparation steps consisting of cropping the EM map and rigid-body fitting of the atomic structure are explained. Then, the EM-driven docking protocol using HADDOCK is illustrated.
Collapse
|
36
|
An overview of data-driven HADDOCK strategies in CAPRI rounds 38-45. Proteins 2019; 88:1029-1036. [PMID: 31886559 DOI: 10.1002/prot.25869] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2019] [Revised: 12/17/2019] [Accepted: 12/26/2019] [Indexed: 01/18/2023]
Abstract
Our information-driven docking approach HADDOCK has demonstrated a sustained performance since the start of its participation to CAPRI. This is due, in part, to its ability to integrate data into the modeling process, and to the robustness of its scoring function. We participated in CAPRI both as server and manual predictors. In CAPRI rounds 38-45, we have used various strategies depending on the available information. These ranged from imposing restraints to a few residues identified from literature as being important for the interaction, to binding pockets identified from homologous complexes or template-based refinement/CA-CA restraint-guided docking from identified templates. When relevant, symmetry restraints were used to limit the conformational sampling. We also tested for a large decamer target a new implementation of the MARTINI coarse-grained force field in HADDOCK. Overall, we obtained acceptable or better predictions for 13 and 11 server and manual submissions, respectively, out of the 22 interfaces. Our server performance (acceptable or higher-quality models when considering the top 10) was better (59%) than the manual (50%) one, in which we typically experiment with various combinations of protocols and data sources. Again, our simple scoring function based on a linear combination of intermolecular van der Waals and electrostatic energies and an empirical desolvation term demonstrated a good performance in the scoring experiment with a 63% success rate across all 22 interfaces. An analysis of model quality indicates that, while we are consistently performing well in generating acceptable models, there is room for improvement for generating/identifying higher quality models.
Collapse
|
37
|
Blind prediction of homo- and hetero-protein complexes: The CASP13-CAPRI experiment. Proteins 2019; 87:1200-1221. [PMID: 31612567 PMCID: PMC7274794 DOI: 10.1002/prot.25838] [Citation(s) in RCA: 79] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2019] [Revised: 09/26/2019] [Accepted: 09/27/2019] [Indexed: 12/28/2022]
Abstract
We present the results for CAPRI Round 46, the third joint CASP-CAPRI protein assembly prediction challenge. The Round comprised a total of 20 targets including 14 homo-oligomers and 6 heterocomplexes. Eight of the homo-oligomer targets and one heterodimer comprised proteins that could be readily modeled using templates from the Protein Data Bank, often available for the full assembly. The remaining 11 targets comprised 5 homodimers, 3 heterodimers, and two higher-order assemblies. These were more difficult to model, as their prediction mainly involved "ab-initio" docking of subunit models derived from distantly related templates. A total of ~30 CAPRI groups, including 9 automatic servers, submitted on average ~2000 models per target. About 17 groups participated in the CAPRI scoring rounds, offered for most targets, submitting ~170 models per target. The prediction performance, measured by the fraction of models of acceptable quality or higher submitted across all predictors groups, was very good to excellent for the nine easy targets. Poorer performance was achieved by predictors for the 11 difficult targets, with medium and high quality models submitted for only 3 of these targets. A similar performance "gap" was displayed by scorer groups, highlighting yet again the unmet challenge of modeling the conformational changes of the protein components that occur upon binding or that must be accounted for in template-based modeling. Our analysis also indicates that residues in binding interfaces were less well predicted in this set of targets than in previous Rounds, providing useful insights for directions of future improvements.
Collapse
|
38
|
Modeling Antibody-Antigen Complexes by Information-Driven Docking. Structure 2019; 28:119-129.e2. [PMID: 31727476 DOI: 10.1016/j.str.2019.10.011] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2019] [Revised: 07/03/2019] [Accepted: 10/18/2019] [Indexed: 10/25/2022]
Abstract
Antibodies are Y-shaped proteins essential for immune response. Their capability to recognize antigens with high specificity makes them excellent therapeutic targets. Understanding the structural basis of antibody-antigen interactions is therefore crucial for improving our ability to design efficient biological drugs. Computational approaches such as molecular docking are providing a valuable and fast alternative to experimental structural characterization for these complexes. We investigate here how information about complementarity-determining regions and binding epitopes can be used to drive the modeling process, and present a comparative study of four different docking software suites (ClusPro, LightDock, ZDOCK, and HADDOCK) providing specific options for antibody-antigen modeling. Their performance on a dataset of 16 complexes is reported. HADDOCK, which includes information to drive the docking, is shown to perform best in terms of both success rate and quality of the generated models in both the presence and absence of information about the epitope on the antigen.
Collapse
|
39
|
Abstract
Given the need for modern researchers to produce open, reproducible scientific output, the lack of standards and best practices for sharing data and workflows used to produce and analyze molecular dynamics (MD) simulations has become an important issue in the field. There are now multiple well-established packages to perform molecular dynamics simulations, often highly tuned for exploiting specific classes of hardware, each with strong communities surrounding them, but with very limited interoperability/transferability options. Thus, the choice of the software package often dictates the workflow for both simulation production and analysis. The level of detail in documenting the workflows and analysis code varies greatly in published work, hindering reproducibility of the reported results and the ability for other researchers to build on these studies. An increasing number of researchers are motivated to make their data available, but many challenges remain in order to effectively share and reuse simulation data. To discuss these and other issues related to best practices in the field in general, we organized a workshop in November 2018 ( https://bioexcel.eu/events/workshop-on-sharing-data-from-molecular-simulations/ ). Here, we present a brief overview of this workshop and topics discussed. We hope this effort will spark further conversation in the MD community to pave the way toward more open, interoperable, and reproducible outputs coming from research studies using MD simulations.
Collapse
|
40
|
Less Is More: Coarse-Grained Integrative Modeling of Large Biomolecular Assemblies with HADDOCK. J Chem Theory Comput 2019; 15:6358-6367. [PMID: 31539250 PMCID: PMC6854652 DOI: 10.1021/acs.jctc.9b00310] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Predicting the 3D structure of protein interactions remains a challenge in the field of computational structural biology. This is in part due to difficulties in sampling the complex energy landscape of multiple interacting flexible polypeptide chains. Coarse-graining approaches, which reduce the number of degrees of freedom of the system, help address this limitation by smoothing the energy landscape, allowing an easier identification of the global energy minimum. They also accelerate the calculations, allowing for modeling larger assemblies. Here, we present the implementation of the MARTINI coarse-grained force field for proteins into HADDOCK, our integrative modeling platform. Docking and refinement are performed at the coarse-grained level, and the resulting models are then converted back to atomistic resolution through a distance restraints-guided morphing procedure. Our protocol, tested on the largest complexes of the protein docking benchmark 5, shows an overall ∼7-fold speed increase compared to standard all-atom calculations, while maintaining a similar accuracy and yielding substantially more near-native solutions. To showcase the potential of our method, we performed simultaneous 7 body docking to model the 1:6 KaiC-KaiB complex, integrating mutagenesis and hydrogen/deuterium exchange data from mass spectrometry with symmetry restraints, and validated the resulting models against a recently published cryo-EM structure.
Collapse
|
41
|
Abstract
Modeling biomolecular assemblies is an important field in computational structural biology. The inherent complexity of their energy landscape and the computational cost associated with modeling large and complex assemblies are major drawbacks for integrative modeling approaches. The so-called coarse-graining approaches, which reduce the degrees of freedom of the system by grouping several atoms into larger “pseudo-atoms,” have been shown to alleviate some of those limitations, facilitating the identification of the global energy minima assumed to correspond to the native state of the complex, while making the calculations more efficient. Here, we describe and assess the implementation of the MARTINI force field for DNA into HADDOCK, our integrative modeling platform. We combine it with our previous implementation for protein-protein coarse-grained docking, enabling coarse-grained modeling of protein-nucleic acid complexes. The system is modeled using MARTINI topologies and interaction parameters during the rigid body docking and semi-flexible refinement stages of HADDOCK, and the resulting models are then converted back to atomistic resolution by an atom-to-bead distance restraints-guided protocol. We first demonstrate the performance of this protocol using 44 complexes from the protein-DNA docking benchmark, which shows an overall ~6-fold speed increase and maintains similar accuracy as compared to standard atomistic calculations. As a proof of concept, we then model the interaction between the PRC1 and the nucleosome (a former CAPRI target in round 31), using the same information available at the time the target was offered, and compare all-atom and coarse-grained models.
Collapse
|
42
|
Pre- and post-docking sampling of conformational changes using ClustENM and HADDOCK for protein-protein and protein-DNA systems. Proteins 2019; 88:292-306. [PMID: 31441121 PMCID: PMC6973081 DOI: 10.1002/prot.25802] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Revised: 08/15/2019] [Accepted: 08/19/2019] [Indexed: 02/01/2023]
Abstract
Incorporating the dynamic nature of biomolecules in the modeling of their complexes is a challenge, especially when the extent and direction of the conformational changes taking place upon binding is unknown. Estimating whether the binding of a biomolecule to its partner(s) occurs in a conformational state accessible to its unbound form (“conformational selection”) and/or the binding process induces conformational changes (“induced‐fit”) is another challenge. We propose here a method combining conformational sampling using ClustENM—an elastic network‐based modeling procedure—with docking using HADDOCK, in a framework that incorporates conformational selection and induced‐fit effects upon binding. The extent of the applied deformation is estimated from its energetical costs, inspired from mechanical tensile testing on materials. We applied our pre‐ and post‐docking sampling of conformational changes to the flexible multidomain protein‐protein docking benchmark and a subset of the protein‐DNA docking benchmark. Our ClustENM‐HADDOCK approach produced acceptable to medium quality models in 7/11 and 5/6 cases for the protein‐protein and protein‐DNA complexes, respectively. The conformational selection (sampling prior to docking) has the highest impact on the quality of the docked models for the protein‐protein complexes. The induced‐fit stage of the pipeline (post‐sampling), however, improved the quality of the final models for the protein‐DNA complexes. Compared to previously described strategies to handle conformational changes, ClustENM‐HADDOCK performs better than two‐body docking in protein‐protein cases but worse than a flexible multidomain docking approach. However, it does show a better or similar performance compared to previous protein‐DNA docking approaches, which makes it a suitable alternative.
Collapse
|
43
|
The structural details of the interaction of single-stranded DNA binding protein hSSB2 (NABP1/OBFC2A) with UV-damaged DNA. Proteins 2019; 88:319-326. [PMID: 31443132 DOI: 10.1002/prot.25806] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2019] [Revised: 08/02/2019] [Accepted: 08/19/2019] [Indexed: 12/17/2022]
Abstract
Single-stranded DNA-binding proteins (SSBs) are required for all known DNA metabolic events such as DNA replication, recombination and repair. While a wealth of structural and functional data is available on the essential human SSB, hSSB1 (NABP2/OBFC2B), the close homolog hSSB2 (NABP1/OBFC2A) remains relatively uncharacterized. Both SSBs possess a well-structured OB (oligonucleotide/oligosaccharide-binding) domain that is able to recognize single-stranded DNA (ssDNA) followed by a flexible carboxyl-tail implicated in the interaction with other proteins. Despite the high sequence similarity of the OB domain, several recent studies have revealed distinct functional differences between hSSB1 and hSSB2. In this study, we show that hSSB2 is able to recognize cyclobutane pyrimidine dimers (CPD) that form in cellular DNA as a consequence of UV damage. Using a combination of biolayer interferometry and NMR, we determine the molecular details of the binding of the OB domain of hSSB2 to CPD-containing ssDNA, confirming the role of four key aromatic residues in hSSB2 (W59, Y78, W82, and Y89) that are also conserved in hSSB1. Our structural data thus demonstrate that ssDNA recognition by the OB fold of hSSB2 is highly similar to hSSB1, indicating that one SSB may be able to replace the other in any initial ssDNA binding event. However, any subsequent recruitment of other repair proteins most likely depends on the divergent carboxyl-tail and as such is likely to be different between hSSB1 and hSSB2.
Collapse
|
44
|
Folding Then Binding vs Folding Through Binding in Macrocyclic Peptide Inhibitors of Human Pancreatic α-Amylase. ACS Chem Biol 2019; 14:1751-1759. [PMID: 31241898 PMCID: PMC6700688 DOI: 10.1021/acschembio.9b00290] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Accepted: 06/26/2019] [Indexed: 12/14/2022]
Abstract
De novo macrocyclic peptides, derived using selection technologies such as phage and mRNA display, present unique and unexpected solutions to challenging biological problems. This is due in part to their unusual folds, which are able to present side chains in ways not available to canonical structures such as α-helices and β-sheets. Despite much recent interest in these molecules, their folding and binding behavior remains poorly characterized. In this work, we present cocrystallization, docking, and solution NMR structures of three de novo macrocyclic peptides that all bind as competitive inhibitors with single-digit nanomolar Ki to the active site of human pancreatic α-amylase. We show that a short stably folded motif in one of these is nucleated by internal hydrophobic interactions in an otherwise dynamic conformation in solution. Comparison of the solution structures with a target-bound structure from docking indicates that stabilization of the bound conformation is provided through interactions with the target protein after binding. These three structures also reveal a surprising functional convergence to present a motif of a single arginine sandwiched between two aromatic residues in the interactions of the peptide with the key catalytic residues of the enzyme, despite little to no other structural homology. Our results suggest that intramolecular hydrophobic interactions are important for priming binding of small macrocyclic peptides to their target and that high rigidity is not necessary for high affinity.
Collapse
|
45
|
Holo-like and Druggable Protein Conformations from Enhanced Sampling of Binding Pocket Volume and Shape. J Chem Inf Model 2019; 59:1515-1528. [PMID: 30883122 DOI: 10.1021/acs.jcim.8b00730] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Understanding molecular recognition of small molecules by proteins in atomistic detail is key for drug design. Molecular docking is a widely used computational method to mimic ligand-protein association in silico. However, predicting conformational changes occurring in proteins upon ligand binding is still a major challenge. Ensemble docking approaches address this issue by considering a set of different conformations of the protein obtained either experimentally or from computer simulations, e.g., molecular dynamics. However, holo structures prone to host (the correct) ligands are generally poorly sampled by standard molecular dynamics simulations of the apo protein. In order to address this limitation, we introduce a computational approach based on metadynamics simulations called ensemble docking with enhanced sampling of pocket shape (EDES) that allows holo-like conformations of proteins to be generated by exploiting only their apo structures. This is achieved by defining a set of collective variables that effectively sample different shapes of the binding site, ultimately mimicking the steric effect due to the ligand. We assessed the method on three challenging proteins undergoing different extents of conformational changes upon ligand binding. In all cases our protocol generates a significant fraction of structures featuring a low RMSD from the experimental holo geometry. Moreover, ensemble docking calculations using those conformations yielded in all cases native-like poses among the top-ranked ones.
Collapse
|
46
|
Abstract
Data processing and data management services for structural biology. Enhancements to existing web services for structure solution and analysis. New pipelines to link these services into more complex higher-level workflows. New data management facilities. Making the benefits of European e-Infrastructures more accessible to structural biologists.
The West-Life project (https://about.west-life.eu/) is a Horizon 2020 project funded by the European Commission to provide data processing and data management services for the international community of structural biologists, and in particular to support integrative experimental approaches within the field of structural biology. It has developed enhancements to existing web services for structure solution and analysis, created new pipelines to link these services into more complex higher-level workflows, and added new data management facilities. Through this work it has striven to make the benefits of European e-Infrastructures more accessible to life-science researchers in general and structural biologists in particular.
Collapse
|
47
|
Natural helix 9 mutants of PPARγ differently affect its transcriptional activity. Mol Metab 2019; 20:115-127. [PMID: 30595551 PMCID: PMC6358588 DOI: 10.1016/j.molmet.2018.12.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Revised: 12/05/2018] [Accepted: 12/11/2018] [Indexed: 12/17/2022] Open
Abstract
OBJECTIVE The nuclear receptor PPARγ is the master regulator of adipocyte differentiation, distribution, and function. In addition, PPARγ induces terminal differentiation of several epithelial cell lineages, including colon epithelia. Loss-of-function mutations in PPARG result in familial partial lipodystrophy subtype 3 (FPDL3), a rare condition characterized by aberrant adipose tissue distribution and severe metabolic complications, including diabetes. Mutations in PPARG have also been reported in sporadic colorectal cancers, but the significance of these mutations is unclear. Studying these natural PPARG mutations provides valuable insights into structure-function relationships in the PPARγ protein. We functionally characterized a novel FPLD3-associated PPARγ L451P mutation in helix 9 of the ligand binding domain (LBD). Interestingly, substitution of the adjacent amino acid K450 was previously reported in a human colon carcinoma cell line. METHODS We performed a detailed side-by-side functional comparison of these two PPARγ mutants. RESULTS PPARγ L451P shows multiple intermolecular defects, including impaired cofactor binding and reduced RXRα heterodimerisation and subsequent DNA binding, but not in DBD-LBD interdomain communication. The K450Q mutant displays none of these functional defects. Other colon cancer-associated PPARγ mutants displayed diverse phenotypes, ranging from complete loss of activity to wildtype activity. CONCLUSIONS Amino acid changes in helix 9 can differently affect LBD integrity and function. In addition, FPLD3-associated PPARγ mutations consistently cause intra- and/or intermolecular defects; colon cancer-associated PPARγ mutations on the other hand may play a role in colon cancer onset and progression, but this is not due to their effects on the most well-studied functional characteristics of PPARγ.
Collapse
|
48
|
Finding the ΔΔ
G
spot: Are predictors of binding affinity changes upon mutations in protein–protein interactions ready for it? WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2019. [DOI: 10.1002/wcms.1410] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
|
49
|
Protein-ligand pose and affinity prediction: Lessons from D3R Grand Challenge 3. J Comput Aided Mol Des 2019; 33:83-91. [PMID: 30128928 PMCID: PMC6373529 DOI: 10.1007/s10822-018-0148-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2018] [Accepted: 08/09/2018] [Indexed: 12/30/2022]
Abstract
We report the performance of HADDOCK in the 2018 iteration of the Grand Challenge organised by the D3R consortium. Building on the findings of our participation in last year's challenge, we significantly improved our pose prediction protocol which resulted in a mean RMSD for the top scoring pose of 3.04 and 2.67 Å for the cross-docking and self-docking experiments respectively, which corresponds to an overall success rate of 63% and 71% when considering the top1 and top5 models respectively. This performance ranks HADDOCK as the 6th and 3rd best performing group (excluding multiple submissions from a same group) out of a total of 44 and 47 submissions respectively. Our ligand-based binding affinity predictor is the 3rd best predictor overall, behind only the two leading structure-based implementations, and the best ligand-based one with a Kendall's Tau correlation of 0.36 for the Cathepsin challenge. It also performed well in the classification part of the Kinase challenges, with Matthews Correlation Coefficients of 0.49 (ranked 1st), 0.39 (ranked 4th) and 0.21 (ranked 4th) for the JAK2, vEGFR2 and p38a targets respectively. Through our participation in last year's competition we came to the conclusion that template selection is of critical importance for the successful outcome of the docking. This year we have made improvements in two additional areas of importance: ligand conformer selection and initial positioning, which have been key to our excellent pose prediction performance this year.
Collapse
|
50
|
Abstract
The pdb-tools are a collection of Python scripts for working with molecular structure data in the Protein Data Bank (PDB) format. They allow users to edit, convert, and validate PDB files, from the command-line, in a simple but efficient manner. The pdb-tools are implemented in Python, without any external dependencies, and are freely available under the open-source Apache License at https://github.com/haddocking/pdb-tools/ and on
PyPI.
Collapse
|