1
|
Ghanbarpour A, Jiang M, Foster D, Chai Q. Structure-free antibody paratope similarity prediction for in silico epitope binning via protein language models. iScience 2023; 26:106036. [PMID: 36824280 PMCID: PMC9941125 DOI: 10.1016/j.isci.2023.106036] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Revised: 11/28/2022] [Accepted: 01/17/2023] [Indexed: 01/26/2023] Open
Abstract
Antibodies are an important group of biological molecules that are used as therapeutics and diagnostic tools. Although millions of antibody sequences are available, identifying their structural and functional similarity and their antigen binding sites remains a challenge at large scale. Here, we present a fast, sequence-based computational method for antibody paratope prediction based on protein language models. The paratope information is then used to measure similarity among antibodies via protein language models. Our computational method enables binning of antibody discovery hits into groups as the function of epitope engagement. We further demonstrate the utility of the method by identifying antibodies targeting highly similar epitopes of the same antigens from a large pool of antibody sequences, using two case studies: SARS CoV2 Receptor Binding Domain (RBD) and Epidermal Growth Factor Receptor (EGFR). Our approach highlights the potential in accelerating antibody discovery by enhancing hit prioritization and diversity selection.
Collapse
Affiliation(s)
- Ahmadreza Ghanbarpour
- Biotechnology Discovery Research, Lilly Biotechnology Center, 10300 Campus Point Drive, San Diego, CA 92121, USA
| | - Min Jiang
- Advanced Analytics and Data Sciences, Lilly Corporate Center, Indianapolis, IN 46225, USA
| | - Denisa Foster
- Biotechnology Discovery Research, Lilly Biotechnology Center, 10300 Campus Point Drive, San Diego, CA 92121, USA
| | - Qing Chai
- Biotechnology Discovery Research, Lilly Biotechnology Center, 10300 Campus Point Drive, San Diego, CA 92121, USA
| |
Collapse
|
2
|
De Lauro A, Di Rienzo L, Miotto M, Olimpieri PP, Milanetti E, Ruocco G. Shape Complementarity Optimization of Antibody–Antigen Interfaces: The Application to SARS-CoV-2 Spike Protein. Front Mol Biosci 2022; 9:874296. [PMID: 35669567 PMCID: PMC9163568 DOI: 10.3389/fmolb.2022.874296] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Accepted: 04/07/2022] [Indexed: 11/17/2022] Open
Abstract
Many factors influence biomolecule binding, and its assessment constitutes an elusive challenge in computational structural biology. In this aspect, the evaluation of shape complementarity at molecular interfaces is one of the main factors to be considered. We focus on the particular case of antibody–antigen complexes to quantify the complementarities occurring at molecular interfaces. We relied on a method we recently developed, which employs the 2D Zernike descriptors, to characterize the investigated regions with an ordered set of numbers summarizing the local shape properties. Collecting a structural dataset of antibody–antigen complexes, we applied this method and we statistically distinguished, in terms of shape complementarity, pairs of the interacting regions from the non-interacting ones. Thus, we set up a novel computational strategy based on in silico mutagenesis of antibody-binding site residues. We developed a Monte Carlo procedure to increase the shape complementarity between the antibody paratope and a given epitope on a target protein surface. We applied our protocol against several molecular targets in SARS-CoV-2 spike protein, known to be indispensable for viral cell invasion. We, therefore, optimized the shape of template antibodies for the interaction with such regions. As the last step of our procedure, we performed an independent molecular docking validation of the results of our Monte Carlo simulations.
Collapse
Affiliation(s)
| | - Lorenzo Di Rienzo
- Center for Life Nano & Neuro-Science, Istituto Italiano di Tecnologia, Rome, Italy
- *Correspondence: Lorenzo Di Rienzo,
| | - Mattia Miotto
- Center for Life Nano & Neuro-Science, Istituto Italiano di Tecnologia, Rome, Italy
| | | | - Edoardo Milanetti
- Center for Life Nano & Neuro-Science, Istituto Italiano di Tecnologia, Rome, Italy
- Department of Physics, Sapienza University, Rome, Italy
| | - Giancarlo Ruocco
- Center for Life Nano & Neuro-Science, Istituto Italiano di Tecnologia, Rome, Italy
- Department of Physics, Sapienza University, Rome, Italy
| |
Collapse
|
3
|
Grassmann G, Miotto M, Di Rienzo L, Gosti G, Ruocco G, Milanetti E. A novel computational strategy for defining the minimal protein molecular surface representation. PLoS One 2022; 17:e0266004. [PMID: 35421111 PMCID: PMC9009619 DOI: 10.1371/journal.pone.0266004] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Accepted: 03/12/2022] [Indexed: 11/18/2022] Open
Abstract
Most proteins perform their biological function by interacting with one or more molecular partners. In this respect, characterizing local features of the molecular surface, that can potentially be involved in the interaction with other molecules, represents a step forward in the investigation of the mechanisms of recognition and binding between molecules. Predictive methods often rely on extensive samplings of molecular patches with the aim to identify hot spots on the surface. In this framework, analysis of large proteins and/or many molecular dynamics frames is often unfeasible due to the high computational cost. Thus, finding optimal ways to reduce the number of points to be sampled maintaining the biological information (including the surface shape) carried by the molecular surface is pivotal. In this perspective, we here present a new theoretical and computational algorithm with the aim of defining a set of molecular surfaces composed of points not uniformly distributed in space, in such a way as to maximize the information of the overall shape of the molecule by minimizing the number of total points. We test our procedure’s ability in recognizing hot-spots by describing the local shape properties of portions of molecular surfaces through a recently developed method based on the formalism of 2D Zernike polynomials. The results of this work show the ability of the proposed algorithm to preserve the key information of the molecular surface using a reduced number of points compared to the complete surface, where all points of the surface are used for the description. In fact, the methodology shows a significant gain of the information stored in the sampling procedure compared to uniform random sampling.
Collapse
Affiliation(s)
| | - Mattia Miotto
- Center for Life Nano & Neuroscience, Italian Institute of Technology, Rome, Italy
| | - Lorenzo Di Rienzo
- Center for Life Nano & Neuroscience, Italian Institute of Technology, Rome, Italy
| | - Giorgio Gosti
- Center for Life Nano & Neuroscience, Italian Institute of Technology, Rome, Italy
| | - Giancarlo Ruocco
- Center for Life Nano & Neuroscience, Italian Institute of Technology, Rome, Italy
- Department of Physics, Sapienza University, Rome, Italy
| | - Edoardo Milanetti
- Center for Life Nano & Neuroscience, Italian Institute of Technology, Rome, Italy
- Department of Physics, Sapienza University, Rome, Italy
- * E-mail:
| |
Collapse
|
4
|
Assessment of Therapeutic Antibody Developability by Combinations of In Vitro and In Silico Methods. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2313:57-113. [PMID: 34478132 DOI: 10.1007/978-1-0716-1450-1_4] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Although antibodies have become the fastest-growing class of therapeutics on the market, it is still challenging to develop them for therapeutic applications, which often require these molecules to withstand stresses that are not present in vivo. We define developability as the likelihood of an antibody candidate with suitable functionality to be developed into a manufacturable, stable, safe, and effective drug that can be formulated to high concentrations while retaining a long shelf life. The implementation of reliable developability assessments from the early stages of antibody discovery enables flagging and deselection of potentially problematic candidates, while focussing available resources on the development of the most promising ones. Currently, however, thorough developability assessment requires multiple in vitro assays, which makes it labor intensive and time consuming to implement at early stages. Furthermore, accurate in vitro analysis at the early stage is compromised by the high number of potential candidates that are often prepared at low quantities and purity. Recent improvements in the performance of computational predictors of developability potential are beginning to change this scenario. Many computational methods only require the knowledge of the amino acid sequences and can be used to identify possible developability issues or to rank available candidates according to a range of biophysical properties. Here, we describe how the implementation of in silico tools into antibody discovery pipelines is increasingly offering time- and cost-effective alternatives to in vitro experimental screening, thus streamlining the drug development process. We discuss in particular the biophysical and biochemical properties that underpin developability potential and their trade-offs, review various in vitro assays to measure such properties or parameters that are predictive of developability, and give an overview of the growing number of in silico tools available to predict properties important for antibody development, including the CamSol method developed in our laboratory.
Collapse
|
5
|
Cueno ME, Imai K. Structural Insights on the SARS-CoV-2 Variants of Concern Spike Glycoprotein: A Computational Study With Possible Clinical Implications. Front Genet 2021; 12:773726. [PMID: 34745235 PMCID: PMC8568765 DOI: 10.3389/fgene.2021.773726] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Accepted: 10/07/2021] [Indexed: 12/31/2022] Open
Abstract
Coronavirus disease 2019 (COVID-19) pandemic has been attributed to SARS-CoV-2 (SARS2) and, consequently, SARS2 has evolved into multiple SARS2 variants driving subsequent waves of infections. In particular, variants of concern (VOC) were identified to have both increased transmissibility and virulence ascribable to mutational changes occurring within the spike protein resulting to modifications in the protein structural orientation which in-turn may affect viral pathogenesis. However, this was never fully elucidated. Here, we generated spike models of endemic HCoVs (HCoV 229E, HCoV OC43, HCoV NL63, HCoV HKU1, SARS CoV, MERS CoV), original SARS2, and VOC (alpha, beta, gamma, delta). Model quality check, structural superimposition, and structural comparison based on RMSD values, TM scores, and contact mapping were all performed. We found that: 1) structural comparison between the original SARS2 and VOC whole spike protein model have minor structural differences (TM > 0.98); 2) the whole VOC spike models putatively have higher structural similarity (TM > 0.70) to spike models from endemic HCoVs coming from the same phylogenetic cluster; 3) original SARS2 S1-CTD and S1-NTD models are structurally comparable to VOC S1-CTD (TM = 1.0) and S1-NTD (TM > 0.96); and 4) endemic HCoV S1-CTD and S1-NTD models are structurally comparable to VOC S1-CTD (TM > 0.70) and S1-NTD (TM > 0.70) models belonging to the same phylogenetic cluster. Overall, we propose that structural similarities (possibly ascribable to similar conformational epitopes) may help determine immune cross-reactivity, whereas, structural differences (possibly associated with varying conformational epitopes) may lead to viral infection (either reinfection or breakthrough infection).
Collapse
Affiliation(s)
- Marni E Cueno
- Department of Microbiology, Nihon University School of Dentistry, Tokyo, Japan
| | - Kenichi Imai
- Department of Microbiology, Nihon University School of Dentistry, Tokyo, Japan
| |
Collapse
|
6
|
Di Rienzo L, Milanetti E, Ruocco G, Lepore R. Quantitative Description of Surface Complementarity of Antibody-Antigen Interfaces. Front Mol Biosci 2021; 8:749784. [PMID: 34660699 PMCID: PMC8514621 DOI: 10.3389/fmolb.2021.749784] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Accepted: 09/14/2021] [Indexed: 11/29/2022] Open
Abstract
Antibodies have the remarkable ability to recognise their cognate antigens with extraordinary affinity and specificity. Discerning the rules that define antibody-antigen recognition is a fundamental step in the rational design and engineering of functional antibodies with desired properties. In this study we apply the 3D Zernike formalism to the analysis of the surface properties of the antibody complementary determining regions (CDRs). Our results show that shape and electrostatic 3DZD descriptors of the surface of the CDRs are predictive of antigen specificity, with classification accuracy of 81% and area under the receiver operating characteristic curve (AUC) of 0.85. Additionally, while in terms of surface size, solvent accessibility and amino acid composition, antibody epitopes are typically not distinguishable from non-epitope, solvent-exposed regions of the antigen, the 3DZD descriptors detect significantly higher surface complementarity to the paratope, and are able to predict correct paratope-epitope interaction with an AUC = 0.75.
Collapse
Affiliation(s)
- Lorenzo Di Rienzo
- Center for Life Nano and Neuro-Science, Istituto Italiano di Tecnologia, Rome, Italy
| | - Edoardo Milanetti
- Center for Life Nano and Neuro-Science, Istituto Italiano di Tecnologia, Rome, Italy
- Department of Physics, Sapienza University, Rome, Italy
| | - Giancarlo Ruocco
- Center for Life Nano and Neuro-Science, Istituto Italiano di Tecnologia, Rome, Italy
- Department of Physics, Sapienza University, Rome, Italy
| | - Rosalba Lepore
- Department of Biomedicine, Basel University Hospital and University of Basel, Basel, Switzerland
| |
Collapse
|
7
|
Milanetti E, Miotto M, Di Rienzo L, Nagaraj M, Monti M, Golbek TW, Gosti G, Roeters SJ, Weidner T, Otzen DE, Ruocco G. In-Silico Evidence for a Two Receptor Based Strategy of SARS-CoV-2. Front Mol Biosci 2021; 8:690655. [PMID: 34179095 PMCID: PMC8219949 DOI: 10.3389/fmolb.2021.690655] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2021] [Accepted: 05/19/2021] [Indexed: 01/04/2023] Open
Abstract
We propose a computational investigation on the interaction mechanisms between SARS-CoV-2 spike protein and possible human cell receptors. In particular, we make use of our newly developed numerical method able to determine efficiently and effectively the relationship of complementarity between portions of protein surfaces. This innovative and general procedure, based on the representation of the molecular isoelectronic density surface in terms of 2D Zernike polynomials, allows the rapid and quantitative assessment of the geometrical shape complementarity between interacting proteins, which was unfeasible with previous methods. Our results indicate that SARS-CoV-2 uses a dual strategy: in addition to the known interaction with angiotensin-converting enzyme 2, the viral spike protein can also interact with sialic-acid receptors of the cells in the upper airways.
Collapse
Affiliation(s)
- Edoardo Milanetti
- Department of Physics, Sapienza University, Rome, Italy
- Center for Life Nano and Neuro Science, Italian Institute of Technology, Rome, Italy
| | - Mattia Miotto
- Department of Physics, Sapienza University, Rome, Italy
- Center for Life Nano and Neuro Science, Italian Institute of Technology, Rome, Italy
| | - Lorenzo Di Rienzo
- Center for Life Nano and Neuro Science, Italian Institute of Technology, Rome, Italy
| | - Madhu Nagaraj
- Interdisciplinary Nanoscience Center (iNANO), Aarhus University, Aarhus, Denmark
| | - Michele Monti
- Centre for Genomic Regulation (CRG), the Barcelona Institute for Science and Technology, Barcelona, Spain
- RNA System Biology Lab, Department of Neuroscience and Brain Technologies, Istituto Italiano di Tecnologia, Genoa, Italy
| | | | - Giorgio Gosti
- Center for Life Nano and Neuro Science, Italian Institute of Technology, Rome, Italy
| | | | - Tobias Weidner
- Department of Chemistry, Aarhus University, Aarhus, Denmark
| | - Daniel E. Otzen
- Interdisciplinary Nanoscience Center (iNANO), Aarhus University, Aarhus, Denmark
| | - Giancarlo Ruocco
- Department of Physics, Sapienza University, Rome, Italy
- Center for Life Nano and Neuro Science, Italian Institute of Technology, Rome, Italy
| |
Collapse
|
8
|
Miotto M, Di Rienzo L, Bò L, Boffi A, Ruocco G, Milanetti E. Molecular Mechanisms Behind Anti SARS-CoV-2 Action of Lactoferrin. Front Mol Biosci 2021; 8:607443. [PMID: 33659275 PMCID: PMC7917183 DOI: 10.3389/fmolb.2021.607443] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Accepted: 01/11/2021] [Indexed: 12/15/2022] Open
Abstract
Despite the huge effort to contain the infection, the novel SARS-CoV-2 coronavirus has rapidly become pandemic, mainly due to its extremely high human-to-human transmission capability, and a surprisingly high viral charge of symptom-less people. While the seek for a vaccine is still ongoing, promising results have been obtained with antiviral compounds. In particular, lactoferrin is regarded to have beneficial effects both in preventing and soothing the infection. Here, we explore the possible molecular mechanisms with which lactoferrin interferes with SARS-CoV-2 cell invasion, preventing attachment and/or entry of the virus. To this aim, we search for possible interactions lactoferrin may have with virus structural proteins and host receptors. Representing the molecular iso-electron surface of proteins in terms of 2D-Zernike descriptors, we 1) identified putative regions on the lactoferrin surface able to bind sialic acid present on the host cell membrane, sheltering the cell from the virus attachment; 2) showed that no significant shape complementarity is present between lactoferrin and the ACE2 receptor, while 3) two high complementarity regions are found on the N- and C-terminal domains of the SARS-CoV-2 spike protein, hinting at a possible competition between lactoferrin and ACE2 for the binding to the spike protein.
Collapse
Affiliation(s)
- Mattia Miotto
- Department of Physics, University of Rome `La Sapienza', Rome, Italy
- Istituto Italiano di Tecnologia (IIT), Center for Life Nano Science, Rome, Italy
| | - Lorenzo Di Rienzo
- Istituto Italiano di Tecnologia (IIT), Center for Life Nano Science, Rome, Italy
| | - Leonardo Bò
- Istituto Italiano di Tecnologia (IIT), Center for Life Nano Science, Rome, Italy
| | - Alberto Boffi
- Istituto Italiano di Tecnologia (IIT), Center for Life Nano Science, Rome, Italy
- Department of Biochemical Sciences “A. Rossi Fanelli” Sapienza University, Rome, Italy
| | - Giancarlo Ruocco
- Department of Physics, University of Rome `La Sapienza', Rome, Italy
- Istituto Italiano di Tecnologia (IIT), Center for Life Nano Science, Rome, Italy
| | - Edoardo Milanetti
- Department of Physics, University of Rome `La Sapienza', Rome, Italy
- Istituto Italiano di Tecnologia (IIT), Center for Life Nano Science, Rome, Italy
| |
Collapse
|
9
|
Richardson E, Galson JD, Kellam P, Kelly DF, Smith SE, Palser A, Watson S, Deane CM. A computational method for immune repertoire mining that identifies novel binders from different clonotypes, demonstrated by identifying anti-pertussis toxoid antibodies. MAbs 2021; 13:1869406. [PMID: 33427589 PMCID: PMC7808390 DOI: 10.1080/19420862.2020.1869406] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Due to their shared genetic history, antibodies from the same clonotype often bind to the same epitope. This knowledge is used in immune repertoire mining, where known binders are used to search bulk sequencing repertoires to identify new binders. However, current computational methods cannot identify epitope convergence between antibodies from different clonotypes, limiting the sequence diversity of antigen-specific antibodies that can be identified. We describe how the antibody binding site, the paratope, can be used to cluster antibodies with common antigen reactivity from different clonotypes. Our method, paratyping, uses the predicted paratope to identify these novel cross clonotype matches. We experimentally validated our predictions on a pertussis toxoid dataset. Our results show that even the simplest abstraction of the antibody binding site, using only the length of the loops involved and predicted binding residues, is sufficient to group antigen-specific antibodies and provide additional information to conventional clonotype analysis. Abbreviations: BCR: B-cell receptor; CDR: complementarity-determining region; PTx: pertussis toxoid
Collapse
Affiliation(s)
- Eve Richardson
- Department of Statistics, University of Oxford , Oxford, UK
| | - Jacob D Galson
- Alchemab Therapeutics Ltd , London, UK.,Division of Immunology, University Children's Hospital, University of Zurich, Zurich , Switzerland
| | - Paul Kellam
- Kymab Ltd , Cambridge, UK.,Department of Infectious Diseases, Faculty of Medicine, Imperial College London , London, UK
| | - Dominic F Kelly
- Department of Paediatrics, University of Oxford , Oxford, UK.,Oxford University Hospitals NHS Foundation Trust , Oxford, UK
| | | | | | | | | |
Collapse
|
10
|
Insights into the Interaction Mechanism of DTP3 with MKK7 by Using STD-NMR and Computational Approaches. Biomedicines 2020; 9:biomedicines9010020. [PMID: 33396582 PMCID: PMC7824710 DOI: 10.3390/biomedicines9010020] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2020] [Revised: 12/21/2020] [Accepted: 12/23/2020] [Indexed: 01/18/2023] Open
Abstract
GADD45β/MKK7 complex is a non-redundant, cancer cell-restricted survival module downstream of the NF-kB survival pathway, and it has a pathogenically critical role in multiple myeloma, an incurable malignancy of plasma cells. The first-in-class GADD45β/MKK7 inhibitor DTP3 effectively kills MM cells expressing its molecular target, both in vitro and in vivo, by inducing MKK7/JNK-dependent apoptosis with no apparent toxicity to normal cells. DTP3 combines favorable drug-like properties, with on-target-specific pharmacology, resulting in a safe and cancer-selective therapeutic effect; however, its mode of action is only partially understood. In this work, we have investigated the molecular determinants underlying the MKK7 interaction with DTP3 by combining computational, NMR, and spectroscopic methods. Data gathered by fluorescence quenching and computational approaches consistently indicate that the N-terminal region of MKK7 is the optimal binding site explored by DTP3. These findings further the understanding of the selective mode of action of GADD45β/MKK7 inhibitors and inform potential mechanisms of drug resistance. Notably, upon validation of the safety and efficacy of DTP3 in human trials, our results could also facilitate the development of novel DTP3-like therapeutics with improved bioavailability or the capacity to bypass drug resistance.
Collapse
|
11
|
Milanetti E, Miotto M, Di Rienzo L, Monti M, Gosti G, Ruocco G. 2D Zernike polynomial expansion: Finding the protein-protein binding regions. Comput Struct Biotechnol J 2020; 19:29-36. [PMID: 33363707 PMCID: PMC7750141 DOI: 10.1016/j.csbj.2020.11.051] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Revised: 11/26/2020] [Accepted: 11/28/2020] [Indexed: 01/26/2023] Open
Abstract
We present a method for efficiently and effectively assessing whether and where two proteins can interact with each other to form a complex. This is still largely an open problem, even for those relatively few cases where the 3D structure of both proteins is known. In fact, even if much of the information about the interaction is encoded in the chemical and geometric features of the structures, the set of possible contact patches and of their relative orientations are too large to be computationally affordable in a reasonable time, thus preventing the compilation of reliable interactome. Our method is able to rapidly and quantitatively measure the geometrical shape complementarity between interacting proteins, comparing their molecular iso-electron density surfaces expanding the surface patches in term of 2D Zernike polynomials. We first test the method against the real binding region of a large dataset of known protein complexes, reaching a success rate of 0.72. We then apply the method for the blind recognition of binding sites, identifying the real region of interaction in about 60% of the analyzed cases. Finally, we investigate how the efficiency in finding the right binding region depends on the surface roughness as a function of the expansion order.
Collapse
Affiliation(s)
- Edoardo Milanetti
- Department of Physics, Sapienza University, Piazzale Aldo Moro 5, 00185 Rome, Italy.,Center for Life Nanoscience, Istituto Italiano di Tecnologia, Viale Regina Elena 291, 00161 Rome, Italy
| | - Mattia Miotto
- Department of Physics, Sapienza University, Piazzale Aldo Moro 5, 00185 Rome, Italy.,Center for Life Nanoscience, Istituto Italiano di Tecnologia, Viale Regina Elena 291, 00161 Rome, Italy
| | - Lorenzo Di Rienzo
- Center for Life Nanoscience, Istituto Italiano di Tecnologia, Viale Regina Elena 291, 00161 Rome, Italy
| | - Michele Monti
- Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain.,RNA System Biology Lab, Department of Neuroscience and Brain Technologies, Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genoa, Italy
| | - Giorgio Gosti
- Center for Life Nanoscience, Istituto Italiano di Tecnologia, Viale Regina Elena 291, 00161 Rome, Italy
| | - Giancarlo Ruocco
- Department of Physics, Sapienza University, Piazzale Aldo Moro 5, 00185 Rome, Italy.,Center for Life Nanoscience, Istituto Italiano di Tecnologia, Viale Regina Elena 291, 00161 Rome, Italy
| |
Collapse
|
12
|
Di Rienzo L, Milanetti E, Testi C, Montemiglio LC, Baiocco P, Boffi A, Ruocco G. A novel strategy for molecular interfaces optimization: The case of Ferritin-Transferrin receptor interaction. Comput Struct Biotechnol J 2020; 18:2678-2686. [PMID: 33101606 PMCID: PMC7548301 DOI: 10.1016/j.csbj.2020.09.020] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2020] [Revised: 09/10/2020] [Accepted: 09/11/2020] [Indexed: 11/24/2022] Open
Abstract
Protein-protein interactions regulate almost all cellular functions and rely on a fine tune of surface amino acids properties involved on both molecular partners. The disruption of a molecular association can be caused even by a single residue mutation, often leading to a pathological modification of a biochemical pathway. Therefore the evaluation of the effects of amino acid substitutions on binding, and the ad hoc design of protein-protein interfaces, is one of the biggest challenges in computational biology. Here, we present a novel strategy for computational mutation and optimization of protein-protein interfaces. Modeling the interaction surface properties using the Zernike polynomials, we describe the shape and electrostatics of binding sites with an ordered set of descriptors, making possible the evaluation of complementarity between interacting surfaces. With a Monte Carlo approach, we obtain protein mutants with controlled molecular complementarities. Applying this strategy to the relevant case of the interaction between Ferritin and Transferrin Receptor, we obtain a set of Ferritin mutants with increased or decreased complementarity. The extensive molecular dynamics validation of the method results confirms its efficacy, showing that this strategy represents a very promising approach in designing correct molecular interfaces.
Collapse
Affiliation(s)
- Lorenzo Di Rienzo
- Center for Life Nanoscience, Istituto Italiano di Tecnologia, Viale Regina Elena 291, 00161 Rome, Italy
| | - Edoardo Milanetti
- Center for Life Nanoscience, Istituto Italiano di Tecnologia, Viale Regina Elena 291, 00161 Rome, Italy
- Department of Physics, Sapienza University, Piazzale Aldo Moro 5, 00185 Rome, Italy
| | - Claudia Testi
- Center for Life Nanoscience, Istituto Italiano di Tecnologia, Viale Regina Elena 291, 00161 Rome, Italy
| | | | - Paola Baiocco
- Center for Life Nanoscience, Istituto Italiano di Tecnologia, Viale Regina Elena 291, 00161 Rome, Italy
- Department of Biochemical Sciences ‘A. Rossi Fanelli’ Sapienza University, Piazzale Aldo Moro 5, 00185 Rome, Italy
| | - Alberto Boffi
- Department of Biochemical Sciences ‘A. Rossi Fanelli’ Sapienza University, Piazzale Aldo Moro 5, 00185 Rome, Italy
| | - Giancarlo Ruocco
- Center for Life Nanoscience, Istituto Italiano di Tecnologia, Viale Regina Elena 291, 00161 Rome, Italy
- Department of Physics, Sapienza University, Piazzale Aldo Moro 5, 00185 Rome, Italy
| |
Collapse
|
13
|
Pomés A, Mueller GA, Chruszcz M. Structural Aspects of the Allergen-Antibody Interaction. Front Immunol 2020; 11:2067. [PMID: 32983155 PMCID: PMC7492603 DOI: 10.3389/fimmu.2020.02067] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Accepted: 07/29/2020] [Indexed: 12/23/2022] Open
Abstract
The development of allergic disease involves the production of IgE antibodies upon allergen exposure in a process called sensitization. IgE binds to receptors on the surface of mast cells and basophils, and subsequent allergen exposure leads to cross-linking of IgE antibodies and release of cell mediators that cause allergy symptoms. Although this process is quite well-understood, very little is known about the epitopes on the allergen recognized by IgE, despite the importance of the allergen-antibody interaction for the allergic response to occur. This review discusses efforts to analyze allergen-antibody interactions, from the original epitope mapping studies using linear peptides or recombinant allergen fragments, to more sophisticated technologies, such as X-ray crystallography and nuclear magnetic resonance. These state-of-the-art approaches, combined with site-directed mutagenesis, have led to the identification of conformational IgE epitopes. The first structures of an allergen (egg lysozyme) in complex with Fab fragments from IgG antibodies were determined in the 1980s. Since then, IgG has been used as surrogate for IgE, due to the difficulty of obtaining monoclonal IgE antibodies. Technical developments including phage display libraries have contributed to progress in epitope mapping thanks to the isolation of IgE antibody constructs from combinatorial libraries made from peripheral blood mononuclear cells of allergic donors. Most recently, single B cell antibody sequencing and human hybridomas are new breakthrough technologies for finally obtaining human IgE monoclonal antibodies, ideal for epitope mapping. The information on antigenic determinants will facilitate the design of hypoallergens for immunotherapy and the investigation of the fundamental mechanisms of the IgE response.
Collapse
Affiliation(s)
- Anna Pomés
- Indoor Biotechnologies, Inc., Charlottesville, VA, United States
| | - Geoffrey A Mueller
- National Institute of Environmental Health Sciences, Durham, NC, United States
| | - Maksymilian Chruszcz
- Department of Chemistry and Biochemistry, University of South Carolina, Columbia, SC, United States
| |
Collapse
|
14
|
Real time structural search of the Protein Data Bank. PLoS Comput Biol 2020; 16:e1007970. [PMID: 32639954 PMCID: PMC7371193 DOI: 10.1371/journal.pcbi.1007970] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Revised: 07/20/2020] [Accepted: 05/20/2020] [Indexed: 11/19/2022] Open
Abstract
Detection of protein structure similarity is a central challenge in structural bioinformatics. Comparisons are usually performed at the polypeptide chain level, however the functional form of a protein within the cell is often an oligomer. This fact, together with recent growth of oligomeric structures in the Protein Data Bank (PDB), demands more efficient approaches to oligomeric assembly alignment/retrieval. Traditional methods use atom level information, which can be complicated by the presence of topological permutations within a polypeptide chain and/or subunit rearrangements. These challenges can be overcome by comparing electron density volumes directly. But, brute force alignment of 3D data is a compute intensive search problem. We developed a 3D Zernike moment normalization procedure to orient electron density volumes and assess similarity with unprecedented speed. Similarity searching with this approach enables real-time retrieval of proteins/protein assemblies resembling a target, from PDB or user input, together with resulting alignments (http://shape.rcsb.org).
Collapse
|
15
|
Alba J, Di Rienzo L, Milanetti E, Acuto O, D’Abramo M. Molecular Dynamics Simulations Reveal Canonical Conformations in Different pMHC/TCR Interactions. Cells 2020; 9:E942. [PMID: 32290289 PMCID: PMC7226950 DOI: 10.3390/cells9040942] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Revised: 04/07/2020] [Accepted: 04/08/2020] [Indexed: 01/14/2023] Open
Abstract
The major defense system against microbial pathogens in vertebrates is the adaptive immune response and represents an effective mechanism in cancer surveillance. T cells represent an essential component of this complex system. They can recognize myriads of antigens as short peptides (p) originated from the intracellular degradation of foreign proteins presented by major histocompatibility complex (MHC) proteins. The clonotypic T-cell antigen receptor (TCR) is specialized in recognizing pMHC and triggering T cells immune response. It is still unclear how TCR engagement to pMHC is translated into the intracellular signal that initiates T-cell immune response. Some work has suggested the possibility that pMHC binding induces in the TCR conformational changes transmitted to its companion CD3 subunits that govern signaling. The conformational changes would promote phosphorylation of the CD3 complex ζ chain that initiates signal propagation intracellularly. Here, we used all-atom molecular dynamics simulations (MDs) of 500 ns to analyze the conformational behavior of three TCRs (1G4, ILA1 and ILA1α1β1) interacting with the same MHC class I (HLA-A*02:01) bound to different peptides, and modelled in the presence of a lipid bilayer. Our data suggest a correlation between the conformations explored by the β-chain constant regions and the T-cell response experimentally determined. In particular, independently by the TCR type involved in the interaction, the TCR activation seems to be linked to a specific zone of the conformational space explored by the β-chain constant region. Moreover, TCR ligation restricts the conformational space the MHC class I groove.
Collapse
Affiliation(s)
- Josephine Alba
- Department of Chemistry, University of Rome Sapienza, P.le A.Moro 5-00185 Rome, Italy
| | - Lorenzo Di Rienzo
- Department of Physics, University of Rome Sapienza, 5-00185 Rome Italy; (L.D.R.); (E.M.)
- Center for Life Nano Science@Sapienza, Istituto Italiano di Tecnologia, 000161 Rome, Italy
| | - Edoardo Milanetti
- Department of Physics, University of Rome Sapienza, 5-00185 Rome Italy; (L.D.R.); (E.M.)
- Center for Life Nano Science@Sapienza, Istituto Italiano di Tecnologia, 000161 Rome, Italy
| | - Oreste Acuto
- Sir William Dunn School of Pathology, University of Oxford, Oxford OX1 3RE, UK;
| | - Marco D’Abramo
- Department of Chemistry, University of Rome Sapienza, P.le A.Moro 5-00185 Rome, Italy
| |
Collapse
|
16
|
Di Rienzo L, Milanetti E, Alba J, D'Abramo M. Quantitative Characterization of Binding Pockets and Binding Complementarity by Means of Zernike Descriptors. J Chem Inf Model 2020; 60:1390-1398. [PMID: 32050068 PMCID: PMC7997106 DOI: 10.1021/acs.jcim.9b01066] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
In this work, we describe the application of the Zernike formalism to quantitatively characterize the binding pockets of two sets of biologically relevant systems. Such an approach, when applied to molecular dynamics trajectories, is able to pinpoint the subtle differences between very similar molecular regions and their impact on the local propensity to ligand binding, allowing us to quantify such differences. The statistical robustness of our procedure suggests that it is very suitable to describe protein binding sites and protein-ligand interactions within a rigorous and well-defined framework.
Collapse
Affiliation(s)
- Lorenzo Di Rienzo
- Department of Physics, Sapienza University of Rome, Piazzale Aldo Moro, 5, 00185 Rome, Italy
| | - Edoardo Milanetti
- Department of Physics, Sapienza University of Rome, Piazzale Aldo Moro, 5, 00185 Rome, Italy.,Center for Life Nano Science@Sapienza, Italian Institute of Technology, Viale Regina Elena 291, 00161 Rome, Italy
| | - Josephine Alba
- Department of Chemistry, Sapienza University of Rome, Piazzale Aldo Moro, 5, 00185 Rome, Italy
| | - Marco D'Abramo
- Department of Chemistry, Sapienza University of Rome, Piazzale Aldo Moro, 5, 00185 Rome, Italy
| |
Collapse
|
17
|
Papageorgiou L, Maroulis D, Chrousos GP, Eliopoulos E, Vlachakis D. Antibody Clustering Using a Machine Learning Pipeline that Fuses Genetic, Structural, and Physicochemical Properties. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2020; 1194:41-58. [PMID: 32468522 DOI: 10.1007/978-3-030-32622-7_4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Antibody V domain clustering is of paramount importance to a repertoire of immunology-related areas. Although several approaches have been proposed for antibody clustering, still no consensus has been reached. Numerous attempts use information from genes, protein sequences, 3D structures, and 3D surfaces in an effort to elucidate unknown action mechanisms directly related to their function and to either link them directly to diseases or drive the discovery of new medicines, such as antibody drug conjugates (ADC). Herein, we describe a new V domain antibody clustering method based on the comparison of the interaction sites between each antibody and its antigen. A more specific clustering analysis of the antibody's V domain was provided using deep learning and data mining techniques. The multidimensional information was extracted from the structural resolved antibodies when they were captured to interact with other proteins. The available 3D structures of protein antigen-antibody (Ag-Ab) interfaces contain information about how antibody V domains recognize antigens as well as about which amino acids are involved in the recognition. As such, the antibody surface holds information about antigens' folding that reside with the Ab-Ag interface residues and how they interact. In order to gain insight into the nature of such interactions, we propose a new simple philosophy to transform the conserved framework (fragment regions, complementarity-determining regions) of antibody V domain in a binary form using structural features of antibody-antigen interactions, toward identifying new antibody signatures in V domain binding activity. Finally, an advanced three-level hybrid classification scheme has been set for clustering antibodies in subgroups, which can combine the information from the protein sequences, the three-dimensional structures, and specific "key patterns" of recognized interactions. The clusters provide multilevel information about antibodies and antibody-antigen complexes.
Collapse
Affiliation(s)
- Louis Papageorgiou
- Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, Athens, Greece.,Genetics and Computational Biology Group, Laboratory of Genetics, Department of Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Dimitris Maroulis
- Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, Athens, Greece
| | - George P Chrousos
- Division of Endocrinology, Metabolism and Diabetes, First Department of Pediatrics, National and Kapodistrian University of Athens Medical School, "Aghia Sophia" Children's Hospital, Athens, Greece.,Division of Endocrinology and Metabolism, Center of Clinical, Experimental Surgery and Translational Research, Biomedical Research Foundation of the Academy of Athens, Athens, Greece
| | - Elias Eliopoulos
- Genetics and Computational Biology Group, Laboratory of Genetics, Department of Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Dimitrios Vlachakis
- Genetics and Computational Biology Group, Laboratory of Genetics, Department of Biotechnology, Agricultural University of Athens, Athens, Greece. .,Division of Endocrinology and Metabolism, Center of Clinical, Experimental Surgery and Translational Research, Biomedical Research Foundation of the Academy of Athens, Athens, Greece.
| |
Collapse
|
18
|
Jespersen MC, Mahajan S, Peters B, Nielsen M, Marcatili P. Antibody Specific B-Cell Epitope Predictions: Leveraging Information From Antibody-Antigen Protein Complexes. Front Immunol 2019; 10:298. [PMID: 30863406 PMCID: PMC6399414 DOI: 10.3389/fimmu.2019.00298] [Citation(s) in RCA: 86] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Accepted: 02/05/2019] [Indexed: 11/13/2022] Open
Abstract
B-cells can neutralize pathogenic molecules by targeting them with extreme specificity using receptors secreted or expressed on their surface (antibodies). This is achieved via molecular interactions between the paratope (i.e., the antibody residues involved in the binding) and the interacting region (epitope) of its target molecule (antigen). Discerning the rules that define this specificity would have profound implications for our understanding of humoral immunogenicity and its applications. The aim of this work is to produce improved, antibody-specific epitope predictions by exploiting features derived from the antigens and their cognate antibodies structures, and combining them using statistical and machine learning algorithms. We have identified several geometric and physicochemical features that are correlated in interacting paratopes and epitopes, used them to develop a Monte Carlo algorithm to generate putative epitopes-paratope pairs, and train a machine-learning model to score them. We show that, by including the structural and physicochemical properties of the paratope, we improve the prediction of the target of a given B-cell receptor. Moreover, we demonstrate a gain in predictive power both in terms of identifying the cognate antigen target for a given antibody and the antibody target for a given antigen, exceeding the results of other available tools.
Collapse
Affiliation(s)
- Martin Closter Jespersen
- Department of Bio and Health Informatics, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Swapnil Mahajan
- La Jolla Institute for Allergy and Immunology, Center for Infectious Disease, Allergy and Asthma Research, La Jolla, CA, United States
| | - Bjoern Peters
- La Jolla Institute for Allergy and Immunology, Center for Infectious Disease, Allergy and Asthma Research, La Jolla, CA, United States
| | - Morten Nielsen
- Department of Bio and Health Informatics, Technical University of Denmark, Kongens Lyngby, Denmark.,Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, Buenos Aires, Argentina
| | - Paolo Marcatili
- Department of Bio and Health Informatics, Technical University of Denmark, Kongens Lyngby, Denmark
| |
Collapse
|
19
|
Sormanni P, Aprile FA, Vendruscolo M. Third generation antibody discovery methods: in silico rational design. Chem Soc Rev 2018; 47:9137-9157. [PMID: 30298157 DOI: 10.1039/c8cs00523k] [Citation(s) in RCA: 81] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Owing to their outstanding performances in molecular recognition, antibodies are extensively used in research and applications in molecular biology, biotechnology and medicine. Recent advances in experimental and computational methods are making it possible to complement well-established in vivo (first generation) and in vitro (second generation) methods of antibody discovery with novel in silico (third generation) approaches. Here we describe the principles of computational antibody design and review the state of the art in this field. We then present Modular, a method that implements the rational design of antibodies in a modular manner, and describe the opportunities offered by this approach.
Collapse
Affiliation(s)
- Pietro Sormanni
- Centre for Misfolding Diseases, Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, UK.
| | | | | |
Collapse
|
20
|
Johnson DE. Biotherapeutics: Challenges and Opportunities for Predictive Toxicology of Monoclonal Antibodies. Int J Mol Sci 2018; 19:E3685. [PMID: 30469350 PMCID: PMC6274697 DOI: 10.3390/ijms19113685] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2018] [Revised: 11/18/2018] [Accepted: 11/19/2018] [Indexed: 12/19/2022] Open
Abstract
Biotherapeutics are a rapidly growing portion of the total pharmaceutical market accounting for almost one-half of recent new drug approvals. A major portion of these approvals each year are monoclonal antibodies (mAbs). During development, non-clinical pharmacology and toxicology testing of mAbs differs from that done with chemical entities since these biotherapeutics are derived from a biological source and therefore the animal models must share the same epitopes (targets) as humans to elicit a pharmacological response. Mechanisms of toxicity of mAbs are both pharmacological and non-pharmacological in nature; however, standard in silico predictive toxicological methods used in research and development of chemical entities currently do not apply to these biotherapeutics. Challenges and potential opportunities exist for new methodologies to provide a more predictive program to assess and monitor potential adverse drug reactions of mAbs for specific patients before and during clinical trials and after market approval.
Collapse
Affiliation(s)
- Dale E Johnson
- Morgan Hall, University of California, Berkeley, Berkeley, CA 94720, USA.
| |
Collapse
|
21
|
Rapid and accurate in silico solubility screening of a monoclonal antibody library. Sci Rep 2017; 7:8200. [PMID: 28811609 PMCID: PMC5558012 DOI: 10.1038/s41598-017-07800-w] [Citation(s) in RCA: 90] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2017] [Accepted: 06/29/2017] [Indexed: 01/20/2023] Open
Abstract
Antibodies represent essential tools in research and diagnostics and are rapidly growing in importance as therapeutics. Commonly used methods to obtain novel antibodies typically yield several candidates capable of engaging a given target. The development steps that follow, however, are usually performed with only one or few candidates since they can be resource demanding, thereby increasing the risk of failure of the overall antibody discovery program. In particular, insufficient solubility, which may lead to aggregation under typical storage conditions, often hinders the ability of a candidate antibody to be developed and manufactured. Here we show that the selection of soluble lead antibodies from an initial library screening can be greatly facilitated by a fast computational prediction of solubility that requires only the amino acid sequence as input. We quantitatively validate this approach on a panel of nine distinct monoclonal antibodies targeting nerve growth factor (NGF), for which we compare the predicted and measured solubilities finding a very close match, and we further benchmark our predictions with published experimental data on aggregation hotspots and solubility of mutational variants of one of these antibodies.
Collapse
|