1
|
Zhang Q, Wang B, Jessica, Ghalandari B, Chen Y, Xu Z, Zhou Q, Ding X. PISAD: De novo peptide design for target protein with iterative stochastic searching algorithm and docking assessment. Biosens Bioelectron 2025; 278:117338. [PMID: 40054153 DOI: 10.1016/j.bios.2025.117338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2024] [Revised: 02/15/2025] [Accepted: 03/03/2025] [Indexed: 03/30/2025]
Abstract
Rapid identification of peptides that bind specifically to a target protein is essential for disease diagnostics and drug development. However, de novo design of peptides without any prior structural knowledge remains a long-standing challenge. Herein, we present Peptide Iterative design with Stochastic Algorithm and Docking (PISAD), a de novo peptide design method, which combines an iterative stochastic searching algorithm with docking assessment. The searching algorithm simulates the evolution of peptide sequences by introducing mutations and crossovers iteratively. After every round of evolution, the peptide sequences undergo docking assessments with the target protein based on the structural prediction of AlphaFold2. We demonstrated PISAD's efficacy by designing peptides targeting four proteins, namely ARF6, ARF1, TGF-β1, and IL-6. For each target, PISAD managed to output peptides with ideal binding affinity within only four iterations of evolutions, and no more than 1250 sequences were assessed. Particularly, the best-performing peptide achieved a KD value of 3.4 nM with ARF6, which has been further experimentally validated. These results demonstrate the efficiency and accuracy of PISAD, which may serve as a universal tool for rapid de novo design of peptides targeting specific proteins.
Collapse
Affiliation(s)
- Qiang Zhang
- Shanghai 6th People's Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China; State Key Laboratory of Systems Medicine for Cancer, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Boqian Wang
- Shanghai 6th People's Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China; State Key Laboratory of Systems Medicine for Cancer, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Jessica
- Shanghai 6th People's Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China; State Key Laboratory of Systems Medicine for Cancer, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Behafarid Ghalandari
- Shanghai 6th People's Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China; State Key Laboratory of Systems Medicine for Cancer, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Youming Chen
- Shanghai 6th People's Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China; State Key Laboratory of Systems Medicine for Cancer, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Zhixiao Xu
- Shanghai 6th People's Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China; State Key Laboratory of Systems Medicine for Cancer, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Quanhong Zhou
- Shanghai 6th People's Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
| | - Xianting Ding
- Shanghai 6th People's Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China; State Key Laboratory of Systems Medicine for Cancer, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai, China.
| |
Collapse
|
2
|
Gaudreault F, Sulea T, Corbeil CR. AI-augmented physics-based docking for antibody-antigen complex prediction. Bioinformatics 2025; 41:btaf129. [PMID: 40135432 PMCID: PMC11978387 DOI: 10.1093/bioinformatics/btaf129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2024] [Revised: 03/13/2025] [Accepted: 03/21/2025] [Indexed: 03/27/2025] Open
Abstract
MOTIVATION Predicting the structure of antibody-antigen complexes is a challenging task with significant implications for the design of better antibody therapeutics. However, the levels of success have remained dauntingly low, particularly when high standards for model quality are required, a necessity for efficient antibody design. Artificial intelligence (AI) has significantly impacted the landscape of structure prediction for antibodies, both alone and in complex with their antigens. METHODS We utilized AI-guided antibody modeling tools to generate ensembles displaying diversity in the complementarity-determining region (CDR) and integrated those into our previously published AlphaFold2-rescored docking pipeline, a strategy called AI-augmented physics-based docking. In this study, we also compare docking performance with AlphaFold and Boltz-1, the new state-of-the-art. We distinguish between two types of success tailored to specific downstream applications: (i) criteria sufficient for epitope mapping, where gross quality is adequate and can complement experimental techniques, and (ii) criteria for producing higher-quality models suitable for engineering purposes. RESULTS We highlight that the quality of the ensemble is crucial for docking performance, that including too many models can be detrimental, and that prioritization of models is essential for achieving good performance. In a scenario analogous to docking using a crystallized antigen, our results robustly demonstrate the advantages of AI-augmented docking over AlphaFold2, further accentuated when higher standards in quality are imposed. Docking also shows improvements over Boltz-1, but those are less pronounced. Docking performance is still noticeably lower than AlphaFold3 in both epitope mapping and antibody design use cases. We observe a strong dependence on CDR-H3 loop length for physics-based tools on their ability to successfully predict. This helps define an applicability range where physics-based docking can be competitive to the newer generation of AI tools. AVAILABILITY AND IMPLEMENTATION The AF2 rescoring scripts are available at github.com/gaudreaultfnrc/AF2-Rescoring.
Collapse
Affiliation(s)
- Francis Gaudreault
- Human Health Therapeutics Research Centre, National Research Council Canada, Montreal, Quebec H4P 2R2, Canada
| | - Traian Sulea
- Human Health Therapeutics Research Centre, National Research Council Canada, Montreal, Quebec H4P 2R2, Canada
- Institute of Parasitology, McGill University, Sainte-Anne-de-Bellevue, Quebec H9X 3V9, Canada
| | - Christopher R Corbeil
- Human Health Therapeutics Research Centre, National Research Council Canada, Montreal, Quebec H4P 2R2, Canada
- Department of Biochemistry, McGill University, Montreal, Quebec H3A 1A3, Canada
| |
Collapse
|
3
|
Ströbaek J, Tang D, Gueto-Tettay C, Gomez Toledo A, Olofsson B, Hartman E, Heusel M, Malmström J, Malmström L. Epitope Mapping with Sidewinder: An XL-MS and Structural Modeling Approach. Int J Mol Sci 2025; 26:1488. [PMID: 40003954 PMCID: PMC11855800 DOI: 10.3390/ijms26041488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2025] [Revised: 02/06/2025] [Accepted: 02/09/2025] [Indexed: 02/27/2025] Open
Abstract
Antibodies are critical to the host's immune defense against bacterial pathogens. Understanding the mechanisms of antibody-antigen interactions is essential for developing new targeted immunotherapies. Building computational workflows that can identify where an antibody binds its cognate antigen and deconvoluting the interaction interface in a high-throughput manner are critical for advancing this field. Cross-linking mass spectrometry (XL-MS) integrated with structural modeling offers a flexible and high-resolution strategy to map protein-protein interactions from low sample amounts. However, cross-linking and in silico modeling have limitations that require robust analytical workflows to make accurate inferences. In this study, we introduce Sidewinder, a modular high-throughput pipeline combining state-of-the-art computational structural prediction and molecular docking with rapid XL-MS analysis, enabling comprehensive interrogation of antibody-antigen systems. We validated this pipeline on antibodies targeting two Streptococcus pyogenes virulence factors. Using recently published data, we identified a well-defined monoclonal antibody epitope on Streptolysin O by generating and querying a large ensemble of interaction models probabilistically. We also showcased the utility of the Sidewinder pipeline by analyzing a more complex system, involving monoclonal antibodies that target the cell wall-anchored M1 protein. The flexibility and robustness of the Sidewinder pipeline provide a powerful framework for future studies of complex antibody-antigen systems, potentially leading to new therapeutic strategies.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | - Lars Malmström
- Department of Clinical Sciences Lund, Infection Medicine, Faculty of Medicine, Lund University, 221 84 Lund, Sweden; (J.S.); (D.T.); (J.M.)
| |
Collapse
|
4
|
Shirali A, Stebliankin V, Karki U, Shi J, Chapagain P, Narasimhan G. A comprehensive survey of scoring functions for protein docking models. BMC Bioinformatics 2025; 26:25. [PMID: 39844036 PMCID: PMC11755896 DOI: 10.1186/s12859-024-05991-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Accepted: 11/18/2024] [Indexed: 01/24/2025] Open
Abstract
BACKGROUND While protein-protein docking is fundamental to our understanding of how proteins interact, scoring protein-protein complex conformations is a critical component of successful docking programs. Without accurate and efficient scoring functions to differentiate between native and non-native binding complexes, the accuracy of current docking tools cannot be guaranteed. Although many innovative scoring functions have been proposed, a good scoring function for docking remains elusive. Deep learning models offer alternatives to using explicit empirical or mathematical functions for scoring protein-protein complexes. RESULTS In this study, we perform a comprehensive survey of the state-of-the-art scoring functions by considering the most popular and highly performant approaches, both classical and deep learning-based, for scoring protein-protein complexes. The methods were also compared based on their runtime as it directly impacts their use in large-scale docking applications. CONCLUSIONS We evaluate the strengths and weaknesses of classical and deep learning-based approaches across seven public and popular datasets to aid researchers in understanding the progress made in this field.
Collapse
Affiliation(s)
- Azam Shirali
- Bioinformatics Research Group (BioRG), Knight Foundation School of Computing and Information Sciences, Florida International University, 11200 SW 8th 10 St, Miami, 33199, USA
| | - Vitalii Stebliankin
- Bioinformatics Research Group (BioRG), Knight Foundation School of Computing and Information Sciences, Florida International University, 11200 SW 8th 10 St, Miami, 33199, USA
| | - Ukesh Karki
- Department of Physics, Florida International University, 11200 SW 8th 10 St, Miami, 33199, USA
| | - Jimeng Shi
- Bioinformatics Research Group (BioRG), Knight Foundation School of Computing and Information Sciences, Florida International University, 11200 SW 8th 10 St, Miami, 33199, USA
| | - Prem Chapagain
- Department of Physics, Florida International University, 11200 SW 8th 10 St, Miami, 33199, USA
- Biomolecular Sciences Institute, Florida International University, 11200 SW 8th St, Miami, 33199, USA
| | - Giri Narasimhan
- Bioinformatics Research Group (BioRG), Knight Foundation School of Computing and Information Sciences, Florida International University, 11200 SW 8th 10 St, Miami, 33199, USA.
- Biomolecular Sciences Institute, Florida International University, 11200 SW 8th St, Miami, 33199, USA.
| |
Collapse
|
5
|
Nguyen QT, Mai TT, Tuong LT, Nguyen TTN, Vo TP, Nguyen DN, Phan-Van CT, Trinh DTT, Tran VT, Thai KM. Binding ability of Delta and Omicron towards the angiotensin-converting enzyme 2 receptor and antibodies: a computational study. J Biomol Struct Dyn 2025:1-13. [PMID: 39755961 DOI: 10.1080/07391102.2024.2446659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2023] [Accepted: 08/31/2024] [Indexed: 01/07/2025]
Abstract
The COVID-19 pandemic posed a threat to global society. Delta and Omicron are concerning variants due to the risk of increasing human-to-human transmissibility and immune evasion. This study aims to evaluate the binding ability of these variants toward the angiotensin-converting enzyme 2 receptor and antibodies using a computational approach. The receptor-binding domain (RBD) of the two variants was created by CHARMM-GUI and then docked to the hACE2 receptor and two antibodies (REGN10933 and REGN10987). These complexes were also subjected to molecular dynamics simulation within 100 ns. As a result, the two variants, Omicron and Delta, exhibited stronger interaction with the hACE2 receptor than the wild type. The mutations in the RBD region also facilitated the virus's escape from antibody neutralization.
Collapse
Affiliation(s)
- Quoc-Thai Nguyen
- Faculty of Pharmacy, University of Medicine and Pharmacy at Ho Chi Minh City, Ho Chi Minh City, Vietnam
| | - Tan Thanh Mai
- Faculty of Pharmacy, University of Medicine and Pharmacy at Ho Chi Minh City, Ho Chi Minh City, Vietnam
| | - Lam-Truong Tuong
- Faculty of Pharmacy, University of Medicine and Pharmacy at Ho Chi Minh City, Ho Chi Minh City, Vietnam
| | - Thi-Thao-Nhung Nguyen
- University of Health Sciences, Vietnam National University Ho Chi Minh City, Ho Chi Minh City, Vietnam
| | - Thanh-Phuong Vo
- University of Health Sciences, Vietnam National University Ho Chi Minh City, Ho Chi Minh City, Vietnam
| | - Dac-Nhan Nguyen
- University of Health Sciences, Vietnam National University Ho Chi Minh City, Ho Chi Minh City, Vietnam
| | - Cong-Thanh Phan-Van
- Faculty of Pharmacy, University of Medicine and Pharmacy at Ho Chi Minh City, Ho Chi Minh City, Vietnam
| | | | - Van-Thanh Tran
- Faculty of Pharmacy, University of Medicine and Pharmacy at Ho Chi Minh City, Ho Chi Minh City, Vietnam
| | - Khac-Minh Thai
- University of Health Sciences, Vietnam National University Ho Chi Minh City, Ho Chi Minh City, Vietnam
| |
Collapse
|
6
|
Bhadra-Lobo S, Derevyanko G, Lamoureux G. Dock2D: Synthetic Data for the Molecular Recognition Problem. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:2580-2586. [PMID: 38814763 DOI: 10.1109/tcbb.2024.3407477] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]
Abstract
Predicting the physical interaction of proteins is a cornerstone problem in computational biology. New classes of learning-based algorithms are actively being developed, and are typically trained end-to-end on protein complex structures extracted from the Protein Data Bank. These training datasets tend to be large and difficult to use for prototyping and, unlike image or natural language datasets, they are not easily interpretable by non-experts. We present Dock2D-IP and Dock2D-IF, two "toy" datasets that can be used to select algorithms predicting protein-protein interactions-or any other type of molecular interactions. Using two-dimensional shapes as input, each example from Dock2D-IP ("interaction pose") describes the interaction pose of two shapes known to interact and each example from Dock2D-IF ("interaction fact") describes whether two shapes form a stable complex or not, regardless of how they bind. We propose a number of baseline solutions to the problem and show that the same underlying energy function can be learned either by solving the interaction pose task (formulated as an energy-minimization "docking" problem) or the fact-of-interaction task (formulated as a binding free energy estimation problem).
Collapse
|
7
|
Chu LS, Sarma S, Gray JJ. Unified Sampling and Ranking for Protein Docking with DFMDock. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.27.615401. [PMID: 39386449 PMCID: PMC11463455 DOI: 10.1101/2024.09.27.615401] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/12/2024]
Abstract
Diffusion models have shown promise in addressing the protein docking problem. Traditionally, these models are used solely for sampling docked poses, with a separate confidence model for ranking. We introduce DFMDock (Denoising Force Matching Dock), a diffusion model that unifies sampling and ranking within a single framework. DFMDock features two output heads: one for predicting forces and the other for predicting energies. The forces are trained using a denoising force matching objective, while the energy gradients are trained to align with the forces. This design enables our model to sample using the predicted forces and rank poses using the predicted energies, thereby eliminating the need for an additional confidence model. Our approach outperforms the previous diffusion model for protein docking, DiffDock-PP, with a sampling success rate of 44% compared to its 8%, and a Top- 1 ranking success rate of 16% compared to 0% on the Docking Benchmark 5.5 test set. In successful decoy cases, the DFMDock Energy forms a binding funnel similar to the physics-based Rosetta Energy, suggesting that DFMDock can capture the underlying energy landscape.
Collapse
Affiliation(s)
- Lee-Shin Chu
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Sudeep Sarma
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Jeffrey J Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| |
Collapse
|
8
|
Aktaş E, Özdemir Özgentürk N. A comprehensive examination of ACE2 receptor and prediction of spike glycoprotein and ACE2 interaction based on in silico analysis of ACE2 receptor. J Biomol Struct Dyn 2024; 42:4412-4428. [PMID: 37349943 DOI: 10.1080/07391102.2023.2220814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2023] [Accepted: 05/28/2023] [Indexed: 06/24/2023]
Abstract
The ACE2 receptor plays a vital role not only in the SARS-CoV-induced epidemic but also in various other diseases, including cardiovascular diseases and ARDS. While studies have explored the interactions between ACE2 and SARS-CoV proteins, comprehensive research utilizing bioinformatic tools on the ACE2 protein has been lacking. The one aim of present study was to extensively analyze the regions of the ACE2 protein. After utilizing all bioinformatics tools especially G104 and L108 regions on ACE2 were come forward. The results of our analysis revealed that possible mutations or deletions in the G104 and L108 regions play a critical role in both the biological functioning and the determination of the chemical-physical properties of ACE2. Additionally, these regions were found to be more susceptible to mutations or deletions compared to other regions of the ACE2 protein. Notably, the randomly selected peptide, LQQNGSSVLS (100-109), which includes G104 and L108, exhibited a crucial role in binding the RBD of the spike protein, as supported by docking scores. Furthermore, both MDs and iMODs results provided evidence that G104 and L108 influence the dynamics of ACE2-spike complexes. This study is expected to offer a new perspective on the ACE2-SARS-CoV interaction and other research areas where ACE2 plays a significant role, such as biotechnology (protein engineering, enzyme optimization), medicine (RAS, pulmonary and cardiac diseases), and basic research (structural motifs, stabilizing protein folds, or facilitating important inter molecular contacts, protein's proper structure and function).Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Emre Aktaş
- Faculty of Art and Science, Molecular Biology and Genetics, Yıldız Technical University, Istanbul, Turkey
| | - Nehir Özdemir Özgentürk
- Faculty of Art and Science, Molecular Biology and Genetics, Yıldız Technical University, Istanbul, Turkey
| |
Collapse
|
9
|
Bayarsaikhan B, Zsidó BZ, Börzsei R, Hetényi C. Efficient Refinement of Complex Structures of Flexible Histone Peptides Using Post-Docking Molecular Dynamics Protocols. Int J Mol Sci 2024; 25:5945. [PMID: 38892133 PMCID: PMC11172440 DOI: 10.3390/ijms25115945] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2024] [Revised: 05/26/2024] [Accepted: 05/27/2024] [Indexed: 06/21/2024] Open
Abstract
Histones are keys to many epigenetic events and their complexes have therapeutic and diagnostic importance. The determination of the structures of histone complexes is fundamental in the design of new drugs. Computational molecular docking is widely used for the prediction of target-ligand complexes. Large, linear peptides like the tail regions of histones are challenging ligands for docking due to their large conformational flexibility, extensive hydration, and weak interactions with the shallow binding pockets of their reader proteins. Thus, fast docking methods often fail to produce complex structures of such peptide ligands at a level appropriate for drug design. To address this challenge, and improve the structural quality of the docked complexes, post-docking refinement has been applied using various molecular dynamics (MD) approaches. However, a final consensus has not been reached on the desired MD refinement protocol. In this present study, MD refinement strategies were systematically explored on a set of problematic complexes of histone peptide ligands with relatively large errors in their docked geometries. Six protocols were compared that differ in their MD simulation parameters. In all cases, pre-MD hydration of the complex interface regions was applied to avoid the unwanted presence of empty cavities. The best-performing protocol achieved a median of 32% improvement over the docked structures in terms of the change in root mean squared deviations from the experimental references. The influence of structural factors and explicit hydration on the performance of post-docking MD refinements are also discussed to help with their implementation in future methods and applications.
Collapse
Affiliation(s)
- Bayartsetseg Bayarsaikhan
- Pharmacoinformatics Unit, Department of Pharmacology and Pharmacotherapy, Medical School, University of Pécs, Szigeti út 12, H-7624 Pécs, Hungary; (B.B.); (B.Z.Z.); (R.B.)
| | - Balázs Zoltán Zsidó
- Pharmacoinformatics Unit, Department of Pharmacology and Pharmacotherapy, Medical School, University of Pécs, Szigeti út 12, H-7624 Pécs, Hungary; (B.B.); (B.Z.Z.); (R.B.)
| | - Rita Börzsei
- Pharmacoinformatics Unit, Department of Pharmacology and Pharmacotherapy, Medical School, University of Pécs, Szigeti út 12, H-7624 Pécs, Hungary; (B.B.); (B.Z.Z.); (R.B.)
| | - Csaba Hetényi
- Pharmacoinformatics Unit, Department of Pharmacology and Pharmacotherapy, Medical School, University of Pécs, Szigeti út 12, H-7624 Pécs, Hungary; (B.B.); (B.Z.Z.); (R.B.)
- National Laboratory for Drug Research and Development, Magyar tudósok krt. 2, H-1117 Budapest, Hungary
| |
Collapse
|
10
|
Mslati H, Gentile F, Pandey M, Ban F, Cherkasov A. PROTACable Is an Integrative Computational Pipeline of 3-D Modeling and Deep Learning To Automate the De Novo Design of PROTACs. J Chem Inf Model 2024; 64:3034-3046. [PMID: 38504115 DOI: 10.1021/acs.jcim.3c01878] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/21/2024]
Abstract
Proteolysis-targeting chimeras (PROTACs) that engage two biological targets at once are a promising technology in degrading clinically relevant protein targets. Since factors that influence the biological activities of PROTACs are more complex than those of a small molecule drug, we explored a combination of computational chemistry and deep learning strategies to forecast PROTAC activity and enable automated design. A new method named PROTACable was developed for the de novo design of PROTACs, which includes a robust 3-D modeling workflow to model PROTAC ternary complexes using a library of E3 ligase and linker and an SE(3)-equivariant graph transformer network to predict the activity of newly designed PROTACs. PROTACable is available at https://github.com/giaguaro/PROTACable/.
Collapse
Affiliation(s)
- Hazem Mslati
- Vancouver Prostate Centre, The University of British Columbia, Vancouver, British Columbia V6H 3Z6, Canada
| | - Francesco Gentile
- Department of Chemistry and Biomolecular Sciences, University of Ottawa, Ottawa, Ontario K1N 6N5, Canada
- Ottawa Institute of Systems Biology, Ottawa, Ontario K1N 6N5, Canada
| | - Mohit Pandey
- Vancouver Prostate Centre, The University of British Columbia, Vancouver, British Columbia V6H 3Z6, Canada
| | - Fuqiang Ban
- Vancouver Prostate Centre, The University of British Columbia, Vancouver, British Columbia V6H 3Z6, Canada
| | - Artem Cherkasov
- Vancouver Prostate Centre, The University of British Columbia, Vancouver, British Columbia V6H 3Z6, Canada
| |
Collapse
|
11
|
Cabello AL, Wells K, Peng W, Feng HQ, Wang J, Meyer DF, Noroy C, Zhao ES, Zhang H, Li X, Chang H, Gomez G, Mao Y, Patrick KL, Watson RO, Russell WK, Yu A, Zhong J, Guo F, Li M, Zhou M, Qian X, Kobayashi KS, Song J, Panthee S, Mechref Y, Ficht TA, Qin QM, de Figueiredo P. Brucella-driven host N-glycome remodeling controls infection. Cell Host Microbe 2024; 32:588-605.e9. [PMID: 38531364 DOI: 10.1016/j.chom.2024.03.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 08/28/2023] [Accepted: 03/06/2024] [Indexed: 03/28/2024]
Abstract
Many powerful methods have been employed to elucidate the global transcriptomic, proteomic, or metabolic responses to pathogen-infected host cells. However, the host glycome responses to bacterial infection remain largely unexplored, and hence, our understanding of the molecular mechanisms by which bacterial pathogens manipulate the host glycome to favor infection remains incomplete. Here, we address this gap by performing a systematic analysis of the host glycome during infection by the bacterial pathogen Brucella spp. that cause brucellosis. We discover, surprisingly, that a Brucella effector protein (EP) Rhg1 induces global reprogramming of the host cell N-glycome by interacting with components of the oligosaccharide transferase complex that controls N-linked protein glycosylation, and Rhg1 regulates Brucella replication and tissue colonization in a mouse model of brucellosis, demonstrating that Brucella exploits the EP Rhg1 to reprogram the host N-glycome and promote bacterial intracellular parasitism, thereby providing a paradigm for bacterial control of host cell infection.
Collapse
Affiliation(s)
- Ana-Lucia Cabello
- Department of Veterinary Pathobiology, Texas A&M University, College Station, TX 77843, USA; Department of Microbial Pathogenesis and Immunology, Texas A&M Health Science Center, Bryan, TX 77807, USA
| | - Kelsey Wells
- Christopher S. Bond Life Sciences Center, Department of Molecular Microbiology and Immunology, School of Medicine, The University of Missouri, Columbia, MO 65211, USA
| | - Wenjing Peng
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, TX 79409, USA
| | - Hui-Qiang Feng
- Department of Microbial Pathogenesis and Immunology, Texas A&M Health Science Center, Bryan, TX 77807, USA
| | - Junyao Wang
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, TX 79409, USA
| | - Damien F Meyer
- CIRAD, UMR ASTRE, 97170 Petit-Bourg, Guadeloupe, France; ASTRE, University Montpellier, CIRAD, INRAE, Montpellier, France
| | - Christophe Noroy
- CIRAD, UMR ASTRE, 97170 Petit-Bourg, Guadeloupe, France; ASTRE, University Montpellier, CIRAD, INRAE, Montpellier, France
| | - En-Shuang Zhao
- College of Computer Science and Technology, Jilin University, Changchun 130012, China
| | - Hao Zhang
- College of Computer Science and Technology, Jilin University, Changchun 130012, China
| | - Xueqing Li
- College of Computer Science and Technology, Jilin University, Changchun 130012, China
| | - Haowu Chang
- College of Computer Science and Technology, Jilin University, Changchun 130012, China
| | - Gabriel Gomez
- Texas A&M Veterinary Medical Diagnostic Laboratory (TVMDL), Texas A&M University, College Station, TX 77843, USA
| | - Yuxin Mao
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853-2703, USA
| | - Kristin L Patrick
- Department of Microbial Pathogenesis and Immunology, Texas A&M Health Science Center, Bryan, TX 77807, USA
| | - Robert O Watson
- Department of Microbial Pathogenesis and Immunology, Texas A&M Health Science Center, Bryan, TX 77807, USA
| | - William K Russell
- Department of Biochemistry & Molecular Biology, University of Texas Medical Branch, Galveston, TX 77555-0635, USA
| | - Aiying Yu
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, TX 79409, USA
| | - Jieqiang Zhong
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, TX 79409, USA
| | - Fengguang Guo
- Department of Microbial Pathogenesis and Immunology, Texas A&M Health Science Center, Bryan, TX 77807, USA
| | - Mingqian Li
- Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 78843, USA
| | - Mingyuan Zhou
- Department of Information, Risk, and Operations Management, Department of Statistics and Data Sciences, The University of Texas at Austin, Austin, TX 78712, USA
| | - Xiaoning Qian
- Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 78843, USA; TEES-AgriLife Center for Bioinformatics & Genomic Systems Engineering, Texas A&M University, College Station, TX 77843, USA
| | - Koichi S Kobayashi
- Department of Microbial Pathogenesis and Immunology, Texas A&M Health Science Center, Bryan, TX 77807, USA; Department of Immunology, Hokkaido University Graduate School of Medicine, Sapporo 060-8638, Japan; Hokkaido University, Institute for Vaccine Research and Development (HU-IVReD), Sapporo 060-8638, Japan
| | - Jianxun Song
- Department of Microbial Pathogenesis and Immunology, Texas A&M Health Science Center, Bryan, TX 77807, USA
| | - Suresh Panthee
- Department of Microbial Pathogenesis and Immunology, Texas A&M Health Science Center, Bryan, TX 77807, USA
| | - Yehia Mechref
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, TX 79409, USA.
| | - Thomas A Ficht
- Department of Veterinary Pathobiology, Texas A&M University, College Station, TX 77843, USA.
| | - Qing-Ming Qin
- Christopher S. Bond Life Sciences Center, Department of Molecular Microbiology and Immunology, School of Medicine, The University of Missouri, Columbia, MO 65211, USA.
| | - Paul de Figueiredo
- Christopher S. Bond Life Sciences Center, Department of Molecular Microbiology and Immunology, School of Medicine, The University of Missouri, Columbia, MO 65211, USA; Department of Veterinary Pathobiology, The University of Missouri, Columbia, MO 65211, USA.
| |
Collapse
|
12
|
Lin P, Li H, Huang SY. Deep learning in modeling protein complex structures: From contact prediction to end-to-end approaches. Curr Opin Struct Biol 2024; 85:102789. [PMID: 38402744 DOI: 10.1016/j.sbi.2024.102789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2023] [Revised: 01/16/2024] [Accepted: 02/06/2024] [Indexed: 02/27/2024]
Abstract
Protein-protein interactions play crucial roles in many biological processes. Traditionally, protein complex structures are normally built by protein-protein docking. With the rapid development of artificial intelligence and its great success in monomer protein structure prediction, deep learning has widely been applied to modeling protein-protein complex structures through inter-protein contact prediction and end-to-end approaches in the past few years. This article reviews the recent advances of deep-learning-based approaches in modeling protein-protein complex structures as well as their advantages and limitations. Challenges and possible future directions are also briefly discussed in applying deep learning for the prediction of protein complex structures.
Collapse
Affiliation(s)
- Peicong Lin
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| | - Hao Li
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China.
| |
Collapse
|
13
|
Chu L, Ruffolo JA, Harmalkar A, Gray JJ. Flexible protein-protein docking with a multitrack iterative transformer. Protein Sci 2024; 33:e4862. [PMID: 38148272 PMCID: PMC10804679 DOI: 10.1002/pro.4862] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2023] [Revised: 11/17/2023] [Accepted: 12/06/2023] [Indexed: 12/28/2023]
Abstract
Conventional protein-protein docking algorithms usually rely on heavy candidate sampling and reranking, but these steps are time-consuming and hinder applications that require high-throughput complex structure prediction, for example, structure-based virtual screening. Existing deep learning methods for protein-protein docking, despite being much faster, suffer from low docking success rates. In addition, they simplify the problem to assume no conformational changes within any protein upon binding (rigid docking). This assumption precludes applications when binding-induced conformational changes play a role, such as allosteric inhibition or docking from uncertain unbound model structures. To address these limitations, we present GeoDock, a multitrack iterative transformer network to predict a docked structure from separate docking partners. Unlike deep learning models for protein structure prediction that input multiple sequence alignments, GeoDock inputs just the sequences and structures of the docking partners, which suits the tasks when the individual structures are given. GeoDock is flexible at the protein residue level, allowing the prediction of conformational changes upon binding. On the Database of Interacting Protein Structures (DIPS) test set, GeoDock achieves a 43% top-1 success rate, outperforming all other tested methods. However, in the standard DIPS train/test splits, we discovered contamination of close homologs in the training set. After decontaminating the training set, the success rate is 31%. On the DB5.5 test set and a benchmark dataset of antibody-antigen complexes, GeoDock outperforms the deep learning models trained using the same dataset but falls behind most of the conventional methods and AlphaFold-Multimer. GeoDock attains an average inference speed of under 1 s on a single GPU, enabling its application in large-scale structure screening. Although binding-induced conformational changes are still a challenge owing to limited training and evaluation data, our architecture sets up the foundation to capture this backbone flexibility. Code and a demonstration Jupyter notebook are available at https://github.com/Graylab/GeoDock.
Collapse
Affiliation(s)
- Lee‐Shin Chu
- Department of Chemical and Biomolecular EngineeringJohns Hopkins UniversityBaltimoreMarylandUSA
| | - Jeffrey A. Ruffolo
- Program in Molecular BiophysicsJohns Hopkins UniversityBaltimoreMarylandUSA
| | - Ameya Harmalkar
- Department of Chemical and Biomolecular EngineeringJohns Hopkins UniversityBaltimoreMarylandUSA
| | - Jeffrey J. Gray
- Department of Chemical and Biomolecular EngineeringJohns Hopkins UniversityBaltimoreMarylandUSA
- Program in Molecular BiophysicsJohns Hopkins UniversityBaltimoreMarylandUSA
| |
Collapse
|
14
|
Kuder KJ. Docking Foundations: From Rigid to Flexible Docking. Methods Mol Biol 2024; 2780:3-14. [PMID: 38987460 DOI: 10.1007/978-1-0716-3985-6_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2024]
Abstract
Despite the development of methods for the experimental determination of protein structures, the dissonance between the number of known sequences and their solved structures is still enormous. This is particularly evident in protein-protein complexes. To fill this gap, diverse technologies have been developed to study protein-protein interactions (PPIs) in a cellular context including a range of biological and computational methods. The latter derive from techniques originally published and applied almost half a century ago and are based on interdisciplinary knowledge from the nexus of the fields of biology, chemistry, and physics about protein sequences, structures, and their folding. Protein-protein docking, the main protagonist of this chapter, is routinely treated as an integral part of protein research. Herein, we describe the basic foundations of the whole process in general terms, but step by step from protein representations through docking methods and evaluation of complexes to their final validation.
Collapse
Affiliation(s)
- Kamil J Kuder
- Department of Technology and Biotechnology of Drugs, Faculty of Pharmacy, Jagiellonian University Medical College, Kraków, Poland.
| |
Collapse
|
15
|
Asim A. Approaches to Backbone Flexibility in Protein-Protein Docking. Methods Mol Biol 2024; 2780:45-68. [PMID: 38987463 DOI: 10.1007/978-1-0716-3985-6_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2024]
Abstract
Proteins are the fundamental organic macromolecules in living systems that play a key role in a variety of biological functions including immunological detection, intracellular trafficking, and signal transduction. The docking of proteins has greatly advanced during recent decades and has become a crucial complement to experimental methods. Protein-protein docking is a helpful method for simulating protein complexes whose structures have not yet been solved experimentally. This chapter focuses on major search tactics along with various docking programs used in protein-protein docking algorithms, which include: direct search, exhaustive global search, local shape feature matching, randomized search, and broad category of post-docking approaches. As backbone flexibility predictions and interactions in high-resolution protein-protein docking remain important issues in the overall optimization context, we have put forward several methods and solutions used to handle backbone flexibility. In addition, various docking methods that are utilized for flexible backbone docking, including ATTRACT, FlexDock, FLIPDock, HADDOCK, RosettaDock, FiberDock, etc., along with their scoring functions, algorithms, advantages, and limitations are discussed. Moreover, what progress in search technology is expected, including not only the creation of new search algorithms but also the enhancement of existing ones, has been debated. As conformational flexibility is one of the most crucial factors affecting docking success, more work should be put into evaluating the conformational flexibility upon binding for a particular case in addition to developing new algorithms to replace the rigid body docking and scoring approach.
Collapse
Affiliation(s)
- Ayesha Asim
- Department of Synthesis and Chemical Technology of Pharmaceutical Substances with Computer Modeling Laboratory, Faculty of Pharmacy, Medical University of Lublin, Lublin, Poland
| |
Collapse
|
16
|
Zeng C, Jian Y, Zhuo C, Li A, Zeng C, Zhao Y. Evaluation of DNA-protein complex structures using the deep learning method. Phys Chem Chem Phys 2023; 26:130-143. [PMID: 38063012 DOI: 10.1039/d3cp04980a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2023]
Abstract
Biological processes such as transcription, repair, and regulation require interactions between DNA and proteins. To unravel their functions, it is imperative to determine the high-resolution structures of DNA-protein complexes. However, experimental methods for this purpose are costly and technically demanding. Consequently, there is an urgent need for computational techniques to identify the structures of DNA-protein complexes. Despite technological advancements, accurately identifying DNA-protein complexes through computational methods still poses a challenge. Our team has developed a cutting-edge deep-learning approach called DDPScore that assesses DNA-protein complex structures. DDPScore utilizes a 4D convolutional neural network to overcome limited training data. This approach effectively captures local and global features while comprehensively considering the conformational changes arising from the flexibility during the DNA-protein docking process. DDPScore consistently outperformed the available methods in comprehensive DNA-protein complex docking evaluations, even for the flexible docking challenges. DDPScore has a wide range of applications in predicting and designing structures of DNA-protein complexes.
Collapse
Affiliation(s)
- Chengwei Zeng
- Institute of Biophysics and Department of Physics, Central China Normal University, Wuhan, 430079, China.
| | - Yiren Jian
- Department of Computer Science, Dartmouth College, Hanover, NH 03755, USA
| | - Chen Zhuo
- Institute of Biophysics and Department of Physics, Central China Normal University, Wuhan, 430079, China.
| | - Anbang Li
- Institute of Biophysics and Department of Physics, Central China Normal University, Wuhan, 430079, China.
| | - Chen Zeng
- Department of Physics, The George Washington University, Washington, DC 20052, USA
| | - Yunjie Zhao
- Institute of Biophysics and Department of Physics, Central China Normal University, Wuhan, 430079, China.
| |
Collapse
|
17
|
Meng Q, Guo F, Wang E, Tang J. ComDock: A novel approach for protein-protein docking with an efficient fusing strategy. Comput Biol Med 2023; 167:107660. [PMID: 37944303 DOI: 10.1016/j.compbiomed.2023.107660] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 10/08/2023] [Accepted: 10/31/2023] [Indexed: 11/12/2023]
Abstract
Protein-protein interaction plays an important role in studying the mechanism of protein functions from the structural perspective. Molecular docking is a powerful approach to detect protein-protein complexes using computational tools, due to the high cost and time-consuming of the traditional experimental methods. Among existing technologies, the template-based method utilizes the structural information of known homologous 3D complexes as available and reliable templates to achieve high accuracy and low computational complexity. However, the performance of the template-based method depends on the quality and quantity of templates. When insufficient or even no templates, the ab initio docking method is necessary and largely enriches the docking conformations. Therefore, it's a feasible strategy to fuse the effectivity of the template-based model and the universality of ab initio model to improve the docking performance. In this study, we construct a new, diverse, comprehensive template library derived from PDB, containing 77,685 complexes. We propose a template-based method (named TemDock), which retrieves the evolutionary relationship between the target sequence and samples in the template library and transfers similar structural information. Then, the target structure is built by superposing on the homologous template complex with TM-align. Moreover, we develop a consensus-based method (named ComDock) to integrate our TemDock and an existing ab initio method (ZDOCK). On 105 targets with templates from Benchmark 5.0, the TemDock and ComDock achieve a success rate of 68.57 % and 71.43 % in the top 10 conformations, respectively. Compared with the HDOCK, ComDock obtains better I-RMSD of hit configurations on 9 targets and more hit models in the top 100 conformations. As an efficient method for protein-protein docking, the ComDock is expected to study protein-protein recognition and reveal the various biological passways that are critical for developing drug discovery. The final results are stored at https://github.com/guofei-tju/mqz_ComDock_docking.
Collapse
Affiliation(s)
- Qiaozhen Meng
- College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Fei Guo
- School of Computer Science and Engineering, Central South University, Changsha, China.
| | - Ercheng Wang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang, China; Zhejiang Laboratory, Hangzhou, Zhejiang, China.
| | - Jijun Tang
- Shenzhen Institute of Advanced Technology of Chinese Academy of Sciences, Shenzhen, China.
| |
Collapse
|
18
|
Gaudreault F, Corbeil CR, Sulea T. Enhanced antibody-antigen structure prediction from molecular docking using AlphaFold2. Sci Rep 2023; 13:15107. [PMID: 37704686 PMCID: PMC10499836 DOI: 10.1038/s41598-023-42090-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Accepted: 09/05/2023] [Indexed: 09/15/2023] Open
Abstract
Predicting the structure of antibody-antigen complexes has tremendous value in biomedical research but unfortunately suffers from a poor performance in real-life applications. AlphaFold2 (AF2) has provided renewed hope for improvements in the field of protein-protein docking but has shown limited success against antibody-antigen complexes due to the lack of co-evolutionary constraints. In this study, we used physics-based protein docking methods for building decoy sets consisting of low-energy docking solutions that were either geometrically close to the native structure (positives) or not (negatives). The docking models were then fed into AF2 to assess their confidence with a novel composite score based on normalized pLDDT and pTMscore metrics after AF2 structural refinement. We show benefits of the AF2 composite score for rescoring docking poses both in terms of (1) classification of positives/negatives and of (2) success rates with particular emphasis on early enrichment. Docking models of at least medium quality present in the decoy set, but not necessarily highly ranked by docking methods, benefitted most from AF2 rescoring by experiencing large advances towards the top of the reranked list of models. These improvements, obtained without any calibration or novel methodologies, led to a notable level of performance in antibody-antigen unbound docking that was never achieved previously.
Collapse
Affiliation(s)
- Francis Gaudreault
- Human Health Therapeutics Research Centre, National Research Council Canada, 6100 Royalmount Avenue, Montreal, QC, H4P 2R2, Canada
| | - Christopher R Corbeil
- Human Health Therapeutics Research Centre, National Research Council Canada, 6100 Royalmount Avenue, Montreal, QC, H4P 2R2, Canada
| | - Traian Sulea
- Human Health Therapeutics Research Centre, National Research Council Canada, 6100 Royalmount Avenue, Montreal, QC, H4P 2R2, Canada.
- Institute of Parasitology, McGill University, 21111 Lakeshore Road, Sainte-Anne-de-Bellevue, QC, H9X 3V9, Canada.
| |
Collapse
|
19
|
Chu LS, Ruffolo JA, Harmalkar A, Gray JJ. Flexible Protein-Protein Docking with a Multi-Track Iterative Transformer. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.29.547134. [PMID: 37425754 PMCID: PMC10327054 DOI: 10.1101/2023.06.29.547134] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
Conventional protein-protein docking algorithms usually rely on heavy candidate sampling and re-ranking, but these steps are time-consuming and hinder applications that require high-throughput complex structure prediction, e.g., structure-based virtual screening. Existing deep learning methods for protein-protein docking, despite being much faster, suffer from low docking success rates. In addition, they simplify the problem to assume no conformational changes within any protein upon binding (rigid docking). This assumption precludes applications when binding-induced conformational changes play a role, such as allosteric inhibition or docking from uncertain unbound model structures. To address these limitations, we present GeoDock, a multi-track iterative transformer network to predict a docked structure from separate docking partners. Unlike deep learning models for protein structure prediction that input multiple sequence alignments (MSAs), GeoDock inputs just the sequences and structures of the docking partners, which suits the tasks when the individual structures are given. GeoDock is flexible at the protein residue level, allowing the prediction of conformational changes upon binding. For a benchmark set of rigid targets, GeoDock obtains a 41% success rate, outperforming all the other tested methods. For a more challenging benchmark set of flexible targets, GeoDock achieves a similar number of top-model successes as the traditional method ClusPro [1], but fewer than ReplicaDock2 [2]. GeoDock attains an average inference speed of under one second on a single GPU, enabling its application in large-scale structure screening. Although binding-induced conformational changes are still a challenge owing to limited training and evaluation data, our architecture sets up the foundation to capture this backbone flexibility. Code and a demonstration Jupyter notebook are available at https://github.com/Graylab/GeoDock.
Collapse
Affiliation(s)
- Lee-Shin Chu
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Jeffrey A Ruffolo
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Ameya Harmalkar
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Jeffrey J Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD 21218, USA
| |
Collapse
|
20
|
McFee M, Kim PM. GDockScore: a graph-based protein-protein docking scoring function. BIOINFORMATICS ADVANCES 2023; 3:vbad072. [PMID: 37359726 PMCID: PMC10290236 DOI: 10.1093/bioadv/vbad072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Revised: 05/30/2023] [Accepted: 06/10/2023] [Indexed: 06/28/2023]
Abstract
Summary Protein complexes play vital roles in a variety of biological processes, such as mediating biochemical reactions, the immune response and cell signalling, with 3D structure specifying function. Computational docking methods provide a means to determine the interface between two complexed polypeptide chains without using time-consuming experimental techniques. The docking process requires the optimal solution to be selected with a scoring function. Here, we propose a novel graph-based deep learning model that utilizes mathematical graph representations of proteins to learn a scoring function (GDockScore). GDockScore was pre-trained on docking outputs generated with the Protein Data Bank biounits and the RosettaDock protocol, and then fine-tuned on HADDOCK decoys generated on the ZDOCK Protein Docking Benchmark. GDockScore performs similarly to the Rosetta scoring function on docking decoys generated using the RosettaDock protocol. Furthermore, state-of-the-art is achieved on the CAPRI score set, a challenging dataset for developing docking scoring functions. Availability and implementation The model implementation is available at https://gitlab.com/mcfeemat/gdockscore. Supplementary information Supplementary data are available at Bioinformatics Advances online.
Collapse
Affiliation(s)
- Matthew McFee
- Department of Molecular Genetics, The University of Toronto, Toronto, ON M5S 1A8, Canada
- Donnelly Centre for Cellular and Biomolecular Research, The University of Toronto, Toronto, ON M5S 3E1, Canada
| | | |
Collapse
|
21
|
Barradas-Bautista D, Almajed A, Oliva R, Kalnis P, Cavallo L. Improving classification of correct and incorrect protein-protein docking models by augmenting the training set. BIOINFORMATICS ADVANCES 2023; 3:vbad012. [PMID: 36789292 PMCID: PMC9923443 DOI: 10.1093/bioadv/vbad012] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/27/2022] [Revised: 01/20/2023] [Accepted: 02/01/2023] [Indexed: 02/04/2023]
Abstract
Motivation Protein-protein interactions drive many relevant biological events, such as infection, replication and recognition. To control or engineer such events, we need to access the molecular details of the interaction provided by experimental 3D structures. However, such experiments take time and are expensive; moreover, the current technology cannot keep up with the high discovery rate of new interactions. Computational modeling, like protein-protein docking, can help to fill this gap by generating docking poses. Protein-protein docking generally consists of two parts, sampling and scoring. The sampling is an exhaustive search of the tridimensional space. The caveat of the sampling is that it generates a large number of incorrect poses, producing a highly unbalanced dataset. This limits the utility of the data to train machine learning classifiers. Results Using weak supervision, we developed a data augmentation method that we named hAIkal. Using hAIkal, we increased the labeled training data to train several algorithms. We trained and obtained different classifiers; the best classifier has 81% accuracy and 0.51 Matthews' correlation coefficient on the test set, surpassing the state-of-the-art scoring functions. Availability and implementation Docking models from Benchmark 5 are available at https://doi.org/10.5281/zenodo.4012018. Processed tabular data are available at https://repository.kaust.edu.sa/handle/10754/666961. Google colab is available at https://colab.research.google.com/drive/1vbVrJcQSf6\_C3jOAmZzgQbTpuJ5zC1RP?usp=sharing. Supplementary information Supplementary data are available at Bioinformatics Advances online.
Collapse
Affiliation(s)
| | - Ali Almajed
- Computer, Electrical and Mathematical Science and Engineering Division, Kaust Extreme Computing Center, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
| | - Romina Oliva
- Department of Sciences and Technologies, University of Naples “Parthenope”, I-80143 Naples, Italy
| | - Panos Kalnis
- Computer, Electrical and Mathematical Science and Engineering Division, Kaust Extreme Computing Center, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
| | - Luigi Cavallo
- Physical Sciences and Engineering Division, Kaust Catalysis Center, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
| |
Collapse
|
22
|
Lin P, Yan Y, Huang SY. DeepHomo2.0: improved protein-protein contact prediction of homodimers by transformer-enhanced deep learning. Brief Bioinform 2023; 24:6849483. [PMID: 36440949 DOI: 10.1093/bib/bbac499] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 10/08/2022] [Accepted: 10/21/2022] [Indexed: 11/30/2022] Open
Abstract
Protein-protein interactions play an important role in many biological processes. However, although structure prediction for monomer proteins has achieved great progress with the advent of advanced deep learning algorithms like AlphaFold, the structure prediction for protein-protein complexes remains an open question. Taking advantage of the Transformer model of ESM-MSA, we have developed a deep learning-based model, named DeepHomo2.0, to predict protein-protein interactions of homodimeric complexes by leveraging the direct-coupling analysis (DCA) and Transformer features of sequences and the structure features of monomers. DeepHomo2.0 was extensively evaluated on diverse test sets and compared with eight state-of-the-art methods including protein language model-based, DCA-based and machine learning-based methods. It was shown that DeepHomo2.0 achieved a high precision of >70% with experimental monomer structures and >60% with predicted monomer structures for the top 10 predicted contacts on the test sets and outperformed the other eight methods. Moreover, even the version without using structure information, named DeepHomoSeq, still achieved a good precision of >55% for the top 10 predicted contacts. Integrating the predicted contacts into protein docking significantly improved the structure prediction of realistic Critical Assessment of Protein Structure Prediction homodimeric complexes. DeepHomo2.0 and DeepHomoSeq are available at http://huanglab.phys.hust.edu.cn/DeepHomo2/.
Collapse
Affiliation(s)
- Peicong Lin
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Yumeng Yan
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| |
Collapse
|
23
|
Abstract
Computer-aided drug design (CADD) approaches are playing an increasingly important role in understanding the fundamentals of ligand-receptor interactions and helping medicinal chemists design therapeutics. About 5 years ago, we presented a chapter devoted to an overview of CADD methods and covered typical CADD protocols including structure-based drug design (SBDD) and ligand-based drug design (LBDD) approaches that were frequently used in the antibiotic drug design process. Advances in computational hardware and algorithms and emerging CADD methods are enhancing the accuracy and ability of CADD in drug design and development. In this chapter, an update to our previous chapter is provided with a focus on new CADD approaches from our laboratory and other peers that can be employed to facilitate the development of antibiotic therapeutics.
Collapse
Affiliation(s)
- Wenbo Yu
- Department of Pharmaceutical Sciences, Computer-Aided Drug Design Center, School of Pharmacy, University of Maryland, Baltimore, MD, USA.
- Institute for Bioscience and Biotechnology Research (IBBR), Rockville, MD, USA.
- Center for Biomolecular Therapeutics (CBT), School of Medicine, University of Maryland, Baltimore, MD, USA.
| | - David J Weber
- Institute for Bioscience and Biotechnology Research (IBBR), Rockville, MD, USA
- Center for Biomolecular Therapeutics (CBT), School of Medicine, University of Maryland, Baltimore, MD, USA
| | - Alexander D MacKerell
- Department of Pharmaceutical Sciences, Computer-Aided Drug Design Center, School of Pharmacy, University of Maryland, Baltimore, MD, USA.
- Institute for Bioscience and Biotechnology Research (IBBR), Rockville, MD, USA.
- Center for Biomolecular Therapeutics (CBT), School of Medicine, University of Maryland, Baltimore, MD, USA.
| |
Collapse
|
24
|
Tiwari S, Pandey VP, Yadav K, Dwivedi UN. Modulation of interaction of BRCA1-RAD51 and BRCA1-AURKA protein complexes by natural metabolites using as possible therapeutic intervention toward cardiotoxic effects of cancer drugs: an in-silico approach. J Biomol Struct Dyn 2022; 40:12863-12879. [PMID: 34632941 DOI: 10.1080/07391102.2021.1976278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Breast cancer type 1 susceptibility protein (BRCA1) plays an important role in maintaining genome stability and is known to interact with several proteins involved in cellular pathways, gene transcription regulation and DNA damage response. More than 40% of inherited breast cancer cases are due to BRCA1 mutation. It is also a prognostic marker in non-small cell lung cancer patients as well as a gatekeeper of cardiac function. Interaction of mutant BRCA1 with other proteins is known to disrupt the tumor suppression mechanism. Two directly interacting proteins with BRCA1 namely, DNA repair protein RAD51 (RAD51) and Aurora kinase A (AURKA), known to regulate homologous recombination (HR) and G/M cell cycle transition, respectively, form protein complex with both wild and mutant BRCA1. To analyze the interactions, protein-protein complexes were generated for each pair of proteins. In order to combat the cardiotoxic effects of cancer drugs, pharmacokinetically screened natural metabolites derived from plant, marine and bacterial sources and along with FDA-approved cancer drugs as control, were subjected to molecular docking. Piperoleine B and dihydrocircumin were the best docked natural metabolites in both RAD51 and AURKA complexes, respectively. Molecular dynamics simulation (MDS) analysis and binding free energy calculations for the best docked natural metabolite and drug for both the mutant BRCA1 complexes suggested better stability for the natural metabolites piperolein B and dihydrocurcumin as compared to drug. Thus, both natural metabolites could be further analyzed for their role against the cardiotoxic effects of cancer drugs through wet lab experiments.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Sameeksha Tiwari
- Department of Biochemistry, University of Lucknow, Lucknow, India
| | - Veda P Pandey
- Department of Biochemistry, University of Lucknow, Lucknow, India
| | - Kusum Yadav
- Department of Biochemistry, University of Lucknow, Lucknow, India
| | - Upendra N Dwivedi
- Department of Biochemistry, University of Lucknow, Lucknow, India.,Institute for Development of Advanced Computing, ONGC Centre for Advanced Studies, University of Lucknow, Lucknow, India
| |
Collapse
|
25
|
Zhang W, Meng Q, Wang J, Guo F. HDIContact: a novel predictor of residue-residue contacts on hetero-dimer interfaces via sequential information and transfer learning strategy. Brief Bioinform 2022; 23:6599074. [PMID: 35653713 DOI: 10.1093/bib/bbac169] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Revised: 03/07/2022] [Accepted: 04/16/2022] [Indexed: 11/12/2022] Open
Abstract
Proteins maintain the functional order of cell in life by interacting with other proteins. Determination of protein complex structural information gives biological insights for the research of diseases and drugs. Recently, a breakthrough has been made in protein monomer structure prediction. However, due to the limited number of the known protein structure and homologous sequences of complexes, the prediction of residue-residue contacts on hetero-dimer interfaces is still a challenge. In this study, we have developed a deep learning framework for inferring inter-protein residue contacts from sequential information, called HDIContact. We utilized transfer learning strategy to produce Multiple Sequence Alignment (MSA) two-dimensional (2D) embedding based on patterns of concatenated MSA, which could reduce the influence of noise on MSA caused by mismatched sequences or less homology. For MSA 2D embedding, HDIContact took advantage of Bi-directional Long Short-Term Memory (BiLSTM) with two-channel to capture 2D context of residue pairs. Our comprehensive assessment on the Escherichia coli (E. coli) test dataset showed that HDIContact outperformed other state-of-the-art methods, with top precision of 65.96%, the Area Under the Receiver Operating Characteristic curve (AUROC) of 83.08% and the Area Under the Precision Recall curve (AUPR) of 25.02%. In addition, we analyzed the potential of HDIContact for human-virus protein-protein complexes, by achieving top five precision of 80% on O75475-P04584 related to Human Immunodeficiency Virus. All experiments indicated that our method was a valuable technical tool for predicting inter-protein residue contacts, which would be helpful for understanding protein-protein interaction mechanisms.
Collapse
Affiliation(s)
- Wei Zhang
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Qiaozhen Meng
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Jianxin Wang
- School of Computer Science and Engineering, Central South University, Changsha 410083, China
| | - Fei Guo
- School of Computer Science and Engineering, Central South University, Changsha 410083, China
| |
Collapse
|
26
|
Si Y, Yan C. Protein complex structure prediction powered by multiple sequence alignments of interologs from multiple taxonomic ranks and AlphaFold2. Brief Bioinform 2022; 23:6596987. [PMID: 35649388 DOI: 10.1093/bib/bbac208] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Revised: 04/17/2022] [Accepted: 05/05/2022] [Indexed: 12/19/2022] Open
Abstract
AlphaFold2 can predict protein complex structures as long as a multiple sequence alignment (MSA) of the interologs of the target protein-protein interaction (PPI) can be provided. In this study, a simplified phylogeny-based approach was applied to generate the MSA of interologs, which was then used as the input to AlphaFold2 for protein complex structure prediction. In this extensively benchmarked protocol on nonredundant PPI dataset, including 107 bacterial PPIs and 442 eukaryotic PPIs, we show complex structures of 79.5% of the bacterial PPIs and 49.8% of the eukaryotic PPIs can be successfully predicted, which yielded significantly better performance than the application of MSA of interologs prepared by two existing approaches. Considering PPIs may not be conserved in species with long evolutionary distances, we further restricted interologs in the MSA to different taxonomic ranks of the species of the target PPI in protein complex structure prediction. We found that the success rates can be increased to 87.9% for the bacterial PPIs and 56.3% for the eukaryotic PPIs if interologs in the MSA are restricted to a specific taxonomic rank of the species of each target PPI. Finally, we show that the optimal taxonomic ranks for protein complex structure prediction can be selected with the application of the predicted template modeling (TM) scores of the output models.
Collapse
Affiliation(s)
- Yunda Si
- School of Physics, Huazhong University of Science and Technology, China
| | - Chengfei Yan
- School of Physics, Huazhong University of Science and Technology, China
| |
Collapse
|
27
|
Yurina V, Adianingsih OR. Predicting epitopes for vaccine development using bioinformatics tools. Ther Adv Vaccines Immunother 2022; 10:25151355221100218. [PMID: 35647486 PMCID: PMC9130818 DOI: 10.1177/25151355221100218] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Accepted: 04/14/2022] [Indexed: 11/20/2022] Open
Abstract
Epitope-based DNA vaccine development is one application of bioinformatics or
in silico studies, that is, computational methods,
including mathematical, chemical, and biological approaches, which are widely
used in drug development. Many in silico studies have been
conducted to analyze the efficacy, safety, toxicity effects, and interactions of
drugs. In the vaccine design process, in silico studies are
performed to predict epitopes that could trigger T-cell and B-cell reactions
that would produce both cellular and humoral immune responses. Immunoinformatics
is the branch of bioinformatics used to study the relationship between immune
responses and predicted epitopes. Progress in immunoinformatics has been rapid
and has led to the development of a variety of tools that are used for the
prediction of epitopes recognized by B cells or T cells as well as the antigenic
responses. However, the in silico approach to vaccine design is
still relatively new; thus, this review is aimed at increasing understanding of
the importance of in silico studies in the design of vaccines
and thereby facilitating future research in this field.
Collapse
Affiliation(s)
- Valentina Yurina
- Department of Pharmacy, Medical Faculty, Universitas Brawijaya, Jalan Veteran, Malang 65145, East Java, Indonesia
| | | |
Collapse
|
28
|
Guo L, He J, Lin P, Huang SY, Wang J. TRScore: a three-dimensional RepVGG-based scoring method for ranking protein docking models. Bioinformatics 2022; 38:2444-2451. [PMID: 35199137 DOI: 10.1093/bioinformatics/btac120] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2021] [Revised: 01/19/2022] [Accepted: 02/21/2022] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Protein-protein interactions (PPI) play important roles in cellular activities. Due to the technical difficulty and high cost of experimental methods, there are considerable interests towards the development of computational approaches, such as protein docking, to decipher PPI patterns. One of the important and difficult aspects in protein docking is recognizing near-native conformations from a set of decoys, but unfortunately traditional scoring functions still suffer from limited accuracy. Therefore, new scoring methods are pressingly needed in methodological and/or practical implications. RESULTS We present a new deep learning-based scoring method for ranking protein-protein docking models based on a three-dimensional (3D) RepVGG network, named TRScore. To recognize near-native conformations from a set of decoys, TRScore voxelizes the protein-protein interface into a 3D grid labeled by the number of atoms in different physicochemical classes. Benefiting from the deep convolutional RepVGG architecture, TRScore can effectively capture the subtle differences between energetically favorable near-native models and unfavorable non-native decoys without needing extra information. TRScore was extensively evaluated on diverse test sets including protein-protein docking benchmark 5.0 update set, DockGround decoy set, as well as realistic CAPRI decoy set, and overall obtained a significant improvement over existing methods in cross validation and independent evaluations. AVAILABILITY Codes available at: https://github.com/BioinformaticsCSU/TRScore.
Collapse
Affiliation(s)
- Linyuan Guo
- School of Computer Science, Central South University, Changsha, Hunan 410083, China
| | - Jiahua He
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Peicong Lin
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Jianxin Wang
- School of Computer Science, Central South University, Changsha, Hunan 410083, China
| |
Collapse
|
29
|
Durojaye OA, Sedzro DM, Idris MO, Yekeen AA, Fadahunsi AA, Alakanse OS. Identification of a Potential mRNA-based Vaccine Candidate against the SARS-CoV-2 Spike Glycoprotein: A Reverse Vaccinology Approach. ChemistrySelect 2022; 7:e202103903. [PMID: 35601809 PMCID: PMC9111088 DOI: 10.1002/slct.202103903] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Accepted: 02/04/2022] [Indexed: 12/11/2022]
Abstract
The emergence of the novel coronavirus (SARS-CoV-2) in December 2019 has generated a devastating global consequence which makes the development of a rapidly deployable, effective and safe vaccine candidate an imminent global health priority. The design of most vaccine candidates has been directed at the induction of antibody responses against the trimeric spike glycoprotein of SARS-CoV-2, a class I fusion protein that aids ACE2 (angiotensin-converting enzyme 2) receptor binding. A variety of formulations and vaccinology approaches are being pursued for targeting the spike glycoprotein, including simian and human replication-defective adenoviral vaccines, subunit protein vaccines, nucleic acid vaccines and whole-inactivated SARS-CoV-2. Here, we directed a reverse vaccinology approach towards the design of a nucleic acid (mRNA-based) vaccine candidate. The "YLQPRTFLL" peptide sequence (position 269-277) which was predicted to be a B cell epitope and likewise a strong binder of the HLA*A-0201 was selected for the design of the vaccine candidate, having satisfied series of antigenicity assessments. Through the codon optimization protocol, the nucleotide sequence for the vaccine candidate design was generated and targeted at the human toll-like receptor 7 (TLR7). Bioinformatics analyses showed that the sequence "UACCUGCAGCCGCGUACCUUCCUGCUG" exhibited a strong affinity and likewise was bound to a stable cavity in the TLR7 pocket. This study is therefore expected to contribute to the research efforts directed at securing definitive preventive measures against the SARS-CoV-2 infection.
Collapse
Affiliation(s)
- Olanrewaju Ayodeji Durojaye
- MOE Key Laboratory of Membraneless Organelle and Cellular DynamicsHefei National Laboratory for Physical Sciences at the MicroscaleUniversity of Science and Technology of ChinaHefeiAnhui230027China
- School of Life SciencesUniversity of Science and Technology of ChinaHefeiAnhui230027China
- Department of Chemical SciencesCoal City University, EmeneEnugu StateNigeria
| | - Divine Mensah Sedzro
- MOE Key Laboratory of Membraneless Organelle and Cellular DynamicsHefei National Laboratory for Physical Sciences at the MicroscaleUniversity of Science and Technology of ChinaHefeiAnhui230027China
- School of Life SciencesUniversity of Science and Technology of ChinaHefeiAnhui230027China
| | | | - Abeeb Abiodun Yekeen
- School of Life SciencesUniversity of Science and Technology of ChinaHefeiAnhui230027China
| | - Adeola Abraham Fadahunsi
- Department of Biomedical EngineeringUniversity of Science and Technology of ChinaHefeiAnhui230027China
| | - Oluwaseun Suleiman Alakanse
- School of Life SciencesUniversity of Science and Technology of ChinaHefeiAnhui230027China
- Department of BiochemistryFaculty of Life SciencesUniversity of IlorinIlorinKwara StateNigeria
| |
Collapse
|
30
|
Elhabashy H, Merino F, Alva V, Kohlbacher O, Lupas AN. Exploring protein-protein interactions at the proteome level. Structure 2022; 30:462-475. [DOI: 10.1016/j.str.2022.02.004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Revised: 10/26/2021] [Accepted: 02/02/2022] [Indexed: 02/08/2023]
|
31
|
Li H, Yan Y, Zhao X, Huang SY. Inclusion of Desolvation Energy into Protein–Protein Docking through Atomic Contact Potentials. J Chem Inf Model 2022; 62:740-750. [DOI: 10.1021/acs.jcim.1c01483] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Hao Li
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Yumeng Yan
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Xuejun Zhao
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| |
Collapse
|
32
|
Goel H, Hazel A, Yu W, Jo S, MacKerell AD. Application of Site-Identification by Ligand Competitive Saturation in Computer-Aided Drug Design. NEW J CHEM 2022; 46:919-932. [PMID: 35210743 PMCID: PMC8863107 DOI: 10.1039/d1nj04028f] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
Site Identification by Ligand Competitive Saturation (SILCS) is a molecular simulation approach that uses diverse small solutes in aqueous solution to obtain functional group affinity patterns of a protein or other macromolecule. This involves employing a combined Grand Canonical Monte Carlo (GCMC)-molecular dynamics (MD) method to sample the full 3D space of the protein, including deep binding pockets and interior cavities from which functional group free energy maps (FragMaps) are obtained. The information content in the maps, which include contributions from protein flexibilty and both protein and functional group desolvation contributions, can be used in many aspects of the drug discovery process. These include identification of novel ligand binding pockets, including allosteric sites, pharmacophore modeling, prediction of relative protein-ligand binding affinities for database screening and lead optimization efforts, evaluation of protein-protein interactions as well as in the formulation of biologics-based drugs including monoclonal antibodies. The present article summarizes the various tools developed in the context of the SILCS methodology and their utility in computer-aided drug design (CADD) applications, showing how the SILCS toolset can improve the drug-development process on a number of fronts with respect to both accuracy and throughput representing a new avenue of CADD applications.
Collapse
Affiliation(s)
- Himanshu Goel
- Computer Aided Drug Design Center, Department of Pharmaceutical Sciences, University of Maryland School of Pharmacy, 20, Penn St. Baltimore, Maryland 21201, United States
| | - Anthony Hazel
- Computer Aided Drug Design Center, Department of Pharmaceutical Sciences, University of Maryland School of Pharmacy, 20, Penn St. Baltimore, Maryland 21201, United States
| | - Wenbo Yu
- Computer Aided Drug Design Center, Department of Pharmaceutical Sciences, University of Maryland School of Pharmacy, 20, Penn St. Baltimore, Maryland 21201, United States
| | - Sunhwan Jo
- SilcsBio LLC, 1100 Wicomico St. Suite 323, Baltimore, MD, 21230, United States
| | - Alexander D. MacKerell
- Computer Aided Drug Design Center, Department of Pharmaceutical Sciences, University of Maryland School of Pharmacy, 20, Penn St. Baltimore, Maryland 21201, United States., SilcsBio LLC, 1100 Wicomico St. Suite 323, Baltimore, MD, 21230, United States.,, Tel: 410-706-7442, Fax: 410-706-5017
| |
Collapse
|
33
|
Barradas-Bautista D, Cao Z, Vangone A, Oliva R, Cavallo L. A random forest classifier for protein-protein docking models. BIOINFORMATICS ADVANCES 2021; 2:vbab042. [PMID: 36699405 PMCID: PMC9710594 DOI: 10.1093/bioadv/vbab042] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Revised: 11/11/2021] [Accepted: 12/06/2021] [Indexed: 01/28/2023]
Abstract
Herein, we present the results of a machine learning approach we developed to single out correct 3D docking models of protein-protein complexes obtained by popular docking software. To this aim, we generated 3 × 10 4 docking models for each of the 230 complexes in the protein-protein benchmark, version 5, using three different docking programs (HADDOCK, FTDock and ZDOCK), for a cumulative set of ≈ 7 × 10 6 docking models. Three different machine learning approaches (Random Forest, Supported Vector Machine and Perceptron) were used to train classifiers with 158 different scoring functions (features). The Random Forest algorithm outperformed the other two algorithms and was selected for further optimization. Using a features selection algorithm, and optimizing the random forest hyperparameters, allowed us to train and validate a random forest classifier, named COnservation Driven Expert System (CoDES). Testing of CoDES on independent datasets, as well as results of its comparative performance with machine learning methods recently developed in the field for the scoring of docking decoys, confirm its state-of-the-art ability to discriminate correct from incorrect decoys both in terms of global parameters and in terms of decoys ranked at the top positions. Supplementary information Supplementary data are available at Bioinformatics Advances online. Software and data availability statement The docking models are available at https://doi.org/10.5281/zenodo.4012018. The programs underlying this article will be shared on request to the corresponding authors.
Collapse
Affiliation(s)
- Didier Barradas-Bautista
- Kaust Catalysis Center, Physical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), 23955-6900 Thuwal, Saudi Arabia,To whom correspondence should be addressed. or or
| | - Zhen Cao
- Kaust Catalysis Center, Physical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), 23955-6900 Thuwal, Saudi Arabia
| | - Anna Vangone
- Pharma Research and Early Development, Therapeutic Modalities, Roche Innovation Center Munich Large Molecule Research, 82377 Penzberg, Germany
| | - Romina Oliva
- Department of Sciences and Technologies, University Parthenope of Naples, Centro Direzionale Isola C4, I-80143 Naples, Italy,To whom correspondence should be addressed. or or
| | - Luigi Cavallo
- Kaust Catalysis Center, Physical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), 23955-6900 Thuwal, Saudi Arabia,To whom correspondence should be addressed. or or
| |
Collapse
|
34
|
Feng Y, Yan Y, He J, Tao H, Wu Q, Huang SY. Docking and scoring for nucleic acid-ligand interactions: Principles and current status. Drug Discov Today 2021; 27:838-847. [PMID: 34718205 DOI: 10.1016/j.drudis.2021.10.013] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Revised: 09/06/2021] [Accepted: 10/20/2021] [Indexed: 12/24/2022]
Abstract
Nucleic acid (NA)-ligand interactions have crucial roles in many cellular processes and, thus, are increasingly attracting therapeutic interest in drug discovery. Molecular docking is a valuable tool for studying molecular interactions. However, because NAs differ significantly from proteins in both their physical and chemical properties, traditional docking algorithms and scoring functions for protein-ligand interactions might not be applicable to NA-ligand docking. Therefore, various sampling strategies and scoring functions for NA-ligand interactions have been developed. Here, we review the basic principles and current status of docking algorithms and scoring functions for DNA/RNA-ligand interactions. We also discuss challenges and limitations of current docking and scoring approaches.
Collapse
Affiliation(s)
- Yuyu Feng
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| | - Yumeng Yan
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| | - Jiahua He
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| | - Huanyu Tao
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| | - Qilong Wu
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China.
| |
Collapse
|
35
|
Feng Y, Zhang K, Wu Q, Huang SY. NLDock: a Fast Nucleic Acid-Ligand Docking Algorithm for Modeling RNA/DNA-Ligand Complexes. J Chem Inf Model 2021; 61:4771-4782. [PMID: 34468128 DOI: 10.1021/acs.jcim.1c00341] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Nucleic acid-ligand interactions play an important role in numerous cellular processes such as gene function expression and regulation. Therefore, nucleic acids such as RNAs have become more and more important drug targets, where the structural determination of nucleic acid-ligand complexes is pivotal for understanding their functions and thus developing therapeutic interventions. Molecular docking has been a useful computational tool in predicting the complex structure between molecules. However, although a number of docking algorithms have been developed for protein-ligand interactions, only a few docking programs were presented for nucleic acid-ligand interactions. Here, we have developed a fast nucleic acid-ligand docking algorithm, named NLDock, by implementing our intrinsic scoring function ITScoreNL for nucleic acid-ligand interactions into a modified version of the MDock program. NLDock was extensively evaluated on four test sets and compared with five other state-of-the-art docking algorithms including AutoDock, DOCK 6, rDock, GOLD, and Glide. It was shown that our NLDock algorithm obtained a significantly better performance than the other docking programs in binding mode predictions and achieved the success rates of 73%, 36%, and 32% on the largest test set of 77 complexes for local rigid-, local flexible-, and global flexible-ligand docking, respectively. In addition, our NLDock approach is also computationally efficient and consumed an average of as short as 0.97 and 2.08 min for a local flexible-ligand docking job and a global flexible-ligand docking job, respectively. These results suggest the good performance of our NLDock in both docking accuracy and computational efficiency.
Collapse
Affiliation(s)
- Yuyu Feng
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Keqiong Zhang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Qilong Wu
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| |
Collapse
|
36
|
Yan Y, Huang SY. Accurate prediction of inter-protein residue-residue contacts for homo-oligomeric protein complexes. Brief Bioinform 2021; 22:bbab038. [PMID: 33693482 PMCID: PMC8425427 DOI: 10.1093/bib/bbab038] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2020] [Revised: 01/09/2021] [Indexed: 12/14/2022] Open
Abstract
Protein-protein interactions play a fundamental role in all cellular processes. Therefore, determining the structure of protein-protein complexes is crucial to understand their molecular mechanisms and develop drugs targeting the protein-protein interactions. Recently, deep learning has led to a breakthrough in intra-protein contact prediction, achieving an unusual high accuracy in recent Critical Assessment of protein Structure Prediction (CASP) structure prediction challenges. However, due to the limited number of known homologous protein-protein interactions and the challenge to generate joint multiple sequence alignments of two interacting proteins, the advances in inter-protein contact prediction remain limited. Here, we have proposed a deep learning model to predict inter-protein residue-residue contacts across homo-oligomeric protein interfaces, named as DeepHomo. Unlike previous deep learning approaches, we integrated intra-protein distance map and inter-protein docking pattern, in addition to evolutionary coupling, sequence conservation, and physico-chemical information of monomers. DeepHomo was extensively tested on both experimentally determined structures and realistic CASP-Critical Assessment of Predicted Interaction (CAPRI) targets. It was shown that DeepHomo achieved a high precision of >60% for the top predicted contact and outperformed state-of-the-art direct-coupling analysis and machine learning-based approaches. Integrating predicted inter-chain contacts into protein-protein docking significantly improved the docking accuracy on the benchmark dataset of realistic homo-dimeric targets from CASP-CAPRI experiments. DeepHomo is available at http://huanglab.phys.hust.edu.cn/DeepHomo/.
Collapse
Affiliation(s)
- Yumeng Yan
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| |
Collapse
|
37
|
Agamennone M, Nicoli A, Bayer S, Weber V, Borro L, Gupta S, Fantacuzzi M, Di Pizio A. Protein-protein interactions at a glance: Protocols for the visualization of biomolecular interactions. Methods Cell Biol 2021; 166:271-307. [PMID: 34752337 DOI: 10.1016/bs.mcb.2021.06.012] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Protein-protein interactions (PPIs) play a key role in many biological processes and are intriguing targets for drug discovery campaigns. Advancements in experimental and computational techniques are leading to a growth of data accessibility, and, with it, an increased need for the analysis of PPIs. In this respect, visualization tools are essential instruments to represent and analyze biomolecular interactions. In this chapter, we reviewed some of the available tools, highlighting their features, and describing their functions with practical information on their usage.
Collapse
Affiliation(s)
| | - Alessandro Nicoli
- Leibniz-Institute for Food Systems Biology at the Technical University of Munich, Freising, Germany
| | - Sebastian Bayer
- Leibniz-Institute for Food Systems Biology at the Technical University of Munich, Freising, Germany
| | - Verena Weber
- Leibniz-Institute for Food Systems Biology at the Technical University of Munich, Freising, Germany
| | - Luca Borro
- Department of Imaging, Advanced Cardiovascular Imaging Unit, Bambino Gesù Children's Hospital, IRCCS, Rome, Italy
| | - Shailendra Gupta
- Department of Systems Biology and Bioinformatics, University of Rostock, Rostock, Germany
| | | | - Antonella Di Pizio
- Leibniz-Institute for Food Systems Biology at the Technical University of Munich, Freising, Germany.
| |
Collapse
|
38
|
Stanzione F, Giangreco I, Cole JC. Use of molecular docking computational tools in drug discovery. PROGRESS IN MEDICINAL CHEMISTRY 2021; 60:273-343. [PMID: 34147204 DOI: 10.1016/bs.pmch.2021.01.004] [Citation(s) in RCA: 202] [Impact Index Per Article: 50.5] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
Molecular docking has become an important component of the drug discovery process. Since first being developed in the 1980s, advancements in the power of computer hardware and the increasing number of and ease of access to small molecule and protein structures have contributed to the development of improved methods, making docking more popular in both industrial and academic settings. Over the years, the modalities by which docking is used to assist the different tasks of drug discovery have changed. Although initially developed and used as a standalone method, docking is now mostly employed in combination with other computational approaches within integrated workflows. Despite its invaluable contribution to the drug discovery process, molecular docking is still far from perfect. In this chapter we will provide an introduction to molecular docking and to the different docking procedures with a focus on several considerations and protocols, including protonation states, active site waters and consensus, that can greatly improve the docking results.
Collapse
Affiliation(s)
| | - Ilenia Giangreco
- Cambridge Crystallographic Data Centre, Cambridge, United Kingdom
| | - Jason C Cole
- Cambridge Crystallographic Data Centre, Cambridge, United Kingdom
| |
Collapse
|
39
|
Parate S, Rampogu S, Lee G, Hong JC, Lee KW. Exploring the Binding Interaction of Raf Kinase Inhibitory Protein With the N-Terminal of C-Raf Through Molecular Docking and Molecular Dynamics Simulation. Front Mol Biosci 2021; 8:655035. [PMID: 34124147 PMCID: PMC8194344 DOI: 10.3389/fmolb.2021.655035] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Accepted: 05/14/2021] [Indexed: 12/12/2022] Open
Abstract
Protein-protein interactions are indispensable physiological processes regulating several biological functions. Despite the availability of structural information on protein-protein complexes, deciphering their complex topology remains an outstanding challenge. Raf kinase inhibitory protein (RKIP) has gained substantial attention as a favorable molecular target for numerous pathologies including cancer and Alzheimer’s disease. RKIP interferes with the RAF/MEK/ERK signaling cascade by endogenously binding with C-Raf (Raf-1 kinase) and preventing its activation. In the current investigation, the binding of RKIP with C-Raf was explored by knowledge-based protein-protein docking web-servers including HADDOCK and ZDOCK and a consensus binding mode of C-Raf/RKIP structural complex was obtained. Molecular dynamics (MD) simulations were further performed in an explicit solvent to sample the conformations for when RKIP binds to C-Raf. Some of the conserved interface residues were mutated to alanine, phenylalanine and leucine and the impact of mutations was estimated by additional MD simulations and MM/PBSA analysis for the wild-type (WT) and constructed mutant complexes. Substantial decrease in binding free energy was observed for the mutant complexes as compared to the binding free energy of WT C-Raf/RKIP structural complex. Furthermore, a considerable increase in average backbone root mean square deviation and fluctuation was perceived for the mutant complexes. Moreover, per-residue energy contribution analysis of the equilibrated simulation trajectory by HawkDock and ANCHOR web-servers was conducted to characterize the key residues for the complex formation. One residue each from C-Raf (Arg398) and RKIP (Lys80) were identified as the druggable “hot spots” constituting the core of the binding interface and corroborated by additional long-time scale (300 ns) MD simulation of Arg398Ala mutant complex. A notable conformational change in Arg398Ala mutant occurred near the mutation site as compared to the equilibrated C-Raf/RKIP native state conformation and an essential hydrogen bonding interaction was lost. The thirteen binding sites assimilated from the overall analysis were mapped onto the complex as surface and divided into active and allosteric binding sites, depending on their location at the interface. The acquired information on the predicted 3D structural complex and the detected sites aid as promising targets in designing novel inhibitors to block the C-Raf/RKIP interaction.
Collapse
Affiliation(s)
- Shraddha Parate
- Division of Life Sciences, Division of Applied Life Science (BK21 Plus), Plant Molecular Biology and Biotechnology Research Center (PMBBRC), Research Institute of Natural Science (RINS), Gyeongsang National University (GNU), Jinju, Korea
| | - Shailima Rampogu
- Division of Life Sciences, Plant Molecular Biology and Biotechnology Research Center (PMBBRC), Department of Bio and Medical Big Data (BK21 Four Program), Research Institute of Natural Science (RINS), Gyeongsang National University (GNU), Jinju, Korea
| | - Gihwan Lee
- Division of Life Sciences, Division of Applied Life Science (BK21 Plus), Plant Molecular Biology and Biotechnology Research Center (PMBBRC), Research Institute of Natural Science (RINS), Gyeongsang National University (GNU), Jinju, Korea
| | - Jong Chan Hong
- Division of Life Sciences, Division of Applied Life Science (BK21 Plus), Plant Molecular Biology and Biotechnology Research Center (PMBBRC), Research Institute of Natural Science (RINS), Gyeongsang National University (GNU), Jinju, Korea
| | - Keun Woo Lee
- Division of Life Sciences, Plant Molecular Biology and Biotechnology Research Center (PMBBRC), Department of Bio and Medical Big Data (BK21 Four Program), Research Institute of Natural Science (RINS), Gyeongsang National University (GNU), Jinju, Korea
| |
Collapse
|
40
|
Zhang W, Meng Q, Tang J, Guo F. Exploring effectiveness of ab-initio protein-protein docking methods on a novel antibacterial protein complex dataset. Brief Bioinform 2021; 22:6265196. [PMID: 33959764 DOI: 10.1093/bib/bbab150] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Revised: 03/12/2021] [Accepted: 03/27/2021] [Indexed: 12/27/2022] Open
Abstract
Diseases caused by bacterial infections become a critical problem in public heath. Antibiotic, the traditional treatment, gradually loses their effectiveness due to the resistance. Meanwhile, antibacterial proteins attract more attention because of broad spectrum and little harm to host cells. Therefore, exploring new effective antibacterial proteins is urgent and necessary. In this paper, we are committed to evaluating the effectiveness of ab-initio docking methods in antibacterial protein-protein docking. For this purpose, we constructed a three-dimensional (3D) structure dataset of antibacterial protein complex, called APCset, which contained $19$ protein complexes whose receptors or ligands are homologous to antibacterial peptides from Antimicrobial Peptide Database. Then we selected five representative ab-initio protein-protein docking tools including ZDOCK3.0.2, FRODOCK3.0, ATTRACT, PatchDock and Rosetta to identify these complexes' structure, whose performance differences were obtained by analyzing from five aspects, including top/best pose, first hit, success rate, average hit count and running time. Finally, according to different requirements, we assessed and recommended relatively efficient protein-protein docking tools. In terms of computational efficiency and performance, ZDOCK was more suitable as preferred computational tool, with average running time of $6.144$ minutes, average Fnat of best pose of $0.953$ and average rank of best pose of $4.158$. Meanwhile, ZDOCK still yielded better performance on Benchmark 5.0, which proved ZDOCK was effective in performing docking on large-scale dataset. Our survey can offer insights into the research on the treatment of bacterial infections by utilizing the appropriate docking methods.
Collapse
Affiliation(s)
- Wei Zhang
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Qiaozhen Meng
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Jijun Tang
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin, China.,School of Computational Science and Engineering, University of South Carolina, Columbia, U.S.,Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin, China
| | - Fei Guo
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin, China
| |
Collapse
|
41
|
Quignot C, Granger P, Chacón P, Guerois R, Andreani J. Atomic-level evolutionary information improves protein-protein interface scoring. Bioinformatics 2021; 37:3175-3181. [PMID: 33901284 DOI: 10.1093/bioinformatics/btab254] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 03/20/2021] [Accepted: 04/19/2021] [Indexed: 12/11/2022] Open
Abstract
MOTIVATION The crucial role of protein interactions and the difficulty in characterising them experimentally strongly motivates the development of computational approaches for structural prediction. Even when protein-protein docking samples correct models, current scoring functions struggle to discriminate them from incorrect decoys. The previous incorporation of conservation and coevolution information has shown promise for improving protein-protein scoring. Here, we present a novel strategy to integrate atomic-level evolutionary information into different types of scoring functions to improve their docking discrimination. RESULTS : We applied this general strategy to our residue-level statistical potential from InterEvScore and to two atomic-level scores, SOAP-PP and Rosetta interface score (ISC). Including evolutionary information from as few as ten homologous sequences improves the top 10 success rates of individual atomic-level scores SOAP-PP and Rosetta ISC by respectively 6 and 13.5 percentage points, on a large benchmark of 752 docking cases. The best individual homology-enriched score reaches a top 10 success rate of 34.4%. A consensus approach based on the complementarity between different homology-enriched scores further increases the top 10 success rate to 40%. AVAILABILITY All data used for benchmarking and scoring results, as well as a Singularity container of the pipeline, are available at http://biodev.cea.fr/interevol/interevdata/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Chloé Quignot
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Pierre Granger
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Pablo Chacón
- Department of Biological Chemical Physics, Rocasolano Institute of Physical Chemistry C.S.I.C, Madrid, Spain
| | - Raphael Guerois
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Jessica Andreani
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| |
Collapse
|
42
|
Ramanto KN, Nurdiansyah R. Structural and immunogenicity analysis of reconstructed ancestral and consensus P48/45 for cross-species anti malaria transmission-blocking vaccine. Comput Biol Chem 2021; 92:107495. [PMID: 33940529 DOI: 10.1016/j.compbiolchem.2021.107495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Accepted: 04/21/2021] [Indexed: 11/25/2022]
Abstract
The development of the anti-malaria vaccine holds a promising future in malaria control. One of the anti-malaria vaccine strategies known as the transmission-blocking vaccine (TBV) is to inhibit the parasite transmission between humans and mosquitoes by targeting the parasite gametocyte. Previously, we found that P48/45 included in the 6-Cysteine protein family shared by Plasmodium sp. We also detected vaccine properties possessed by all human-infecting Plasmodium and could be used as a cross-species anti-malaria vaccine. In this study, we investigated the efficacy of P48/45 through the ancestral and consensus reconstruction approach. P48/45 phylogenetic and time tree analysis was done by RAXML and BEAST2. GRASP server and Ugene software were used to reconstruct ancestral and consensus sequences, respectively. The protein structural prediction was made by using a psipred and Rosetta program. Each protein characteristic of P48/45 was analyzed by assessing hydrophobicity and Post-Translational Modification sites. Meanwhile, the Epitope sequence for B-cell, T-cell, and HLA was determined using an immunoinformatics approach. Lastly, molecular docking simulation was done to determine native binding interactions of P48/45-P230. The result showed a distinct protein characteristic of ancestral and consensus sequences. The immunogenicity analysis revealed the number of epitopes in the ancestral sequence is greater than the consensus sequence. The study also found a conserved epitope located in the binding site and consists of specific Post-Translational Modification sites. Hence, our research provides detailed insight into ancestral and consensus P48/45 efficacy for the cross-species anti-malaria vaccine.
Collapse
Affiliation(s)
- Kevin Nathanael Ramanto
- Department of Bioinformatics, School of Life Sciences, Indonesia International Institute for Life Sciences, Jakarta, Indonesia
| | - Rizky Nurdiansyah
- Department of Bioinformatics, School of Life Sciences, Indonesia International Institute for Life Sciences, Jakarta, Indonesia.
| |
Collapse
|
43
|
Ramanto KN, Nurdiansyah R. Immunogenicity and structural efficacy of P41 of Plasmodium sp. as potential cross-species blood-stage malaria vaccine. Comput Biol Chem 2021; 92:107493. [PMID: 33962170 DOI: 10.1016/j.compbiolchem.2021.107493] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Accepted: 04/21/2021] [Indexed: 10/21/2022]
Abstract
Vaccine based strategies offer a promising future in malaria control by generating protective immunity against natural infection. However, vaccine development is hindered by the Plasmodium sp. genetic diversity. Previously, we have shown P41 protein from 6-Cysteine shared by Plasmodium sp. and could be used for cross-species anti-malaria vaccines. Two different approaches, ancestral, and consensus sequence, could produce a single target for all human-infecting Plasmodium. In this study, we investigated the efficacy of ancestral and consensus of P41 protein. Phylogenetic and time tree reconstruction was conducted by RAXML and BEAST2 package to determine the relationship of known P41 sequences. Ancestral and consensus sequences were reconstructed by the GRASP server and Unipro Ugene software, respectively. The structural prediction was made using the Psipred and Rosetta program. The protein characteristic was analyzed by assessing hydrophobicity and Post-Translational Modification sites. Meanwhile, the immunogenicity score for B-cell, T-cell, and MHC was determined using an immunoinformatic approach. The result suggests that ancestral and consensus have a distinct protein characteristic with high immunogenicity scores for all immune cells. We found one shared conserved epitope with phosphorylation modification from the ancestral sequence to target the cross-species vaccine. Thus, this study provides detailed insight into P41 efficacy for the cross-species anti-malaria blood-stage vaccine.
Collapse
Affiliation(s)
- Kevin Nathanael Ramanto
- Department of Bioinformatics, School of Life Sciences, Indonesia International Institute for Life Sciences, Jakarta, Indonesia
| | - Rizky Nurdiansyah
- Department of Bioinformatics, School of Life Sciences, Indonesia International Institute for Life Sciences, Jakarta, Indonesia.
| |
Collapse
|
44
|
Sulimov VB, Kutov DC, Taschilova AS, Ilin IS, Tyrtyshnikov EE, Sulimov AV. Docking Paradigm in Drug Design. Curr Top Med Chem 2021; 21:507-546. [PMID: 33292135 DOI: 10.2174/1568026620666201207095626] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2020] [Revised: 09/28/2020] [Accepted: 10/16/2020] [Indexed: 11/22/2022]
Abstract
Docking is in demand for the rational computer aided structure based drug design. A review of docking methods and programs is presented. Different types of docking programs are described. They include docking of non-covalent small ligands, protein-protein docking, supercomputer docking, quantum docking, the new generation of docking programs and the application of docking for covalent inhibitors discovery. Taking into account the threat of COVID-19, we present here a short review of docking applications to the discovery of inhibitors of SARS-CoV and SARS-CoV-2 target proteins, including our own result of the search for inhibitors of SARS-CoV-2 main protease using docking and quantum chemical post-processing. The conclusion is made that docking is extremely important in the fight against COVID-19 during the process of development of antivirus drugs having a direct action on SARS-CoV-2 target proteins.
Collapse
Affiliation(s)
- Vladimir B Sulimov
- Research Computer Center of Lomonosov Moscow State University, Moscow, Russian Federation
| | - Danil C Kutov
- Research Computer Center of Lomonosov Moscow State University, Moscow, Russian Federation
| | - Anna S Taschilova
- Research Computer Center of Lomonosov Moscow State University, Moscow, Russian Federation
| | - Ivan S Ilin
- Research Computer Center of Lomonosov Moscow State University, Moscow, Russian Federation
| | - Eugene E Tyrtyshnikov
- Institute of Numerical Mathematics of Russian Academy of Sciences, Moscow, Russian Federation
| | - Alexey V Sulimov
- Research Computer Center of Lomonosov Moscow State University, Moscow, Russian Federation
| |
Collapse
|
45
|
Rosell M, Rodríguez-Lumbreras LA, Fernández-Recio J. Modeling of Protein Complexes and Molecular Assemblies with pyDock. Methods Mol Biol 2021; 2165:175-198. [PMID: 32621225 DOI: 10.1007/978-1-0716-0708-4_10] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The study of the 3D structural details of protein interactions is essential to understand biomolecular functions at the molecular level. In this context, the limited availability of experimental structures of protein-protein complexes at atomic resolution is propelling the development of computational docking methods that aim to complement the current structural coverage of protein interactions. One of these docking approaches is pyDock, which uses van der Waals, electrostatics, and desolvation energy to score docking poses generated by a variety of sampling methods, typically FTDock or ZDOCK. The method has shown a consistently good prediction performance in community-wide assessment experiments like CAPRI or CASP, and has provided biological insights and insightful interpretation of experiments by modeling many biomolecular interactions of biomedical and biotechnological interest. Here, we describe in detail how to perform structural modeling of protein assemblies with pyDock, and the application of its modules to different biomolecular recognition phenomena, such as modeling of binding mode, interface, and hot-spot prediction, use of restraints based on experimental data, inclusion of low-resolution structural data, binding affinity estimation, or modeling of homo- and hetero-oligomeric assemblies.
Collapse
Affiliation(s)
- Mireia Rosell
- Barcelona Supercomputing Center (BSC), Barcelona, Spain.,Instituto de Ciencias de la Vid y del Vino (ICVV), Consejo Superior de Investigaciones Científicas (CSIC) - Universidad de La Rioja - Gobierno de La Rioja, Logroño, Spain
| | - Luis Angel Rodríguez-Lumbreras
- Barcelona Supercomputing Center (BSC), Barcelona, Spain.,Instituto de Ciencias de la Vid y del Vino (ICVV), Consejo Superior de Investigaciones Científicas (CSIC) - Universidad de La Rioja - Gobierno de La Rioja, Logroño, Spain
| | - Juan Fernández-Recio
- Barcelona Supercomputing Center (BSC), Barcelona, Spain. .,Instituto de Ciencias de la Vid y del Vino (ICVV), Consejo Superior de Investigaciones Científicas (CSIC) - Universidad de La Rioja - Gobierno de La Rioja, Logroño, Spain. .,Institut de Biologia Molecular de Barcelona (IBMB), Consejo Superior de Investigaciones Científicas (CSIC), Barcelona, Spain.
| |
Collapse
|
46
|
Mathematical Modelling of Biosensing Platforms Applied for Environmental Monitoring. CHEMOSENSORS 2021. [DOI: 10.3390/chemosensors9030050] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
In recent years, mathematical modelling has known an overwhelming integration in different scientific fields. In general, modelling is used to obtain new insights and achieve more quantitative and qualitative information about systems by programming language, manipulating matrices, creating algorithms and tracing functions and data. Researchers have been inspired by these techniques to explore several methods to solve many problems with high precision. In this direction, simulation and modelling have been employed for the development of sensitive and selective detection tools in different fields including environmental control. Emerging pollutants such as pesticides, heavy metals and pharmaceuticals are contaminating water resources, thus threatening wildlife. As a consequence, various biosensors using modelling have been reported in the literature for efficient environmental monitoring. In this review paper, the recent biosensors inspired by modelling and applied for environmental monitoring will be overviewed. Moreover, the level of success and the analytical performances of each modelling-biosensor will be discussed. Finally, current challenges in this field will be highlighted.
Collapse
|
47
|
Albuquerque ADO, da Silva Junior HC, Sartori GR, Martins da Silva JH. Computationally-obtained structural insights into the molecular interactions between Pidilizumab and binding partners DLL1 and PD-1. J Biomol Struct Dyn 2021; 40:6450-6462. [PMID: 33559526 DOI: 10.1080/07391102.2021.1885492] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
Pidilizumab is a monoclonal antibody tested against several types of malignancies, such as lymphoma and metastatic melanoma, showing promising results. In 2016, the FDA put Pidilizumab's clinical studies on partial hold due to emerging evidence pointing to the antibody target uncertainty. Although initial studies indicated an interaction with the PD-1 checkpoint receptor, recent updates assert that Pidilizumab binds primarily to Notch ligand DLL1. However, a detailed description of which interactions coordinate antibody-antigen complex formation is lacking. Therefore, this study uses computational tools to identify molecular interactions between Pidilizumab and its reported targets PD-1 and DLL1. A docking methodology was validated and applied to determine the binding modes between modeled Pidilizumab scFvs and the two antigens. We used Molecular Dynamics (MD) simulations to verify the complexes' stability and submitted the resulting trajectory files to MM/PBSA and Principal Component Analysis. A set of different prediction tools determined scFv interface hot-spots. Whereas docking and MD simulations revealed that the antibody fragments do not interact straightforwardly with PD-1, ten scFv hot-spots, including Met93 and Leu112, mediated the interaction with the DLL1 C2 domain. The interaction triggered a conformational selection-like effect on DLL1, allowing new hydrogen bonds on the β3-β4 interface loop. The unprecedented structural data on Pidilizumab's interactions provided novel evidence that its legitimate target is the DLL1 protein and offered structural insight on how these molecules interact, shedding light on the pathways that could be affected by the use of this essential immunobiological.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
| | | | - Geraldo Rodrigues Sartori
- Grupo para Modelagem, Simulação e Evolução, in sílico, de Biomoléculas, Fiocruz-Ceará, Eusébio, Brazil
| | | |
Collapse
|
48
|
Guest JD, Vreven T, Zhou J, Moal I, Jeliazkov JR, Gray JJ, Weng Z, Pierce BG. An expanded benchmark for antibody-antigen docking and affinity prediction reveals insights into antibody recognition determinants. Structure 2021; 29:606-621.e5. [PMID: 33539768 DOI: 10.1016/j.str.2021.01.005] [Citation(s) in RCA: 64] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2020] [Revised: 11/15/2020] [Accepted: 01/11/2021] [Indexed: 01/04/2023]
Abstract
Accurate predictive modeling of antibody-antigen complex structures and structure-based antibody design remain major challenges in computational biology, with implications for biotherapeutics, immunity, and vaccines. Through a systematic search for high-resolution structures of antibody-antigen complexes and unbound antibody and antigen structures, in conjunction with identification of experimentally determined binding affinities, we have assembled a non-redundant set of test cases for antibody-antigen docking and affinity prediction. This benchmark more than doubles the number of antibody-antigen complexes and corresponding affinities available in our previous benchmarks, providing an unprecedented view of the determinants of antibody recognition and insights into molecular flexibility. Initial assessments of docking and affinity prediction tools highlight the challenges posed by this diverse set of cases, which includes camelid nanobodies, therapeutic monoclonal antibodies, and broadly neutralizing antibodies targeting viral glycoproteins. This dataset will enable development of advanced predictive modeling and design methods for this therapeutically relevant class of protein-protein interactions.
Collapse
Affiliation(s)
- Johnathan D Guest
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD 20850, USA; Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA
| | - Thom Vreven
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA
| | - Jing Zhou
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Iain Moal
- Computational Sciences, GlaxoSmithKline Research and Development, Stevenage SG1 2NY, UK
| | - Jeliazko R Jeliazkov
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Jeffrey J Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA; Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Zhiping Weng
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA.
| | - Brian G Pierce
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD 20850, USA; Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA.
| |
Collapse
|
49
|
Feng Y, Huang SY. ITScore-NL: An Iterative Knowledge-Based Scoring Function for Nucleic Acid-Ligand Interactions. J Chem Inf Model 2020; 60:6698-6708. [PMID: 33291885 DOI: 10.1021/acs.jcim.0c00974] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Nucleic acid-ligand complexes underlie numerous cellular processes, such as gene function expression and regulation, in which their three-dimensional structures are important to understand their functions and thus to develop therapeutic interventions. Given the high cost and technical difficulties in experimental methods, computational methods such as molecular docking have been actively used to investigate nucleic acid-ligand interactions in which an accurate scoring function is crucial. However, because of the limited number of experimental nucleic acid-ligand binding data and structures, the scoring function development for nucleic acid-ligand interactions falls far behind that for protein-protein and protein-ligand interactions. Here, based on our statistical mechanics-based iterative approach, we have developed an iterative knowledge-based scoring function for nucleic acid-ligand interactions, named as ITScore-NL, by explicitly including stacking and electrostatic potentials. Our ITScore-NL scoring function was extensively evaluated for its ability in the binding mode and binding affinity predictions on three diverse test sets and compared with state-of-the-art scoring functions. Overall, ITScore-NL obtained significantly better performance than the other 12 scoring functions and predicted near-native poses with rmsd ≤ 1.5 Å for 71.43% of the cases when the top three binding modes were considered and a good correlation of R = 0.64 in binding affinity prediction on the large test set of 77 nucleic acid-ligand complexes. These results suggested the accuracy of ITScore-NL and the necessity of explicitly including stacking and electrostatic potentials.
Collapse
Affiliation(s)
- Yuyu Feng
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| |
Collapse
|
50
|
Pitard I, Monet D, Goossens PL, Blondel A, Malliavin TE. Analyzing In Silico the Relationship Between the Activation of the Edema Factor and Its Interaction With Calmodulin. Front Mol Biosci 2020; 7:586544. [PMID: 33344505 PMCID: PMC7746812 DOI: 10.3389/fmolb.2020.586544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Accepted: 11/02/2020] [Indexed: 11/25/2022] Open
Abstract
Molecular dynamics (MD) simulations have been recorded on the complex between the edema factor (EF) of Bacilllus anthracis and calmodulin (CaM), starting from a structure with the orthosteric inhibitor adefovir bound in the EF catalytic site. The starting structure has been destabilized by alternately suppressing different co-factors, such as adefovir ligand or ions, revealing several long-distance correlations between the conformation of CaM, the geometry of the CaM/EF interface, the enzymatic site and the overall organization of the complex. An allosteric communication between CaM/EF interface and the EF catalytic site, highlighted by these correlations, was confirmed by several bioinformatics approaches from the literature. A network of hydrogen bonds and stacking interactions extending from the helix V of of CaM, and the residues of the switches A, B and C, and connecting to catalytic site residues, is a plausible candidate for the mediation of allosteric communication. The greatest variability in volume between the different MD conditions was also found for cavities present at the EF/CaM interface and in the EF catalytic site. The similarity between the predictions from literature and the volume variability might introduce the volume variability as new descriptor of allostery.
Collapse
Affiliation(s)
- Irène Pitard
- Unité de Bioinformatique Structurale, Institut Pasteur and CNRS UMR 3528, Paris, France.,Center de Bioinformatique, Biostatistique et Biologie Intégrative, Institut Pasteur and CNRS USR 3756, Paris, France.,Ecole Doctorale Université Paris Sorbonne, Paris, France
| | - Damien Monet
- Unité de Bioinformatique Structurale, Institut Pasteur and CNRS UMR 3528, Paris, France.,Center de Bioinformatique, Biostatistique et Biologie Intégrative, Institut Pasteur and CNRS USR 3756, Paris, France.,Ecole Doctorale Université Paris Sorbonne, Paris, France
| | | | - Arnaud Blondel
- Unité de Bioinformatique Structurale, Institut Pasteur and CNRS UMR 3528, Paris, France.,Center de Bioinformatique, Biostatistique et Biologie Intégrative, Institut Pasteur and CNRS USR 3756, Paris, France
| | - Thérèse E Malliavin
- Unité de Bioinformatique Structurale, Institut Pasteur and CNRS UMR 3528, Paris, France.,Center de Bioinformatique, Biostatistique et Biologie Intégrative, Institut Pasteur and CNRS USR 3756, Paris, France
| |
Collapse
|