1
|
Ge J, Wang J, Ye Q, Pan L, Kang Y, Shen C, Deng Y, Hsieh CY, Hou T. TRAP: a contrastive learning-enhanced framework for robust TCR-pMHC binding prediction with improved generalizability. Chem Sci 2025:d4sc08141b. [PMID: 40321182 PMCID: PMC12046420 DOI: 10.1039/d4sc08141b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2024] [Accepted: 04/21/2025] [Indexed: 05/08/2025] Open
Abstract
The binding of T cell receptors (TCRs) to peptide-MHC I (pMHC) complexes is critical for triggering adaptive immune responses to potential health threats. Developing highly accurate machine learning (ML) models to predict TCR-pMHC binding could significantly accelerate immunotherapy advancements. However, existing ML models for TCR-pMHC binding prediction often underperform with unseen epitopes, severely limiting their applicability. We introduce TRAP, which leverages contrastive learning to enhance model performance by aligning structural and sequence features of pMHC with TCR sequences. TRAP outperforms previous state-of-the-art models in both random and unseen epitope scenarios, achieving an AUPR of 0.84 (a 22% improvement over the second-best model) and an AUC of 0.92 in the random scenario, and an AUC of 0.75 (almost 11% higher than the second-best model) in the unseen epitope scenario. Furthermore, TRAP demonstrates a noteworthy capability to diagnose potential issues of cross-reactivity between TCRs and similar epitopes. This highly robust performance makes it a suitable tool for large-scale predictions in real-world settings. A specific case study confirmed that TRAP can discover hit TCRs with binding free energies comparable to referenced experimental results. These findings highlight TRAP's potential for practical applications and its role as a powerful tool in developing TCR-based immunotherapies.
Collapse
Affiliation(s)
- Jingxuan Ge
- College of Pharmaceutical Sciences, Zhejiang University Hangzhou 310058 Zhejiang China
- CarbonSilicon AI Technology Company, Ltd Hangzhou 310018 Zhejiang China
| | - Jike Wang
- College of Pharmaceutical Sciences, Zhejiang University Hangzhou 310058 Zhejiang China
- CarbonSilicon AI Technology Company, Ltd Hangzhou 310018 Zhejiang China
| | - Qing Ye
- College of Pharmaceutical Sciences, Zhejiang University Hangzhou 310058 Zhejiang China
| | - Liqiang Pan
- College of Pharmaceutical Sciences, Zhejiang University Hangzhou 310058 Zhejiang China
| | - Yu Kang
- College of Pharmaceutical Sciences, Zhejiang University Hangzhou 310058 Zhejiang China
| | - Chao Shen
- College of Pharmaceutical Sciences, Zhejiang University Hangzhou 310058 Zhejiang China
| | - Yafeng Deng
- CarbonSilicon AI Technology Company, Ltd Hangzhou 310018 Zhejiang China
| | - Chang-Yu Hsieh
- College of Pharmaceutical Sciences, Zhejiang University Hangzhou 310058 Zhejiang China
| | - Tingjun Hou
- College of Pharmaceutical Sciences, Zhejiang University Hangzhou 310058 Zhejiang China
| |
Collapse
|
2
|
De A, Subramanian S, Nayak P, Pal K. In silico drug repurposing of potential antiviral inhibitors targeting methyltransferase (2'-O-MTase) domain of Marburg virus. In Silico Pharmacol 2025; 13:70. [PMID: 40291443 PMCID: PMC12018677 DOI: 10.1007/s40203-025-00355-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2024] [Accepted: 04/04/2025] [Indexed: 04/30/2025] Open
Abstract
Marburg Virus (MARV) presents a significant threat to human health, highlighting the urgent need for effective therapeutics. The MARV genome encodes a multifunctional 'large' L protein that plays a crucial role in polymerase, capping, and methyltransferase activities. Within this protein, the 2'-O-methyltransferase (2'-O-MTase) domain is essential for viral replication and immune evasion, making it a promising therapeutic target. However, the lack of structural data on this domain limits drug discovery efforts. To address this challenge, we utilized AlphaFold2 to predict a 3D structure of the MARV 2'-O-MTase domain. Molecular docking with its natural ligand, S-adenosyl methionine (SAM), allowed us to identify key active-site residues involved in ligand binding. We then screened 62 known inhibitors against this domain and identified four promising candidates: Lifirafenib (- 9.5 kcal/mol), Dolutegravir (- 8.5 kcal/mol), BRD3969 (- 8.3 kcal/mol), and JFD00244 (- 8.2 kcal/mol). Further, we assessed the pharmacokinetic and pharmacodynamic properties of these compounds to evaluate their drug-likeness. Molecular dynamics simulations, along with MM/GBSA free energy calculations, confirmed stable interactions between the selected inhibitors and the target domain. While these findings highlight promising candidates for MARV, experimental validation through in vitro and in vivo assays is essential to assess their safety and efficacy. Graphical abstract Supplementary Information The online version contains supplementary material available at 10.1007/s40203-025-00355-z.
Collapse
Affiliation(s)
- Arkajit De
- Department of Biotechnology, School of Life Science and Biotechnology, Adamas University, Barasat, Kolkata, West Bengal 700126 India
- Present Address: Department of Structural Biology, Van Andel Institute, Grand Rapids, MI 49503 USA
| | - Swagath Subramanian
- Department of Chemistry, School of Advanced Sciences (SAS), Vellore Institute of Technology, Vellore, Tamil Nadu 632014 India
| | - Prateek Nayak
- Department of Biosciences, School of Biosciences and Technology (SBST), Vellore Institute of Technology, Vellore, Tamil Nadu 632014 India
| | - Kuntal Pal
- Department of Biosciences, School of Biosciences and Technology (SBST), Vellore Institute of Technology, Vellore, Tamil Nadu 632014 India
| |
Collapse
|
3
|
Qian H, Wang Y, Zhou X, Gu T, Wang H, Lyu H, Li Z, Li X, Zhou H, Guo C, Yuan F, Wang Y. ESM-Ezy: a deep learning strategy for the mining of novel multicopper oxidases with superior properties. Nat Commun 2025; 16:3274. [PMID: 40188191 PMCID: PMC11972304 DOI: 10.1038/s41467-025-58521-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2024] [Accepted: 03/21/2025] [Indexed: 04/07/2025] Open
Abstract
The UniProt database is a valuable resource for biocatalyst discovery, yet predicting enzymatic functions remains challenging, especially for low-similarity sequences. Identifying superior enzymes with enhanced catalytic properties is even harder. To overcome these challenges, we develop ESM-Ezy, an enzyme mining strategy leveraging the ESM-1b protein language model and similarity calculations in semantic space. Using ESM-Ezy, we identify novel multicopper oxidases (MCOs) with superior catalytic properties, achieving a 44% success rate in outperforming query enzymes (QEs) in at least one property, including catalytic efficiency, heat and organic solvent tolerance, and pH stability. Notably, 51% of the MCOs excel in environmental remediation applications, and some exhibited unique structural motifs and unique active centers enhancing their functions. Beyond MCOs, 40% of L-asparaginases identified show higher specific activity and catalytic efficiency than QEs. ESM-Ezy thus provides a promising approach for discovering high-performance biocatalysts with low sequence similarity, accelerating enzyme discovery for industrial applications.
Collapse
Affiliation(s)
- Hui Qian
- School of Engineering, Westlake University, Hangzhou, 310014, Zhejiang, China
- The Center for Synthetic Biology and Integrated Bioengineering, Westlake University, Hangzhou, 310014, Zhejiang, China
| | - Yuxuan Wang
- School of Engineering, Westlake University, Hangzhou, 310014, Zhejiang, China
- The Center for Synthetic Biology and Integrated Bioengineering, Westlake University, Hangzhou, 310014, Zhejiang, China
| | - Xibin Zhou
- School of Engineering, Westlake University, Hangzhou, 310014, Zhejiang, China
| | - Tao Gu
- School of Engineering, Westlake University, Hangzhou, 310014, Zhejiang, China
- The Center for Synthetic Biology and Integrated Bioengineering, Westlake University, Hangzhou, 310014, Zhejiang, China
| | - Hui Wang
- Beijing Academy of Artificial Intelligence, Beijing, China
| | - Hao Lyu
- School of Engineering, Westlake University, Hangzhou, 310014, Zhejiang, China
| | - Zhikai Li
- School of Engineering, Westlake University, Hangzhou, 310014, Zhejiang, China
| | - Xiuxu Li
- School of Engineering, Westlake University, Hangzhou, 310014, Zhejiang, China
- The Center for Synthetic Biology and Integrated Bioengineering, Westlake University, Hangzhou, 310014, Zhejiang, China
| | - Huan Zhou
- Westlake Laboratory of Life Sciences and Biomedicine, Xihu District, Hangzhou, 310024, Zhejiang Province, China
| | - Chengchen Guo
- School of Engineering, Westlake University, Hangzhou, 310014, Zhejiang, China
| | - Fajie Yuan
- School of Engineering, Westlake University, Hangzhou, 310014, Zhejiang, China.
- The Center for Synthetic Biology and Integrated Bioengineering, Westlake University, Hangzhou, 310014, Zhejiang, China.
| | - Yajie Wang
- School of Engineering, Westlake University, Hangzhou, 310014, Zhejiang, China.
- The Center for Synthetic Biology and Integrated Bioengineering, Westlake University, Hangzhou, 310014, Zhejiang, China.
- Westlake Laboratory of Life Sciences and Biomedicine, Xihu District, Hangzhou, 310024, Zhejiang Province, China.
- School of Life Science, Westlake University, Hangzhou, 310014, Zhejiang, China.
- Muyuan laboratory, Zhengzhou, Henan, China.
| |
Collapse
|
4
|
Swenson CS, Mandava G, Thomas DM, Moellering RE. Tackling Undruggable Targets with Designer Peptidomimetics and Synthetic Biologics. Chem Rev 2024; 124:13020-13093. [PMID: 39540650 PMCID: PMC12036645 DOI: 10.1021/acs.chemrev.4c00423] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2024]
Abstract
The development of potent, specific, and pharmacologically viable chemical probes and therapeutics is a central focus of chemical biology and therapeutic development. However, a significant portion of predicted disease-causal proteins have proven resistant to targeting by traditional small molecule and biologic modalities. Many of these so-called "undruggable" targets feature extended, dynamic protein-protein and protein-nucleic acid interfaces that are central to their roles in normal and diseased signaling pathways. Here, we discuss the development of synthetically stabilized peptide and protein mimetics as an ever-expanding and powerful region of chemical space to tackle undruggable targets. These molecules aim to combine the synthetic tunability and pharmacologic properties typically associated with small molecules with the binding footprints, affinities and specificities of biologics. In this review, we discuss the historical and emerging platforms and approaches to design, screen, select and optimize synthetic "designer" peptidomimetics and synthetic biologics. We examine the inspiration and design of different classes of designer peptidomimetics: (i) macrocyclic peptides, (ii) side chain stabilized peptides, (iii) non-natural peptidomimetics, and (iv) synthetic proteomimetics, and notable examples of their application to challenging biomolecules. Finally, we summarize key learnings and remaining challenges for these molecules to become useful chemical probes and therapeutics for historically undruggable targets.
Collapse
Affiliation(s)
- Colin S Swenson
- Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, United States
| | - Gunasheil Mandava
- Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, United States
| | - Deborah M Thomas
- Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, United States
| | - Raymond E Moellering
- Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
5
|
Wang T, Zhang X, Zhang O, Chen G, Pan P, Wang E, Wang J, Wu J, Zhou D, Wang L, Jin R, Chen S, Shen C, Kang Y, Hsieh CY, Hou T. Highly Accurate and Efficient Deep Learning Paradigm for Full-Atom Protein Loop Modeling with KarmaLoop. RESEARCH (WASHINGTON, D.C.) 2024; 7:0408. [PMID: 39055686 PMCID: PMC11268956 DOI: 10.34133/research.0408] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/25/2024] [Accepted: 05/22/2024] [Indexed: 07/27/2024]
Abstract
Protein loop modeling is a challenging yet highly nontrivial task in protein structure prediction. Despite recent progress, existing methods including knowledge-based, ab initio, hybrid, and deep learning (DL) methods fall substantially short of either atomic accuracy or computational efficiency. To overcome these limitations, we present KarmaLoop, a novel paradigm that distinguishes itself as the first DL method centered on full-atom (encompassing both backbone and side-chain heavy atoms) protein loop modeling. Our results demonstrate that KarmaLoop considerably outperforms conventional and DL-based methods of loop modeling in terms of both accuracy and efficiency, with the average RMSDs of 1.77 and 1.95 Å for the CASP13+14 and CASP15 benchmark datasets, respectively, and manifests at least 2 orders of magnitude speedup in general compared with other methods. Consequently, our comprehensive evaluations indicate that KarmaLoop provides a state-of-the-art DL solution for protein loop modeling, with the potential to hasten the advancement of protein engineering, antibody-antigen recognition, and drug design.
Collapse
Affiliation(s)
- Tianyue Wang
- Innovation Institute for Artificial Intelligence in Medicine ofZhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Xujun Zhang
- Innovation Institute for Artificial Intelligence in Medicine ofZhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Odin Zhang
- Innovation Institute for Artificial Intelligence in Medicine ofZhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | | | - Peichen Pan
- Innovation Institute for Artificial Intelligence in Medicine ofZhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Ercheng Wang
- Innovation Institute for Artificial Intelligence in Medicine ofZhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
- Zhejiang Laboratory, Hangzhou 311100, Zhejiang, China
| | - Jike Wang
- Innovation Institute for Artificial Intelligence in Medicine ofZhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Jialu Wu
- Innovation Institute for Artificial Intelligence in Medicine ofZhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Donghao Zhou
- Shenzhen Institute of Advanced Technology,
Chinese Academy of Sciences, Shenzhen 518055, Guangdong, China
| | - Langcheng Wang
- Department of Pathology,
New York University Medical Center, New York, NY 10016, USA
| | - Ruofan Jin
- Innovation Institute for Artificial Intelligence in Medicine ofZhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
- College of Life Sciences,
Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Shicheng Chen
- Innovation Institute for Artificial Intelligence in Medicine ofZhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Chao Shen
- Innovation Institute for Artificial Intelligence in Medicine ofZhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Yu Kang
- Innovation Institute for Artificial Intelligence in Medicine ofZhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Chang-Yu Hsieh
- Innovation Institute for Artificial Intelligence in Medicine ofZhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Tingjun Hou
- Innovation Institute for Artificial Intelligence in Medicine ofZhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| |
Collapse
|
6
|
Jobe A, Vijayan R. Orphan G protein-coupled receptors: the ongoing search for a home. Front Pharmacol 2024; 15:1349097. [PMID: 38495099 PMCID: PMC10941346 DOI: 10.3389/fphar.2024.1349097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Accepted: 02/15/2024] [Indexed: 03/19/2024] Open
Abstract
G protein-coupled receptors (GPCRs) make up the largest receptor superfamily, accounting for 4% of protein-coding genes. Despite the prevalence of such transmembrane receptors, a significant number remain orphans, lacking identified endogenous ligands. Since their conception, the reverse pharmacology approach has been used to characterize such receptors. However, the multifaceted and nuanced nature of GPCR signaling poses a great challenge to their pharmacological elucidation. Considering their therapeutic relevance, the search for native orphan GPCR ligands continues. Despite limited structural input in terms of 3D crystallized structures, with advances in machine-learning approaches, there has been great progress with respect to accurate ligand prediction. Though such an approach proves valuable given that ligand scarcity is the greatest hurdle to orphan GPCR deorphanization, the future pairings of the remaining orphan GPCRs may not necessarily take a one-size-fits-all approach but should be more comprehensive in accounting for numerous nuanced possibilities to cover the full spectrum of GPCR signaling.
Collapse
Affiliation(s)
- Amie Jobe
- Department of Biology, College of Science, United Arab Emirates University, Al Ain, United Arab Emirates
| | - Ranjit Vijayan
- Department of Biology, College of Science, United Arab Emirates University, Al Ain, United Arab Emirates
- The Big Data Analytics Center, United Arab Emirates University, Al Ain, United Arab Emirates
- Zayed Bin Sultan Center for Health Sciences, United Arab Emirates University, Al Ain, United Arab Emirates
| |
Collapse
|