Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Grazioli F, Machart P, Mösch A, Li K, Castorina LV, Pfeifer N, Min MR. Attentive Variational Information Bottleneck for TCR-peptide interaction prediction. Bioinformatics 2022;39:6960920. [PMID: 36571499 PMCID: PMC9825246 DOI: 10.1093/bioinformatics/btac820] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Revised: 11/18/2022] [Accepted: 12/23/2022] [Indexed: 12/27/2022] Open

For:	Grazioli F, Machart P, Mösch A, Li K, Castorina LV, Pfeifer N, Min MR. Attentive Variational Information Bottleneck for TCR-peptide interaction prediction. Bioinformatics 2022;39:6960920. [PMID: 36571499 PMCID: PMC9825246 DOI: 10.1093/bioinformatics/btac820] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Revised: 11/18/2022] [Accepted: 12/23/2022] [Indexed: 12/27/2022] Open

Number

Cited by Other Article(s)

Castorina LV, Grazioli F, Machart P, Mösch A, Errica F. Assessing the generalization capabilities of TCR binding predictors via peptide distance analysis. PLoS One 2025;20:e0324011. [PMID: 40392871 PMCID: PMC12091837 DOI: 10.1371/journal.pone.0324011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2024] [Accepted: 04/19/2025] [Indexed: 05/22/2025] Open

Abstract

Understanding the interaction between T Cell Receptors (TCRs) and peptide-bound Major Histocompatibility Complexes (pMHCs) is crucial for comprehending immune responses and developing targeted immunotherapies. While recent machine learning (ML) models show remarkable success in predicting TCR-pMHC binding within training data, these models often fail to generalize to peptides outside their training distributions, raising concerns about their applicability in therapeutic settings. Understanding and improving the generalization of these models is therefore critical to ensure real-world applications. To address this issue, we evaluate the effect of the distance between training and testing peptide distributions on ML model empirical risk assessments, using sequence-based and 3D structure-based distance metrics. In our analysis we use several state-of-the-art models for TCR-peptide binding prediction: Attentive Variational Information Bottleneck (AVIB), NetTCR-2.0 and -2.2, and ERGO II (pre-trained autoencoder) and ERGO II (LSTM). In this work, we introduce a novel approach for assessing the generalization capabilities of TCR binding predictors: the Distance Split (DS) algorithm. The DS algorithm controls the distance between training and testing peptides based on both sequence and structure, allowing for a more nuanced evaluation of model performance. We show that lower 3D shape similarity between training and test peptides is associated with a harder out-of-distribution task definition, which is more interesting when measuring the ability to generalize to unseen peptides. However, we observe the opposite effect when splitting using sequence-based similarity. These findings highlight the importance of using a distance-based splitting approach to benchmark models. This could then be used to estimate a confidence score on predictions on novel and unseen peptides, based on how different they are from the training ones. Additionally, our results may hint that employing 3D shape to complement sequence information could improve the accuracy of TCR-pMHC binding predictors.

Collapse

Ullanat V, Jing B, Sledzieski S, Berger B. Learning the language of protein-protein interactions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.09.642188. [PMID: 40166198 PMCID: PMC11956943 DOI: 10.1101/2025.03.09.642188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/02/2025]

Pham MDN, Su CTT, Nguyen TN, Nguyen HN, Nguyen DDA, Giang H, Nguyen DT, Phan MD, Nguyen V. epiTCR-KDA: knowledge distillation model on dihedral angles for TCR-peptide prediction. BIOINFORMATICS ADVANCES 2024;4:vbae190. [PMID: 39678207 PMCID: PMC11646569 DOI: 10.1093/bioadv/vbae190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/19/2024] [Revised: 11/03/2024] [Accepted: 11/27/2024] [Indexed: 12/17/2024]

Velez-Arce A, Li MM, Gao W, Lin X, Huang K, Fu T, Pentelute BL, Kellis M, Zitnik M. Signals in the Cells: Multimodal and Contextualized Machine Learning Foundations for Therapeutics. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.12.598655. [PMID: 38948789 PMCID: PMC11212894 DOI: 10.1101/2024.06.12.598655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/02/2024]

Li F, Qian X, Zhu X, Lai X, Zhang X, Wang J. TCRcost: a deep learning model utilizing TCR 3D structure for enhanced of TCR-peptide binding. Front Genet 2024;15:1346784. [PMID: 39415981 PMCID: PMC11479912 DOI: 10.3389/fgene.2024.1346784] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Accepted: 09/05/2024] [Indexed: 10/19/2024] Open

Abstract

Introduction

Predicting TCR-peptide binding is a complex and significant computational problem in systems immunology. During the past decade, a series of computational methods have been developed for better predicting TCR-peptide binding from amino acid sequences. However, the performance of sequence-based methods appears to have hit a bottleneck. Considering the 3D structures of TCR-peptide complexes, which provide much more information, could potentially lead to better prediction outcomes.

Methods

In this study, we developed TCRcost, a deep learning method, to predict TCR-peptide binding by incorporating 3D structures. TCRcost overcomes two significant challenges: acquiring a sufficient number of high-quality TCR-peptide structures and effectively extracting information from these structures for binding prediction. TCRcost corrects TCR 3D structures generated by protein structure tools, significantly extending the available datasets. The main and side chains of a TCR structure are separately corrected using a long short-term memory (LSTM) model. This approach prevents interference between the chains and accurately extracts interactions among both adjacent and global atoms. A 3D convolutional neural network (CNN) is designed to extract the atomic features relevant to TCR-peptide binding. The spatial features extracted by the 3DCNN are then processed through a fully connected layer to estimate the probability of TCR-peptide binding.

Results

Test results demonstrated that predicting TCR-peptide binding from 3D TCR structures is both efficient and highly accurate with an average accuracy of 0.974 on precise structures. Furthermore, the average accuracy on corrected structures was 0.762, significantly higher than the average accuracy of 0.375 on uncorrected original structures. Additionally, the average root mean square distance (RMSD) to precise structures was significantly reduced from 12.753 Å for predicted structures to 8.785 Å for corrected structures.

Discussion

Thus, utilizing structural information of TCR-peptide complexes is a promising approach to improve the accuracy of binding predictions.

Collapse

Deng Q, Wang Z, Xiang S, Wang Q, Liu Y, Hou T, Sun H. RLpMIEC: High-Affinity Peptide Generation Targeting Major Histocompatibility Complex-I Guided and Interpreted by Interaction Spectrum-Navigated Reinforcement Learning. J Chem Inf Model 2024;64:6432-6449. [PMID: 39118363 DOI: 10.1021/acs.jcim.4c01153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/10/2024]

Machaca V, Goyzueta V, Cruz MG, Sejje E, Pilco LM, López J, Túpac Y. Transformers meets neoantigen detection: a systematic literature review. J Integr Bioinform 2024;21:jib-2023-0043. [PMID: 38960869 PMCID: PMC11377031 DOI: 10.1515/jib-2023-0043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Accepted: 03/20/2024] [Indexed: 07/05/2024] Open

Mösch A, Grazioli F, Machart P, Malone B. NeoAgDT: optimization of personal neoantigen vaccine composition by digital twin simulation of a cancer cell population. Bioinformatics 2024;40:btae205. [PMID: 38614133 PMCID: PMC11076149 DOI: 10.1093/bioinformatics/btae205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 03/28/2024] [Accepted: 04/11/2024] [Indexed: 04/15/2024] Open

Bravi B. Development and use of machine learning algorithms in vaccine target selection. NPJ Vaccines 2024;9:15. [PMID: 38242890 PMCID: PMC10798987 DOI: 10.1038/s41541-023-00795-8] [Citation(s) in RCA: 22] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Accepted: 12/07/2023] [Indexed: 01/21/2024] Open