1
|
Quast NP, Abanades B, Guloglu B, Karuppiah V, Harper S, Raybould MIJ, Deane CM. T-cell receptor structures and predictive models reveal comparable alpha and beta chain structural diversity despite differing genetic complexity. Commun Biol 2025; 8:362. [PMID: 40038394 PMCID: PMC11880327 DOI: 10.1038/s42003-025-07708-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Accepted: 02/09/2025] [Indexed: 03/06/2025] Open
Abstract
T-cell receptor (TCR) structures are currently under-utilised in early-stage drug discovery and repertoire-scale informatics. Here, we leverage a large dataset of solved TCR structures from Immunocore to evaluate the current state-of-the-art for TCR structure prediction, and identify which regions of the TCR remain challenging to model. Through clustering analyses and the training of a TCR-specific model capable of large-scale structure prediction, we find that the alpha chain VJ-recombined loop (CDR3α) is as structurally diverse and correspondingly difficult to predict as the beta chain VDJ-recombined loop (CDR3β). This differentiates TCR variable domain loops from the genetically analogous antibody loops and supports the conjecture that both TCR alpha and beta chains are deterministic of antigen specificity. We hypothesise that the larger number of alpha chain joining genes compared to beta chain joining genes compensates for the lack of a diversity gene segment. We also provide over 1.5M predicted TCR structures to enable repertoire structural analysis and elucidate strategies towards improving the accuracy of future TCR structure predictors. Our observations reinforce the importance of paired TCR sequence information and capture the current state-of-the-art for TCR structure prediction, while our model and 1.5M structure predictions enable the use of structural TCR information at an unprecedented scale.
Collapse
MESH Headings
- Receptors, Antigen, T-Cell, alpha-beta/genetics
- Receptors, Antigen, T-Cell, alpha-beta/chemistry
- Humans
- Models, Molecular
- Genetic Variation
- Receptors, Antigen, T-Cell/chemistry
- Receptors, Antigen, T-Cell/genetics
- Protein Conformation
- Complementarity Determining Regions/genetics
- Complementarity Determining Regions/chemistry
Collapse
Affiliation(s)
- Nele P Quast
- Department of Statistics, University of Oxford, Oxford, UK
| | | | - Bora Guloglu
- Department of Statistics, University of Oxford, Oxford, UK
| | | | | | | | | |
Collapse
|
2
|
Nagano Y, Pyo AGT, Milighetti M, Henderson J, Shawe-Taylor J, Chain B, Tiffeau-Mayer A. Contrastive learning of T cell receptor representations. Cell Syst 2025; 16:101165. [PMID: 39778580 DOI: 10.1016/j.cels.2024.12.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2024] [Revised: 10/09/2024] [Accepted: 12/06/2024] [Indexed: 01/11/2025]
Abstract
Computational prediction of the interaction of T cell receptors (TCRs) and their ligands is a grand challenge in immunology. Despite advances in high-throughput assays, specificity-labeled TCR data remain sparse. In other domains, the pre-training of language models on unlabeled data has been successfully used to address data bottlenecks. However, it is unclear how to best pre-train protein language models for TCR specificity prediction. Here, we introduce a TCR language model called SCEPTR (simple contrastive embedding of the primary sequence of T cell receptors), which is capable of data-efficient transfer learning. Through our model, we introduce a pre-training strategy combining autocontrastive learning and masked-language modeling, which enables SCEPTR to achieve its state-of-the-art performance. In contrast, existing protein language models and a variant of SCEPTR pre-trained without autocontrastive learning are outperformed by sequence alignment-based methods. We anticipate that contrastive learning will be a useful paradigm to decode the rules of TCR specificity. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- Yuta Nagano
- Division of Infection and Immunity, University College London, London WC1E 6BT, UK; Division of Medicine, University College London, London WC1E 6BT, UK
| | - Andrew G T Pyo
- Center for the Physics of Biological Function, Princeton University, Princeton, NJ 08544, USA
| | - Martina Milighetti
- Division of Infection and Immunity, University College London, London WC1E 6BT, UK; Cancer Institute, University College London, London WC1E 6DD, UK
| | - James Henderson
- Division of Infection and Immunity, University College London, London WC1E 6BT, UK; Institute for the Physics of Living Systems, University College London, London WC1E 6BT, UK
| | - John Shawe-Taylor
- Department of Computer Science, University College London, London WC1E 6BT, UK
| | - Benny Chain
- Division of Infection and Immunity, University College London, London WC1E 6BT, UK; Department of Computer Science, University College London, London WC1E 6BT, UK
| | - Andreas Tiffeau-Mayer
- Division of Infection and Immunity, University College London, London WC1E 6BT, UK; Institute for the Physics of Living Systems, University College London, London WC1E 6BT, UK.
| |
Collapse
|
3
|
Zhou Z, Yin Y, Han H, Jia Y, Koh JH, Kong AWK, Mu Y. ProAffinity-GNN: A Novel Approach to Structure-Based Protein-Protein Binding Affinity Prediction via a Curated Data Set and Graph Neural Networks. J Chem Inf Model 2024; 64:8796-8808. [PMID: 39558674 DOI: 10.1021/acs.jcim.4c01850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2024]
Abstract
Protein-protein interactions (PPIs) are crucial for understanding biological processes and disease mechanisms, contributing significantly to advances in protein engineering and drug discovery. The accurate determination of binding affinities, essential for decoding PPIs, faces challenges due to the substantial time and financial costs involved in experimental and theoretical methods. This situation underscores the urgent need for more effective and precise methodologies for predicting binding affinity. Despite the abundance of research on PPI modeling, the field of quantitative binding affinity prediction remains underexplored, mainly due to a lack of comprehensive data. This study seeks to address these needs by manually curating pairwise interaction labels on available 3D structures of protein complexes, with experimentally determined binding affinities, creating the largest data set for structure-based pairwise protein interaction with binding affinity to date. Subsequently, we introduce ProAffinity-GNN, a novel deep learning framework using protein language model and graph neural network (GNN) to improve the accuracy of prediction of structure-based protein-protein binding affinities. The evaluation results across several benchmark test sets and an additional case study demonstrate that ProAffinity-GNN not only outperforms existing models in terms of accuracy but also shows strong generalization capabilities.
Collapse
Affiliation(s)
- Zhiyuan Zhou
- School of Biological Sciences, Nanyang Technological University, 637551, Singapore
| | - Yueming Yin
- Institute for Digital Molecular Analytics and Science (IDMxS), Nanyang Technological University, 636921, Singapore
| | - Hao Han
- School of Biological Sciences, Nanyang Technological University, 637551, Singapore
| | - Yiping Jia
- School of Pharmacy, Shanghai Jiao Tong University, 200240, Shanghai, China
| | - Jun Hong Koh
- School of Biological Sciences, Nanyang Technological University, 637551, Singapore
| | - Adams Wai-Kin Kong
- College of Computing and Data Science, Nanyang Technological University, 639798, Singapore
| | - Yuguang Mu
- School of Biological Sciences, Nanyang Technological University, 637551, Singapore
| |
Collapse
|
4
|
Chi WY, Hu Y, Huang HC, Kuo HH, Lin SH, Kuo CTJ, Tao J, Fan D, Huang YM, Wu AA, Hung CF, Wu TC. Molecular targets and strategies in the development of nucleic acid cancer vaccines: from shared to personalized antigens. J Biomed Sci 2024; 31:94. [PMID: 39379923 PMCID: PMC11463125 DOI: 10.1186/s12929-024-01082-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2024] [Accepted: 09/01/2024] [Indexed: 10/10/2024] Open
Abstract
Recent breakthroughs in cancer immunotherapies have emphasized the importance of harnessing the immune system for treating cancer. Vaccines, which have traditionally been used to promote protective immunity against pathogens, are now being explored as a method to target cancer neoantigens. Over the past few years, extensive preclinical research and more than a hundred clinical trials have been dedicated to investigating various approaches to neoantigen discovery and vaccine formulations, encouraging development of personalized medicine. Nucleic acids (DNA and mRNA) have become particularly promising platform for the development of these cancer immunotherapies. This shift towards nucleic acid-based personalized vaccines has been facilitated by advancements in molecular techniques for identifying neoantigens, antigen prediction methodologies, and the development of new vaccine platforms. Generating these personalized vaccines involves a comprehensive pipeline that includes sequencing of patient tumor samples, data analysis for antigen prediction, and tailored vaccine manufacturing. In this review, we will discuss the various shared and personalized antigens used for cancer vaccine development and introduce strategies for identifying neoantigens through the characterization of gene mutation, transcription, translation and post translational modifications associated with oncogenesis. In addition, we will focus on the most up-to-date nucleic acid vaccine platforms, discuss the limitations of cancer vaccines as well as provide potential solutions, and raise key clinical and technical considerations in vaccine development.
Collapse
Affiliation(s)
- Wei-Yu Chi
- Physiology, Biophysics and Systems Biology Graduate Program, Weill Cornell Medicine, New York, NY, USA
| | - Yingying Hu
- Tri-Institutional PhD Program in Chemical Biology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Hsin-Che Huang
- Tri-Institutional PhD Program in Chemical Biology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Hui-Hsuan Kuo
- Pharmacology PhD Program, Weill Cornell Medicine, New York, NY, USA
| | - Shu-Hong Lin
- Department of Epidemiology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
- The University of Texas Graduate School of Biomedical Sciences at Houston and MD Anderson Cancer Center, Houston, TX, USA
| | - Chun-Tien Jimmy Kuo
- Division of Pharmaceutics and Pharmacology, College of Pharmacy, The Ohio State University, Columbus, OH, USA
| | - Julia Tao
- Department of Pathology, Johns Hopkins School of Medicine, 1550 Orleans St, CRB II Room 309, Baltimore, MD, 21287, USA
| | - Darrell Fan
- Department of Pathology, Johns Hopkins School of Medicine, 1550 Orleans St, CRB II Room 309, Baltimore, MD, 21287, USA
| | - Yi-Min Huang
- Department of Pathology, Johns Hopkins School of Medicine, 1550 Orleans St, CRB II Room 309, Baltimore, MD, 21287, USA
| | - Annie A Wu
- Department of Pathology, Johns Hopkins School of Medicine, 1550 Orleans St, CRB II Room 309, Baltimore, MD, 21287, USA
| | - Chien-Fu Hung
- Department of Pathology, Johns Hopkins School of Medicine, 1550 Orleans St, CRB II Room 309, Baltimore, MD, 21287, USA
- Department of Oncology, Johns Hopkins School of Medicine, Baltimore, MD, USA
- Department of Obstetrics and Gynecology, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - T-C Wu
- Department of Pathology, Johns Hopkins School of Medicine, 1550 Orleans St, CRB II Room 309, Baltimore, MD, 21287, USA.
- Department of Oncology, Johns Hopkins School of Medicine, Baltimore, MD, USA.
- Department of Obstetrics and Gynecology, Johns Hopkins School of Medicine, Baltimore, MD, USA.
- Department of Molecular Microbiology and Immunology, Bloomberg School of Public Health, Johns Hopkins School of Medicine, Baltimore, MD, USA.
| |
Collapse
|
5
|
Golikova EA, Alshevskaya AA, Alrhmoun S, Sivitskaya NA, Sennikov SV. TCR-T cell therapy: current development approaches, preclinical evaluation, and perspectives on regulatory challenges. J Transl Med 2024; 22:897. [PMID: 39367419 PMCID: PMC11451006 DOI: 10.1186/s12967-024-05703-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Accepted: 09/24/2024] [Indexed: 10/06/2024] Open
Abstract
TCR-T cell therapy represents a promising advancement in adoptive immunotherapy for cancer treatment. Despite its potential, the development and preclinical testing of TCR-T cells face significant challenges. This review provides a structured overview of the key stages in preclinical testing, including in silico, in vitro, and in vivo methods, within the context of the sequential development of novel therapies. This review aimed to systematically outline the processes for evaluating TCR-T cells at each stage: from in silico approaches used to predict target antigens, assess cross-reactivity, and minimize off-target effects, to in vitro assays designed to measure cell functionality, cytotoxicity, and activation. Additionally, the review discusses the limitations of in vivo testing in animal models, particularly in accurately reflecting the human tumor microenvironment and immune responses. Performed analysis emphasizes the importance of these preclinical stages in the safe and effective development of TCR-T cell therapies. While current models provide valuable insights, we identify critical gaps, particularly in in vivo biodistribution and toxicity assessments, and propose the need for enhanced standardization and the development of more representative models. This structured approach aims to improve the predictability and safety of TCR-T cell therapy as it advances towards clinical application.
Collapse
Affiliation(s)
- Elena A Golikova
- Federal State Autonomous Educational Institution of Higher Education, I.M. Sechenov First Moscow State Medical University of the Ministry of Health of the Russian Federation (Sechenov University), 119435, Moscow, Russia
| | - Alina A Alshevskaya
- Federal State Autonomous Educational Institution of Higher Education, I.M. Sechenov First Moscow State Medical University of the Ministry of Health of the Russian Federation (Sechenov University), 119435, Moscow, Russia.
| | - Saleh Alrhmoun
- Federal State Autonomous Educational Institution of Higher Education, I.M. Sechenov First Moscow State Medical University of the Ministry of Health of the Russian Federation (Sechenov University), 119435, Moscow, Russia
- Federal State Budgetary Scientific Institution, "Research Institute of Fundamental and Clinical Immunology" (RIFCI), 630099, Novosibirsk, Russia
| | - Natalia A Sivitskaya
- Federal State Autonomous Educational Institution of Higher Education, I.M. Sechenov First Moscow State Medical University of the Ministry of Health of the Russian Federation (Sechenov University), 119435, Moscow, Russia
| | - Sergey V Sennikov
- Federal State Autonomous Educational Institution of Higher Education, I.M. Sechenov First Moscow State Medical University of the Ministry of Health of the Russian Federation (Sechenov University), 119435, Moscow, Russia
- Federal State Budgetary Scientific Institution, "Research Institute of Fundamental and Clinical Immunology" (RIFCI), 630099, Novosibirsk, Russia
| |
Collapse
|
6
|
Scheffer L, Reber EE, Mehta BB, Pavlović M, Chernigovskaya M, Richardson E, Akbar R, Lund-Johansen F, Greiff V, Haff IH, Sandve GK. Predictability of antigen binding based on short motifs in the antibody CDRH3. Brief Bioinform 2024; 25:bbae537. [PMID: 39438077 PMCID: PMC11495870 DOI: 10.1093/bib/bbae537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Revised: 09/30/2024] [Accepted: 10/16/2024] [Indexed: 10/25/2024] Open
Abstract
Adaptive immune receptors, such as antibodies and T-cell receptors, recognize foreign threats with exquisite specificity. A major challenge in adaptive immunology is discovering the rules governing immune receptor-antigen binding in order to predict the antigen binding status of previously unseen immune receptors. Many studies assume that the antigen binding status of an immune receptor may be determined by the presence of a short motif in the complementarity determining region 3 (CDR3), disregarding other amino acids. To test this assumption, we present a method to discover short motifs which show high precision in predicting antigen binding and generalize well to unseen simulated and experimental data. Our analysis of a mutagenesis-based antibody dataset reveals 11 336 position-specific, mostly gapped motifs of 3-5 amino acids that retain high precision on independently generated experimental data. Using a subset of only 178 motifs, a simple classifier was made that on the independently generated dataset outperformed a deep learning model proposed specifically for such datasets. In conclusion, our findings support the notion that for some antibodies, antigen binding may be largely determined by a short CDR3 motif. As more experimental data emerge, our methodology could serve as a foundation for in-depth investigations into antigen binding signals.
Collapse
Affiliation(s)
- Lonneke Scheffer
- Department of Informatics, University of Oslo, Gaustadalléen 23B, 0373 Oslo, Norway
| | - Eric Emanuel Reber
- Department of Informatics, University of Oslo, Gaustadalléen 23B, 0373 Oslo, Norway
| | - Brij Bhushan Mehta
- Department of Immunology, University of Oslo, Sognsvannsveien 20, Rikshospitalet, 0372 Oslo, Norway
| | - Milena Pavlović
- Department of Informatics, University of Oslo, Gaustadalléen 23B, 0373 Oslo, Norway
| | - Maria Chernigovskaya
- Department of Immunology, University of Oslo, Sognsvannsveien 20, Rikshospitalet, 0372 Oslo, Norway
| | - Eve Richardson
- La Jolla Institute for Immunology, 9420 Athena Cir, La Jolla, CA, United States
| | - Rahmad Akbar
- Department of Immunology, University of Oslo, Sognsvannsveien 20, Rikshospitalet, 0372 Oslo, Norway
| | - Fridtjof Lund-Johansen
- Department of Immunology, University of Oslo, Sognsvannsveien 20, Rikshospitalet, 0372 Oslo, Norway
| | - Victor Greiff
- Department of Immunology, University of Oslo, Sognsvannsveien 20, Rikshospitalet, 0372 Oslo, Norway
| | - Ingrid Hobæk Haff
- Department of Mathematics, University of Oslo, Niels Henrik Abels hus, Moltke Moes vei 35, 0851 Oslo, Norway
| | - Geir Kjetil Sandve
- Department of Informatics, University of Oslo, Gaustadalléen 23B, 0373 Oslo, Norway
| |
Collapse
|
7
|
Muraoka D, Moi ML, Muto O, Nakatsukasa T, Deng S, Takashima C, Yamaguchi R, Sawada SI, Hayakawa H, Nguyen TTN, Haseda Y, Soga T, Matsushita H, Ikeda H, Akiyoshi K, Harada N. Low-frequency CD8 + T cells induced by SIGN-R1 + macrophage-targeted vaccine confer SARS-CoV-2 clearance in mice. NPJ Vaccines 2024; 9:173. [PMID: 39294173 PMCID: PMC11411095 DOI: 10.1038/s41541-024-00961-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Accepted: 09/01/2024] [Indexed: 09/20/2024] Open
Abstract
Vaccine-induced T cells and neutralizing antibodies are essential for protection against SARS-CoV-2. Previously, we demonstrated that an antigen delivery system, pullulan nanogel (PNG), delivers vaccine antigen to lymph node medullary macrophages and thereby enhances the induction of specific CD8+ T cells. In this study, we revealed that medullary macrophage-selective delivery by PNG depends on its binding to a C-type lectin SIGN-R1. In a K18-hACE2 mouse model of SARS-CoV-2 infection, vaccination with a PNG-encapsulated receptor-binding domain of spike protein decreased the viral load and prolonged the survival in the CD8+ T cell- and B cell-dependent manners. T cell receptor repertoire analysis revealed that although the vaccine induced T cells at various frequencies, low-frequency specific T cells mainly promoted virus clearance. Thus, the induction of specific CD8+ T cells that respond quickly to viral infection, even at low frequencies, is important for vaccine efficacy and can be achieved by SIGN-R1+ medullary macrophage-targeted antigen delivery.
Collapse
Affiliation(s)
- Daisuke Muraoka
- Department of Oncology, Nagasaki University Graduate School of Biomedical Sciences, Nagasaki, Japan.
- Division of Translational Oncoimmunology, Aichi Cancer Center Research Institute, Nagoya, Japan.
| | - Meng Ling Moi
- School of International Health, Graduate School of Medicine, the University of Tokyo, Tokyo, Japan.
- Institute of Tropical Medicine, Nagasaki University, Nagasaki, Japan.
| | - Osamu Muto
- Division of Cancer Systems Biology, Aichi Cancer Center Research Institute, Nagoya, Japan
- Division of Cancer Informatics, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Takaaki Nakatsukasa
- Department of Oncology, Nagasaki University Graduate School of Biomedical Sciences, Nagasaki, Japan
- Department of Surgical Oncology, Nagasaki University Graduate School of Biomedical Sciences, Nagasaki, Japan
| | - Situo Deng
- Department of Oncology, Nagasaki University Graduate School of Biomedical Sciences, Nagasaki, Japan
| | - Chieko Takashima
- Division of Translational Oncoimmunology, Aichi Cancer Center Research Institute, Nagoya, Japan
| | - Rui Yamaguchi
- Division of Cancer Systems Biology, Aichi Cancer Center Research Institute, Nagoya, Japan
- Division of Cancer Informatics, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Shin-Ichi Sawada
- Synergy Institute for Futuristic Mucosal Vaccine Research and Development (cSIMVa), Chiba University, Chiba, Japan
| | - Haruka Hayakawa
- School of International Health, Graduate School of Medicine, the University of Tokyo, Tokyo, Japan
| | | | | | | | - Hirokazu Matsushita
- Division of Translational Oncoimmunology, Aichi Cancer Center Research Institute, Nagoya, Japan
| | - Hiroaki Ikeda
- Department of Oncology, Nagasaki University Graduate School of Biomedical Sciences, Nagasaki, Japan
| | - Kazunari Akiyoshi
- Department of Immunology, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | | |
Collapse
|
8
|
Ehrlich R, Glynn E, Singh M, Ghersi D. Computational Methods for Predicting Key Interactions in T Cell-Mediated Adaptive Immunity. Annu Rev Biomed Data Sci 2024; 7:295-316. [PMID: 38748864 DOI: 10.1146/annurev-biodatasci-102423-122741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/25/2024]
Abstract
The adaptive immune system recognizes pathogen- and cancer-specific features and is endowed with memory, enabling it to respond quickly and efficiently to repeated encounters with the same antigens. T cells play a central role in the adaptive immune system by directly targeting intracellular pathogens and helping to activate B cells to secrete antibodies. Several fundamental protein interactions-including those between major histocompatibility complex (MHC) proteins and antigen-derived peptides as well as between T cell receptors and peptide-MHC complexes-underlie the ability of T cells to recognize antigens with great precision. Computational approaches to predict these interactions are increasingly being used for medically relevant applications, including vaccine design and prediction of patient response to cancer immunotherapies. We provide computational researchers with an accessible introduction to the adaptive immune system, review computational approaches to predict the key protein interactions underlying T cell-mediated adaptive immunity, and highlight remaining challenges.
Collapse
Affiliation(s)
- Ryan Ehrlich
- School of Interdisciplinary Informatics, University of Nebraska, Omaha, Nebraska, USA;
| | - Eric Glynn
- Lewis-Sigler Institute, Princeton University, Princeton, New Jersey, USA
| | - Mona Singh
- Department of Computer Science, Princeton University, Princeton, New Jersey, USA;
- Lewis-Sigler Institute, Princeton University, Princeton, New Jersey, USA
| | - Dario Ghersi
- School of Interdisciplinary Informatics, University of Nebraska, Omaha, Nebraska, USA;
| |
Collapse
|
9
|
Drost F, An Y, Bonafonte-Pardàs I, Dratva LM, Lindeboom RGH, Haniffa M, Teichmann SA, Theis F, Lotfollahi M, Schubert B. Multi-modal generative modeling for joint analysis of single-cell T cell receptor and gene expression data. Nat Commun 2024; 15:5577. [PMID: 38956082 PMCID: PMC11220149 DOI: 10.1038/s41467-024-49806-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Accepted: 05/23/2024] [Indexed: 07/04/2024] Open
Abstract
Recent advances in single-cell immune profiling have enabled the simultaneous measurement of transcriptome and T cell receptor (TCR) sequences, offering great potential for studying immune responses at the cellular level. However, integrating these diverse modalities across datasets is challenging due to their unique data characteristics and technical variations. Here, to address this, we develop the multimodal generative model mvTCR to fuse modality-specific information across transcriptome and TCR into a shared representation. Our analysis demonstrates the added value of multimodal over unimodal approaches to capture antigen specificity. Notably, we use mvTCR to distinguish T cell subpopulations binding to SARS-CoV-2 antigens from bystander cells. Furthermore, when combined with reference mapping approaches, mvTCR can map newly generated datasets to extensive T cell references, facilitating knowledge transfer. In summary, we envision mvTCR to enable a scalable analysis of multimodal immune profiling data and advance our understanding of immune responses.
Collapse
Affiliation(s)
- Felix Drost
- Computational Health Center, Helmholtz Munich, Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
- School of Life Sciences Weihenstephan, Technical University of Munich, Alte Akademie 8, 85354, Freising, Germany
| | - Yang An
- Computational Health Center, Helmholtz Munich, Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
- School of Computation, Information and Technology, Technical University of Munich, Boltzmannstraße 3, 85748, Garching bei München, Germany
| | - Irene Bonafonte-Pardàs
- Computational Health Center, Helmholtz Munich, Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
| | - Lisa M Dratva
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Rik G H Lindeboom
- The Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX, Amsterdam, The Netherlands
| | - Muzlifah Haniffa
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
- Biosciences Institute, Newcastle University, Newcastle upon Tyne, NE2 4HH, UK
| | - Sarah A Teichmann
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
- Department of Physics, Cavendish Laboratory, University of Cambridge, 19 JJ Thomson Avenue, Cambridge, UK
| | - Fabian Theis
- Computational Health Center, Helmholtz Munich, Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
- School of Life Sciences Weihenstephan, Technical University of Munich, Alte Akademie 8, 85354, Freising, Germany
- School of Computation, Information and Technology, Technical University of Munich, Boltzmannstraße 3, 85748, Garching bei München, Germany
| | - Mohammad Lotfollahi
- Computational Health Center, Helmholtz Munich, Ingolstädter Landstraße 1, 85764, Neuherberg, Germany.
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK.
| | - Benjamin Schubert
- Computational Health Center, Helmholtz Munich, Ingolstädter Landstraße 1, 85764, Neuherberg, Germany.
- School of Computation, Information and Technology, Technical University of Munich, Boltzmannstraße 3, 85748, Garching bei München, Germany.
| |
Collapse
|
10
|
Yu Z, Jiang M, Lan X. HeteroTCR: A heterogeneous graph neural network-based method for predicting peptide-TCR interaction. Commun Biol 2024; 7:684. [PMID: 38834836 PMCID: PMC11150398 DOI: 10.1038/s42003-024-06380-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Accepted: 05/23/2024] [Indexed: 06/06/2024] Open
Abstract
Identifying interactions between T-cell receptors (TCRs) and immunogenic peptides holds profound implications across diverse research domains and clinical scenarios. Unsupervised clustering models (UCMs) cannot predict peptide-TCR binding directly, while supervised predictive models (SPMs) often face challenges in identifying antigens previously unencountered by the immune system or possessing limited TCR binding repertoires. Therefore, we propose HeteroTCR, an SPM based on Heterogeneous Graph Neural Network (GNN), to accurately predict peptide-TCR binding probabilities. HeteroTCR captures within-type (TCR-TCR or peptide-peptide) similarity information and between-type (peptide-TCR) interaction insights for predictions on unseen peptides and TCRs, surpassing limitations of existing SPMs. Our evaluation shows HeteroTCR outperforms state-of-the-art models on independent datasets. Ablation studies and visual interpretation underscore the Heterogeneous GNN module's critical role in enhancing HeteroTCR's performance by capturing pivotal binding process features. We further demonstrate the robustness and reliability of HeteroTCR through validation using single-cell datasets, aligning with the expectation that pMHC-TCR complexes with higher predicted binding probabilities correspond to increased binding fractions.
Collapse
Affiliation(s)
- Zilan Yu
- School of Medicine, Tsinghua University, 100084, Beijing, China
- Centre for Life Sciences, Tsinghua University, 100084, Beijing, China
| | - Mengnan Jiang
- School of Medicine, Tsinghua University, 100084, Beijing, China
| | - Xun Lan
- School of Medicine, Tsinghua University, 100084, Beijing, China.
- Centre for Life Sciences, Tsinghua University, 100084, Beijing, China.
- Tsinghua-Peking Center for Life Sciences, MOE Key Laboratory of Tsinghua University, Beijing, China.
- MOE Key Laboratory of Bioinformatics, Tsinghua University, 100084, Beijing, China.
| |
Collapse
|
11
|
Machaca V, Goyzueta V, Cruz MG, Sejje E, Pilco LM, López J, Túpac Y. Transformers meets neoantigen detection: a systematic literature review. J Integr Bioinform 2024; 21:jib-2023-0043. [PMID: 38960869 PMCID: PMC11377031 DOI: 10.1515/jib-2023-0043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Accepted: 03/20/2024] [Indexed: 07/05/2024] Open
Abstract
Cancer immunology offers a new alternative to traditional cancer treatments, such as radiotherapy and chemotherapy. One notable alternative is the development of personalized vaccines based on cancer neoantigens. Moreover, Transformers are considered a revolutionary development in artificial intelligence with a significant impact on natural language processing (NLP) tasks and have been utilized in proteomics studies in recent years. In this context, we conducted a systematic literature review to investigate how Transformers are applied in each stage of the neoantigen detection process. Additionally, we mapped current pipelines and examined the results of clinical trials involving cancer vaccines.
Collapse
Affiliation(s)
| | | | | | - Erika Sejje
- Universidad Nacional de San Agustín, Arequipa, Perú
| | | | | | - Yván Túpac
- 187038 Universidad Católica San Pablo , Arequipa, Perú
| |
Collapse
|
12
|
Croce G, Bobisse S, Moreno DL, Schmidt J, Guillame P, Harari A, Gfeller D. Deep learning predictions of TCR-epitope interactions reveal epitope-specific chains in dual alpha T cells. Nat Commun 2024; 15:3211. [PMID: 38615042 PMCID: PMC11016097 DOI: 10.1038/s41467-024-47461-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Accepted: 04/03/2024] [Indexed: 04/15/2024] Open
Abstract
T cells have the ability to eliminate infected and cancer cells and play an essential role in cancer immunotherapy. T cell activation is elicited by the binding of the T cell receptor (TCR) to epitopes displayed on MHC molecules, and the TCR specificity is determined by the sequence of its α and β chains. Here, we collect and curate a dataset of 17,715 αβTCRs interacting with dozens of class I and class II epitopes. We use this curated data to develop MixTCRpred, an epitope-specific TCR-epitope interaction predictor. MixTCRpred accurately predicts TCRs recognizing several viral and cancer epitopes. MixTCRpred further provides a useful quality control tool for multiplexed single-cell TCR sequencing assays of epitope-specific T cells and pinpoints a substantial fraction of putative contaminants in public databases. Analysis of epitope-specific dual α T cells demonstrates that MixTCRpred can identify α chains mediating epitope recognition. Applying MixTCRpred to TCR repertoires from COVID-19 patients reveals enrichment of clonotypes predicted to bind an immunodominant SARS-CoV-2 epitope. Overall, MixTCRpred provides a robust tool to predict TCRs interacting with specific epitopes and interpret TCR-sequencing data from both bulk and epitope-specific T cells.
Collapse
Affiliation(s)
- Giancarlo Croce
- Department of Oncology UNIL CHUV, Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
- Agora Cancer Research Centre, Lausanne, Switzerland
- Swiss Cancer Center Leman (SCCL), Lausanne, Switzerland
| | - Sara Bobisse
- Agora Cancer Research Centre, Lausanne, Switzerland
- Swiss Cancer Center Leman (SCCL), Lausanne, Switzerland
- Department of Oncology UNIL CHUV, Ludwig Institute for Cancer Research, University Hospital of Lausanne, Lausanne, Switzerland
| | - Dana Léa Moreno
- Department of Oncology UNIL CHUV, Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
- Agora Cancer Research Centre, Lausanne, Switzerland
- Swiss Cancer Center Leman (SCCL), Lausanne, Switzerland
| | - Julien Schmidt
- Department of Oncology UNIL CHUV, Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Swiss Cancer Center Leman (SCCL), Lausanne, Switzerland
- Department of Oncology UNIL CHUV, Ludwig Institute for Cancer Research, University Hospital of Lausanne, Lausanne, Switzerland
| | - Philippe Guillame
- Swiss Cancer Center Leman (SCCL), Lausanne, Switzerland
- Department of Oncology UNIL CHUV, Ludwig Institute for Cancer Research, University Hospital of Lausanne, Lausanne, Switzerland
| | - Alexandre Harari
- Department of Oncology UNIL CHUV, Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Agora Cancer Research Centre, Lausanne, Switzerland
- Swiss Cancer Center Leman (SCCL), Lausanne, Switzerland
- Department of Oncology UNIL CHUV, Ludwig Institute for Cancer Research, University Hospital of Lausanne, Lausanne, Switzerland
| | - David Gfeller
- Department of Oncology UNIL CHUV, Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland.
- Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland.
- Agora Cancer Research Centre, Lausanne, Switzerland.
- Swiss Cancer Center Leman (SCCL), Lausanne, Switzerland.
| |
Collapse
|
13
|
Aterido A, López-Lasanta M, Blanco F, Juan-Mas A, García-Vivar ML, Erra A, Pérez-García C, Sánchez-Fernández SÁ, Sanmartí R, Fernández-Nebro A, Alperi-López M, Tornero J, Ortiz AM, Fernández-Cid CM, Palau N, Pan W, Byrne-Steele M, Starenki D, Weber D, Rodriguez-Nunez I, Han J, Myers RM, Marsal S, Julià A. Seven-chain adaptive immune receptor repertoire analysis in rheumatoid arthritis reveals novel features associated with disease and clinically relevant phenotypes. Genome Biol 2024; 25:68. [PMID: 38468286 PMCID: PMC10926600 DOI: 10.1186/s13059-024-03210-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Accepted: 03/04/2024] [Indexed: 03/13/2024] Open
Abstract
BACKGROUND In rheumatoid arthritis (RA), the activation of T and B cell clones specific for self-antigens leads to the chronic inflammation of the synovium. Here, we perform an in-depth quantitative analysis of the seven chains that comprise the adaptive immune receptor repertoire (AIRR) in RA. RESULTS In comparison to controls, we show that RA patients have multiple and strong differences in the B cell receptor repertoire including reduced diversity as well as altered isotype, chain, and segment frequencies. We demonstrate that therapeutic tumor necrosis factor inhibition partially restores this alteration but find a profound difference in the underlying biochemical reactivities between responders and non-responders. Combining the AIRR with HLA typing, we identify the specific T cell receptor repertoire associated with disease risk variants. Integrating these features, we further develop a molecular classifier that shows the utility of the AIRR as a diagnostic tool. CONCLUSIONS Simultaneous sequencing of the seven chains of the human AIRR reveals novel features associated with the disease and clinically relevant phenotypes, including response to therapy. These findings show the unique potential of AIRR to address precision medicine in immune-related diseases.
Collapse
Affiliation(s)
- Adrià Aterido
- Rheumatology Research Group, Vall Hebron Research Institute, 08035, Barcelona, Spain
| | - María López-Lasanta
- Rheumatology Research Group, Vall Hebron Research Institute, 08035, Barcelona, Spain
| | - Francisco Blanco
- Rheumatology Department, Hospital Juan Canalejo, A Coruña, Spain
| | | | | | - Alba Erra
- Rheumatology Research Group, Vall Hebron Research Institute, 08035, Barcelona, Spain
- Rheumatology Department, Hospital Sant Rafael, Barcelona, Spain
| | | | | | - Raimon Sanmartí
- Rheumatology Department, Hospital Clínic de Barcelona and IDIBAPS, Barcelona, Spain
| | | | | | - Jesús Tornero
- Rheumatology Department, Hospital Universitario Guadalajara, Guadalajara, Spain
| | - Ana María Ortiz
- Rheumatology Department, Hospital Universitario La Princesa, IIS La Princesa, Madrid, Spain
| | | | - Núria Palau
- Rheumatology Research Group, Vall Hebron Research Institute, 08035, Barcelona, Spain
| | | | | | | | | | | | - Jian Han
- iRepertoire Inc, Huntsville, AL, USA
| | - Richard M Myers
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Sara Marsal
- Rheumatology Research Group, Vall Hebron Research Institute, 08035, Barcelona, Spain
| | - Antonio Julià
- Rheumatology Research Group, Vall Hebron Research Institute, 08035, Barcelona, Spain.
| |
Collapse
|
14
|
Xiong P, Liang A, Cai X, Xia T. APTAnet: an atom-level peptide-TCR interaction affinity prediction model. BIOPHYSICS REPORTS 2024; 10:1-14. [PMID: 38737473 PMCID: PMC11079603 DOI: 10.52601/bpr.2023.230037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Accepted: 01/26/2024] [Indexed: 05/14/2024] Open
Abstract
The prediction of affinity between TCRs and peptides is crucial for the further development of TIL (Tumor-Infiltrating Lymphocytes) immunotherapy. Inspired by the broader research of drug-protein interaction (DPI), we propose an atom-level peptide-TCR interaction (PTI) affinity prediction model APTAnet using natural language processing methods. APTAnet model achieved an average ROC-AUC and PR-AUC of 0.893 and 0.877, respectively, in ten-fold cross-validation on 25,675 pairs of PTI data. Furthermore, experimental results on an independent test set from the McPAS database showed that APTAnet outperformed the current mainstream models. Finally, through the validation on 11 cases of real tumor patient data, we found that the APTAnet model can effectively identify tumor peptides and screen tumor-specific TCRs.
Collapse
Affiliation(s)
- Peng Xiong
- School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Anyi Liang
- School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Xunhui Cai
- Institute of Pathology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
| | - Tian Xia
- School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China
- Institute of Pathology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, China
| |
Collapse
|
15
|
Suo C, Polanski K, Dann E, Lindeboom RGH, Vilarrasa-Blasi R, Vento-Tormo R, Haniffa M, Meyer KB, Dratva LM, Tuong ZK, Clatworthy MR, Teichmann SA. Dandelion uses the single-cell adaptive immune receptor repertoire to explore lymphocyte developmental origins. Nat Biotechnol 2024; 42:40-51. [PMID: 37055623 PMCID: PMC10791579 DOI: 10.1038/s41587-023-01734-7] [Citation(s) in RCA: 23] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Accepted: 03/07/2023] [Indexed: 04/15/2023]
Abstract
Assessment of single-cell gene expression (single-cell RNA sequencing) and adaptive immune receptor (AIR) sequencing (scVDJ-seq) has been invaluable in studying lymphocyte biology. Here we introduce Dandelion, a computational pipeline for scVDJ-seq analysis. It enables the application of standard V(D)J analysis workflows to single-cell datasets, delivering improved V(D)J contig annotation and the identification of nonproductive and partially spliced contigs. We devised a strategy to create an AIR feature space that can be used for both differential V(D)J usage analysis and pseudotime trajectory inference. The application of Dandelion improved the alignment of human thymic development trajectories of double-positive T cells to mature single-positive CD4/CD8 T cells, generating predictions of factors regulating lineage commitment. Dandelion analysis of other cell compartments provided insights into the origins of human B1 cells and ILC/NK cell development, illustrating the power of our approach. Dandelion is available at https://www.github.com/zktuong/dandelion .
Collapse
Affiliation(s)
- Chenqu Suo
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
- Department of Paediatrics, Cambridge University Hospitals, Cambridge, UK
| | | | - Emma Dann
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | | | | | | | - Muzlifah Haniffa
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
- Biosciences Institute, Newcastle University, Newcastle upon Tyne, UK
- Department of Dermatology and NIHR Newcastle Biomedical Research Centre, Newcastle upon Tyne Hospitals NHS Foundation Trust, Newcastle upon Tyne, UK
| | - Kerstin B Meyer
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Lisa M Dratva
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Zewen Kelvin Tuong
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK.
- Molecular Immunity Unit, Department of Medicine, University of Cambridge, Cambridge, UK.
- Frazer Institute, Faculty of Medicine, The University of Queensland, Brisbane, Queensland, Australia.
- Ian Frazer Centre for Children's Immunotherapy Research, Child Health Research Centre, Faculty of Medicine, The University of Queensland, Brisbane, Queensland, Australia.
| | - Menna R Clatworthy
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK.
- Molecular Immunity Unit, Department of Medicine, University of Cambridge, Cambridge, UK.
| | - Sarah A Teichmann
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK.
- Theory of Condensed Matter, Cavendish Laboratory, Department of Physics, University of Cambridge, Cambridge, UK.
| |
Collapse
|
16
|
Montemurro A, Povlsen HR, Jessen LE, Nielsen M. Benchmarking data-driven filtering for denoising of TCRpMHC single-cell data. Sci Rep 2023; 13:16147. [PMID: 37752190 PMCID: PMC10522655 DOI: 10.1038/s41598-023-43048-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Accepted: 09/18/2023] [Indexed: 09/28/2023] Open
Abstract
Pairing of the T cell receptor (TCR) with its cognate peptide-MHC (pMHC) is a cornerstone in T cell-mediated immunity. Recently, single-cell sequencing coupled with DNA-barcoded MHC multimer staining has enabled high-throughput studies of T cell specificities. However, the immense variability of TCR-pMHC interactions combined with the relatively low signal-to-noise ratio in the data generated using current technologies are complicating these studies. Several approaches have been proposed for denoising single-cell TCR-pMHC specificity data. Here, we present a benchmark evaluating two such denoising methods, ICON and ITRAP. We applied and evaluated the methods on publicly available immune profiling data provided by 10x Genomics. We find that both methods identified approximately 75% of the raw data as noise. We analyzed both internal metrics developed for the purpose and performance on independent data using machine learning methods trained on the raw and denoised 10x data. We find an increased signal-to-noise ratio comparing the denoised to the raw data for both methods, and demonstrate an overall superior performance of the ITRAP method in terms of both data consistency and performance. In conclusion, this study demonstrates that Improving the data quality from high throughput studies of TCRpMHC-specificity by denoising is paramount in increasing our understanding of T cell-mediated immunity.
Collapse
Affiliation(s)
- Alessandro Montemurro
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, DTU, 2800, Kgs. Lyngby, Denmark
| | - Helle Rus Povlsen
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, DTU, 2800, Kgs. Lyngby, Denmark
| | - Leon Eyrich Jessen
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, DTU, 2800, Kgs. Lyngby, Denmark
| | - Morten Nielsen
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, DTU, 2800, Kgs. Lyngby, Denmark.
| |
Collapse
|
17
|
Zhao Y, He B, Xu F, Li C, Xu Z, Su X, He H, Huang Y, Rossjohn J, Song J, Yao J. DeepAIR: A deep learning framework for effective integration of sequence and 3D structure to enable adaptive immune receptor analysis. SCIENCE ADVANCES 2023; 9:eabo5128. [PMID: 37556545 PMCID: PMC10411891 DOI: 10.1126/sciadv.abo5128] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Accepted: 07/06/2023] [Indexed: 08/11/2023]
Abstract
Structural docking between the adaptive immune receptors (AIRs), including T cell receptors (TCRs) and B cell receptors (BCRs), and their cognate antigens are one of the most fundamental processes in adaptive immunity. However, current methods for predicting AIR-antigen binding largely rely on sequence-derived features of AIRs, omitting the structure features that are essential for binding affinity. In this study, we present a deep learning framework, termed DeepAIR, for the accurate prediction of AIR-antigen binding by integrating both sequence and structure features of AIRs. DeepAIR achieves a Pearson's correlation of 0.813 in predicting the binding affinity of TCR, and a median area under the receiver-operating characteristic curve (AUC) of 0.904 and 0.942 in predicting the binding reactivity of TCR and BCR, respectively. Meanwhile, using TCR and BCR repertoire, DeepAIR correctly identifies every patient with nasopharyngeal carcinoma and inflammatory bowel disease in test data. Thus, DeepAIR improves the AIR-antigen binding prediction that facilitates the study of adaptive immunity.
Collapse
Affiliation(s)
- Yu Zhao
- AI Lab, Tencent, Shenzhen, China
| | - Bing He
- AI Lab, Tencent, Shenzhen, China
| | - Fan Xu
- AI Lab, Tencent, Shenzhen, China
| | - Chen Li
- Biomedicine Discovery Institute and Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
| | | | | | | | | | - Jamie Rossjohn
- Infection and Immunity Program and Department of Biochemistry and Molecular Biology, Biomedicine Discovery Institute, Monash University, Melbourne, VIC 3800, Australia
- Institute of Infection and Immunity, Cardiff University School of Medicine, Heath Park, Cardiff, UK
| | - Jiangning Song
- AI Lab, Tencent, Shenzhen, China
- Biomedicine Discovery Institute and Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
| | | |
Collapse
|
18
|
Hudson D, Fernandes RA, Basham M, Ogg G, Koohy H. Can we predict T cell specificity with digital biology and machine learning? Nat Rev Immunol 2023; 23:511-521. [PMID: 36755161 PMCID: PMC9908307 DOI: 10.1038/s41577-023-00835-3] [Citation(s) in RCA: 67] [Impact Index Per Article: 33.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/07/2022] [Indexed: 02/10/2023]
Abstract
Recent advances in machine learning and experimental biology have offered breakthrough solutions to problems such as protein structure prediction that were long thought to be intractable. However, despite the pivotal role of the T cell receptor (TCR) in orchestrating cellular immunity in health and disease, computational reconstruction of a reliable map from a TCR to its cognate antigens remains a holy grail of systems immunology. Current data sets are limited to a negligible fraction of the universe of possible TCR-ligand pairs, and performance of state-of-the-art predictive models wanes when applied beyond these known binders. In this Perspective article, we make the case for renewed and coordinated interdisciplinary effort to tackle the problem of predicting TCR-antigen specificity. We set out the general requirements of predictive models of antigen binding, highlight critical challenges and discuss how recent advances in digital biology such as single-cell technology and machine learning may provide possible solutions. Finally, we describe how predicting TCR specificity might contribute to our understanding of the broader puzzle of antigen immunogenicity.
Collapse
Affiliation(s)
- Dan Hudson
- MRC Human Immunology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
- The Rosalind Franklin Institute, Didcot, UK
| | - Ricardo A Fernandes
- Chinese Academy of Medical Sciences Oxford Institute, University of Oxford, Oxford, UK
| | | | - Graham Ogg
- MRC Human Immunology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
- Chinese Academy of Medical Sciences Oxford Institute, University of Oxford, Oxford, UK
| | - Hashem Koohy
- MRC Human Immunology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK.
- Centre for Computational Biology, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK.
| |
Collapse
|
19
|
Wolmarans HJ, Barker VR, Chobrutskiy A, Chobrutskiy BI, Huda TI, Blanck G. Exploiting big data survival information to unify risk-stratification related, adaptive immune receptor parameters for multiple myeloma. Genes Immun 2023; 24:194-199. [PMID: 37443300 DOI: 10.1038/s41435-023-00212-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 06/27/2023] [Accepted: 06/29/2023] [Indexed: 07/15/2023]
Abstract
With the improvement of treatment options, multiple myeloma related life expectancy has been prolonged, but the disease remains largely incurable. Immunotherapy is a growing field that shows promise in advancements for treatment, and recent work has demonstrated an opportunity to use immune receptor, complementarity determining region-3 (CDR3)-candidate antigen chemical complementarity scores to identify survival distinctions among subgroups of patients. Here, we have applied the complementarity scoring algorithm to identify multiple myeloma related, CDR3-cancer testis antigen (CTA) relationships associated with survival distinctions. Furthermore, we have overlapped these immune receptor features with a previous study that showed a dramatic survival distinction based on T-cell receptor, V- and J-gene segment usage, HLA allele combinations, whereby 100% of the patients in certain combination groups had no mortality related to multiple myeloma, during the study period. This overlap evaluation was consistent with the idea that there are likely considerable constraints on productive TRB-antigen-HLA combinations but more flexibility, and unpredictability, for the TRA-antigen-HLA combinations. Also, the approaches in this reported indicated the potential importance of the CTA, IGSF11, as a multiple myeloma antigen, an antigen previously, independently considered as a vaccine candidate in other settings.
Collapse
Affiliation(s)
- Hope J Wolmarans
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, 33612, USA
| | - Vayda R Barker
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, 33612, USA
| | - Andrea Chobrutskiy
- Department of Pediatrics, Oregon Health and Science University Hospital, Portland, OR, 97239, USA
| | - Boris I Chobrutskiy
- Department of Internal Medicine, Oregon Health and Science University Hospital, Portland, OR, 97239, USA
| | - Taha I Huda
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, 33612, USA
| | - George Blanck
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, 33612, USA.
- Department of Immunology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, 33612, USA.
| |
Collapse
|
20
|
Xu AM, Chour W, DeLucia DC, Su Y, Pavlovitch-Bedzyk AJ, Ng R, Rasheed Y, Davis MM, Lee JK, Heath JR. Entropic analysis of antigen-specific CDR3 domains identifies essential binding motifs shared by CDR3s with different antigen specificities. Cell Syst 2023; 14:273-284.e5. [PMID: 37001518 PMCID: PMC10355346 DOI: 10.1016/j.cels.2023.03.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 09/01/2022] [Accepted: 03/01/2023] [Indexed: 04/22/2023]
Abstract
Antigen-specific T cell receptor (TCR) sequences can have prognostic, predictive, and therapeutic value, but decoding the specificity of TCR recognition remains challenging. Unlike DNA strands that base pair, TCRs bind to their targets with different orientations and different lengths, which complicates comparisons. We present scanning parametrized by normalized TCR length (SPAN-TCR) to analyze antigen-specific TCR CDR3 sequences and identify patterns driving TCR-pMHC specificity. Using entropic analysis, SPAN-TCR identifies 2-mer motifs that decrease the diversity (entropy) of CDR3s. These motifs are the most common patterns that can predict CDR3 composition, and we identify "essential" motifs that decrease entropy in the same CDR3 α or β chain containing the 2-mer, and "super-essential" motifs that decrease entropy in both chains. Molecular dynamics analysis further suggests that these motifs may play important roles in binding. We then employ SPAN-TCR to resolve similarities in TCR repertoires against different antigens using public databases of TCR sequences.
Collapse
Affiliation(s)
- Alexander M Xu
- Institute for Systems Biology, Seattle, WA 98109, USA; Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125, USA; Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, CA 90048, USA; Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA 90048, USA.
| | - William Chour
- Institute for Systems Biology, Seattle, WA 98109, USA; Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA; Keck School of Medicine, University of Southern California, Los Angeles, CA 91125, USA
| | - Diana C DeLucia
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Yapeng Su
- Institute for Systems Biology, Seattle, WA 98109, USA; Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125, USA
| | | | - Rachel Ng
- Institute for Systems Biology, Seattle, WA 98109, USA
| | - Yusuf Rasheed
- Institute for Systems Biology, Seattle, WA 98109, USA
| | - Mark M Davis
- Computational and Systems Immunology Program, Stanford University School of Medicine, Stanford, CA 94305, USA; Institute for Immunity, Transplantation and Infection, Stanford University School of Medicine, Stanford, CA 94305, USA; Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA 94305, USA; Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - John K Lee
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Division of Medical Oncology, Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | - James R Heath
- Institute for Systems Biology, Seattle, WA 98109, USA.
| |
Collapse
|
21
|
Tippalagama R, Chihab LY, Kearns K, Lewis S, Panda S, Willemsen L, Burel JG, Lindestam Arlehamn CS. Antigen-specificity measurements are the key to understanding T cell responses. Front Immunol 2023; 14:1127470. [PMID: 37122719 PMCID: PMC10140422 DOI: 10.3389/fimmu.2023.1127470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Accepted: 03/30/2023] [Indexed: 05/02/2023] Open
Abstract
Antigen-specific T cells play a central role in the adaptive immune response and come in a wide range of phenotypes. T cell receptors (TCRs) mediate the antigen-specificities found in T cells. Importantly, high-throughput TCR sequencing provides a fingerprint which allows tracking of specific T cells and their clonal expansion in response to particular antigens. As a result, many studies have leveraged TCR sequencing in an attempt to elucidate the role of antigen-specific T cells in various contexts. Here, we discuss the published approaches to studying antigen-specific T cells and their specific TCR repertoire. Further, we discuss how these methods have been applied to study the TCR repertoire in various diseases in order to characterize the antigen-specific T cells involved in the immune control of disease.
Collapse
|
22
|
Mayer A, Callan CG. Measures of epitope binding degeneracy from T cell receptor repertoires. Proc Natl Acad Sci U S A 2023; 120:e2213264120. [PMID: 36649423 PMCID: PMC9942805 DOI: 10.1073/pnas.2213264120] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Accepted: 12/13/2022] [Indexed: 01/19/2023] Open
Abstract
Adaptive immunity is driven by specific binding of hypervariable receptors to diverse molecular targets. The sequence diversity of receptors and targets are both individually known but because multiple receptors can recognize the same target, a measure of the effective "functional" diversity of the human immune system has remained elusive. Here, we show that sequence near-coincidences within T cell receptors that bind specific epitopes provide a new window into this problem and allow the quantification of how binding probability covaries with sequence. We find that near-coincidence statistics within epitope-specific repertoires imply a measure of binding degeneracy to amino acid changes in receptor sequence that is consistent across disparate experiments. Paired data on both chains of the heterodimeric receptor are particularly revealing since simultaneous near-coincidences are rare and we show how they can be exploited to estimate the number of epitope responses that created the memory compartment. In addition, we find that paired-chain coincidences are strongly suppressed across donors with different human leukocyte antigens, evidence for a central role of antigen-driven selection in making paired chain receptors public. These results demonstrate the power of coincidence analysis to reveal the sequence determinants of epitope binding in receptor repertoires.
Collapse
Affiliation(s)
- Andreas Mayer
- Division of Infection and Immunity, University College London, LondonWC1E 6BT, UK
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton08544, NJ
- Institute for the Physics of Living Systems, University College London, LondonWC1E 6BT, UK
| | - Curtis G. Callan
- Department of Physics, Princeton University, Princeton08544, NJ
- Institute for Advanced Study, Princeton08540, NJ
| |
Collapse
|
23
|
Akerman O, Isakov H, Levi R, Psevkin V, Louzoun Y. Counting is almost all you need. Front Immunol 2023; 13:1031011. [PMID: 36741395 PMCID: PMC9896581 DOI: 10.3389/fimmu.2022.1031011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Accepted: 12/27/2022] [Indexed: 01/21/2023] Open
Abstract
The immune memory repertoire encodes the history of present and past infections and immunological attributes of the individual. As such, multiple methods were proposed to use T-cell receptor (TCR) repertoires to detect disease history. We here show that the counting method outperforms two leading algorithms. We then show that the counting can be further improved using a novel attention model to weigh the different TCRs. The attention model is based on the projection of TCRs using a Variational AutoEncoder (VAE). Both counting and attention algorithms predict better than current leading algorithms whether the host had CMV and its HLA alleles. As an intermediate solution between the complex attention model and the very simple counting model, we propose a new Graph Convolutional Network approach that obtains the accuracy of the attention model and the simplicity of the counting model. The code for the models used in the paper is provided at: https://github.com/louzounlab/CountingIsAlmostAllYouNeed.
Collapse
Affiliation(s)
- Ofek Akerman
- Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel
- Department of Computer Science, Bar-Ilan University, Ramat Gan, Israel
| | - Haim Isakov
- Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel
| | - Reut Levi
- Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel
| | - Vladimir Psevkin
- Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel
| | - Yoram Louzoun
- Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel
| |
Collapse
|
24
|
Zhang L, Li H, Zhang Z, Wang J, Chen G, Chen D, Shi W, Jia G, Liu M. Hybrid gMLP model for interaction prediction of MHC-peptide and TCR. Front Genet 2023; 13:1092822. [PMID: 36685858 PMCID: PMC9845249 DOI: 10.3389/fgene.2022.1092822] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 12/01/2022] [Indexed: 01/05/2023] Open
Abstract
Understanding the interaction of T-cell receptor (TCR) with major histocompatibility-peptide (MHC-peptide) complex is extremely important in human immunotherapy and vaccine development. However, due to the limited available data, the performance of existing models for predicting the interaction of T-cell receptors (TCR) with major histocompatibility-peptide complexes is still unsatisfactory. Deep learning models have been applied to prediction tasks in various fields and have achieved better results compared with other traditional models. In this study, we leverage the gMLP model combined with attention mechanism to predict the interaction of MHC-peptide and TCR. Experiments show that our model can predict TCR-peptide interactions accurately and can handle the problems caused by different TCR lengths. Moreover, we demonstrate that the models trained with paired CDR3β-chain and CDR3α-chain data are better than those trained with only CDR3β-chain or with CDR3α-chain data. We also demonstrate that the hybrid model has greater potential than the traditional convolutional neural network.
Collapse
Affiliation(s)
- Lichao Zhang
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, China
| | - Haojin Li
- School of Software, Shandong University, Jinan, China
| | - Zhenjiu Zhang
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, China
| | - Jinjin Wang
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, China
| | | | | | - Wentao Shi
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, China
| | - Gaozhi Jia
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, China
| | - Mingjun Liu
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, China
| |
Collapse
|
25
|
Hudock TR, Barker VR, Manley BJ, Chobrutskiy A, Chobrutskiy BI, Diaz MJ, Song JJ, Blanck G. TRB CDR3-cancer testis antigen chemical complementarity scoring for identifying productive immune responses in renal cell carcinoma. Cancer Biomark 2023; 38:103-110. [PMID: 37545223 DOI: 10.3233/cbm-230047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]
Abstract
BACKGROUND Immunogenomics approaches to the characterization of renal cell carcinoma (RCC) have helped to better our understanding of the features of RCC immune dysfunction. However, much is still unknown with regard to specific immune interactions and their impact in the tumor microenvironment. OBJECTIVE This study applied chemical complementarity scoring for the TRB complementarity determining region-3 (CDR3) amino acid sequences and cancer testis antigens (CTAs) to determine whether such complementarity correlated with survival and the expression of immune marker genes. METHODS TRB recombination reads from RCC tumor samples from RNAseq files obtained from two separate databases, Moffitt Cancer Center and The Cancer Genome Atlas (TCGA), were evaluated. Chemical complementarity scores (CSs) were calculated for TRB CDR3-CTA pairs and survival assessments based on those CSs were performed. RESULTS Moffitt Cancer Center and TCGA cases representing the upper 50th percentile of chemical CSs for TRB CDR3 amino acid sequences and the CTA POTEA were found to be associated with a better overall survival (OS) Also, greater tumor RNA expression of multiple immune signature genes, including granzyme A, granzyme B, and interferon-gamma were correlated with the higher chemical CSs. CONCLUSIONS These results indicate that TRB CDR3-CTA chemical complementarity scoring may be useful in distinguishing RCC cases with a productive, anti-tumor immune response from cases where basic immune parameter assessments are inconsistent with a productive immune response.
Collapse
Affiliation(s)
- Tabitha R Hudock
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
| | - Vayda R Barker
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
| | - Brandon J Manley
- Department of Genitourinary Oncology, Moffitt Cancer Center and Research Institute, Tampa, FL, USA
| | - Andrea Chobrutskiy
- Department of Pediatrics, Oregon Health and Science University Hospital, Portland, OR, USA
| | - Boris I Chobrutskiy
- Department of Internal Medicine, Oregon Health and Science University Hospital, Portland, OR, USA
| | - Michael J Diaz
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
| | - Joanna J Song
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
| | - George Blanck
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
- Department of Immunology, Moffitt Cancer Center and Research Institute, Tampa, FL, USA
| |
Collapse
|
26
|
Grazioli F, Machart P, Mösch A, Li K, Castorina LV, Pfeifer N, Min MR. Attentive Variational Information Bottleneck for TCR-peptide interaction prediction. Bioinformatics 2022; 39:6960920. [PMID: 36571499 PMCID: PMC9825246 DOI: 10.1093/bioinformatics/btac820] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Revised: 11/18/2022] [Accepted: 12/23/2022] [Indexed: 12/27/2022] Open
Abstract
MOTIVATION We present a multi-sequence generalization of Variational Information Bottleneck and call the resulting model Attentive Variational Information Bottleneck (AVIB). Our AVIB model leverages multi-head self-attention to implicitly approximate a posterior distribution over latent encodings conditioned on multiple input sequences. We apply AVIB to a fundamental immuno-oncology problem: predicting the interactions between T-cell receptors (TCRs) and peptides. RESULTS Experimental results on various datasets show that AVIB significantly outperforms state-of-the-art methods for TCR-peptide interaction prediction. Additionally, we show that the latent posterior distribution learned by AVIB is particularly effective for the unsupervised detection of out-of-distribution amino acid sequences. AVAILABILITY AND IMPLEMENTATION The code and the data used for this study are publicly available at: https://github.com/nec-research/vibtcr. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Pierre Machart
- Biomedical AI Group, NEC Laboratories Europe, Heidelberg 69115, Germany
| | - Anja Mösch
- Biomedical AI Group, NEC Laboratories Europe, Heidelberg 69115, Germany
| | - Kai Li
- Machine Learning Department, NEC Laboratories America, Princeton, NJ 08540, USA
| | | | - Nico Pfeifer
- Methods in Medical Informatics, Department of Computer Science, University of Tübingen, Tübingen 72076, Germany
| | | |
Collapse
|
27
|
Guo Z, Yamaguchi R. Machine learning methods for protein-protein binding affinity prediction in protein design. FRONTIERS IN BIOINFORMATICS 2022; 2:1065703. [PMID: 36591334 PMCID: PMC9800603 DOI: 10.3389/fbinf.2022.1065703] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Accepted: 12/01/2022] [Indexed: 12/23/2022] Open
Abstract
Protein-protein interactions govern a wide range of biological activity. A proper estimation of the protein-protein binding affinity is vital to design proteins with high specificity and binding affinity toward a target protein, which has a variety of applications including antibody design in immunotherapy, enzyme engineering for reaction optimization, and construction of biosensors. However, experimental and theoretical modelling methods are time-consuming, hinder the exploration of the entire protein space, and deter the identification of optimal proteins that meet the requirements of practical applications. In recent years, the rapid development in machine learning methods for protein-protein binding affinity prediction has revealed the potential of a paradigm shift in protein design. Here, we review the prediction methods and associated datasets and discuss the requirements and construction methods of binding affinity prediction models for protein design.
Collapse
Affiliation(s)
- Zhongliang Guo
- Division of Cancer Systems Biology, Aichi Cancer Center Research Institute, Nagoya, Aichi, Japan
| | - Rui Yamaguchi
- Division of Cancer Systems Biology, Aichi Cancer Center Research Institute, Nagoya, Aichi, Japan,Division of Cancer Informatics, Nagoya University Graduate School of Medicine, Nagoya, Aichi, Japan,*Correspondence: Rui Yamaguchi,
| |
Collapse
|
28
|
Grazioli F, Mösch A, Machart P, Li K, Alqassem I, O’Donnell TJ, Min MR. On TCR binding predictors failing to generalize to unseen peptides. Front Immunol 2022; 13:1014256. [PMID: 36341448 PMCID: PMC9634250 DOI: 10.3389/fimmu.2022.1014256] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Accepted: 10/05/2022] [Indexed: 11/18/2022] Open
Abstract
Several recent studies investigate TCR-peptide/-pMHC binding prediction using machine learning or deep learning approaches. Many of these methods achieve impressive results on test sets, which include peptide sequences that are also included in the training set. In this work, we investigate how state-of-the-art deep learning models for TCR-peptide/-pMHC binding prediction generalize to unseen peptides. We create a dataset including positive samples from IEDB, VDJdb, McPAS-TCR, and the MIRA set, as well as negative samples from both randomization and 10X Genomics assays. We name this collection of samples TChard. We propose the hard split, a simple heuristic for training/test split, which ensures that test samples exclusively present peptides that do not belong to the training set. We investigate the effect of different training/test splitting techniques on the models’ test performance, as well as the effect of training and testing the models using mismatched negative samples generated randomly, in addition to the negative samples derived from assays. Our results show that modern deep learning methods fail to generalize to unseen peptides. We provide an explanation why this happens and verify our hypothesis on the TChard dataset. We then conclude that robust prediction of TCR recognition is still far for being solved.
Collapse
Affiliation(s)
- Filippo Grazioli
- Biomedical AI Group, NEC Laboratories Europe, Heidelberg, Germany
- *Correspondence: Filippo Grazioli, ; Martin Renqiang Min,
| | - Anja Mösch
- Biomedical AI Group, NEC Laboratories Europe, Heidelberg, Germany
| | - Pierre Machart
- Biomedical AI Group, NEC Laboratories Europe, Heidelberg, Germany
| | - Kai Li
- Machine Learning Department, NEC Laboratories America, Princeton, NJ, United States
| | - Israa Alqassem
- Biomedical AI Group, NEC Laboratories Europe, Heidelberg, Germany
| | - Timothy J. O’Donnell
- Division of Hematology and Medical Oncology, Icahn School of Medicine at Mount Sinai, New York, NY, United States
| | - Martin Renqiang Min
- Machine Learning Department, NEC Laboratories America, Princeton, NJ, United States
- *Correspondence: Filippo Grazioli, ; Martin Renqiang Min,
| |
Collapse
|
29
|
Mateyka LM, Strobl PM, Jarosch S, Scheu SJC, Busch DH, D'Ippolito E. Gene Signatures of T-Cell Activation Can Serve as Predictors of Functionality for SARS-CoV-2-Specific T-Cell Receptors. Vaccines (Basel) 2022; 10:vaccines10101617. [PMID: 36298482 PMCID: PMC9611811 DOI: 10.3390/vaccines10101617] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 09/15/2022] [Accepted: 09/23/2022] [Indexed: 11/16/2022] Open
Abstract
The importance of T cells in controlling SARS-CoV-2 infections has been demonstrated widely, but insights into the quality of these responses are still limited due to technical challenges. Indeed, understanding the functionality of the T-cell receptor (TCR) repertoire of a polyclonal antigen-specific population still requires the tedious work of T-cell cloning or TCR re-expression and subsequent characterization. In this work, we show that it is possible to discriminate highly functional and bystander TCRs based on gene signatures of T-cell activation induced by recent peptide stimulation. SARS-CoV-2-specific TCRs previously identified by cytokine release after peptide restimulation and subsequent single-cell RNA sequencing were re-expressed via CRISPR-Cas9-mediated gene editing into a Jurkat-based reporter cell line system suitable for high-throughput screening. We could observe differences in SARS-CoV-2 epitope recognition as well as a wide range of functional avidities. By correlating these in vitro TCR engineered functional data with the transcriptomic profiles of the corresponding TCR-expressing parental T cells, we could validate that gene signatures of recent T-cell activation accurately identify and predict truly SARS-CoV-2-specific TCRs. In summary, this work paves the way for alternative approaches useful for the functional analysis of global antigen-specific TCR repertoires with largely improved throughput.
Collapse
Affiliation(s)
- Laura M Mateyka
- Institute for Medical Microbiology, Immunology and Hygiene, Technical University of Munich (TUM), 81675 Munich, Germany
| | - Philipp M Strobl
- Institute for Medical Microbiology, Immunology and Hygiene, Technical University of Munich (TUM), 81675 Munich, Germany
| | - Sebastian Jarosch
- Institute for Medical Microbiology, Immunology and Hygiene, Technical University of Munich (TUM), 81675 Munich, Germany
| | - Sebastian J C Scheu
- Institute for Medical Microbiology, Immunology and Hygiene, Technical University of Munich (TUM), 81675 Munich, Germany
| | - Dirk H Busch
- Institute for Medical Microbiology, Immunology and Hygiene, Technical University of Munich (TUM), 81675 Munich, Germany
- German Center for Infection Research (DZIF), Partner Site Munich, 81675 Munich, Germany
| | - Elvira D'Ippolito
- Institute for Medical Microbiology, Immunology and Hygiene, Technical University of Munich (TUM), 81675 Munich, Germany
| |
Collapse
|
30
|
Sidhom JW, Oliveira G, Ross-MacDonald P, Wind-Rotolo M, Wu CJ, Pardoll DM, Baras AS. Deep learning reveals predictive sequence concepts within immune repertoires to immunotherapy. SCIENCE ADVANCES 2022; 8:eabq5089. [PMID: 36112691 PMCID: PMC9481116 DOI: 10.1126/sciadv.abq5089] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Accepted: 07/29/2022] [Indexed: 06/09/2023]
Abstract
T cell receptor (TCR) sequencing has been used to characterize the immune response to cancer. However, most analyses have been restricted to quantitative measures such as clonality that do not leverage the complementarity-determining region 3 (CDR3) sequence. We use DeepTCR, a framework of deep learning algorithms, to reveal sequence concepts that are predictive of response to immunotherapy. We demonstrate that DeepTCR can predict response and use the model to infer the antigenic specificities of the predictive signature and their unique dynamics during therapy. The predictive signature of nonresponse is associated with high frequencies of TCRs predicted to recognize tumor-specific antigens, and these tumor-specific TCRs undergo a higher degree of dynamic changes on therapy in nonresponders versus responders. These results are consistent with a biological model where the hallmark of nonresponders is an accumulation of tumor-specific T cells that undergo turnover on therapy, possibly because of the dysfunctional state of these T cells in nonresponders.
Collapse
Affiliation(s)
- John-William Sidhom
- Bloomberg Kimmel Institute for Cancer Immunotherapy, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- The Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Giacomo Oliveira
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | | | | | - Catherine J. Wu
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Brigham and Women’s Hospital, Boston, MA, USA
| | - Drew M. Pardoll
- Bloomberg Kimmel Institute for Cancer Immunotherapy, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- The Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Alexander S. Baras
- Bloomberg Kimmel Institute for Cancer Immunotherapy, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- The Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| |
Collapse
|
31
|
Trofimov A, Brouillard P, Larouche JD, Séguin J, Laverdure JP, Brasey A, Ehx G, Roy DC, Busque L, Lachance S, Lemieux S, Perreault C. Two types of human TCR differentially regulate reactivity to self and non-self antigens. iScience 2022; 25:104968. [PMID: 36111255 PMCID: PMC9468382 DOI: 10.1016/j.isci.2022.104968] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Revised: 06/24/2022] [Accepted: 08/12/2022] [Indexed: 11/30/2022] Open
Abstract
Based on analyses of TCR sequences from over 1,000 individuals, we report that the TCR repertoire is composed of two ontogenically and functionally distinct types of TCRs. Their production is regulated by variations in thymic output and terminal deoxynucleotidyl transferase (TDT) activity. Neonatal TCRs derived from TDT-negative progenitors persist throughout life, are highly shared among subjects, and are reported as disease-associated. Thus, 10%–30% of most frequent cord blood TCRs are associated with common pathogens and autoantigens. TDT-dependent TCRs present distinct structural features and are less shared among subjects. TDT-dependent TCRs are produced in maximal numbers during infancy when thymic output and TDT activity reach a summit, are more abundant in subjects with AIRE mutations, and seem to play a dominant role in graft-versus-host disease. Factors decreasing thymic output (age, male sex) negatively impact TCR diversity. Males compensate for their lower repertoire diversity via hyperexpansion of selected TCR clonotypes. Over 108 TCR CDR3 sequences from ∼103 individuals and 7 cohorts were analyzed The TCR repertoire is composed of two layers: neonatal and TDT-dependent layer ∼70% of frequent cord blood TCRs are associated with common pathogens Acute graft-vs-host disease correlates with a high proportion of TDT-dependent TCRs
Collapse
Affiliation(s)
- Assya Trofimov
- Institute for Research in Immunology and Cancer (IRIC), Université de Montréal, Montreal, Quebec H3C 3J7, Canada
- Department of Computer Science and Research Operations, Université de Montréal, Montreal, Quebec H3C 3J7, Canada
- Quebec Institute for Learning Algorithms (Mila), Montreal, Quebec H2S 3H1, Canada
- Currently Fred Hutchinson Cancer Center, Seattle, WA 98109, USA
- Currently Department of Physics, University of Washington, Seattle, WA 98195-1560, USA
| | - Philippe Brouillard
- Department of Computer Science and Research Operations, Université de Montréal, Montreal, Quebec H3C 3J7, Canada
- Quebec Institute for Learning Algorithms (Mila), Montreal, Quebec H2S 3H1, Canada
| | - Jean-David Larouche
- Institute for Research in Immunology and Cancer (IRIC), Université de Montréal, Montreal, Quebec H3C 3J7, Canada
- Department of Medicine, Université de Montréal, Montreal, Quebec H3C 3J7, Canada
| | - Jonathan Séguin
- Institute for Research in Immunology and Cancer (IRIC), Université de Montréal, Montreal, Quebec H3C 3J7, Canada
| | - Jean-Philippe Laverdure
- Institute for Research in Immunology and Cancer (IRIC), Université de Montréal, Montreal, Quebec H3C 3J7, Canada
| | - Ann Brasey
- Maisonneuve-Rosemont Hospital, Montreal, Quebec H1T 2M4, Canada
| | - Gregory Ehx
- Institute for Research in Immunology and Cancer (IRIC), Université de Montréal, Montreal, Quebec H3C 3J7, Canada
- Currently Interdisciplinary Cluster for Applied Geno-Proteomics (GIGA-I3), University of Liege, Liege 4000, Belgium
| | | | - Lambert Busque
- Maisonneuve-Rosemont Hospital, Montreal, Quebec H1T 2M4, Canada
| | - Silvy Lachance
- Department of Medicine, Université de Montréal, Montreal, Quebec H3C 3J7, Canada
- Maisonneuve-Rosemont Hospital, Montreal, Quebec H1T 2M4, Canada
| | - Sébastien Lemieux
- Institute for Research in Immunology and Cancer (IRIC), Université de Montréal, Montreal, Quebec H3C 3J7, Canada
- Department of Computer Science and Research Operations, Université de Montréal, Montreal, Quebec H3C 3J7, Canada
- Department of Biochemistry at University of Montreal, Université de Montréal, Montreal, Quebec H3C 3J7, Canada
- Corresponding author
| | - Claude Perreault
- Institute for Research in Immunology and Cancer (IRIC), Université de Montréal, Montreal, Quebec H3C 3J7, Canada
- Department of Medicine, Université de Montréal, Montreal, Quebec H3C 3J7, Canada
- Maisonneuve-Rosemont Hospital, Montreal, Quebec H1T 2M4, Canada
- Corresponding author
| |
Collapse
|
32
|
Katayama Y, Yokota R, Akiyama T, Kobayashi TJ. Machine Learning Approaches to TCR Repertoire Analysis. Front Immunol 2022; 13:858057. [PMID: 35911778 PMCID: PMC9334875 DOI: 10.3389/fimmu.2022.858057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Accepted: 06/07/2022] [Indexed: 11/13/2022] Open
Abstract
Sparked by the development of genome sequencing technology, the quantity and quality of data handled in immunological research have been changing dramatically. Various data and database platforms are now driving the rapid progress of machine learning for immunological data analysis. Of various topics in immunology, T cell receptor repertoire analysis is one of the most important targets of machine learning for assessing the state and abnormalities of immune systems. In this paper, we review recent repertoire analysis methods based on machine learning and deep learning and discuss their prospects.
Collapse
Affiliation(s)
- Yotaro Katayama
- Graduate School of Engineering, The University of Tokyo, Tokyo, Japan
| | - Ryo Yokota
- National Research Institute of Police Science, Kashiwa, Chiba, Japan
| | - Taishin Akiyama
- Laboratory for Immune Homeostasis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Graduate School of Medical Life Science, Yokohama City University, Yokohama, Japan
| | - Tetsuya J. Kobayashi
- Graduate School of Engineering, The University of Tokyo, Tokyo, Japan
- Institute of Industrial Science, The University of Tokyo, Tokyo, Japan
| |
Collapse
|
33
|
Papadopoulou I, Nguyen AP, Weber A, Martínez MR. DECODE: a computational pipeline to discover T cell receptor binding rules. Bioinformatics 2022; 38:i246-i254. [PMID: 35758821 PMCID: PMC9235487 DOI: 10.1093/bioinformatics/btac257] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Motivation Understanding the mechanisms underlying T cell receptor (TCR) binding is of fundamental importance to understanding adaptive immune responses. A better understanding of the biochemical rules governing TCR binding can be used, e.g. to guide the design of more powerful and safer T cell-based therapies. Advances in repertoire sequencing technologies have made available millions of TCR sequences. Data abundance has, in turn, fueled the development of many computational models to predict the binding properties of TCRs from their sequences. Unfortunately, while many of these works have made great strides toward predicting TCR specificity using machine learning, the black-box nature of these models has resulted in a limited understanding of the rules that govern the binding of a TCR and an epitope. Results We present an easy-to-use and customizable computational pipeline, DECODE, to extract the binding rules from any black-box model designed to predict the TCR-epitope binding. DECODE offers a range of analytical and visualization tools to guide the user in the extraction of such rules. We demonstrate our pipeline on a recently published TCR-binding prediction model, TITAN, and show how to use the provided metrics to assess the quality of the computed rules. In conclusion, DECODE can lead to a better understanding of the sequence motifs that underlie TCR binding. Our pipeline can facilitate the investigation of current immunotherapeutic challenges, such as cross-reactive events due to off-target TCR binding. Availability and implementation Code is available publicly at https://github.com/phineasng/DECODE. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Iliana Papadopoulou
- IBM Research Europe, 8803 Rüschlikon, Switzerland.,ETH Zurich, Department of Biosystems Science and Engineering (D-BSSE), 4058 Basel, Switzerland
| | - An-Phi Nguyen
- IBM Research Europe, 8803 Rüschlikon, Switzerland.,ETH Zurich, Department of Mathematics (D-Math), 8092 Zurich, Switzerland
| | - Anna Weber
- IBM Research Europe, 8803 Rüschlikon, Switzerland.,ETH Zurich, Department of Biosystems Science and Engineering (D-BSSE), 4058 Basel, Switzerland
| | | |
Collapse
|
34
|
Glazer N, Akerman O, Louzoun Y. Naive and memory T cells TCR-HLA-binding prediction. OXFORD OPEN IMMUNOLOGY 2022; 3:iqac001. [PMID: 36846560 PMCID: PMC9914496 DOI: 10.1093/oxfimm/iqac001] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Revised: 05/01/2022] [Accepted: 05/17/2022] [Indexed: 11/12/2022] Open
Abstract
T cells recognize antigens through the interaction of their T cell receptor (TCR) with a peptide-major histocompatibility complex (pMHC) molecule. Following thymic-positive selection, TCRs in peripheral naive T cells are expected to bind MHC alleles of the host. Peripheral clonal selection is expected to further increase the frequency of antigen-specific TCRs that bind to the host MHC alleles. To check for a systematic preference for MHC-binding T cells in TCR repertoires, we developed Natural Language Processing-based methods to predict TCR-MHC binding independently of the peptide presented for Class I MHC alleles. We trained a classifier on published TCR-pMHC binding pairs and obtained a high area under curve (AUC) of over 0.90 on the test set. However, when applied to TCR repertoires, the accuracy of the classifier dropped. We thus developed a two-stage prediction model, based on large-scale naive and memory TCR repertoires, denoted TCR HLA-binding predictor (CLAIRE). Since each host carries multiple human leukocyte antigen (HLA) alleles, we first computed whether a TCR on a CD8 T cell binds an MHC from any of the host Class-I HLA alleles. We then performed an iteration, where we predict the binding with the most probable allele from the first round. We show that this classifier is more precise for memory than for naïve cells. Moreover, it can be transferred between datasets. Finally, we developed a CD4-CD8 T cell classifier to apply CLAIRE to unsorted bulk sequencing datasets and showed a high AUC of 0.96 and 0.90 on large datasets. CLAIRE is available through a GitHub at: https://github.com/louzounlab/CLAIRE, and as a server at: https://claire.math.biu.ac.il/Home.
Collapse
Affiliation(s)
- Neta Glazer
- Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel
| | - Ofek Akerman
- Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel
| | - Yoram Louzoun
- Correspondence address. Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel. E-mail:
| |
Collapse
|
35
|
Abondio P, De Intinis C, da Silva Gonçalves Vianez Júnior JL, Pace L. SINGLE CELL MULTIOMIC APPROACHES TO DISENTANGLE T CELL HETEROGENEITY. Immunol Lett 2022; 246:37-51. [DOI: 10.1016/j.imlet.2022.04.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 04/16/2022] [Accepted: 04/26/2022] [Indexed: 11/29/2022]
|
36
|
Lantz O, Teyton L. Identification of T cell antigens in the 21st century, as difficult as ever. Semin Immunol 2022; 60:101659. [PMID: 36183497 PMCID: PMC10332289 DOI: 10.1016/j.smim.2022.101659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Identifying antigens recognized by T cells is still challenging, particularly for innate like T cells that do not recognize peptides but small metabolites or lipids in the context of MHC-like molecules or see non-MHC restricted antigens. The fundamental reason for this situation is the low affinity of T cell receptors for their ligands coupled with a level of degeneracy that makes them bind to similar surfaces on antigen presenting cells. Herein we will describe non-exhaustively some of the methods that were used to identify peptide antigens and briefly mention the high throughput methods more recently proposed for that purpose. We will then present how the molecules recognized by innate like T cells (NKT, MAIT and γδ T cells) were discovered. We will show that serendipity was instrumental in many cases.
Collapse
Affiliation(s)
- Olivier Lantz
- INSERM U932, PSL University, Institut Curie, 75005 Paris, France; Laboratoire d'Immunologie Clinique, Institut Curie, Paris 75005, France; Centre d'investigation Clinique en Biothérapie Gustave-Roussy Institut Curie (CIC-BT1428) Institut Curie, Paris 75005, France
| | - Luc Teyton
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA.
| |
Collapse
|
37
|
Rickenbach C, Gericke C. Specificity of Adaptive Immune Responses in Central Nervous System Health, Aging and Diseases. Front Neurosci 2022; 15:806260. [PMID: 35126045 PMCID: PMC8812614 DOI: 10.3389/fnins.2021.806260] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2021] [Accepted: 12/29/2021] [Indexed: 12/25/2022] Open
Abstract
The field of neuroimmunology endorses the involvement of the adaptive immune system in central nervous system (CNS) health, disease, and aging. While immune cell trafficking into the CNS is highly regulated, small numbers of antigen-experienced lymphocytes can still enter the cerebrospinal fluid (CSF)-filled compartments for regular immune surveillance under homeostatic conditions. Meningeal lymphatics facilitate drainage of brain-derived antigens from the CSF to deep cervical lymph nodes to prime potential adaptive immune responses. During aging and CNS disorders, brain barriers and meningeal lymphatic functions are impaired, and immune cell trafficking and antigen efflux are altered. In this context, alterations in the immune cell repertoire of blood and CSF and T and B cells primed against CNS-derived autoantigens have been observed in various CNS disorders. However, for many diseases, a causal relationship between observed immune responses and neuropathological findings is lacking. Here, we review recent discoveries about the association between the adaptive immune system and CNS disorders such as autoimmune neuroinflammatory and neurodegenerative diseases. We focus on the current challenges in identifying specific T cell epitopes in CNS diseases and discuss the potential implications for future diagnostic and treatment options.
Collapse
Affiliation(s)
- Chiara Rickenbach
- Institute for Regenerative Medicine, University of Zurich, Schlieren, Switzerland
| | - Christoph Gericke
- Institute for Regenerative Medicine, University of Zurich, Schlieren, Switzerland
| |
Collapse
|
38
|
Integrating T cell receptor sequences and transcriptional profiles by clonotype neighbor graph analysis (CoNGA). Nat Biotechnol 2022; 40:54-63. [PMID: 34426704 PMCID: PMC8832949 DOI: 10.1038/s41587-021-00989-2] [Citation(s) in RCA: 70] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Accepted: 06/18/2021] [Indexed: 02/07/2023]
Abstract
Links between T cell clonotypes, as defined by T cell receptor (TCR) sequences, and phenotype, as reflected in gene expression (GEX) profiles, surface protein expression and peptide:major histocompatibility complex binding, can reveal functional relationships beyond the features shared by clonally related cells. Here we present clonotype neighbor graph analysis (CoNGA), a graph theoretic approach that identifies correlations between GEX profile and TCR sequence through statistical analysis of GEX and TCR similarity graphs. Using CoNGA, we uncovered associations between TCR sequence and GEX profiles that include a previously undescribed 'natural lymphocyte' population of human circulating CD8+ T cells and a set of TCR sequence determinants of differentiation in thymocytes. These examples show that CoNGA might help elucidate complex relationships between TCR sequence and T cell phenotype in large, heterogeneous, single-cell datasets.
Collapse
|
39
|
Wang Y, Mai G, Zou M, Long H, Chen YQ, Sun L, Tian D, Zhao Y, Jiang G, Cao Z, Du X. Heavy chain sequence-based classifier for the specificity of human antibodies. Brief Bioinform 2021; 23:6483065. [PMID: 34953464 DOI: 10.1093/bib/bbab516] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Revised: 10/07/2021] [Accepted: 11/12/2021] [Indexed: 11/13/2022] Open
Abstract
Antibodies specifically bind to antigens and are an essential part of the immune system. Hence, antibodies are powerful tools in research and diagnostics. High-throughput sequencing technologies have promoted comprehensive profiling of the immune repertoire, which has resulted in large amounts of antibody sequences that remain to be further analyzed. In this study, antibodies were downloaded from IMGT/LIGM-DB and Sequence Read Archive databases. Contributing features from antibody heavy chains were formulated as numerical inputs and fed into an ensemble machine learning classifier to classify the antigen specificity of six classes of antibodies, namely anti-HIV-1, anti-influenza virus, anti-pneumococcal polysaccharide, anti-citrullinated protein, anti-tetanus toxoid and anti-hepatitis B virus. The classifier was validated using cross-validation and a testing dataset. The ensemble classifier achieved a macro-average area under the receiver operating characteristic curve (AUC) of 0.9246 from the 10-fold cross-validation, and 0.9264 for the testing dataset. Among the contributing features, the contribution of the complementarity-determining regions was 53.1% and that of framework regions was 46.9%, and the amino acid mutation rates occupied the first and second ranks among the top five contributing features. The classifier and insights provided in this study could promote the mechanistic study, isolation and utilization of potential therapeutic antibodies.
Collapse
Affiliation(s)
- Yaqi Wang
- School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, P.R. China.,School of Public Health (Shenzhen), Sun Yat-sen University, Guangzhou 510275, P.R. China
| | - Guoqin Mai
- School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, P.R. China.,School of Public Health (Shenzhen), Sun Yat-sen University, Guangzhou 510275, P.R. China
| | - Min Zou
- School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, P.R. China.,School of Public Health (Shenzhen), Sun Yat-sen University, Guangzhou 510275, P.R. China
| | - Haoyu Long
- School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, P.R. China.,School of Public Health (Shenzhen), Sun Yat-sen University, Guangzhou 510275, P.R. China
| | - Yao-Qing Chen
- School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, P.R. China.,School of Public Health (Shenzhen), Sun Yat-sen University, Guangzhou 510275, P.R. China
| | - Litao Sun
- School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, P.R. China.,School of Public Health (Shenzhen), Sun Yat-sen University, Guangzhou 510275, P.R. China
| | - Dechao Tian
- School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, P.R. China.,School of Public Health (Shenzhen), Sun Yat-sen University, Guangzhou 510275, P.R. China
| | - Yang Zhao
- School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, P.R. China.,School of Public Health (Shenzhen), Sun Yat-sen University, Guangzhou 510275, P.R. China
| | - Guozhi Jiang
- School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, P.R. China.,School of Public Health (Shenzhen), Sun Yat-sen University, Guangzhou 510275, P.R. China
| | - Zicheng Cao
- School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, P.R. China.,School of Public Health (Shenzhen), Sun Yat-sen University, Guangzhou 510275, P.R. China
| | - Xiangjun Du
- School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, P.R. China.,School of Public Health (Shenzhen), Sun Yat-sen University, Guangzhou 510275, P.R. China.,Key Laboratory of Tropical Disease Control, Ministry of Education, Sun Yat-sen University, Guangzhou, 510030, P.R. China
| |
Collapse
|
40
|
Lood C, Boeckaerts D, Stock M, De Baets B, Lavigne R, van Noort V, Briers Y. Digital phagograms: predicting phage infectivity through a multilayer machine learning approach. Curr Opin Virol 2021; 52:174-181. [PMID: 34952265 DOI: 10.1016/j.coviro.2021.12.004] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 11/26/2021] [Accepted: 12/04/2021] [Indexed: 12/19/2022]
Abstract
Machine learning has been broadly implemented to investigate biological systems. In this regard, the field of phage biology has embraced machine learning to elucidate and predict phage-host interactions, based on receptor-binding proteins, (anti-)defense systems, prophage detection, and life cycle recognition. Here, we highlight the enormous potential of integrating information from omics data with insights from systems biology to better understand phage-host interactions. We conceptualize and discuss the potential of a multilayer model that mirrors the phage infection process, integrating adsorption, bacterial pan-immune components and hijacking of the bacterial metabolism to predict phage infectivity. In the future, this model can offer insights into the underlying mechanisms of the infection process, and digital phagograms can support phage cocktail design and phage engineering.
Collapse
Affiliation(s)
- Cédric Lood
- Laboratory of Gene Technology, Department of Biosystems, KU Leuven, Leuven, Belgium; Centre of Microbial and Plant Genetics, Department of Microbial and Molecular Systems, KU Leuven, Leuven, Belgium
| | - Dimitri Boeckaerts
- Laboratory of Applied Biotechnology, Department of Biotechnology, Ghent University, Ghent, Belgium; KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Michiel Stock
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium; BIOBIX, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Bernard De Baets
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Rob Lavigne
- Laboratory of Gene Technology, Department of Biosystems, KU Leuven, Leuven, Belgium.
| | - Vera van Noort
- Centre of Microbial and Plant Genetics, Department of Microbial and Molecular Systems, KU Leuven, Leuven, Belgium; Institute of Biology, Leiden University, Leiden, The Netherlands.
| | - Yves Briers
- Laboratory of Applied Biotechnology, Department of Biotechnology, Ghent University, Ghent, Belgium.
| |
Collapse
|
41
|
Ceglia V, Kelley EJ, Boyle AS, Zurawski S, Mead HL, Harms CE, Blanck JP, Flamar AL, Kirschman JH, Ogongo P, Ernst JD, Levy Y, Zurawski G, Altin JA. A Framework to Identify Antigen-Expanded T Cell Receptor Clusters Within Complex Repertoires. Front Immunol 2021; 12:735584. [PMID: 34917073 PMCID: PMC8670329 DOI: 10.3389/fimmu.2021.735584] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Accepted: 11/01/2021] [Indexed: 11/13/2022] Open
Abstract
Common approaches for monitoring T cell responses are limited in their multiplexity and sensitivity. In contrast, deep sequencing of the T Cell Receptor (TCR) repertoire provides a global view that is limited only in terms of theoretical sensitivity due to the depth of available sampling; however, the assignment of antigen specificities within TCR repertoires has become a bottleneck. This study combines antigen-driven expansion, deep TCR sequencing, and a novel analysis framework to show that homologous ‘Clusters of Expanded TCRs (CETs)’ can be confidently identified without cell isolation, and assigned to antigen against a background of non-specific clones. We show that clonotypes within each CET respond to the same epitope, and that protein antigens stimulate multiple CETs reactive to constituent peptides. Finally, we demonstrate the personalized assignment of antigen-specificity to rare clones within fully-diverse uncultured repertoires. The method presented here may be used to monitor T cell responses to vaccination and immunotherapy with high fidelity.
Collapse
Affiliation(s)
- Valentina Ceglia
- Baylor Institute for Immunology Research, Dallas, TX, United States.,Université Paris-Est Créteil, Sciences de la Vie et de la Santé, Créteil, France.,Vaccine Research Institute, INSERM, Unité U955, Institut Mondor de Recherche Biomédicale, Créteil, France
| | - Erin J Kelley
- Translational Genomics Research Institute, Flagstaff, AZ, United States
| | - Annalee S Boyle
- Translational Genomics Research Institute, Flagstaff, AZ, United States
| | - Sandra Zurawski
- Baylor Institute for Immunology Research, Dallas, TX, United States.,Vaccine Research Institute, INSERM, Unité U955, Institut Mondor de Recherche Biomédicale, Créteil, France
| | - Heather L Mead
- Translational Genomics Research Institute, Flagstaff, AZ, United States
| | - Caroline E Harms
- Translational Genomics Research Institute, Flagstaff, AZ, United States
| | | | - Anne-Laure Flamar
- Baylor Institute for Immunology Research, Dallas, TX, United States.,Université Paris-Est Créteil, Sciences de la Vie et de la Santé, Créteil, France.,Vaccine Research Institute, INSERM, Unité U955, Institut Mondor de Recherche Biomédicale, Créteil, France
| | | | - Paul Ogongo
- Department of Medicine, Division of Experimental Medicine, University of California, San Francisco, San Francisco, CA, United States
| | - Joel D Ernst
- Department of Medicine, Division of Experimental Medicine, University of California, San Francisco, San Francisco, CA, United States
| | - Yves Levy
- Université Paris-Est Créteil, Sciences de la Vie et de la Santé, Créteil, France.,Vaccine Research Institute, INSERM, Unité U955, Institut Mondor de Recherche Biomédicale, Créteil, France
| | - Gerard Zurawski
- Baylor Institute for Immunology Research, Dallas, TX, United States.,Vaccine Research Institute, INSERM, Unité U955, Institut Mondor de Recherche Biomédicale, Créteil, France
| | - John A Altin
- Translational Genomics Research Institute, Flagstaff, AZ, United States
| |
Collapse
|
42
|
The immuneML ecosystem for machine learning analysis of adaptive immune receptor repertoires. NAT MACH INTELL 2021. [DOI: 10.1038/s42256-021-00413-z] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
|
43
|
Dhusia K, Su Z, Wu Y. A structural-based machine learning method to classify binding affinities between TCR and peptide-MHC complexes. Mol Immunol 2021; 139:76-86. [PMID: 34455212 PMCID: PMC10811653 DOI: 10.1016/j.molimm.2021.07.020] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 07/13/2021] [Accepted: 07/25/2021] [Indexed: 11/27/2022]
Abstract
The activation of T cells is triggered by the interactions of T cell receptors (TCRs) with their epitopes, which are peptides presented by major histocompatibility complex (MHC) on the surfaces of antigen presenting cells (APC). While each TCR can only recognize a specific subset from a large repertoire of peptide-MHC (pMHC) complexes, it is very often that peptides in this subset share little sequence similarity. This is known as the specificity and cross-reactivity of T cells, respectively. The binding affinities between different types of TCRs and pMHC are the major driving force to shape this specificity and cross-reactivity in T cell recognition. The binding affinities, furthermore, are determined by the sequence and structural properties at the interfaces between TCRs and pMHC. Fortunately, a wealth of data on binding and structures of TCR-pMHC interactions becomes publicly accessible in online resources, which offers us the opportunity to develop a random forest classifier for predicting the binding affinities between TCR and pMHC based on the structure of their complexes. Specifically, the structure and sequence of a given complex were projected onto a high-dimensional feature space as the input of the classifier, which was then trained by a large-scale benchmark dataset. Based on the cross-validation results, we found that our machine learning model can predict if the binding affinity of a given TCR-pMHC complex is stronger or weaker than a predefined threshold with an overall accuracy approximately around 75 %. The significance of our prediction was estimated by statistical analysis. Moreover, more than 60 % of binding affinities in the ATLAS database can be successfully classified into groups within the range of 2 kcal/mol. Additionally, we show that TCR-pMHC complexes with strong binding affinity prefer hydrophobic interactions between amino acids with large aromatic rings instead of electrostatic interactions. Our results therefore provide insights to design engineered TCRs which enhance the specificity for their targeted epitopes. Taken together, this method can serve as a useful addition to a suite of existing approaches which study binding between TCR and pMHC.
Collapse
Affiliation(s)
- Kalyani Dhusia
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, United States
| | - Zhaoqian Su
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, United States
| | - Yinghao Wu
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, United States.
| |
Collapse
|
44
|
Montemurro A, Schuster V, Povlsen HR, Bentzen AK, Jurtz V, Chronister WD, Crinklaw A, Hadrup SR, Winther O, Peters B, Jessen LE, Nielsen M. NetTCR-2.0 enables accurate prediction of TCR-peptide binding by using paired TCRα and β sequence data. Commun Biol 2021; 4:1060. [PMID: 34508155 PMCID: PMC8433451 DOI: 10.1038/s42003-021-02610-3] [Citation(s) in RCA: 115] [Impact Index Per Article: 28.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Accepted: 08/27/2021] [Indexed: 12/17/2022] Open
Abstract
Prediction of T-cell receptor (TCR) interactions with MHC-peptide complexes remains highly challenging. This challenge is primarily due to three dominant factors: data accuracy, data scarceness, and problem complexity. Here, we showcase that "shallow" convolutional neural network (CNN) architectures are adequate to deal with the problem complexity imposed by the length variations of TCRs. We demonstrate that current public bulk CDR3β-pMHC binding data overall is of low quality and that the development of accurate prediction models is contingent on paired α/β TCR sequence data corresponding to at least 150 distinct pairs for each investigated pMHC. In comparison, models trained on CDR3α or CDR3β data alone demonstrated a variable and pMHC specific relative performance drop. Together these findings support that T-cell specificity is predictable given the availability of accurate and sufficient paired TCR sequence data. NetTCR-2.0 is publicly available at https://services.healthtech.dtu.dk/service.php?NetTCR-2.0 .
Collapse
Affiliation(s)
- Alessandro Montemurro
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, DTU, 2800 Kgs, Lyngby, Denmark
| | - Viktoria Schuster
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, DTU, 2800 Kgs, Lyngby, Denmark
| | - Helle Rus Povlsen
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, DTU, 2800 Kgs, Lyngby, Denmark
| | - Amalie Kai Bentzen
- Department of Health Technology, Section for Experimental and Translational Immunology, Technical University of Denmark, DTU, 2800 Kgs, Lyngby, Denmark
| | - Vanessa Jurtz
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, DTU, 2800 Kgs, Lyngby, Denmark
| | - William D Chronister
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, 92037, USA
| | - Austin Crinklaw
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, 92037, USA
| | - Sine R Hadrup
- Department of Health Technology, Section for Experimental and Translational Immunology, Technical University of Denmark, DTU, 2800 Kgs, Lyngby, Denmark
| | - Ole Winther
- Department of Biology, Bioinformatics Centre, University of Copenhagen, 2200, Copenhagen, Denmark
- Section for Cognitive Systems, Department of Applied Mathematics and Computer Science, Technical University of Denmark, Kgs., Lyngby, 2800, Denmark
- Centre for Genomic Medicine, Rigshospitalet, Copenhagen University Hospital, København, Ø 2100, Denmark
| | - Bjoern Peters
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, 92037, USA
- Department of Medicine, Division of Infectious Diseases and Global Public Health, University of California, San Diego, La Jolla, CA, 92037, USA
| | - Leon Eyrich Jessen
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, DTU, 2800 Kgs, Lyngby, Denmark
| | - Morten Nielsen
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, DTU, 2800 Kgs, Lyngby, Denmark.
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, Buenos Aires, Argentina.
| |
Collapse
|
45
|
Milighetti M, Shawe-Taylor J, Chain B. Predicting T Cell Receptor Antigen Specificity From Structural Features Derived From Homology Models of Receptor-Peptide-Major Histocompatibility Complexes. Front Physiol 2021; 12:730908. [PMID: 34566692 PMCID: PMC8456106 DOI: 10.3389/fphys.2021.730908] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Accepted: 08/02/2021] [Indexed: 11/13/2022] Open
Abstract
The physical interaction between the T cell receptor (TCR) and its cognate antigen causes T cells to activate and participate in the immune response. Understanding this physical interaction is important in predicting TCR binding to a target epitope, as well as potential cross-reactivity. Here, we propose a way of collecting informative features of the binding interface from homology models of T cell receptor-peptide-major histocompatibility complex (TCR-pMHC) complexes. The information collected from these structures is sufficient to discriminate binding from non-binding TCR-pMHC pairs in multiple independent datasets. The classifier is limited by the number of crystal structures available for the homology modelling and by the size of the training set. However, the classifier shows comparable performance to sequence-based classifiers requiring much larger training sets.
Collapse
Affiliation(s)
- Martina Milighetti
- Division of Infection and Immunity, University College London, London, United Kingdom
- Cancer Institute, University College London, London, United Kingdom
| | - John Shawe-Taylor
- Department of Computer Science, University College London, London, United Kingdom
| | - Benny Chain
- Division of Infection and Immunity, University College London, London, United Kingdom
- Department of Computer Science, University College London, London, United Kingdom
| |
Collapse
|
46
|
Pai JA, Satpathy AT. High-throughput and single-cell T cell receptor sequencing technologies. Nat Methods 2021; 18:881-892. [PMID: 34282327 PMCID: PMC9345561 DOI: 10.1038/s41592-021-01201-8] [Citation(s) in RCA: 164] [Impact Index Per Article: 41.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2020] [Accepted: 06/07/2021] [Indexed: 02/06/2023]
Abstract
T cells express T cell receptors (TCRs) composed of somatically recombined TCRα and TCRβ chains, which mediate recognition of major histocompatibility complex (MHC)-antigen complexes and drive the antigen-specific adaptive immune response to pathogens and cancer. The TCR repertoire in each individual is highly diverse, which allows for recognition of a wide array of foreign antigens, but also presents a challenge in analyzing this response using conventional methods. Recent studies have developed high-throughput sequencing technologies to identify TCR sequences, analyze their antigen specificities using experimental and computational tools, and pair TCRs with transcriptional and epigenetic cell state phenotypes in single cells. In this Review, we highlight these technological advances and describe how they have been applied to discover fundamental insights into T cell-mediated immunity.
Collapse
Affiliation(s)
- Joy A Pai
- Program in Immunology, Stanford University School of Medicine, Stanford, CA, USA
| | - Ansuman T Satpathy
- Program in Immunology, Stanford University School of Medicine, Stanford, CA, USA.
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA.
| |
Collapse
|
47
|
Serr I, Drost F, Schubert B, Daniel C. Antigen-Specific Treg Therapy in Type 1 Diabetes - Challenges and Opportunities. Front Immunol 2021; 12:712870. [PMID: 34367177 PMCID: PMC8341764 DOI: 10.3389/fimmu.2021.712870] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Accepted: 07/06/2021] [Indexed: 01/16/2023] Open
Abstract
Regulatory T cells (Tregs) are key mediators of peripheral self-tolerance and alterations in their frequencies, stability, and function have been linked to autoimmunity. The antigen-specific induction of Tregs is a long-envisioned goal for the treatment of autoimmune diseases given reduced side effects compared to general immunosuppressive therapies. However, the translation of antigen-specific Treg inducing therapies for the treatment or prevention of autoimmune diseases into the clinic remains challenging. In this mini review, we will discuss promising results for antigen-specific Treg therapies in allergy and specific challenges for such therapies in autoimmune diseases, with a focus on type 1 diabetes (T1D). We will furthermore discuss opportunities for antigen-specific Treg therapies in T1D, including combinatorial strategies and tissue-specific Treg targeting. Specifically, we will highlight recent advances in miRNA-targeting as a means to foster Tregs in autoimmunity. Additionally, we will discuss advances and perspectives of computational strategies for the detailed analysis of tissue-specific Tregs on the single-cell level.
Collapse
Affiliation(s)
- Isabelle Serr
- Group Immune Tolerance in Type 1 Diabetes, Helmholtz Diabetes Center at Helmholtz Zentrum München, Institute of Diabetes Research, Munich, Germany
- Deutsches Zentrum für Diabetesforschung (DZD), Neuherberg, Germany
| | - Felix Drost
- School of Life Sciences Weihenstephan, Technische Universität München, Garching bei München, Germany
| | - Benjamin Schubert
- Institute of Computational Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
- Department of Mathematics, Technische Universität München, Garching bei München, Germany
| | - Carolin Daniel
- Group Immune Tolerance in Type 1 Diabetes, Helmholtz Diabetes Center at Helmholtz Zentrum München, Institute of Diabetes Research, Munich, Germany
- Deutsches Zentrum für Diabetesforschung (DZD), Neuherberg, Germany
- Division of Clinical Pharmacology, Department of Medicine IV, Ludwig-Maximilians-Universität München, Munich, Germany
| |
Collapse
|
48
|
Moris P, De Pauw J, Postovskaya A, Gielis S, De Neuter N, Bittremieux W, Ogunjimi B, Laukens K, Meysman P. Current challenges for unseen-epitope TCR interaction prediction and a new perspective derived from image classification. Brief Bioinform 2021; 22:bbaa318. [PMID: 33346826 PMCID: PMC8294552 DOI: 10.1093/bib/bbaa318] [Citation(s) in RCA: 70] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
The prediction of epitope recognition by T-cell receptors (TCRs) has seen many advancements in recent years, with several methods now available that can predict recognition for a specific set of epitopes. However, the generic case of evaluating all possible TCR-epitope pairs remains challenging, mainly due to the high diversity of the interacting sequences and the limited amount of currently available training data. In this work, we provide an overview of the current state of this unsolved problem. First, we examine appropriate validation strategies to accurately assess the generalization performance of generic TCR-epitope recognition models when applied to both seen and unseen epitopes. In addition, we present a novel feature representation approach, which we call ImRex (interaction map recognition). This approach is based on the pairwise combination of physicochemical properties of the individual amino acids in the CDR3 and epitope sequences, which provides a convolutional neural network with the combined representation of both sequences. Lastly, we highlight various challenges that are specific to TCR-epitope data and that can adversely affect model performance. These include the issue of selecting negative data, the imbalanced epitope distribution of curated TCR-epitope datasets and the potential exchangeability of TCR alpha and beta chains. Our results indicate that while extrapolation to unseen epitopes remains a difficult challenge, ImRex makes this feasible for a subset of epitopes that are not too dissimilar from the training data. We show that appropriate feature engineering methods and rigorous benchmark standards are required to create and validate TCR-epitope predictive models.
Collapse
MESH Headings
- Animals
- Complementarity Determining Regions/genetics
- Complementarity Determining Regions/immunology
- Epitopes, T-Lymphocyte/genetics
- Epitopes, T-Lymphocyte/immunology
- Humans
- Macaca mulatta
- Mice
- Models, Genetic
- Models, Immunological
- Receptors, Antigen, T-Cell, alpha-beta/genetics
- Receptors, Antigen, T-Cell, alpha-beta/immunology
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | - Pieter Meysman
- Corresponding author: Pieter Meysman, Adrem Data Lab, Department of Computer Science, University of Antwerp, Antwerp, 2020, Belgium. E-mail:
| |
Collapse
|
49
|
Weber A, Born J, Rodriguez Martínez M. TITAN: T-cell receptor specificity prediction with bimodal attention networks. Bioinformatics 2021; 37:i237-i244. [PMID: 34252922 PMCID: PMC8275323 DOI: 10.1093/bioinformatics/btab294] [Citation(s) in RCA: 70] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/26/2021] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION The activity of the adaptive immune system is governed by T-cells and their specific T-cell receptors (TCR), which selectively recognize foreign antigens. Recent advances in experimental techniques have enabled sequencing of TCRs and their antigenic targets (epitopes), allowing to research the missing link between TCR sequence and epitope binding specificity. Scarcity of data and a large sequence space make this task challenging, and to date only models limited to a small set of epitopes have achieved good performance. Here, we establish a k-nearest-neighbor (K-NN) classifier as a strong baseline and then propose Tcr epITope bimodal Attention Networks (TITAN), a bimodal neural network that explicitly encodes both TCR sequences and epitopes to enable the independent study of generalization capabilities to unseen TCRs and/or epitopes. RESULTS By encoding epitopes at the atomic level with SMILES sequences, we leverage transfer learning and data augmentation to enrich the input data space and boost performance. TITAN achieves high performance in the prediction of specificity of unseen TCRs (ROC-AUC 0.87 in 10-fold CV) and surpasses the results of the current state-of-the-art (ImRex) by a large margin. Notably, our Levenshtein-based K-NN classifier also exhibits competitive performance on unseen TCRs. While the generalization to unseen epitopes remains challenging, we report two major breakthroughs. First, by dissecting the attention heatmaps, we demonstrate that the sparsity of available epitope data favors an implicit treatment of epitopes as classes. This may be a general problem that limits unseen epitope performance for sufficiently complex models. Second, we show that TITAN nevertheless exhibits significantly improved performance on unseen epitopes and is capable of focusing attention on chemically meaningful molecular structures. AVAILABILITY AND IMPLEMENTATION The code as well as the dataset used in this study is publicly available at https://github.com/PaccMann/TITAN. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Anna Weber
- IBM Research Europe, 8803 Rüschlikon, Switzerland.,ETH Zurich, Department of Biosystems Science and Engineering (D-BSSE), 4058 Basel, Switzerland
| | - Jannis Born
- IBM Research Europe, 8803 Rüschlikon, Switzerland.,ETH Zurich, Department of Biosystems Science and Engineering (D-BSSE), 4058 Basel, Switzerland
| | | |
Collapse
|
50
|
Gao Y, Dunlap G, Elahee M, Rao DA. Patterns of T-Cell Phenotypes in Rheumatic Diseases From Single-Cell Studies of Tissue. ACR Open Rheumatol 2021; 3:601-613. [PMID: 34255929 PMCID: PMC8449042 DOI: 10.1002/acr2.11296] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Accepted: 05/19/2021] [Indexed: 02/06/2023] Open
Abstract
High-dimensional analyses of tissue samples from patients with rheumatic diseases are providing increasingly detailed descriptions of the immune cell populations that infiltrate tissues in different rheumatic diseases. Here we review key observations emerging from high-dimensional analyses of T cells within tissues in different rheumatic diseases, highlighting common themes across diseases as well as distinguishing features. Single-cell RNA sequencing analyses capture several dimensions of T-cell states, yet surprisingly, these analyses generally have not demonstrated distinct clusters of paradigmatic T-cell effector subsets, such as T helper (Th) 1, Th2, and Th17 cells. Rather, global transcriptomics robustly identify both proliferating T cells and regulatory T cells and have also helped to reveal new effector subsets in inflamed tissues, including T peripheral helper cells and granzyme K+ T cells. Further characterization of the T-cell populations that accumulate within target tissues should enable more precise targeting of biologic therapies and accelerate development of more specific biomarkers to track activity of relevant immune pathways in patients with rheumatic diseases.
Collapse
Affiliation(s)
- Yidan Gao
- Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts
| | - Garrett Dunlap
- Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts
| | - Mehreen Elahee
- Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts
| | - Deepak A Rao
- Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts
| |
Collapse
|