1
|
Qian X, Yang G, Li F, Zhang X, Zhu X, Lai X, Xiao X, Wang T, Wang J. DeepLION2: deep multi-instance contrastive learning framework enhancing the prediction of cancer-associated T cell receptors by attention strategy on motifs. Front Immunol 2024; 15:1345586. [PMID: 38515756 PMCID: PMC10956474 DOI: 10.3389/fimmu.2024.1345586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 02/19/2024] [Indexed: 03/23/2024] Open
Abstract
Introduction T cell receptor (TCR) repertoires provide valuable insights into complex human diseases, including cancers. Recent advancements in immune sequencing technology have significantly improved our understanding of TCR repertoire. Some computational methods have been devised to identify cancer-associated TCRs and enable cancer detection using TCR sequencing data. However, the existing methods are often limited by their inadequate consideration of the correlations among TCRs within a repertoire, hindering the identification of crucial TCRs. Additionally, the sparsity of cancer-associated TCR distribution presents a challenge in accurate prediction. Methods To address these issues, we presented DeepLION2, an innovative deep multi-instance contrastive learning framework specifically designed to enhance cancer-associated TCR prediction. DeepLION2 leveraged content-based sparse self-attention, focusing on the top k related TCRs for each TCR, to effectively model inter-TCR correlations. Furthermore, it adopted a contrastive learning strategy for bootstrapping parameter updates of the attention matrix, preventing the model from fixating on non-cancer-associated TCRs. Results Extensive experimentation on diverse patient cohorts, encompassing over ten cancer types, demonstrated that DeepLION2 significantly outperformed current state-of-the-art methods in terms of accuracy, sensitivity, specificity, Matthews correlation coefficient, and area under the curve (AUC). Notably, DeepLION2 achieved impressive AUC values of 0.933, 0.880, and 0.763 on thyroid, lung, and gastrointestinal cancer cohorts, respectively. Furthermore, it effectively identified cancer-associated TCRs along with their key motifs, highlighting the amino acids that play a crucial role in TCR-peptide binding. Conclusion These compelling results underscore DeepLION2's potential for enhancing cancer detection and facilitating personalized cancer immunotherapy. DeepLION2 is publicly available on GitHub, at https://github.com/Bioinformatics7181/DeepLION2, for academic use only.
Collapse
Affiliation(s)
- Xinyang Qian
- School of Computer Science and Technology, Xi’an Jiaotong University, Xi’an, China
- Shaanxi Engineering Research Center of Medical and Health Big Data, Xi’an Jiaotong University, Xi’an, China
| | - Guang Yang
- Department of Clinical Oncology, The Second Affiliated Hospital of Air Force Medical University, Xi’an, China
| | - Fan Li
- School of Computer Science and Technology, Xi’an Jiaotong University, Xi’an, China
- Shaanxi Engineering Research Center of Medical and Health Big Data, Xi’an Jiaotong University, Xi’an, China
| | - Xuanping Zhang
- School of Computer Science and Technology, Xi’an Jiaotong University, Xi’an, China
- Shaanxi Engineering Research Center of Medical and Health Big Data, Xi’an Jiaotong University, Xi’an, China
| | - Xiaoyan Zhu
- School of Computer Science and Technology, Xi’an Jiaotong University, Xi’an, China
- Shaanxi Engineering Research Center of Medical and Health Big Data, Xi’an Jiaotong University, Xi’an, China
| | - Xin Lai
- School of Computer Science and Technology, Xi’an Jiaotong University, Xi’an, China
- Shaanxi Engineering Research Center of Medical and Health Big Data, Xi’an Jiaotong University, Xi’an, China
| | - Xiao Xiao
- Genomics Institute, Geneplus-Shenzhen, Shenzhen, China
| | - Tao Wang
- Department of Thoracic Surgery, The Second Affiliated Hospital of Air Force Medical University, Xi’an, China
| | - Jiayin Wang
- School of Computer Science and Technology, Xi’an Jiaotong University, Xi’an, China
- Shaanxi Engineering Research Center of Medical and Health Big Data, Xi’an Jiaotong University, Xi’an, China
| |
Collapse
|
2
|
Xu AM, Chour W, DeLucia DC, Su Y, Pavlovitch-Bedzyk AJ, Ng R, Rasheed Y, Davis MM, Lee JK, Heath JR. Entropic analysis of antigen-specific CDR3 domains identifies essential binding motifs shared by CDR3s with different antigen specificities. Cell Syst 2023; 14:273-284.e5. [PMID: 37001518 PMCID: PMC10355346 DOI: 10.1016/j.cels.2023.03.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 09/01/2022] [Accepted: 03/01/2023] [Indexed: 04/22/2023]
Abstract
Antigen-specific T cell receptor (TCR) sequences can have prognostic, predictive, and therapeutic value, but decoding the specificity of TCR recognition remains challenging. Unlike DNA strands that base pair, TCRs bind to their targets with different orientations and different lengths, which complicates comparisons. We present scanning parametrized by normalized TCR length (SPAN-TCR) to analyze antigen-specific TCR CDR3 sequences and identify patterns driving TCR-pMHC specificity. Using entropic analysis, SPAN-TCR identifies 2-mer motifs that decrease the diversity (entropy) of CDR3s. These motifs are the most common patterns that can predict CDR3 composition, and we identify "essential" motifs that decrease entropy in the same CDR3 α or β chain containing the 2-mer, and "super-essential" motifs that decrease entropy in both chains. Molecular dynamics analysis further suggests that these motifs may play important roles in binding. We then employ SPAN-TCR to resolve similarities in TCR repertoires against different antigens using public databases of TCR sequences.
Collapse
Affiliation(s)
- Alexander M Xu
- Institute for Systems Biology, Seattle, WA 98109, USA; Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125, USA; Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, CA 90048, USA; Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA 90048, USA.
| | - William Chour
- Institute for Systems Biology, Seattle, WA 98109, USA; Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA; Keck School of Medicine, University of Southern California, Los Angeles, CA 91125, USA
| | - Diana C DeLucia
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Yapeng Su
- Institute for Systems Biology, Seattle, WA 98109, USA; Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125, USA
| | | | - Rachel Ng
- Institute for Systems Biology, Seattle, WA 98109, USA
| | - Yusuf Rasheed
- Institute for Systems Biology, Seattle, WA 98109, USA
| | - Mark M Davis
- Computational and Systems Immunology Program, Stanford University School of Medicine, Stanford, CA 94305, USA; Institute for Immunity, Transplantation and Infection, Stanford University School of Medicine, Stanford, CA 94305, USA; Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA 94305, USA; Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - John K Lee
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Division of Medical Oncology, Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | - James R Heath
- Institute for Systems Biology, Seattle, WA 98109, USA.
| |
Collapse
|
3
|
Fu J, Rust D, Fang Z, Jiao W, Lagana S, Batal I, Chen B, Merl S, Jones R, Sykes M, Weiner J. T cell repertoire profiling in allografts and native tissues in recipients with COVID-19 after solid organ transplantation: Insight into T cell-mediated allograft protection from viral infection. Front Immunol 2022; 13:1056703. [PMID: 36591281 PMCID: PMC9795050 DOI: 10.3389/fimmu.2022.1056703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Accepted: 11/18/2022] [Indexed: 12/15/2022] Open
Abstract
Introduction The effects of the SARS-CoV-2 virus on the body, and why the effects are more severe in certain patients, remain incompletely understood. One population of special interest is transplant recipients because of their immunosuppressed state. Understanding the pathophysiology of graft dysfunction in transplant patients with the COVID-19 viral syndrome is important for prognosticating the risk to the graft as well as understanding how best to prevent and, if necessary, treat graft injury in these patients. Methods We analyzed multiple types of solid organ transplant recipients (liver, kidney, heart or lung) at our institution who died from SARS-CoV-2 and underwent autopsy (n = 6) or whose grafts were biopsied during active SARS-CoV-2 infection (n = 8). Their serum inflammatory markers were examined together with the histological appearance, viral load, and TCR repertoire of their graft tissue and, for autopsy patients, several native tissues. Results Histology and clinical lab results revealed a systemic inflammatory pattern that included elevated inflammatory markers and diffuse tissue damage regardless of graft rejection. Virus was detected throughout all tissues, although most abundant in lungs. The TCR repertoire was broadly similar throughout the tissues of each individual, with greater sharing of dominant clones associated with more rapid disease course. There was no difference in viral load or clonal distribution of overall, COVID-associated, or putative SARS-CoV-2-specific TCRs between allograft and native tissue. We further demonstrated that SARSCoV-2-specific TCR sequences in transplant patients lack a donor HLArestricted pattern, regardless of distribution in allograft or native tissues,suggesting that recognition of viral antigens on infiltrating recipient cells can effectively trigger host T cell anti-viral responses in both the host and graft. Discussion Our findings suggest a systemic immune response to the SARS-CoV-2 virus in solid organ transplant patients that is not associated with rejection and consistent with a largely destructive effect of recipient HLA-restricted T cell clones that affects donor and native organs similarly.
Collapse
Affiliation(s)
- Jianing Fu
- Columbia Center for Translational Immunology, Department of Medicine, Columbia University, New York, NY, United States,*Correspondence: Jianing Fu, ; Joshua Weiner,
| | - Dylan Rust
- Columbia University Vagelos College of Physicians and Surgeons, New York, NY, United States
| | - Zhou Fang
- Columbia Center for Translational Immunology, Department of Medicine, Columbia University, New York, NY, United States
| | - Wenyu Jiao
- Columbia Center for Translational Immunology, Department of Medicine, Columbia University, New York, NY, United States
| | - Stephen Lagana
- Department of Pathology, Columbia University, New York, NY, United States
| | - Ibrahim Batal
- Department of Pathology, Columbia University, New York, NY, United States
| | - Bryan Chen
- Columbia Center for Translational Immunology, Department of Medicine, Columbia University, New York, NY, United States
| | - Sarah Merl
- Columbia Center for Translational Immunology, Department of Medicine, Columbia University, New York, NY, United States,Department of Pathology, Columbia University, New York, NY, United States
| | - Rebecca Jones
- Columbia Center for Translational Immunology, Department of Medicine, Columbia University, New York, NY, United States
| | - Megan Sykes
- Columbia Center for Translational Immunology, Department of Medicine, Columbia University, New York, NY, United States,Department of Microbiology and Immunology, Columbia University, New York, NY, United States,Department of Surgery, Columbia University, New York, NY, United States
| | - Joshua Weiner
- Columbia Center for Translational Immunology, Department of Medicine, Columbia University, New York, NY, United States,Department of Surgery, Columbia University, New York, NY, United States,*Correspondence: Jianing Fu, ; Joshua Weiner,
| |
Collapse
|
4
|
Olson BJ, Schattgen SA, Thomas PG, Bradley P, Matsen IV FA. Comparing T cell receptor repertoires using optimal transport. PLoS Comput Biol 2022; 18:e1010681. [PMID: 36476997 PMCID: PMC9728925 DOI: 10.1371/journal.pcbi.1010681] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Accepted: 10/24/2022] [Indexed: 12/12/2022] Open
Abstract
The complexity of entire T cell receptor (TCR) repertoires makes their comparison a difficult but important task. Current methods of TCR repertoire comparison can incur a high loss of distributional information by considering overly simplistic sequence- or repertoire-level characteristics. Optimal transport methods form a suitable approach for such comparison given some distance or metric between values in the sample space, with appealing theoretical and computational properties. In this paper we introduce a nonparametric approach to comparing empirical TCR repertoires that applies the Sinkhorn distance, a fast, contemporary optimal transport method, and a recently-created distance between TCRs called TCRdist. We show that our methods identify meaningful differences between samples from distinct TCR distributions for several case studies, and compete with more complicated methods despite minimal modeling assumptions and a simpler pipeline.
Collapse
Affiliation(s)
- Branden J. Olson
- Department of Computational Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
- Department of Statistics, University of Washington, Seattle, Washington, United States of America
| | - Stefan A. Schattgen
- Department of Immunology, St. Jude Children’s Research Hospital, Memphis, Tennessee, United States of America
| | - Paul G. Thomas
- Department of Immunology, St. Jude Children’s Research Hospital, Memphis, Tennessee, United States of America
| | - Philip Bradley
- Department of Computational Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
- Institute for Protein Design, Department of Biochemistry, University of Washington, Seattle, Washington, United States of America
- * E-mail: (PB); (FAM)
| | - Frederick A. Matsen IV
- Department of Computational Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
- Department of Statistics, University of Washington, Seattle, Washington, United States of America
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
- Howard Hughes Medical Institute, Seattle, Washington, United States of America
- * E-mail: (PB); (FAM)
| |
Collapse
|
5
|
Katayama Y, Yokota R, Akiyama T, Kobayashi TJ. Machine Learning Approaches to TCR Repertoire Analysis. Front Immunol 2022; 13:858057. [PMID: 35911778 PMCID: PMC9334875 DOI: 10.3389/fimmu.2022.858057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Accepted: 06/07/2022] [Indexed: 11/13/2022] Open
Abstract
Sparked by the development of genome sequencing technology, the quantity and quality of data handled in immunological research have been changing dramatically. Various data and database platforms are now driving the rapid progress of machine learning for immunological data analysis. Of various topics in immunology, T cell receptor repertoire analysis is one of the most important targets of machine learning for assessing the state and abnormalities of immune systems. In this paper, we review recent repertoire analysis methods based on machine learning and deep learning and discuss their prospects.
Collapse
Affiliation(s)
- Yotaro Katayama
- Graduate School of Engineering, The University of Tokyo, Tokyo, Japan
- *Correspondence: Yotaro Katayama,
| | - Ryo Yokota
- National Research Institute of Police Science, Kashiwa, Chiba, Japan
| | - Taishin Akiyama
- Laboratory for Immune Homeostasis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Graduate School of Medical Life Science, Yokohama City University, Yokohama, Japan
| | - Tetsuya J. Kobayashi
- Graduate School of Engineering, The University of Tokyo, Tokyo, Japan
- Institute of Industrial Science, The University of Tokyo, Tokyo, Japan
| |
Collapse
|
6
|
Xu Y, Qian X, Zhang X, Lai X, Liu Y, Wang J. DeepLION: Deep Multi-Instance Learning Improves the Prediction of Cancer-Associated T Cell Receptors for Accurate Cancer Detection. Front Genet 2022; 13:860510. [PMID: 35601486 PMCID: PMC9121378 DOI: 10.3389/fgene.2022.860510] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2022] [Accepted: 02/23/2022] [Indexed: 01/21/2023] Open
Abstract
Recent studies highlight the potential of T cell receptor (TCR) repertoires in accurately detecting cancers via noninvasive sampling. Unfortunately, due to the complicated associations among cancer antigens and the possible induced T cell responses, currently, the practical strategy for identifying cancer-associated TCRs is the computational prediction based on TCR repertoire data. Several state-of-the-art methods were proposed in recent year or two; however, the prediction algorithms were still weakened by two major issues. To facilitate the computational processes, the algorithms prefer to decompose the original TCR sequences into length-fixed amino acid fragments, while the first dilemma comes as the lengths of cancer-associated motifs are suggested to be various. Moreover, the correlations among TCRs in the same repertoire should be further considered, which are often ignored by the existing methods. We here developed a deep multi-instance learning method, named DeepLION, to improve the prediction of cancer-associated TCRs by considering these issues. First, DeepLION introduced a deep learning framework with alternative convolution filters and 1-max pooling operations to handle the amino acid fragments with different lengths. Then, the multi-instance learning framework modeled the TCR correlations and assigned adjusted weights for each TCR sequence during the predicting process. To validate the performance of DeepLION, we conducted a series of experiments on several cohorts of patients from nine cancer types. Compared to the existing methods, DeepLION achieved, on most of the cohorts, higher prediction accuracies, sensitivities, specificities, and areas under the curve (AUCs), where the AUC reached notably 0.97 and 0.90 for thyroid and lung cancer cohorts, respectively. Thus, DeepLION may further support the detection of cancers from TCR repertoire data. DeepLION is publicly available on GitHub, at https://github.com/Bioinformatics7181/DeepLION, for academic usage only.
Collapse
Affiliation(s)
- Ying Xu
- Department of Computer Science and Technology, School of Electronic and Information Engineering, Xi’an Jiaotong University, Xi’an, China
- Institute of Data Science and Information Quality, Shaanxi Engineering Research Center of Medical and Health Big Data, Xi’an Jiaotong University, Xi’an, China
| | - Xinyang Qian
- Department of Computer Science and Technology, School of Electronic and Information Engineering, Xi’an Jiaotong University, Xi’an, China
- Institute of Data Science and Information Quality, Shaanxi Engineering Research Center of Medical and Health Big Data, Xi’an Jiaotong University, Xi’an, China
| | - Xuanping Zhang
- Department of Computer Science and Technology, School of Electronic and Information Engineering, Xi’an Jiaotong University, Xi’an, China
- Institute of Data Science and Information Quality, Shaanxi Engineering Research Center of Medical and Health Big Data, Xi’an Jiaotong University, Xi’an, China
| | - Xin Lai
- Department of Computer Science and Technology, School of Electronic and Information Engineering, Xi’an Jiaotong University, Xi’an, China
- Institute of Data Science and Information Quality, Shaanxi Engineering Research Center of Medical and Health Big Data, Xi’an Jiaotong University, Xi’an, China
| | - Yuqian Liu
- Department of Computer Science and Technology, School of Electronic and Information Engineering, Xi’an Jiaotong University, Xi’an, China
- Institute of Data Science and Information Quality, Shaanxi Engineering Research Center of Medical and Health Big Data, Xi’an Jiaotong University, Xi’an, China
| | - Jiayin Wang
- Department of Computer Science and Technology, School of Electronic and Information Engineering, Xi’an Jiaotong University, Xi’an, China
- Institute of Data Science and Information Quality, Shaanxi Engineering Research Center of Medical and Health Big Data, Xi’an Jiaotong University, Xi’an, China
- *Correspondence: Jiayin Wang,
| |
Collapse
|
7
|
Barennes P, Quiniou V, Shugay M, Egorov ES, Davydov AN, Chudakov DM, Uddin I, Ismail M, Oakes T, Chain B, Eugster A, Kashofer K, Rainer PP, Darko S, Ransier A, Douek DC, Klatzmann D, Mariotti-Ferrandiz E. Benchmarking of T cell receptor repertoire profiling methods reveals large systematic biases. Nat Biotechnol 2021; 39:236-245. [PMID: 32895550 DOI: 10.1038/s41587-020-0656-3] [Citation(s) in RCA: 53] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Accepted: 07/28/2020] [Indexed: 12/13/2022]
Abstract
Monitoring the T cell receptor (TCR) repertoire in health and disease can provide key insights into adaptive immune responses, but the accuracy of current TCR sequencing (TCRseq) methods is unclear. In this study, we systematically compared the results of nine commercial and academic TCRseq methods, including six rapid amplification of complementary DNA ends (RACE)-polymerase chain reaction (PCR) and three multiplex-PCR approaches, when applied to the same T cell sample. We found marked differences in accuracy and intra- and inter-method reproducibility for T cell receptor α (TRA) and T cell receptor β (TRB) TCR chains. Most methods showed a lower ability to capture TRA than TRB diversity. Low RNA input generated non-representative repertoires. Results from the 5' RACE-PCR methods were consistent among themselves but differed from the RNA-based multiplex-PCR results. Using an in silico meta-repertoire generated from 108 replicates, we found that one genomic DNA-based method and two non-unique molecular identifier (UMI) RNA-based methods were more sensitive than UMI methods in detecting rare clonotypes, despite the better clonotype quantification accuracy of the latter.
Collapse
Affiliation(s)
- Pierre Barennes
- Sorbonne Université, INSERM, Immunology-Immunopathology-Immunotherapy (i3), Paris, France
- AP-HP, Hôpital Pitié-Salpêtrière, Biotherapy (CIC-BTi) and Inflammation-Immunopathology-Biotherapy Department (i2B), Paris, France
| | - Valentin Quiniou
- Sorbonne Université, INSERM, Immunology-Immunopathology-Immunotherapy (i3), Paris, France
- AP-HP, Hôpital Pitié-Salpêtrière, Biotherapy (CIC-BTi) and Inflammation-Immunopathology-Biotherapy Department (i2B), Paris, France
| | - Mikhail Shugay
- Center of Life Sciences, Skoltech, Moscow, Russia
- Genomics of Adaptive Immunity Department, Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Moscow, Russia
- Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Pirogov Russian National Research Medical University, Moscow, Russia
| | - Evgeniy S Egorov
- Genomics of Adaptive Immunity Department, Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Moscow, Russia
| | - Alexey N Davydov
- Adaptive Immunity Group, Central European Institute of Technology, Brno, Czechia
| | - Dmitriy M Chudakov
- Center of Life Sciences, Skoltech, Moscow, Russia
- Genomics of Adaptive Immunity Department, Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Moscow, Russia
- Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Pirogov Russian National Research Medical University, Moscow, Russia
- Adaptive Immunity Group, Central European Institute of Technology, Brno, Czechia
| | - Imran Uddin
- Division of Infection and Immunity, University College London, London, UK
| | - Mazlina Ismail
- Division of Infection and Immunity, University College London, London, UK
| | - Theres Oakes
- Division of Infection and Immunity, University College London, London, UK
| | - Benny Chain
- Division of Infection and Immunity, University College London, London, UK
| | - Anne Eugster
- DFG-Centre for Regenerative Therapies Dresden, Faculty of Medicine Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
| | - Karl Kashofer
- Diagnostic and Research Institute of Pathology, Medical University of Graz, Graz, Austria
| | - Peter P Rainer
- Division of Cardiology, Medical University of Graz, Graz, Austria
- BioTechMed-Graz, Graz, Austria
| | - Samuel Darko
- Vaccine Research Center, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Amy Ransier
- Vaccine Research Center, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Daniel C Douek
- Vaccine Research Center, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, USA
| | - David Klatzmann
- Sorbonne Université, INSERM, Immunology-Immunopathology-Immunotherapy (i3), Paris, France
- AP-HP, Hôpital Pitié-Salpêtrière, Biotherapy (CIC-BTi) and Inflammation-Immunopathology-Biotherapy Department (i2B), Paris, France
| | - Encarnita Mariotti-Ferrandiz
- Sorbonne Université, INSERM, Immunology-Immunopathology-Immunotherapy (i3), Paris, France.
- AP-HP, Hôpital Pitié-Salpêtrière, Biotherapy (CIC-BTi) and Inflammation-Immunopathology-Biotherapy Department (i2B), Paris, France.
| |
Collapse
|
8
|
Chen SY, Liu CJ, Zhang Q, Guo AY. An ultra-sensitive T-cell receptor detection method for TCR-Seq and RNA-Seq data. Bioinformatics 2021; 36:4255-4262. [PMID: 32399561 DOI: 10.1093/bioinformatics/btaa432] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 04/14/2020] [Accepted: 05/06/2020] [Indexed: 12/30/2022] Open
Abstract
MOTIVATION T-cell receptors (TCRs) function to recognize antigens and play vital roles in T-cell immunology. Surveying TCR repertoires by characterizing complementarity-determining region 3 (CDR3) is a key issue. Due to the high diversity of CDR3 and technological limitation, accurate characterization of CDR3 repertoires remains a great challenge. RESULTS We propose a computational method named CATT for ultra-sensitive and precise TCR CDR3 sequences detection. CATT can be applied on TCR sequencing, RNA-Seq and single-cell TCR(RNA)-Seq data to characterize CDR3 repertoires. CATT integrated de Bruijn graph-based micro-assembly algorithm, data-driven error correction model and Bayesian inference algorithm, to self-adaptively and ultra-sensitively characterize CDR3 repertoires with high performance. Benchmark results of datasets from in silico and experimental data demonstrated that CATT showed superior recall and precision compared with existing tools, especially for data with short read length and small size and single-cell sequencing data. Thus, CATT will be a useful tool for TCR analysis in researches of cancer and immunology. AVAILABILITY AND IMPLEMENTATION http://bioinfo.life.hust.edu.cn/CATT or https://github.com/GuoBioinfoLab/CATT. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Si-Yi Chen
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Chun-Jie Liu
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Qiong Zhang
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China.,Department of Biotechnology, College of Life Sciences, Anhui Normal University, Wuhu, China
| | - An-Yuan Guo
- Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| |
Collapse
|
9
|
Teraguchi S, Saputri DS, Llamas-Covarrubias MA, Davila A, Diez D, Nazlica SA, Rozewicki J, Ismanto HS, Wilamowski J, Xie J, Xu Z, Loza-Lopez MDJ, van Eerden FJ, Li S, Standley DM. Methods for sequence and structural analysis of B and T cell receptor repertoires. Comput Struct Biotechnol J 2020; 18:2000-2011. [PMID: 32802272 PMCID: PMC7366105 DOI: 10.1016/j.csbj.2020.07.008] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 07/08/2020] [Accepted: 07/08/2020] [Indexed: 02/07/2023] Open
Abstract
B cell receptors (BCRs) and T cell receptors (TCRs) make up an essential network of defense molecules that, collectively, can distinguish self from non-self and facilitate destruction of antigen-bearing cells such as pathogens or tumors. The analysis of BCR and TCR repertoires plays an important role in both basic immunology as well as in biotechnology. Because the repertoires are highly diverse, specialized software methods are needed to extract meaningful information from BCR and TCR sequence data. Here, we review recent developments in bioinformatics tools for analysis of BCR and TCR repertoires, with an emphasis on those that incorporate structural features. After describing the recent sequencing technologies for immune receptor repertoires, we survey structural modeling methods for BCR and TCRs, along with methods for clustering such models. We review downstream analyses, including BCR and TCR epitope prediction, antibody-antigen docking and TCR-peptide-MHC Modeling. We also briefly discuss molecular dynamics in this context.
Collapse
Affiliation(s)
- Shunsuke Teraguchi
- Immunology Frontier Research Center, Osaka University, 3-1 Yamadaoka, Suita, Japan
- Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita, Japan
| | - Dianita S. Saputri
- Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita, Japan
| | - Mara Anais Llamas-Covarrubias
- Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita, Japan
- Departamento de Biología Molecular y Genómica, Centro Universitario de Ciencias de la Salud, Universidad de Guadalajara, Mexico
| | - Ana Davila
- Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita, Japan
| | - Diego Diez
- Immunology Frontier Research Center, Osaka University, 3-1 Yamadaoka, Suita, Japan
| | - Sedat Aybars Nazlica
- Immunology Frontier Research Center, Osaka University, 3-1 Yamadaoka, Suita, Japan
| | - John Rozewicki
- Immunology Frontier Research Center, Osaka University, 3-1 Yamadaoka, Suita, Japan
- Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita, Japan
| | - Hendra S. Ismanto
- Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita, Japan
| | - Jan Wilamowski
- Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita, Japan
| | - Jiaqi Xie
- Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita, Japan
| | - Zichang Xu
- Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita, Japan
| | | | - Floris J. van Eerden
- Immunology Frontier Research Center, Osaka University, 3-1 Yamadaoka, Suita, Japan
| | - Songling Li
- Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita, Japan
| | - Daron M. Standley
- Immunology Frontier Research Center, Osaka University, 3-1 Yamadaoka, Suita, Japan
- Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita, Japan
| |
Collapse
|
10
|
Olson BJ, Moghimi P, Schramm CA, Obraztsova A, Ralph D, Vander Heiden JA, Shugay M, Shepherd AJ, Lees W, Matsen FA. sumrep: A Summary Statistic Framework for Immune Receptor Repertoire Comparison and Model Validation. Front Immunol 2019; 10:2533. [PMID: 31736960 PMCID: PMC6838214 DOI: 10.3389/fimmu.2019.02533] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2019] [Accepted: 10/11/2019] [Indexed: 12/28/2022] Open
Abstract
The adaptive immune system generates an incredible diversity of antigen receptors for B and T cells to keep dangerous pathogens at bay. The DNA sequences coding for these receptors arise by a complex recombination process followed by a series of productivity-based filters, as well as affinity maturation for B cells, giving considerable diversity to the circulating pool of receptor sequences. Although these datasets hold considerable promise for medical and public health applications, the complex structure of the resulting adaptive immune receptor repertoire sequencing (AIRR-seq) datasets makes analysis difficult. In this paper we introduce sumrep, an R package that efficiently performs a wide variety of repertoire summaries and comparisons, and show how sumrep can be used to perform model validation. We find that summaries vary in their ability to differentiate between datasets, although many are able to distinguish between covariates such as donor, timepoint, and cell type for BCR and TCR repertoires. We show that deletion and insertion lengths resulting from V(D)J recombination tend to be more discriminative characterizations of a repertoire than summaries that describe the amino acid composition of the CDR3 region. We also find that state-of-the-art generative models excel at recapitulating gene usage and recombination statistics in a given experimental repertoire, but struggle to capture many physiochemical properties of real repertoires.
Collapse
Affiliation(s)
- Branden J Olson
- Fred Hutchinson Cancer Research Center, Seattle, WA, United States.,Department of Statistics, University of Washington, Seattle, WA, United States
| | - Pejvak Moghimi
- Department of Biological Sciences, Institute of Structural and Molecular Biology, Birkbeck, University of London, London, United Kingdom
| | - Chaim A Schramm
- Vaccine Research Center, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Anna Obraztsova
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia.,Genomics of Adaptive Immunity Department, Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Moscow, Russia
| | - Duncan Ralph
- Fred Hutchinson Cancer Research Center, Seattle, WA, United States
| | - Jason A Vander Heiden
- Department of Bioinformatics and Computational Biology, Genentech, Inc., South San Francisco, CA, United States
| | - Mikhail Shugay
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia.,Genomics of Adaptive Immunity Department, Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Moscow, Russia.,Department of Molecular Technologies, Pirogov Russian National Research Medical University, Moscow, Russia
| | - Adrian J Shepherd
- Department of Biological Sciences, Institute of Structural and Molecular Biology, Birkbeck, University of London, London, United Kingdom
| | - William Lees
- Department of Biological Sciences, Institute of Structural and Molecular Biology, Birkbeck, University of London, London, United Kingdom
| | | |
Collapse
|
11
|
Bradley P, Thomas PG. Using T Cell Receptor Repertoires to Understand the Principles of Adaptive Immune Recognition. Annu Rev Immunol 2019; 37:547-570. [PMID: 30699000 DOI: 10.1146/annurev-immunol-042718-041757] [Citation(s) in RCA: 83] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Adaptive immune recognition is mediated by antigen receptors on B and T cells generated by somatic recombination during lineage development. The high level of diversity resulting from this process posed technical limitations that previously limited the comprehensive analysis of adaptive immune recognition. Advances over the last ten years have produced data and approaches allowing insights into how T cells develop, evolutionary signatures of recombination and selection, and the features of T cell receptors that mediate epitope-specific binding and T cell activation. The size and complexity of these data have necessitated the generation of novel computational and analytical approaches, which are transforming how T cell immunology is conducted. Here we review the development and application of novel biological, theoretical, and computational methods for understanding T cell recognition and discuss the potential for improved models of receptor:antigen interactions.
Collapse
Affiliation(s)
- Philip Bradley
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA; .,Institute for Protein Design, University of Washington, Seattle, Washington 98195, USA
| | - Paul G Thomas
- Department of Immunology, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, USA;
| |
Collapse
|
12
|
Aoki H, Ueha S, Shichino S, Ogiwara H, Hashimoto SI, Kakimi K, Ito S, Matsushima K. TCR Repertoire Analysis Reveals Mobilization of Novel CD8 + T Cell Clones Into the Cancer-Immunity Cycle Following Anti-CD4 Antibody Administration. Front Immunol 2019; 9:3185. [PMID: 30733724 PMCID: PMC6353793 DOI: 10.3389/fimmu.2018.03185] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2018] [Accepted: 12/31/2018] [Indexed: 01/05/2023] Open
Abstract
Depletion of CD4+ cells using an anti-CD4 monoclonal antibody (anti-CD4 mAb) induces the expansion of tumor-reactive CD8+ T cells and strong antitumor effects in several murine tumor models. However, it is not known whether the anti-CD4 mAb treatment activates a particular or a broad spectrum of tumor-reactive CD8+ T cell clones. To investigate the changes in the TCR repertoire induced by the anti-CD4 mAb treatment, we performed unbiased high-throughput TCR sequencing in a B16F10 mouse subcutaneous melanoma model. By Inter-Organ Clone Tracking analysis, we demonstrated that anti-CD4 mAb treatment increased the diversity and combined frequency of CD8+ T cell clones that overlapped among the tumor, draining lymph node (dLN), and peripheral blood repertoires. Interestingly, the anti-CD4 mAb treatment-induced expansion of overlapping clones occurred mainly in the dLN rather than in the tumor. Overall, the Inter-Organ Clone Tracking analysis revealed that anti-CD4 mAb treatment enhances the mobilization of a wide variety of tumor-reactive CD8+ T cell clones into the Cancer-Immunity Cycle and thus induces a robust antitumor immune response in mice.
Collapse
Affiliation(s)
- Hiroyasu Aoki
- Department of Molecular Preventive Medicine, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan.,Division of Molecular Regulation of Inflammatory and Immune Diseases, Research Institute for Biomedical Sciences, Tokyo University of Science, Noda, Japan
| | - Satoshi Ueha
- Department of Molecular Preventive Medicine, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan.,Division of Molecular Regulation of Inflammatory and Immune Diseases, Research Institute for Biomedical Sciences, Tokyo University of Science, Noda, Japan
| | - Shigeyuki Shichino
- Department of Molecular Preventive Medicine, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan.,Division of Molecular Regulation of Inflammatory and Immune Diseases, Research Institute for Biomedical Sciences, Tokyo University of Science, Noda, Japan
| | - Haru Ogiwara
- Department of Molecular Preventive Medicine, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan.,Division of Molecular Regulation of Inflammatory and Immune Diseases, Research Institute for Biomedical Sciences, Tokyo University of Science, Noda, Japan
| | - Shin-Ichi Hashimoto
- Department of Molecular Preventive Medicine, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan.,Division of Molecular Regulation of Inflammatory and Immune Diseases, Research Institute for Biomedical Sciences, Tokyo University of Science, Noda, Japan.,Division of Nephrology, Department of Laboratory Medicine, Kanazawa University, Kanazawa, Japan
| | - Kazuhiro Kakimi
- Department of Immunotherapeutics, The University of Tokyo Hospital, Tokyo, Japan
| | - Satoru Ito
- Department of Molecular Preventive Medicine, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan.,Division of Molecular Regulation of Inflammatory and Immune Diseases, Research Institute for Biomedical Sciences, Tokyo University of Science, Noda, Japan.,IDAC Theranostics, Inc., Tokyo, Japan
| | - Kouji Matsushima
- Department of Molecular Preventive Medicine, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan.,Division of Molecular Regulation of Inflammatory and Immune Diseases, Research Institute for Biomedical Sciences, Tokyo University of Science, Noda, Japan
| |
Collapse
|
13
|
Abstract
Probabilistic modeling is fundamental to the statistical analysis of complex data. In addition to forming a coherent description of the data-generating process, probabilistic models enable parameter inference about given datasets. This procedure is well developed in the Bayesian perspective, in which one infers probability distributions describing to what extent various possible parameters agree with the data. In this paper, we motivate and review probabilistic modeling for adaptive immune receptor repertoire data then describe progress and prospects for future work, from germline haplotyping to adaptive immune system deployment across tissues. The relevant quantities in immune sequence analysis include not only continuous parameters such as gene use frequency but also discrete objects such as B-cell clusters and lineages. Throughout this review, we unravel the many opportunities for probabilistic modeling in adaptive immune receptor analysis, including settings for which the Bayesian approach holds substantial promise (especially if one is optimistic about new computational methods). From our perspective, the greatest prospects for progress in probabilistic modeling for repertoires concern ancestral sequence estimation for B-cell receptor lineages, including uncertainty from germline genotype, rearrangement, and lineage development.
Collapse
Affiliation(s)
- Branden Olson
- Computational Biology Program Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. N., Mail stop: M1-B514 Seattle, WA 98109-1024 phone: +1 206 667 7318
| | - Frederick A. Matsen
- Computational Biology Program Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. N., Mail stop: M1-B514 Seattle, WA 98109-1024 phone: +1 206 667 7318
| |
Collapse
|
14
|
Miho E, Yermanos A, Weber CR, Berger CT, Reddy ST, Greiff V. Computational Strategies for Dissecting the High-Dimensional Complexity of Adaptive Immune Repertoires. Front Immunol 2018; 9:224. [PMID: 29515569 PMCID: PMC5826328 DOI: 10.3389/fimmu.2018.00224] [Citation(s) in RCA: 116] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2017] [Accepted: 01/26/2018] [Indexed: 12/21/2022] Open
Abstract
The adaptive immune system recognizes antigens via an immense array of antigen-binding antibodies and T-cell receptors, the immune repertoire. The interrogation of immune repertoires is of high relevance for understanding the adaptive immune response in disease and infection (e.g., autoimmunity, cancer, HIV). Adaptive immune receptor repertoire sequencing (AIRR-seq) has driven the quantitative and molecular-level profiling of immune repertoires, thereby revealing the high-dimensional complexity of the immune receptor sequence landscape. Several methods for the computational and statistical analysis of large-scale AIRR-seq data have been developed to resolve immune repertoire complexity and to understand the dynamics of adaptive immunity. Here, we review the current research on (i) diversity, (ii) clustering and network, (iii) phylogenetic, and (iv) machine learning methods applied to dissect, quantify, and compare the architecture, evolution, and specificity of immune repertoires. We summarize outstanding questions in computational immunology and propose future directions for systems immunology toward coupling AIRR-seq with the computational discovery of immunotherapeutics, vaccines, and immunodiagnostics.
Collapse
Affiliation(s)
- Enkelejda Miho
- Department for Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
- aiNET GmbH, ETH Zürich, Basel, Switzerland
| | - Alexander Yermanos
- Department for Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
| | - Cédric R. Weber
- Department for Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
| | - Christoph T. Berger
- Department of Biomedicine, University Hospital Basel, Basel, Switzerland
- Department of Internal Medicine, Clinical Immunology, University Hospital Basel, Basel, Switzerland
| | - Sai T. Reddy
- Department for Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
| | - Victor Greiff
- Department for Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
- Department of Immunology, University of Oslo, Oslo, Norway
| |
Collapse
|