51
|
Borden ES, Buetow KH, Wilson MA, Hastings KT. Cancer Neoantigens: Challenges and Future Directions for Prediction, Prioritization, and Validation. Front Oncol 2022; 12:836821. [PMID: 35311072 PMCID: PMC8929516 DOI: 10.3389/fonc.2022.836821] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Accepted: 02/07/2022] [Indexed: 12/16/2022] Open
Abstract
Prioritization of immunogenic neoantigens is key to enhancing cancer immunotherapy through the development of personalized vaccines, adoptive T cell therapy, and the prediction of response to immune checkpoint inhibition. Neoantigens are tumor-specific proteins that allow the immune system to recognize and destroy a tumor. Cancer immunotherapies, such as personalized cancer vaccines, adoptive T cell therapy, and immune checkpoint inhibition, rely on an understanding of the patient-specific neoantigen profile in order to guide personalized therapeutic strategies. Genomic approaches to predicting and prioritizing immunogenic neoantigens are rapidly expanding, raising new opportunities to advance these tools and enhance their clinical relevance. Predicting neoantigens requires acquisition of high-quality samples and sequencing data, followed by variant calling and variant annotation. Subsequently, prioritizing which of these neoantigens may elicit a tumor-specific immune response requires application and integration of tools to predict the expression, processing, binding, and recognition potentials of the neoantigen. Finally, improvement of the computational tools is held in constant tension with the availability of datasets with validated immunogenic neoantigens. The goal of this review article is to summarize the current knowledge and limitations in neoantigen prediction, prioritization, and validation and propose future directions that will improve personalized cancer treatment.
Collapse
Affiliation(s)
- Elizabeth S Borden
- Department of Basic Medical Sciences, College of Medicine-Phoenix, University of Arizona, Phoenix, AZ, United States.,Department of Research and Internal Medicine (Dermatology), Phoenix Veterans Affairs Health Care System, Phoenix, AZ, United States
| | - Kenneth H Buetow
- School of Life Sciences, Arizona State University, Tempe, AZ, United States.,Center for Evolution and Medicine, Arizona State University, Tempe, AZ, United States
| | - Melissa A Wilson
- School of Life Sciences, Arizona State University, Tempe, AZ, United States.,Center for Evolution and Medicine, Arizona State University, Tempe, AZ, United States
| | - Karen Taraszka Hastings
- Department of Basic Medical Sciences, College of Medicine-Phoenix, University of Arizona, Phoenix, AZ, United States.,Department of Research and Internal Medicine (Dermatology), Phoenix Veterans Affairs Health Care System, Phoenix, AZ, United States
| |
Collapse
|
52
|
Koşaloğlu-Yalçın Z, Lee J, Greenbaum J, Schoenberger SP, Miller A, Kim YJ, Sette A, Nielsen M, Peters B. Combined assessment of MHC binding and antigen abundance improves T cell epitope predictions. iScience 2022; 25:103850. [PMID: 35128348 PMCID: PMC8806398 DOI: 10.1016/j.isci.2022.103850] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Revised: 10/19/2021] [Accepted: 01/26/2022] [Indexed: 01/16/2023] Open
Abstract
Many steps of the MHC class I antigen processing pathway can be predicted using computational methods. Here we show that epitope predictions can be further improved by considering abundance levels of peptides' source proteins. We utilized biophysical principles and existing MHC binding prediction tools in concert with abundance estimates of source proteins to derive a function that estimates the likelihood of a peptide to be an MHC class I ligand. We found that this combination improved predictions for both naturally eluted ligands and cancer neoantigen epitopes. We compared the use of different measures of antigen abundance, including mRNA expression by RNA-Seq, gene translation by Ribo-Seq, and protein abundance by proteomics on a dataset of SARS-CoV-2 epitopes. Epitope predictions were improved above binding predictions alone in all cases and gave the highest performance when using proteomic data. Our results highlight the value of incorporating antigen abundance levels to improve epitope predictions.
Collapse
Affiliation(s)
- Zeynep Koşaloğlu-Yalçın
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA 92037, USA
| | - Jenny Lee
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA 92037, USA
| | - Jason Greenbaum
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA 92037, USA
| | - Stephen P. Schoenberger
- Division of Hematology and Oncology, Center for Personalized Cancer Therapy, San Diego Moore's Cancer Center, University of California, San Diego, San Diego, CA, USA
- Laboratory of Cellular Immunology, La Jolla Institute for Immunology, La Jolla, CA 92037, USA
| | - Aaron Miller
- Division of Hematology and Oncology, Center for Personalized Cancer Therapy, San Diego Moore's Cancer Center, University of California, San Diego, San Diego, CA, USA
- Laboratory of Cellular Immunology, La Jolla Institute for Immunology, La Jolla, CA 92037, USA
| | - Young J. Kim
- Department of Otolaryngology-Head & Neck Surgery, Vanderbilt Ingram Cancer Center, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Alessandro Sette
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA 92037, USA
- Department of Medicine, University of California, San Diego, La Jolla Institute for Immunology, 9420 Athena Circle, La Jolla, CA 92037, USA
| | - Morten Nielsen
- Department of Health Technology, Technical University of Denmark, DK Lyngby, 2800, Denmark
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, CP San Martín, B1650, Argentina
| | - Bjoern Peters
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA 92037, USA
- Department of Medicine, University of California, San Diego, La Jolla Institute for Immunology, 9420 Athena Circle, La Jolla, CA 92037, USA
| |
Collapse
|
53
|
Kaabinejadian S, Barra C, Alvarez B, Yari H, Hildebrand WH, Nielsen M. Accurate MHC Motif Deconvolution of Immunopeptidomics Data Reveals a Significant Contribution of DRB3, 4 and 5 to the Total DR Immunopeptidome. Front Immunol 2022; 13:835454. [PMID: 35154160 PMCID: PMC8826445 DOI: 10.3389/fimmu.2022.835454] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Accepted: 01/10/2022] [Indexed: 01/23/2023] Open
Abstract
Mass spectrometry (MS) based immunopeptidomics is used in several biomedical applications including neo-epitope discovery in oncology, next-generation vaccine development and protein-drug immunogenicity assessment. Immunopeptidome data are highly complex given the expression of multiple HLA alleles on the cell membrane and presence of co-immunoprecipitated contaminants. The absence of tools that deal with these challenges effectively and guide the analysis and interpretation of this complex type of data is currently a major bottleneck for the large-scale application of this technique. To resolve this, we here present the MHCMotifDecon that benefits from state-of-the-art HLA class-I and class-II predictions to accurately deconvolute immunopeptidome datasets and assign individual ligands to the most likely HLA molecule, allowing to identify and characterize HLA binding motifs while discarding co-purified contaminants. We have benchmarked the tool against other state-of-the-art methods and illustrated its application on experimental datasets for HLA-DR demonstrating a previously underappreciated role for HLA-DRB3/4/5 molecules in defining HLA class II immune repertoires. With its ease of use, MHCMotifDecon can efficiently guide interpretation of immunopeptidome datasets, serving the discovery of novel T cell targets. MHCMotifDecon is available at https://services.healthtech.dtu.dk/service.php?MHCMotifDecon-1.0.
Collapse
Affiliation(s)
- Saghar Kaabinejadian
- Pure MHC, LLC., Oklahoma City, OK, United States.,Department of Microbiology and Immunology, University of Oklahoma Health Sciences Center, Oklahoma City, OK, United States
| | - Carolina Barra
- Department of Health Technology, Technical University of Denmark, Lyngby, Denmark
| | - Bruno Alvarez
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, Buenos Aires, Argentina
| | - Hooman Yari
- Department of Microbiology and Immunology, University of Oklahoma Health Sciences Center, Oklahoma City, OK, United States
| | - William H Hildebrand
- Department of Microbiology and Immunology, University of Oklahoma Health Sciences Center, Oklahoma City, OK, United States
| | - Morten Nielsen
- Department of Health Technology, Technical University of Denmark, Lyngby, Denmark.,Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, Buenos Aires, Argentina
| |
Collapse
|
54
|
Akbar R, Bashour H, Rawat P, Robert PA, Smorodina E, Cotet TS, Flem-Karlsen K, Frank R, Mehta BB, Vu MH, Zengin T, Gutierrez-Marcos J, Lund-Johansen F, Andersen JT, Greiff V. Progress and challenges for the machine learning-based design of fit-for-purpose monoclonal antibodies. MAbs 2022; 14:2008790. [PMID: 35293269 PMCID: PMC8928824 DOI: 10.1080/19420862.2021.2008790] [Citation(s) in RCA: 55] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2021] [Revised: 11/04/2021] [Accepted: 11/17/2021] [Indexed: 12/15/2022] Open
Abstract
Although the therapeutic efficacy and commercial success of monoclonal antibodies (mAbs) are tremendous, the design and discovery of new candidates remain a time and cost-intensive endeavor. In this regard, progress in the generation of data describing antigen binding and developability, computational methodology, and artificial intelligence may pave the way for a new era of in silico on-demand immunotherapeutics design and discovery. Here, we argue that the main necessary machine learning (ML) components for an in silico mAb sequence generator are: understanding of the rules of mAb-antigen binding, capacity to modularly combine mAb design parameters, and algorithms for unconstrained parameter-driven in silico mAb sequence synthesis. We review the current progress toward the realization of these necessary components and discuss the challenges that must be overcome to allow the on-demand ML-based discovery and design of fit-for-purpose mAb therapeutic candidates.
Collapse
Affiliation(s)
- Rahmad Akbar
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
| | - Habib Bashour
- School of Life Sciences, University of Warwick, Coventry, UK
| | - Puneet Rawat
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, India
| | - Philippe A. Robert
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
| | - Eva Smorodina
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Russia
| | | | - Karine Flem-Karlsen
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
- Institute of Clinical Medicine, Department of Pharmacology, University of Oslo and Oslo University Hospital, Norway
| | - Robert Frank
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
| | - Brij Bhushan Mehta
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
| | - Mai Ha Vu
- Department of Linguistics and Scandinavian Studies, University of Oslo, Norway
| | - Talip Zengin
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
- Department of Bioinformatics, Mugla Sitki Kocman University, Turkey
| | | | | | - Jan Terje Andersen
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
- Institute of Clinical Medicine, Department of Pharmacology, University of Oslo and Oslo University Hospital, Norway
| | - Victor Greiff
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
| |
Collapse
|
55
|
Gartner JJ, Parkhurst MR, Gros A, Tran E, Jafferji MS, Copeland A, Hanada KI, Zacharakis N, Lalani A, Krishna S, Sachs A, Prickett TD, Li YF, Florentin M, Kivitz S, Chatmon SC, Rosenberg SA, Robbins PF. A machine learning model for ranking candidate HLA class I neoantigens based on known neoepitopes from multiple human tumor types. NATURE CANCER 2021; 2:563-574. [PMID: 34927080 DOI: 10.1038/s43018-021-00197-6] [Citation(s) in RCA: 50] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Tumor neoepitopes presented by major histocompatibility complex (MHC) class I are recognized by tumor-infiltrating lymphocytes (TIL) and are targeted by adoptive T-cell therapies. Identifying which mutant neoepitopes from tumor cells are capable of recognition by T cells can assist in the development of tumor-specific, cell-based therapies and can shed light on antitumor responses. Here, we generate a ranking algorithm for class I candidate neoepitopes by using next-generation sequencing data and a dataset of 185 neoepitopes that are recognized by HLA class I-restricted TIL from individuals with metastatic cancer. Random forest model analysis showed that the inclusion of multiple factors impacting epitope presentation and recognition increased output sensitivity and specificity compared to the use of predicted HLA binding alone. The ranking score output provides a set of class I candidate neoantigens that may serve as therapeutic targets and provides a tool to facilitate in vitro and in vivo studies aimed at the development of more effective immunotherapies.
Collapse
Affiliation(s)
- Jared J Gartner
- Surgery Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Maria R Parkhurst
- Surgery Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Alena Gros
- Vall d'Hebron Institute of Oncology (VHIO), Cellex Center, Barcelona, Spain
| | - Eric Tran
- Earle A. Chiles Research Institute, Providence Cancer Institute, Portland, OR, USA
| | | | - Amy Copeland
- Surgery Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Ken-Ichi Hanada
- Surgery Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Nikolaos Zacharakis
- Surgery Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Almin Lalani
- Surgery Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Sri Krishna
- Surgery Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Abraham Sachs
- Surgery Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Todd D Prickett
- Surgery Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Yong F Li
- Surgery Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Maria Florentin
- Surgery Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Scott Kivitz
- Surgery Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Samuel C Chatmon
- Surgery Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Steven A Rosenberg
- Surgery Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Paul F Robbins
- Surgery Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
56
|
Fotakis G, Trajanoski Z, Rieder D. Computational cancer neoantigen prediction: current status and recent advances. IMMUNO-ONCOLOGY TECHNOLOGY 2021; 12:100052. [PMID: 35755950 PMCID: PMC9216660 DOI: 10.1016/j.iotech.2021.100052] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Over the last few decades, immunotherapy has shown significant therapeutic efficacy in a broad range of cancer types. Antitumor immune responses are contingent on the recognition of tumor-specific antigens, which are termed neoantigens. Tumor neoantigens are ideal targets for immunotherapy since they can be recognized as non-self antigens by the host immune system and thus are able to elicit an antitumor T-cell response. There are an increasing number of studies that highlight the importance of tumor neoantigens in immunoediting and in the sensitivity to immune checkpoint blockade. Therefore, one of the most fundamental tasks in the field of immuno-oncology research is the identification of patient-specific neoantigens. To this end, a plethora of computational approaches have been developed in order to predict tumor-specific aberrant peptides and quantify their likelihood of binding to patients' human leukocyte antigen molecules in order to be recognized by T cells. In this review, we systematically summarize and present the most recent advances in computational neoantigen prediction, and discuss the challenges and novel methods that are being developed to resolve them. Tumors have the ability to acquire immune escape mechanisms. Tumor-specific aberrant peptides (neoantigens) can elicit an immune response by the host immune system. The identification of neoantigens is one of the most fundamental tasks in the field of immuno-oncology research. A plethora of computational approaches have been developed in order to predict patient-specificneoantigens.
Collapse
Affiliation(s)
- G Fotakis
- Institute of Bioinformatics, Biocenter, Medical University of Innsbruck, Innsbruck, Austria
| | - Z Trajanoski
- Institute of Bioinformatics, Biocenter, Medical University of Innsbruck, Innsbruck, Austria
| | - D Rieder
- Institute of Bioinformatics, Biocenter, Medical University of Innsbruck, Innsbruck, Austria
| |
Collapse
|
57
|
Yarmarkovich M, Marshall QF, Warrington JM, Premaratne R, Farrel A, Groff D, Li W, di Marco M, Runbeck E, Truong H, Toor JS, Tripathi S, Nguyen S, Shen H, Noel T, Church NL, Weiner A, Kendsersky N, Martinez D, Weisberg R, Christie M, Eisenlohr L, Bosse KR, Dimitrov DS, Stevanovic S, Sgourakis NG, Kiefel BR, Maris JM. Cross-HLA targeting of intracellular oncoproteins with peptide-centric CARs. Nature 2021; 599:477-484. [PMID: 34732890 PMCID: PMC8599005 DOI: 10.1038/s41586-021-04061-6] [Citation(s) in RCA: 81] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Accepted: 09/23/2021] [Indexed: 12/27/2022]
Abstract
The majority of oncogenic drivers are intracellular proteins, thus constraining their immunotherapeutic targeting to mutated peptides (neoantigens) presented by individual human leukocyte antigen (HLA) allotypes1. However, most cancers have a modest mutational burden that is insufficient to generate responses using neoantigen-based therapies2,3. Neuroblastoma is a paediatric cancer that harbours few mutations and is instead driven by epigenetically deregulated transcriptional networks4. Here we show that the neuroblastoma immunopeptidome is enriched with peptides derived from proteins that are essential for tumourigenesis and focus on targeting the unmutated peptide QYNPIRTTF, discovered on HLA-A*24:02, which is derived from the neuroblastoma dependency gene and master transcriptional regulator PHOX2B. To target QYNPIRTTF, we developed peptide-centric chimeric antigen receptors (CARs) using a counter-panning strategy with predicted potentially cross-reactive peptides. We further hypothesized that peptide-centric CARs could recognize peptides on additional HLA allotypes when presented in a similar manner. Informed by computational modelling, we showed that PHOX2B peptide-centric CARs also recognize QYNPIRTTF presented by HLA-A*23:01 and the highly divergent HLA-B*14:02. Finally, we demonstrated potent and specific killing of neuroblastoma cells expressing these HLAs in vitro and complete tumour regression in mice. These data suggest that peptide-centric CARs have the potential to vastly expand the pool of immunotherapeutic targets to include non-immunogenic intracellular oncoproteins and widen the population of patients who would benefit from such therapy by breaking conventional HLA restriction.
Collapse
Affiliation(s)
- Mark Yarmarkovich
- Division of Oncology and Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Quinlen F Marshall
- Division of Oncology and Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - John M Warrington
- Division of Oncology and Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | | | - Alvin Farrel
- Division of Oncology and Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - David Groff
- Division of Oncology and Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Wei Li
- University of Pittsburgh, Pittsburgh, PA, USA
| | | | - Erin Runbeck
- Division of Oncology and Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Hau Truong
- Division of Oncology and Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Jugmohit S Toor
- Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Sarvind Tripathi
- Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Son Nguyen
- Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Helena Shen
- Division of Oncology and Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Tiffany Noel
- Division of Oncology and Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | | | - Amber Weiner
- Division of Oncology and Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Nathan Kendsersky
- Division of Oncology and Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Dan Martinez
- Department of Pathology and Lab Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Rebecca Weisberg
- Division of Oncology and Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Molly Christie
- Division of Oncology and Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Laurence Eisenlohr
- Department of Pathology and Lab Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Kristopher R Bosse
- Division of Oncology and Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA, USA
| | | | | | - Nikolaos G Sgourakis
- Division of Oncology and Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | | | - John M Maris
- Division of Oncology and Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA.
- Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
58
|
Connecting MHC-I-binding motifs with HLA alleles via deep learning. Commun Biol 2021; 4:1194. [PMID: 34663927 PMCID: PMC8523706 DOI: 10.1038/s42003-021-02716-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2021] [Accepted: 09/24/2021] [Indexed: 12/17/2022] Open
Abstract
The selection of peptides presented by MHC molecules is crucial for antigen discovery. Previously, several predictors have shown impressive performance on binding affinity. However, the decisive MHC residues and their relation to the selection of binding peptides are still unrevealed. Here, we connected HLA alleles with binding motifs via our deep learning-based framework, MHCfovea. MHCfovea expanded the knowledge of MHC-I-binding motifs from 150 to 13,008 alleles. After clustering N-terminal and C-terminal sub-motifs on both observed and unobserved alleles, MHCfovea calculated the hyper-motifs and the corresponding allele signatures on the important positions to disclose the relation between binding motifs and MHC-I sequences. MHCfovea delivered 32 pairs of hyper-motifs and allele signatures (HLA-A: 13, HLA-B: 12, and HLA-C: 7). The paired hyper-motifs and allele signatures disclosed the critical polymorphic residues that determine the binding preference, which are believed to be valuable for antigen discovery and vaccine design when allele specificity is concerned. Ko-Han Lee et al. develop MHCfovea, a machine-learning method for predicting peptide-binding by MHC molecules and inferring peptide motifs and MHC allele signatures. They demonstrate that MHCfovea is capable of detecting meaningful hyper-motifs and allele signatures, making it a useful resource for the community.
Collapse
|
59
|
Junet V, Daura X. CNN-PepPred: An open-source tool to create convolutional NN models for the discovery of patterns in peptide sets. Application to peptide-MHC class II binding prediction. Bioinformatics 2021; 37:4567-4568. [PMID: 34601583 PMCID: PMC8652105 DOI: 10.1093/bioinformatics/btab687] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Revised: 08/02/2021] [Accepted: 09/28/2021] [Indexed: 11/12/2022] Open
Abstract
SUMMARY The ability to unveil binding patterns in peptide sets has important applications in several biomedical areas, including the development of vaccines. We present an open-source tool, CNN-PepPred, that uses convolutional neural networks to discover such patterns, along with its application to peptide-HLA class II binding prediction. The tool can be used locally on different operating systems, with CPUs or GPUs, to train, evaluate, apply and visualize models. AVAILABILITY AND IMPLEMENTATION CNN-PepPred is freely available as a Python tool with a detailed User's Guide at: https://github.com/ComputBiol-IBB/CNN-PepPred. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Valentin Junet
- Anaxomics Biotech SL, Barcelona, 08008, Spain.,Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, 08193, Spain
| | - Xavier Daura
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, 08193, Spain.,Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, 08010, Spain.,Biomedical Research Networking Center in Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Spain
| |
Collapse
|
60
|
Chen I, Chen MY, Goedegebuure SP, Gillanders WE. Challenges targeting cancer neoantigens in 2021: a systematic literature review. Expert Rev Vaccines 2021; 20:827-837. [PMID: 34047245 DOI: 10.1080/14760584.2021.1935248] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Introduction: Cancer neoantigens represent important targets of cancer immunotherapy. The goal of cancer neoantigen vaccines is to induce neoantigen-specific immune responses and antitumor immunity while minimizing the potential for autoimmune toxicity. Advances in sequencing technologies, neoantigen prediction algorithms, and other technologies have dramatically improved the ability to identify and prioritize cancer neoantigens. Unfortunately, results from preclinical studies and early phase clinical trials highlight important challenges to the successful clinical translation of neoantigen cancer vaccines.Areas covered: In this review, we provide an overview of current strategies for the identification and prioritization of cancer neoantigens with a particular emphasis on the two most common strategies used for neoantigen identification: (1) direct identification of peptide ligands eluted from peptide-MHC complexes, and (2) next-generation sequencing combined with neoantigen prediction algorithms. We highlight the limitations of current neoantigen prediction pipelines, and discuss broader challenges associated with cancer neoantigen vaccines including tumor purity/heterogeneity and the immunosuppressive tumor microenvironment.Expert opinion: Despite current limitations, neoantigen prediction is likely to improve rapidly based on advances in sequencing, machine learning, and information sharing. The successful development of robust cancer neoantigen prediction strategies is likely to have a significant impact, with the potential to facilitate cancer neoantigen vaccine design.
Collapse
Affiliation(s)
- Ina Chen
- Department of Surgery, Washington University and Siteman Cancer Center in St. Louis, St Louis, Missouri, USA
| | - Michael Y Chen
- Department of Surgery, Washington University and Siteman Cancer Center in St. Louis, St Louis, Missouri, USA
| | - S Peter Goedegebuure
- Department of Surgery, Washington University and Siteman Cancer Center in St. Louis, St Louis, Missouri, USA.,The Alvin J. Siteman Cancer Center at Barnes-Jewish Hospital and Washington University School of Medicine, St Louis, MO, USA
| | - William E Gillanders
- Department of Surgery, Washington University and Siteman Cancer Center in St. Louis, St Louis, Missouri, USA.,The Alvin J. Siteman Cancer Center at Barnes-Jewish Hospital and Washington University School of Medicine, St Louis, MO, USA
| |
Collapse
|
61
|
Cheng J, Bendjama K, Rittner K, Malone B. BERTMHC: Improved MHC-peptide class II interaction prediction with transformer and multiple instance learning. Bioinformatics 2021; 37:4172-4179. [PMID: 34096999 PMCID: PMC9502151 DOI: 10.1093/bioinformatics/btab422] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Revised: 05/17/2021] [Accepted: 06/04/2021] [Indexed: 11/12/2022] Open
Abstract
Motivation Increasingly comprehensive characterization of cancer-associated genetic alterations has paved the way for the development of highly specific therapeutic vaccines. Predicting precisely the binding and presentation of peptides to major histocompatibility complex (MHC) alleles is an important step toward such therapies. Recent data suggest that presentation of both class I and II epitopes are critical for the induction of a sustained effective immune response. However, the prediction performance for MHC class II has been limited compared to class I. Results We present a transformer neural network model which leverages self-supervised pretraining from a large corpus of protein sequences. We also propose a multiple instance learning (MIL) framework to deconvolve mass spectrometry data where multiple potential MHC alleles may have presented each peptide. We show that pretraining boosted the performance for these tasks. Combining pretraining and the novel MIL approach, our model outperforms state-of-the-art models based on peptide and MHC sequence only for both binding and cell surface presentation predictions. Availability and implementation Our source code is available at https://github.com/s6juncheng/BERTMHC under a noncommercial license. A webserver is available at https://bertmhc.privacy.nlehd.de/ Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jun Cheng
- NEC Laboratories Europe GmbH Kurfuersten-Anlage 36, 69115 Heidelberg, Germany
| | - Kaïdre Bendjama
- Transgene, Boulevard Gonthier d'Andernach, 67400 Illkirch-Graffenstaden, France
| | - Karola Rittner
- Transgene, Boulevard Gonthier d'Andernach, 67400 Illkirch-Graffenstaden, France
| | - Brandon Malone
- NEC Laboratories Europe GmbH Kurfuersten-Anlage 36, 69115 Heidelberg, Germany
| |
Collapse
|
62
|
Hwang W, Lei W, Katritsis NM, MacMahon M, Chapman K, Han N. Current and prospective computational approaches and challenges for developing COVID-19 vaccines. Adv Drug Deliv Rev 2021; 172:249-274. [PMID: 33561453 PMCID: PMC7871111 DOI: 10.1016/j.addr.2021.02.004] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2020] [Revised: 02/01/2021] [Accepted: 02/03/2021] [Indexed: 12/23/2022]
Abstract
SARS-CoV-2, which causes COVID-19, was first identified in humans in late 2019 and is a coronavirus which is zoonotic in origin. As it spread around the world there has been an unprecedented effort in developing effective vaccines. Computational methods can be used to speed up the long and costly process of vaccine development. Antigen selection, epitope prediction, and toxicity and allergenicity prediction are areas in which computational tools have already been applied as part of reverse vaccinology for SARS-CoV-2 vaccine development. However, there is potential for computational methods to assist further. We review approaches which have been used and highlight additional bioinformatic approaches and PK modelling as in silico methods which may be useful for SARS-CoV-2 vaccine design but remain currently unexplored. As more novel viruses with pandemic potential are expected to arise in future, these techniques are not limited to application to SARS-CoV-2 but also useful to rapidly respond to novel emerging viruses.
Collapse
Affiliation(s)
- Woochang Hwang
- Milner Therapeutics Institute, University of Cambridge, Cambridge, UK
| | - Winnie Lei
- Milner Therapeutics Institute, University of Cambridge, Cambridge, UK; Department of Surgery, University of Cambridge, Cambridge, UK
| | - Nicholas M Katritsis
- Milner Therapeutics Institute, University of Cambridge, Cambridge, UK; Department of Chemical Engineering and Biotechnology, University of Cambridge, Cambridge, UK
| | - Méabh MacMahon
- Milner Therapeutics Institute, University of Cambridge, Cambridge, UK; Centre for Therapeutics Discovery, LifeArc, Stevenage, UK
| | - Kathryn Chapman
- Milner Therapeutics Institute, University of Cambridge, Cambridge, UK
| | - Namshik Han
- Milner Therapeutics Institute, University of Cambridge, Cambridge, UK.
| |
Collapse
|
63
|
Pertseva M, Gao B, Neumeier D, Yermanos A, Reddy ST. Applications of Machine and Deep Learning in Adaptive Immunity. Annu Rev Chem Biomol Eng 2021; 12:39-62. [PMID: 33852352 DOI: 10.1146/annurev-chembioeng-101420-125021] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Adaptive immunity is mediated by lymphocyte B and T cells, which respectively express a vast and diverse repertoire of B cell and T cell receptors and, in conjunction with peptide antigen presentation through major histocompatibility complexes (MHCs), can recognize and respond to pathogens and diseased cells. In recent years, advances in deep sequencing have led to a massive increase in the amount of adaptive immune receptor repertoire data; additionally, proteomics techniques have led to a wealth of data on peptide-MHC presentation. These large-scale data sets are now making it possible to train machine and deep learning models, which can be used to identify complex and high-dimensional patterns in immune repertoires. This article introduces adaptive immune repertoires and machine and deep learning related to biological sequence data and then summarizes the many applications in this field, which span from predicting the immunological status of a host to the antigen specificity of individual receptors and the engineering of immunotherapeutics.
Collapse
Affiliation(s)
- Margarita Pertseva
- Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland; .,Life Science Zurich Graduate School, ETH Zurich and University of Zurich, 8006 Zurich, Switzerland
| | - Beichen Gao
- Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland;
| | - Daniel Neumeier
- Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland;
| | - Alexander Yermanos
- Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland; .,Department of Pathology and Immunology, University of Geneva, 1205 Geneva, Switzerland.,Department of Biology, Institute of Microbiology and Immunology, ETH Zurich, 8093 Zurich, Switzerland
| | - Sai T Reddy
- Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland;
| |
Collapse
|
64
|
Fisch A, Reynisson B, Benedictus L, Nicastri A, Vasoya D, Morrison I, Buus S, Ferreira BR, Kinney Ferreira de Miranda Santos I, Ternette N, Connelley T, Nielsen M. Integral Use of Immunopeptidomics and Immunoinformatics for the Characterization of Antigen Presentation and Rational Identification of BoLA-DR-Presented Peptides and Epitopes. THE JOURNAL OF IMMUNOLOGY 2021; 206:2489-2497. [PMID: 33789985 DOI: 10.4049/jimmunol.2001409] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Accepted: 03/01/2021] [Indexed: 02/04/2023]
Abstract
MHC peptide binding and presentation is the most selective event defining the landscape of T cell epitopes. Consequently, understanding the diversity of MHC alleles in a given population and the parameters that define the set of ligands that can be bound and presented by each of these alleles (the immunopeptidome) has an enormous impact on our capacity to predict and manipulate the potential of protein Ags to elicit functional T cell responses. Liquid chromatography-mass spectrometry analysis of MHC-eluted ligand data has proven to be a powerful technique for identifying such peptidomes, and methods integrating such data for prediction of Ag presentation have reached a high level of accuracy for both MHC class I and class II. In this study, we demonstrate how these techniques and prediction methods can be readily extended to the bovine leukocyte Ag class II DR locus (BoLA-DR). BoLA-DR binding motifs were characterized by eluted ligand data derived from bovine cell lines expressing a range of DRB3 alleles prevalent in Holstein-Friesian populations. The model generated (NetBoLAIIpan, available as a Web server at www.cbs.dtu.dk/services/NetBoLAIIpan) was shown to have unprecedented predictive power to identify known BoLA-DR-restricted CD4 epitopes. In summary, the results demonstrate the power of an integrated approach combining advanced mass spectrometry peptidomics with immunoinformatics for characterization of the BoLA-DR Ag presentation system and provide a prediction tool that can be used to assist in rational evaluation and selection of bovine CD4 T cell epitopes.
Collapse
Affiliation(s)
- Andressa Fisch
- Ribeirão Preto College of Nursing, University of São Paulo, Av Bandeirantes, Ribeirão Preto, Brazil
| | - Birkir Reynisson
- Department of Health Technology, Technical University of Denmark, Lyngby, Denmark
| | | | - Annalisa Nicastri
- The Jenner Institute, Nuffield Department of Medicine, Oxford, United Kingdom
| | - Deepali Vasoya
- The Roslin Institute, Edinburgh, Midlothian, United Kingdom
| | - Ivan Morrison
- The Roslin Institute, Edinburgh, Midlothian, United Kingdom
| | - Søren Buus
- Laboratory of Experimental Immunology, Department of Immunology and Microbiology, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | | | | | - Nicola Ternette
- The Jenner Institute, Nuffield Department of Medicine, Oxford, United Kingdom
| | - Tim Connelley
- The Roslin Institute, Edinburgh, Midlothian, United Kingdom
| | - Morten Nielsen
- Department of Health Technology, Technical University of Denmark, Lyngby, Denmark .,Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, San Martín, Argentina
| |
Collapse
|
65
|
Roerden M, Märklin M, Salih HR, Bethge WA, Klein R, Rammensee HG, Nelde A, Walz JS. Expression levels of HLA-DR in acute myeloid leukemia: implications for antigenicity and clinical outcome. Leuk Lymphoma 2021; 62:1907-1919. [PMID: 33648413 DOI: 10.1080/10428194.2021.1885659] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
Low human leukocyte antigen (HLA)-DR expression might compromise CD4+ T-cell-mediated anti-tumor immunity. Its immunological and clinical significance however remain undefined in non-promyelocytic acute myeloid leukemia (AML). Taking advantage of mass spectrometry-based immunopeptidome analysis of primary AML samples (n = 31), we studied the implications of low HLA-DR expression for antigen presentation and analyzed its association with disease characteristics and survival within a cohort of 399 AML patients. Remarkably, overall HLA-DR/DQ immunopeptidome diversity was preserved in AML with low HLA-DR expression (HLA-DRlow AML) and was associated with a shift in HLA-DR/DQ allotype abundances (HLA-DQ to HLA-DR/DQ ligand ratio 0.36 vs 0.19 in HLA-DRlow and HLA-DRhigh patients, respectively). Consistent with unimpaired antigenicity, survival was similar in HLA-DRlow and HLA-DRhigh patients. Demonstrating for the first time that overall HLA-DR/DQ antigen presentation is preserved in HLA-DRlow AML, our findings provide a rationale for the non-inferior outcome observed in HLA-DRlow AML patients.
Collapse
Affiliation(s)
- Malte Roerden
- Department of Hematology, Oncology, Clinical Immunology and Rheumatology, University Hospital Tübingen, Tübingen, Germany.,Institute for Cell Biology, Department of Immunology, University of Tübingen, Tübingen, Germany.,Cluster of Excellence iFIT (EXC2180) "Image-Guided and Functionally Instructed Tumor Therapies", University of Tübingen, Tübingen, Germany
| | - Melanie Märklin
- Cluster of Excellence iFIT (EXC2180) "Image-Guided and Functionally Instructed Tumor Therapies", University of Tübingen, Tübingen, Germany.,Clinical Collaboration Unit Translational Immunology, German Cancer Consortium (DKTK), Department of Internal Medicine, University Hospital Tübingen, Tübingen, Germany
| | - Helmut R Salih
- Cluster of Excellence iFIT (EXC2180) "Image-Guided and Functionally Instructed Tumor Therapies", University of Tübingen, Tübingen, Germany.,Clinical Collaboration Unit Translational Immunology, German Cancer Consortium (DKTK), Department of Internal Medicine, University Hospital Tübingen, Tübingen, Germany
| | - Wolfgang A Bethge
- Department of Hematology, Oncology, Clinical Immunology and Rheumatology, University Hospital Tübingen, Tübingen, Germany
| | - Reinhild Klein
- Department of Hematology, Oncology, Clinical Immunology and Rheumatology, University Hospital Tübingen, Tübingen, Germany
| | - Hans-Georg Rammensee
- Institute for Cell Biology, Department of Immunology, University of Tübingen, Tübingen, Germany.,Cluster of Excellence iFIT (EXC2180) "Image-Guided and Functionally Instructed Tumor Therapies", University of Tübingen, Tübingen, Germany.,German Cancer Consortium (DKTK), DKFZ partner site Tübingen, Tübingen, Germany
| | - Annika Nelde
- Institute for Cell Biology, Department of Immunology, University of Tübingen, Tübingen, Germany.,Cluster of Excellence iFIT (EXC2180) "Image-Guided and Functionally Instructed Tumor Therapies", University of Tübingen, Tübingen, Germany.,Clinical Collaboration Unit Translational Immunology, German Cancer Consortium (DKTK), Department of Internal Medicine, University Hospital Tübingen, Tübingen, Germany
| | - Juliane S Walz
- Institute for Cell Biology, Department of Immunology, University of Tübingen, Tübingen, Germany.,Cluster of Excellence iFIT (EXC2180) "Image-Guided and Functionally Instructed Tumor Therapies", University of Tübingen, Tübingen, Germany.,Clinical Collaboration Unit Translational Immunology, German Cancer Consortium (DKTK), Department of Internal Medicine, University Hospital Tübingen, Tübingen, Germany
| |
Collapse
|
66
|
Mei S, Li F, Xiang D, Ayala R, Faridi P, Webb GI, Illing PT, Rossjohn J, Akutsu T, Croft NP, Purcell AW, Song J. Anthem: a user customised tool for fast and accurate prediction of binding between peptides and HLA class I molecules. Brief Bioinform 2021; 22:6102669. [PMID: 33454737 DOI: 10.1093/bib/bbaa415] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Revised: 11/29/2020] [Accepted: 12/16/2020] [Indexed: 12/17/2022] Open
Abstract
Neopeptide-based immunotherapy has been recognised as a promising approach for the treatment of cancers. For neopeptides to be recognised by CD8+ T cells and induce an immune response, their binding to human leukocyte antigen class I (HLA-I) molecules is a necessary first step. Most epitope prediction tools thus rely on the prediction of such binding. With the use of mass spectrometry, the scale of naturally presented HLA ligands that could be used to develop such predictors has been expanded. However, there are rarely efforts that focus on the integration of these experimental data with computational algorithms to efficiently develop up-to-date predictors. Here, we present Anthem for accurate HLA-I binding prediction. In particular, we have developed a user-friendly framework to support the development of customisable HLA-I binding prediction models to meet challenges associated with the rapidly increasing availability of large amounts of immunopeptidomic data. Our extensive evaluation, using both independent and experimental datasets shows that Anthem achieves an overall similar or higher area under curve value compared with other contemporary tools. It is anticipated that Anthem will provide a unique opportunity for the non-expert user to analyse and interpret their own in-house or publicly deposited datasets.
Collapse
Affiliation(s)
- Shutao Mei
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Australia
| | - Fuyi Li
- Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Australia
| | - Dongxu Xiang
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Australia
| | - Rochelle Ayala
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Australia
| | - Pouya Faridi
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Australia
| | | | - Patricia T Illing
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Australia
| | - Jamie Rossjohn
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Australia
| | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Japan
| | - Nathan P Croft
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Australia
| | - Anthony W Purcell
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Australia
| | - Jiangning Song
- Monash Biomedicine Discovery Institute and Biochemistry and Molecular Biology, Monash University, Australia
| |
Collapse
|
67
|
Koşaloğlu-Yalçın Z, Sidney J, Chronister W, Peters B, Sette A. Comparison of HLA ligand elution data and binding predictions reveals varying prediction performance for the multiple motifs recognized by HLA-DQ2.5. Immunology 2020; 162:235-247. [PMID: 33064841 PMCID: PMC7808151 DOI: 10.1111/imm.13279] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Revised: 10/06/2020] [Accepted: 10/07/2020] [Indexed: 12/02/2022] Open
Abstract
Binding prediction tools are commonly used to identify peptides presented on MHC class II molecules. Recently, a wealth of data in the form of naturally eluted ligands has become available and discrepancies between ligand elution data and binding predictions have been reported. Quantitative metrics for such comparisons are currently lacking. In this study, we assessed how efficiently MHC class II binding predictions can identify naturally eluted peptides, and investigated instances with discrepancies between the two methods in detail. We found that, in general, MHC class II eluted ligands are predicted to bind to their reported restriction element with high affinity. But, for several studies reporting an increased number of ligands that were not predicted to bind, we found that the reported MHC restriction was ambiguous. Additional analyses determined that most of the ligands predicted to not bind, are predicted to bind other co‐expressed MHC class II molecules. For selected alleles, we addressed discrepancies between elution data and binding predictions by experimental measurements and found that predicted and measured affinities correlate well. For DQA1*05:01/DQB1*02:01 (DQ2.5) however, binding predictions did miss several peptides that were determined experimentally to be binders. For these peptides and several known DQ2.5 binders, we determined key residues for conferring DQ2.5 binding capacity, which revealed that DQ2.5 utilizes two different binding motifs, of which only one is predicted effectively. These findings have important implications for the interpretation of ligand elution data and for the improvement of MHC class II binding predictions.
Collapse
Affiliation(s)
| | - John Sidney
- La Jolla Institute for Immunology, La Jolla, CA, USA
| | | | - Bjoern Peters
- La Jolla Institute for Immunology, La Jolla, CA, USA.,Department of Medicine, University of California, San Diego, La Jolla, CA, USA
| | - Alessandro Sette
- La Jolla Institute for Immunology, La Jolla, CA, USA.,Department of Medicine, University of California, San Diego, La Jolla, CA, USA
| |
Collapse
|
68
|
Attermann AS, Barra C, Reynisson B, Schultz HS, Leurs U, Lamberth K, Nielsen M. Improved prediction of HLA antigen presentation hotspots: Applications for immunogenicity risk assessment of therapeutic proteins. Immunology 2020; 162:208-219. [PMID: 33010039 DOI: 10.1111/imm.13274] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Revised: 09/15/2020] [Accepted: 09/16/2020] [Indexed: 12/11/2022] Open
Abstract
Immunogenicity risk assessment is a critical element in protein drug development. Currently, the risk assessment is most often performed using MHC-associated peptide proteomics (MAPPs) and/or T-cell activation assays. However, this is a highly costly procedure that encompasses limited sensitivity imposed by sample sizes, the MHC repertoire of the tested donor cohort and the experimental procedures applied. Recent work has suggested that these techniques could be complemented by accurate, high-throughput and cost-effective prediction of in silico models. However, this work covered a very limited set of therapeutic proteins and eluted ligand (EL) data. Here, we resolved these limitations by showcasing, in a broader setting, the versatility of in silico models for assessment of protein drug immunogenicity. A method for prediction of MHC class II antigen presentation was developed on the hereto largest available mass spectrometry (MS) HLA-DR EL data set. Using independent test sets, the performance of the method for prediction of HLA-DR antigen presentation hotspots was benchmarked. In particular, the method was showcased on a set of protein sequences including four therapeutic proteins and demonstrated to accurately predict the experimental MS hotspot regions at a significantly lower false-positive rate compared with other methods. This gain in performance was particularly pronounced when compared to the NetMHCIIpan-3.2 method trained on binding affinity data. These results suggest that in silico methods trained on MS HLA EL data can effectively and accurately be used to complement MAPPs assays for the risk assessment of protein drugs.
Collapse
Affiliation(s)
| | - Carolina Barra
- Department of Health Technology, Technical University of Denmark, Lyngby, Denmark
| | - Birkir Reynisson
- Department of Health Technology, Technical University of Denmark, Lyngby, Denmark
| | - Heidi Schiøler Schultz
- Assay, Analysis & Characterisation, Global Research Technologies, Novo Nordisk A/S, Måløv, Denmark
| | - Ulrike Leurs
- Assay, Analysis & Characterisation, Global Research Technologies, Novo Nordisk A/S, Måløv, Denmark
| | - Kasper Lamberth
- Assay, Analysis & Characterisation, Global Research Technologies, Novo Nordisk A/S, Måløv, Denmark
| | - Morten Nielsen
- Department of Health Technology, Technical University of Denmark, Lyngby, Denmark.,Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, Buenos Aires, Argentina
| |
Collapse
|
69
|
Stryhn A, Kongsgaard M, Rasmussen M, Harndahl MN, Østerbye T, Bassi MR, Thybo S, Gabriel M, Hansen MB, Nielsen M, Christensen JP, Randrup Thomsen A, Buus S. A Systematic, Unbiased Mapping of CD8 + and CD4 + T Cell Epitopes in Yellow Fever Vaccinees. Front Immunol 2020; 11:1836. [PMID: 32983097 PMCID: PMC7489334 DOI: 10.3389/fimmu.2020.01836] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2020] [Accepted: 07/08/2020] [Indexed: 12/30/2022] Open
Abstract
Examining CD8+ and CD4+ T cell responses after primary Yellow Fever vaccination in a cohort of 210 volunteers, we have identified and tetramer-validated 92 CD8+ and 50 CD4+ T cell epitopes, many inducing strong and prevalent (i.e., immunodominant) T cell responses. Restricted by 40 and 14 HLA-class I and II allotypes, respectively, these responses have wide population coverage and might be of considerable academic, diagnostic and therapeutic interest. The broad coverage of epitopes and HLA overcame the otherwise confounding effects of HLA diversity and non-HLA background providing the first evidence of T cell immunodomination in humans. Also, double-staining of CD4+ T cells with tetramers representing the same HLA-binding core, albeit with different flanking regions, demonstrated an extensive diversification of the specificities of many CD4+ T cell responses. We suggest that this could reduce the risk of pathogen escape, and that multi-tetramer staining is required to reveal the true magnitude and diversity of CD4+ T cell responses. Our T cell epitope discovery approach uses a combination of (1) overlapping peptides representing the entire Yellow Fever virus proteome to search for peptides containing CD4+ and/or CD8+ T cell epitopes, (2) predictors of peptide-HLA binding to suggest epitopes and their restricting HLA allotypes, (3) generation of peptide-HLA tetramers to identify T cell epitopes, and (4) analysis of ex vivo T cell responses to validate the same. This approach is systematic, exhaustive, and can be done in any individual of any HLA haplotype. It is all-inclusive in the sense that it includes all protein antigens and peptide epitopes, and encompasses both CD4+ and CD8+ T cell epitopes. It is efficient and, importantly, reduces the false discovery rate. The unbiased nature of the T cell epitope discovery approach presented here should support the refinement of future peptide-HLA class I and II predictors and tetramer technologies, which eventually should cover all HLA class I and II isotypes. We believe that future investigations of emerging pathogens (e.g., SARS-CoV-2) should include population-wide T cell epitope discovery using blood samples from patients, convalescents and/or long-term survivors, who might all hold important information on T cell epitopes and responses.
Collapse
Affiliation(s)
- Anette Stryhn
- Department of Immunology and Microbiology, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Michael Kongsgaard
- Department of Immunology and Microbiology, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Michael Rasmussen
- Department of Immunology and Microbiology, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Mikkel Nors Harndahl
- Department of Immunology and Microbiology, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Thomas Østerbye
- Department of Immunology and Microbiology, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Maria Rosaria Bassi
- Department of Immunology and Microbiology, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Søren Thybo
- Department of Infectious Diseases, Copenhagen University Hospital, Copenhagen, Denmark
| | | | - Morten Bagge Hansen
- Department of Clinical Immunology, Copenhagen University Hospital, Copenhagen, Denmark
| | - Morten Nielsen
- Department of Health Technology, The Technical University of Denmark, Lyngby, Denmark
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, Buenos Aires, Argentina
| | - Jan Pravsgaard Christensen
- Department of Immunology and Microbiology, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Allan Randrup Thomsen
- Department of Immunology and Microbiology, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Soren Buus
- Department of Immunology and Microbiology, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
70
|
Immunoinformatic-Based Prediction of Candidate Epitopes for the Diagnosis and Control of Paratuberculosis (Johne's Disease). Pathogens 2020; 9:pathogens9090705. [PMID: 32867087 PMCID: PMC7558617 DOI: 10.3390/pathogens9090705] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2020] [Revised: 08/19/2020] [Accepted: 08/24/2020] [Indexed: 01/22/2023] Open
Abstract
Paratuberculosis is an infectious disease of ruminants caused by Mycobacterium avium subsp. paratuberculosis (MAP). MAP is an intracellular pathogen with a possible zoonotic potential since it has been successfully isolated from the intestine and blood of Crohn’s disease patients.Since no cure is available, after the detection of the disease, animal culling is the sole applicable containment strategy. However, the difficult detection of the disease in its subclinical form, facilitates its spread raising the need for the development of effective diagnosis and vaccination strategies. The prompt identification and isolation of the infected animals in the subclinical stage would prevent the spread of the infection.In the present study, an immunoinformatic approach has been used to investigate the immunogenic properties of 10 MAP proteins. These proteins were chosen according to a previously published immunoproteomics approach. For each previously-described immunoreactive protein, we predicted the epitopes capable of eliciting an immune response by binding both B-cells and/or class I MHC antigens. The retrieved peptide sequences were analyzed for their specificity and cross-reactivity. The final aim is to employ the discovered peptides sequences as a filtered library useful for early-stage diagnosis and/or to be used in novel multi-subunit or recombinant vaccine formulations.
Collapse
|
71
|
Wendorff M, Garcia Alvarez HM, Østerbye T, ElAbd H, Rosati E, Degenhardt F, Buus S, Franke A, Nielsen M. Unbiased Characterization of Peptide-HLA Class II Interactions Based on Large-Scale Peptide Microarrays; Assessment of the Impact on HLA Class II Ligand and Epitope Prediction. Front Immunol 2020; 11:1705. [PMID: 32903714 PMCID: PMC7438773 DOI: 10.3389/fimmu.2020.01705] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Accepted: 06/25/2020] [Indexed: 12/12/2022] Open
Abstract
Human Leukocyte Antigen class II (HLA-II) molecules present peptides to T lymphocytes and play an important role in adaptive immune responses. Characterizing the binding specificity of single HLA-II molecules has profound impacts for understanding cellular immunity, identifying the cause of autoimmune diseases, for immunotherapeutics, and vaccine development. Here, novel high-density peptide microarray technology combined with machine learning techniques were used to address this task at an unprecedented level of high-throughput. Microarrays with over 200,000 defined peptides were assayed with four exemplary HLA-II molecules. Machine learning was applied to mine the signals. The comparison of identified binding motifs, and power for predicting eluted ligands and CD4+ epitope datasets to that obtained using NetMHCIIpan-3.2, confirmed a high quality of the chip readout. These results suggest that the proposed microarray technology offers a novel and unique platform for large-scale unbiased interrogation of peptide binding preferences of HLA-II molecules.
Collapse
Affiliation(s)
- Mareike Wendorff
- Genetics & Bioinformatics, Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel, Germany
| | | | - Thomas Østerbye
- Department of Immunology and Microbiology, University of Copenhagen, Copenhagen, Denmark
| | - Hesham ElAbd
- Genetics & Bioinformatics, Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel, Germany
| | - Elisa Rosati
- Genetics & Bioinformatics, Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel, Germany
| | - Frauke Degenhardt
- Genetics & Bioinformatics, Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel, Germany
| | - Søren Buus
- Department of Immunology and Microbiology, University of Copenhagen, Copenhagen, Denmark
| | - Andre Franke
- Genetics & Bioinformatics, Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel, Germany
| | - Morten Nielsen
- IIBIO, UNSAM-CONICET, Buenos Aires, Argentina.,Department of Health Technology, Technical University of Denmark, Lyngby, Denmark
| |
Collapse
|
72
|
Reynisson B, Alvarez B, Paul S, Peters B, Nielsen M. NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data. Nucleic Acids Res 2020; 48:W449-W454. [PMID: 32406916 PMCID: PMC7319546 DOI: 10.1093/nar/gkaa379] [Citation(s) in RCA: 1116] [Impact Index Per Article: 223.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2020] [Revised: 04/17/2020] [Accepted: 04/29/2020] [Indexed: 12/12/2022] Open
Abstract
Major histocompatibility complex (MHC) molecules are expressed on the cell surface, where they present peptides to T cells, which gives them a key role in the development of T-cell immune responses. MHC molecules come in two main variants: MHC Class I (MHC-I) and MHC Class II (MHC-II). MHC-I predominantly present peptides derived from intracellular proteins, whereas MHC-II predominantly presents peptides from extracellular proteins. In both cases, the binding between MHC and antigenic peptides is the most selective step in the antigen presentation pathway. Therefore, the prediction of peptide binding to MHC is a powerful utility to predict the possible specificity of a T-cell immune response. Commonly MHC binding prediction tools are trained on binding affinity or mass spectrometry-eluted ligands. Recent studies have however demonstrated how the integration of both data types can boost predictive performances. Inspired by this, we here present NetMHCpan-4.1 and NetMHCIIpan-4.0, two web servers created to predict binding between peptides and MHC-I and MHC-II, respectively. Both methods exploit tailored machine learning strategies to integrate different training data types, resulting in state-of-the-art performance and outperforming their competitors. The servers are available at http://www.cbs.dtu.dk/services/NetMHCpan-4.1/ and http://www.cbs.dtu.dk/services/NetMHCIIpan-4.0/.
Collapse
Affiliation(s)
- Birkir Reynisson
- Department of Bio and Health Informatics, Technical University of Denmark, Kgs. Lyngby, DK 28002, Denmark
| | - Bruno Alvarez
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, Buenos Aires, BA 16503, Argentina
| | - Sinu Paul
- La Jolla Institute for Immunology, La Jolla, CA 92037, USA
| | - Bjoern Peters
- La Jolla Institute for Immunology, La Jolla, CA 92037, USA
- Department of Medicine, University of California, San Diego, CA 92093, USA
| | - Morten Nielsen
- Department of Bio and Health Informatics, Technical University of Denmark, Kgs. Lyngby, DK 28002, Denmark
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, Buenos Aires, BA 16503, Argentina
| |
Collapse
|
73
|
Abstract
Immunoinformatics is a discipline that applies methods of computer science to study and model the immune system. A fundamental question addressed by immunoinformatics is how to understand the rules of antigen presentation by MHC molecules to T cells, a process that is central to adaptive immune responses to infections and cancer. In the modern era of personalized medicine, the ability to model and predict which antigens can be presented by MHC is key to manipulating the immune system and designing strategies for therapeutic intervention. Since the MHC is both polygenic and extremely polymorphic, each individual possesses a personalized set of MHC molecules with different peptide-binding specificities, and collectively they present a unique individualized peptide imprint of the ongoing protein metabolism. Mapping all MHC allotypes is an enormous undertaking that cannot be achieved without a strong bioinformatics component. Computational tools for the prediction of peptide-MHC binding have thus become essential in most pipelines for T cell epitope discovery and an inescapable component of vaccine and cancer research. Here, we describe the development of several such tools, from pioneering efforts to the current state-of-the-art methods, that have allowed for accurate predictions of peptide binding of all MHC molecules, even including those that have not yet been characterized experimentally.
Collapse
Affiliation(s)
- Morten Nielsen
- Department of Health Technology, Technical University of Denmark, DK-2800 Kongens Lyngby, Denmark
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, CP 1650 San Martin, Buenos Aires, Argentina
| | - Massimo Andreatta
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, CP 1650 San Martin, Buenos Aires, Argentina
| | - Bjoern Peters
- Division of Vaccine Discovery, La Jolla Institute for Immunology, La Jolla, California 92037, USA
- Department of Medicine, University of California, San Diego, La Jolla, California 92093, USA
| | - Søren Buus
- Department of Immunology and Microbiology, Faculty of Health Sciences, University of Copenhagen, DK-2200 Copenhagen, Denmark
| |
Collapse
|
74
|
Barra C, Ackaert C, Reynisson B, Schockaert J, Jessen LE, Watson M, Jang A, Comtois-Marotte S, Goulet JP, Pattijn S, Paramithiotis E, Nielsen M. Immunopeptidomic Data Integration to Artificial Neural Networks Enhances Protein-Drug Immunogenicity Prediction. Front Immunol 2020; 11:1304. [PMID: 32655572 PMCID: PMC7325480 DOI: 10.3389/fimmu.2020.01304] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2020] [Accepted: 05/22/2020] [Indexed: 01/17/2023] Open
Abstract
Recombinant DNA technology has, in the last decades, contributed to a vast expansion of the use of protein drugs as pharmaceutical agents. However, such biological drugs can lead to the formation of anti-drug antibodies (ADAs) that may result in adverse effects, including allergic reactions and compromised therapeutic efficacy. Production of ADAs is most often associated with activation of CD4 T cell responses resulting from proteolysis of the biotherapeutic and loading of drug-specific peptides into major histocompatibility complex (MHC) class II on professional antigen-presenting cells. Recently, readouts from MHC-associated peptide proteomics (MAPPs) assays have been shown to correlate with the presence of CD4 T cell epitopes. However, the limited sensitivity of MAPPs challenges its use as an immunogenicity biomarker. In this work, MAPPs data was used to construct an artificial neural network (ANN) model for MHC class II antigen presentation. Using Infliximab and Rituximab as showcase stories, the model demonstrated an unprecedented performance for predicting MAPPs and CD4 T cell epitopes in the context of protein-drug immunogenicity, complementing results from MAPPs assays and outperforming conventional prediction models trained on binding affinity data.
Collapse
Affiliation(s)
- Carolina Barra
- Immunoinformatics and Machine Learning, DTU Health Technology, Danish Technical University, Lyngby, Denmark
| | | | - Birkir Reynisson
- Immunoinformatics and Machine Learning, DTU Health Technology, Danish Technical University, Lyngby, Denmark
| | | | - Leon Eyrich Jessen
- Immunoinformatics and Machine Learning, DTU Health Technology, Danish Technical University, Lyngby, Denmark
| | | | - Anne Jang
- Caprion Biosciences, Montreal, QC, Canada
| | | | | | | | | | - Morten Nielsen
- Immunoinformatics and Machine Learning, DTU Health Technology, Danish Technical University, Lyngby, Denmark
- IIBIO-UNSAM, Universidad Nacional de San Martin, Buenos Aires, Argentina
| |
Collapse
|
75
|
Uncovering the Tumor Antigen Landscape: What to Know about the Discovery Process. Cancers (Basel) 2020; 12:cancers12061660. [PMID: 32585818 PMCID: PMC7352969 DOI: 10.3390/cancers12061660] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Revised: 06/11/2020] [Accepted: 06/20/2020] [Indexed: 12/14/2022] Open
Abstract
According to the latest available data, cancer is the second leading cause of death, highlighting the need for novel cancer therapeutic approaches. In this context, immunotherapy is emerging as a reliable first-line treatment for many cancers, particularly metastatic melanoma. Indeed, cancer immunotherapy has attracted great interest following the recent clinical approval of antibodies targeting immune checkpoint molecules, such as PD-1, PD-L1, and CTLA-4, that release the brakes of the immune system, thus reviving a field otherwise poorly explored. Cancer immunotherapy mainly relies on the generation and stimulation of cytotoxic CD8 T lymphocytes (CTLs) within the tumor microenvironment (TME), priming T cells and establishing efficient and durable anti-tumor immunity. Therefore, there is a clear need to define and identify immunogenic T cell epitopes to use in therapeutic cancer vaccines. Naturally presented antigens in the human leucocyte antigen-1 (HLA-I) complex on the tumor surface are the main protagonists in evocating a specific anti-tumor CD8+ T cell response. However, the methodologies for their identification have been a major bottleneck for their reliable characterization. Consequently, the field of antigen discovery has yet to improve. The current review is intended to define what are today known as tumor antigens, with a main focus on CTL antigenic peptides. We also review the techniques developed and employed to date for antigen discovery, exploring both the direct elution of HLA-I peptides and the in silico prediction of epitopes. Finally, the last part of the review analyses the future challenges and direction of the antigen discovery field.
Collapse
|
76
|
Reynisson B, Barra C, Kaabinejadian S, Hildebrand WH, Peters B, Nielsen M. Improved Prediction of MHC II Antigen Presentation through Integration and Motif Deconvolution of Mass Spectrometry MHC Eluted Ligand Data. J Proteome Res 2020; 19:2304-2315. [PMID: 32308001 DOI: 10.1021/acs.jproteome.9b00874] [Citation(s) in RCA: 271] [Impact Index Per Article: 54.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Major histocompatibility complex II (MHC II) molecules play a vital role in the onset and control of cellular immunity. In a highly selective process, MHC II presents peptides derived from exogenous antigens on the surface of antigen-presenting cells for T cell scrutiny. Understanding the rules defining this presentation holds critical insights into the regulation and potential manipulation of the cellular immune system. Here, we apply the NNAlign_MA machine learning framework to analyze and integrate large-scale eluted MHC II ligand mass spectrometry (MS) data sets to advance prediction of CD4+ epitopes. NNAlign_MA allows integration of mixed data types, handling ligands with multiple potential allele annotations, encoding of ligand context, leveraging information between data sets, and has pan-specific power allowing accurate predictions outside the set of molecules included in the training data. Applying this framework, we identified accurate binding motifs of more than 50 MHC class II molecules described by MS data, particularly expanding coverage for DP and DQ beyond that obtained using current MS motif deconvolution techniques. Furthermore, in large-scale benchmarking, the final model termed NetMHCIIpan-4.0 demonstrated improved performance beyond current state-of-the-art predictors for ligand and CD4+ T cell epitope prediction. These results suggest that NNAlign_MA and NetMHCIIpan-4.0 are powerful tools for analysis of immunopeptidome MS data, prediction of T cell epitopes, and development of personalized immunotherapies.
Collapse
Affiliation(s)
- Birkir Reynisson
- Department of Health Technology, Technical University of Denmark, Lyngby 2800, Denmark
| | - Carolina Barra
- Department of Health Technology, Technical University of Denmark, Lyngby 2800, Denmark
| | | | - William H Hildebrand
- Department of Microbiology and Immunology, University of Oklahoma Health Sciences Center, Oklahoma City, Oklahoma 73104, United States
| | - Bjoern Peters
- Division of Vaccine Discovery, La Jolla Institute for Allergy and Immunology, La Jolla, California 92037, United States.,Department of Medicine, University of California, San Diego, San Diego, California 92093, United States
| | - Morten Nielsen
- Department of Health Technology, Technical University of Denmark, Lyngby 2800, Denmark.,Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, San Martín CP1650, Argentina
| |
Collapse
|
77
|
Paul S, Grifoni A, Peters B, Sette A. Major Histocompatibility Complex Binding, Eluted Ligands, and Immunogenicity: Benchmark Testing and Predictions. Front Immunol 2020; 10:3151. [PMID: 32117208 PMCID: PMC7012937 DOI: 10.3389/fimmu.2019.03151] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2019] [Accepted: 12/30/2019] [Indexed: 01/01/2023] Open
Abstract
Antidrug antibody (ADA) responses impact drug safety, potency, and efficacy. It is generally assumed that ADA responses are associated with human leukocyte antigen (HLA) class II-restricted CD4+ T-cell reactivity. Although this review does not address ADA responses per se, the analysis presented here is relevant to the topic, because measuring or predicting CD4+ T-cell reactivity is a common strategy to address ADA and immunogenicity concerns. Because human CD4+ T-cell reactivity relies on the recognition of peptides bound to HLA class II, prediction, or measurement of the capacity of different peptides to bind or be natural ligands of HLA class II is used as a predictor of CD4+ T-cell reactivity and ADA development. Thus, three different interconnected variables are commonly utilized in predicting T-cell reactivity: major histocompatibility complex (MHC) binding, capacity to be generated as natural HLA ligands, and T-cell immunogenicity. To provide the scientific community with guidance in the relative merit of different approaches, it is necessary to clearly define what outcomes are being considered. Thus, the accuracy of HLA binding predictions varies as a function of what the outcome predicted is, whether it is binding itself, natural processing, or T-cell immunogenicity. Furthermore, it is necessary that the accuracy of prediction is based on rigorous benchmarking, grounded by fair, objective, transparent, and experimental criteria. In this review, we provide our perspective on how different variables and methodologies predict each of the various outcomes and point out knowledge gaps and areas to be addressed by further experimental work.
Collapse
Affiliation(s)
- Sinu Paul
- Division of Vaccine Discovery, La Jolla Institute for Immunology, La Jolla, CA, United States
| | - Alba Grifoni
- Division of Vaccine Discovery, La Jolla Institute for Immunology, La Jolla, CA, United States
| | - Bjoern Peters
- Division of Vaccine Discovery, La Jolla Institute for Immunology, La Jolla, CA, United States
- Department of Medicine, University of California, San Diego, San Diego, CA, United States
| | - Alessandro Sette
- Division of Vaccine Discovery, La Jolla Institute for Immunology, La Jolla, CA, United States
- Department of Medicine, University of California, San Diego, San Diego, CA, United States
| |
Collapse
|