1
|
Abrams ED, Basu A, Zavorka Thomas ME, Henrickson SE, Abraham RS. Expanding the diagnostic toolbox for complex genetic immune disorders. J Allergy Clin Immunol 2025; 155:255-274. [PMID: 39581295 DOI: 10.1016/j.jaci.2024.11.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2024] [Revised: 10/29/2024] [Accepted: 11/12/2024] [Indexed: 11/26/2024]
Abstract
Laboratory-based immunology evaluation is essential to the diagnostic workup of patients with complex immune disorders, and is as essential, if not more so, depending on the context, as genetic testing, because it enables identification of aberrant pathways amenable to therapeutic intervention and clarifies variants of uncertain significance. There have been considerable advances in techniques and instrumentation in the clinical laboratory in the past 2 decades, although there are still "miles to go." One of the goals of the clinical laboratory is to ensure advanced diagnostic testing is widely accessible to physicians and thus patients, through reference laboratories, particularly in the context of academic medical centers. This ensures a greater likelihood of translating research discoveries into the diagnostic laboratory, on the basis of patient care needs rather than a sole emphasis on commercial utility. However, these advances are under threat from burdensome regulatory oversight that can compromise, at best, and curtail, at worst, the ability to rapidly diagnose rare immune disorders and ensure delivery of precision medicine. This review discusses the clinical utility of diagnostic immunology tools, beyond cellular immunophenotyping of lymphocyte subsets, which can be used in conjunction with clinical and other laboratory data for diagnosis as well as monitoring of therapeutic response in patients with genetic immunologic diseases.
Collapse
Affiliation(s)
- Eric D Abrams
- Division of Allergy and Immunology, Department of Pediatrics, Children's Hospital of Philadelphia, Philadelphia, Pa
| | - Amrita Basu
- Diagnostic Immunology Laboratory, Department of Pathology and Laboratory Medicine, Nationwide Children's Hospital, Columbus, Ohio
| | - Megan E Zavorka Thomas
- Diagnostic Immunology Laboratory, Department of Pathology and Laboratory Medicine, Nationwide Children's Hospital, Columbus, Ohio
| | - Sarah E Henrickson
- Division of Allergy and Immunology, Department of Pediatrics, Children's Hospital of Philadelphia, Philadelphia, Pa; Institute for Immunology and Immune Health, University of Pennsylvania, Philadelphia, Pa; Department of Microbiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pa
| | - Roshini S Abraham
- Diagnostic Immunology Laboratory, Department of Pathology and Laboratory Medicine, Nationwide Children's Hospital, Columbus, Ohio.
| |
Collapse
|
2
|
Hanna SJ, Bonami RH, Corrie B, Westley M, Posgai AL, Luning Prak ET, Breden F, Michels AW, Brusko TM. The Type 1 Diabetes T Cell Receptor and B Cell Receptor Repository in the AIRR Data Commons: a practical guide for access, use and contributions through the Type 1 Diabetes AIRR Consortium. Diabetologia 2025; 68:186-202. [PMID: 39467874 PMCID: PMC11663175 DOI: 10.1007/s00125-024-06298-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/26/2024] [Accepted: 08/19/2024] [Indexed: 10/30/2024]
Abstract
Human molecular genetics has brought incredible insights into the variants that confer risk for the development of tissue-specific autoimmune diseases, including type 1 diabetes. The hallmark cell-mediated immune destruction that is characteristic of type 1 diabetes is closely linked with risk conferred by the HLA class II gene locus, in combination with a broad array of additional candidate genes influencing islet-resident beta cells within the pancreas, as well as function, phenotype and trafficking of immune cells to tissues. In addition to the well-studied germline SNP variants, there are critical contributions conferred by T cell receptor (TCR) and B cell receptor (BCR) genes that undergo somatic recombination to yield the Adaptive Immune Receptor Repertoire (AIRR) responsible for autoimmunity in type 1 diabetes. We therefore created the T1D TCR/BCR Repository (The Type 1 Diabetes T Cell Receptor and B Cell Receptor Repository) to study these highly variable and dynamic gene rearrangements. In addition to processed TCR and BCR sequences, the T1D TCR/BCR Repository includes detailed metadata (e.g. participant demographics, disease-associated parameters and tissue type). We introduce the Type 1 Diabetes AIRR Consortium goals and outline methods to use and deposit data to this comprehensive repository. Our ultimate goal is to facilitate research community access to rich, carefully annotated immune AIRR datasets to enable new scientific inquiry and insight into the natural history and pathogenesis of type 1 diabetes.
Collapse
MESH Headings
- Diabetes Mellitus, Type 1/immunology
- Diabetes Mellitus, Type 1/genetics
- Humans
- Receptors, Antigen, T-Cell/genetics
- Receptors, Antigen, T-Cell/metabolism
- Receptors, Antigen, B-Cell/genetics
- Receptors, Antigen, B-Cell/metabolism
- Autoimmunity
Collapse
Affiliation(s)
- Stephanie J Hanna
- Division of Infection and Immunity, Cardiff University School of Medicine, Cardiff, UK.
| | - Rachel H Bonami
- Department of Medicine, Division of Rheumatology and Immunology, Vanderbilt University Medical Center, Nashville, TN, USA
- Department of Pathology, Microbiology, and Immunology, Vanderbilt University Medical Center, Nashville, TN, USA
- Vanderbilt Center for Immunobiology, Nashville, TN, USA
- Vanderbilt Institute for Infection, Immunology, and Inflammation, Nashville, TN, USA
| | - Brian Corrie
- Department of Biological Sciences, Simon Fraser University, Burnaby, BC, Canada
- iReceptor Genomic Services, Summerland, BC, Canada
| | | | - Amanda L Posgai
- Department of Pathology, Immunology, and Laboratory Medicine, College of Medicine, Diabetes Institute, University of Florida, Gainesville, FL, USA
| | - Eline T Luning Prak
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Felix Breden
- Department of Biological Sciences, Simon Fraser University, Burnaby, BC, Canada
- iReceptor Genomic Services, Summerland, BC, Canada
| | - Aaron W Michels
- Barbara Davis Center for Diabetes, University of Colorado School of Medicine, Aurora, CO, USA.
| | - Todd M Brusko
- Department of Pathology, Immunology, and Laboratory Medicine, College of Medicine, Diabetes Institute, University of Florida, Gainesville, FL, USA.
- Department of Pediatrics, College of Medicine, Diabetes Institute, University of Florida, Gainesville, FL, USA.
- Department of Biochemistry and Molecular Biology, College of Medicine, Diabetes Institute, University of Florida, Gainesville, FL, USA.
| |
Collapse
|
3
|
Jagota M, Hsu C, Mazumder T, Sung K, DeWitt WS, Listgarten J, Matsen FA, Ye CJ, Song YS. Learning antibody sequence constraints from allelic inclusion. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.22.619760. [PMID: 39484623 PMCID: PMC11526943 DOI: 10.1101/2024.10.22.619760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/03/2024]
Abstract
Antibodies and B-cell receptors (BCRs) are produced by B cells, and are built of a heavy chain and a light chain. Although each B cell could express two different heavy chains and four different light chains, usually only a unique pair of heavy chain and light chain is expressed-a phenomenon known as allelic exclusion. However, a small fraction of naive-B cells violate allelic exclusion by expressing two productive light chains, one of which has impaired function; this has been called allelic inclusion. We demonstrate that these B cells can be used to learn constraints on antibody sequence. Using large-scale single-cell sequencing data from humans, we find examples of light chain allelic inclusion in thousands of naive-B cells, which is an order of magnitude larger than existing datasets. We train machine learning models to identify the abnormal sequences in these cells. The resulting models correlate with antibody properties that they were not trained on, including polyreactivity, surface expression, and mutation usage in affinity maturation. These correlations are larger than what is achieved by existing antibody modeling approaches, indicating that allelic inclusion data contains useful new information. We also investigate the impact of similar selection forces on the heavy chain in mouse, and observe that pairing with the surrogate light chain significantly restricts heavy chain diversity.
Collapse
Affiliation(s)
- Milind Jagota
- Computer Science Division, UC Berkeley, Berkeley, CA USA
| | - Chloe Hsu
- Computer Science Division, UC Berkeley, Berkeley, CA USA
| | - Thomas Mazumder
- Division of Rheumatology, Department of Medicine, UCSF, San Francisco, CA, USA
| | - Kevin Sung
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | | | | | - Frederick A. Matsen
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Chun Jimmie Ye
- Division of Rheumatology, Department of Medicine, UCSF, San Francisco, CA, USA
- Chan Zuckerberg Biohub, San Francisco, CA, USA
- Parker Institute for Cancer Immunotherapy, UCSF, San Francisco, CA, USA
- Institute for Human Genetics, UCSF, San Francisco, CA, USA
- Bakar Computational Health Sciences Institute, UCSF, San Francisco, California, USA
- Department of Epidemiology and Biostatistics, UCSF, San Francisco, CA, USA
| | - Yun S. Song
- Computer Science Division, UC Berkeley, Berkeley, CA USA
- Department of Statistics, UC Berkeley, Berkeley, CA, USA October 23, 2024
| |
Collapse
|
4
|
Wang M, Patsenker J, Li H, Kluger Y, Kleinstein SH. Supervised fine-tuning of pre-trained antibody language models improves antigen specificity prediction. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.13.593807. [PMID: 38798340 PMCID: PMC11118465 DOI: 10.1101/2024.05.13.593807] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Antibodies play a crucial role in adaptive immune responses by determining B cell specificity to antigens and focusing immune function on target pathogens. Accurate prediction of antibody-antigen specificity directly from antibody sequencing data would be a great aid in understanding immune responses, guiding vaccine design, and developing antibody-based therapeutics. In this study, we present a method of supervised fine-tuning for antibody language models, which improves on previous results in binding specificity prediction to SARS-CoV-2 spike protein and influenza hemagglutinin. We perform supervised fine-tuning on four pre-trained antibody language models to predict specificity to these antigens and demonstrate that fine-tuned language model classifiers exhibit enhanced predictive accuracy compared to classifiers trained on pre-trained model embeddings. The change of model attention activations after supervised fine-tuning suggested that this performance was driven by an increased model focus on the complementarity determining regions (CDRs). Application of the supervised fine-tuned models to BCR repertoire data demonstrated that these models could recognize the specific responses elicited by influenza and SARS-CoV-2 vaccination. Overall, our study highlights the benefits of supervised fine-tuning on pre-trained antibody language models as a mechanism to improve antigen specificity prediction.
Collapse
Affiliation(s)
- Meng Wang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
| | - Jonathan Patsenker
- Program in Applied Mathematics, Yale University, New Haven, Connecticut, United States of America
| | - Henry Li
- Program in Applied Mathematics, Yale University, New Haven, Connecticut, United States of America
| | - Yuval Kluger
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
- Program in Applied Mathematics, Yale University, New Haven, Connecticut, United States of America
- Department of Pathology, Yale School of Medicine, New Haven, Connecticut, United States of America
| | - Steven H Kleinstein
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
- Department of Pathology, Yale School of Medicine, New Haven, Connecticut, United States of America
- Department of Immunobiology, Yale School of Medicine, New Haven, Connecticut, United States of America
| |
Collapse
|
5
|
Barton J, Gaspariunas A, Galson JD, Leem J. Building Representation Learning Models for Antibody Comprehension. Cold Spring Harb Perspect Biol 2024; 16:a041462. [PMID: 38012013 PMCID: PMC10910360 DOI: 10.1101/cshperspect.a041462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Antibodies are versatile proteins with both the capacity to bind a broad range of targets and a proven track record as some of the most successful therapeutics. However, the development of novel antibody therapeutics is a lengthy and costly process. It is challenging to predict the functional and biophysical properties of antibodies from their amino acid sequence alone, requiring numerous experiments for full characterization. Machine learning, specifically deep representation learning, has emerged as a family of methods that can complement wet lab approaches and accelerate the overall discovery and engineering process. Here, we review advances in antibody sequence representation learning, and how this has improved antibody structure prediction and facilitated antibody optimization. We discuss challenges in the development and implementation of such models, such as the lack of publicly available, well-curated antibody function data and highlight opportunities for improvement. These and future advances in machine learning for antibody sequences have the potential to increase the success rate in developing new therapeutics, resulting in broader access to transformative medicines and improved patient outcomes.
Collapse
Affiliation(s)
- Justin Barton
- Alchemab Therapeutics Ltd, London N1C 4AX, United Kingdom
| | | | - Jacob D Galson
- Alchemab Therapeutics Ltd, London N1C 4AX, United Kingdom
| | - Jinwoo Leem
- Alchemab Therapeutics Ltd, London N1C 4AX, United Kingdom
| |
Collapse
|
6
|
Mikelov A, Nefediev G, Tashkeev A, Rodriguez OL, Ortmans DA, Skatova V, Izraelson M, Davydov A, Poslavsky S, Rahmouni S, Watson CT, Chudakov D, Boyd SD, Bolotin D. Ultrasensitive allele inference from immune repertoire sequencing data with MiXCR. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.10.561703. [PMID: 38014266 PMCID: PMC10680553 DOI: 10.1101/2023.10.10.561703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Allelic variability in the adaptive immune receptor loci, which harbor the gene segments that encode B cell and T cell receptors (BCR/TCR), has been shown to be of critical importance for immune responses to pathogens and vaccines. In recent years, B cell and T cell receptor repertoire sequencing (Rep-Seq) has become widespread in immunology research making it the most readily available source of information about allelic diversity in immunoglobulin (IG) and T cell receptor (TR) loci in different populations. Here we present a novel algorithm for extra-sensitive and specific variable (V) and joining (J) gene allele inference and genotyping allowing reconstruction of individual high-quality gene segment libraries. The approach can be applied for inferring allelic variants from peripheral blood lymphocyte BCR and TCR repertoire sequencing data, including hypermutated isotype-switched BCR sequences, thus allowing high-throughput genotyping and novel allele discovery from a wide variety of existing datasets. The developed algorithm is a part of the MiXCR software ( https://mixcr.com ) and can be incorporated into any pipeline utilizing upstream processing with MiXCR. We demonstrate the accuracy of this approach using Rep-Seq paired with long-read genomic sequencing data, comparing it to a widely used algorithm, TIgGER. We applied the algorithm to a large set of IG heavy chain (IGH) Rep-Seq data from 450 donors of ancestrally diverse population groups, and to the largest reported full-length TCR alpha and beta chain (TRA; TRB) Rep-Seq dataset, representing 134 individuals. This allowed us to assess the genetic diversity of genes within the IGH, TRA and TRB loci in different populations and demonstrate the connection between antibody repertoire gene usage and the number of allelic variants present in the population. Finally we established a database of allelic variants of V and J genes inferred from Rep-Seq data and their population frequencies with free public access at https://vdj.online .
Collapse
|
7
|
Nagano Y, Chain B. tidytcells: standardizer for TR/MH nomenclature. Front Immunol 2023; 14:1276106. [PMID: 37954585 PMCID: PMC10634431 DOI: 10.3389/fimmu.2023.1276106] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 10/09/2023] [Indexed: 11/14/2023] Open
Abstract
T cell receptors (TR) underpin the diversity and specificity of T cell activity. As such, TR repertoire data is valuable both as an adaptive immune biomarker, and as a way to identify candidate therapeutic TR. Analysis of TR repertoires relies heavily on computational analysis, and therefore it is of vital importance that the data is standardized and computer-readable. However in practice, the usage of different abbreviations and non-standard nomenclature in different datasets makes this data pre-processing non-trivial. tidytcells is a lightweight, platform-independent Python package that provides easy-to-use standardization tools specifically designed for TR nomenclature. The software is open-sourced under the MIT license and is available to install from the Python Package Index (PyPI). At the time of publishing, tidytcells is on version 2.0.0.
Collapse
Affiliation(s)
- Yuta Nagano
- Division of Medicine, Faculty of Medical Scienecs, University College London (UCL), London, United Kingdom
| | - Benjamin Chain
- Division of Infection and Immunity, Faculty of Medical Sciences, University College London (UCL), London, United Kingdom
| |
Collapse
|
8
|
Zhao Y, He B, Xu F, Li C, Xu Z, Su X, He H, Huang Y, Rossjohn J, Song J, Yao J. DeepAIR: A deep learning framework for effective integration of sequence and 3D structure to enable adaptive immune receptor analysis. SCIENCE ADVANCES 2023; 9:eabo5128. [PMID: 37556545 PMCID: PMC10411891 DOI: 10.1126/sciadv.abo5128] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Accepted: 07/06/2023] [Indexed: 08/11/2023]
Abstract
Structural docking between the adaptive immune receptors (AIRs), including T cell receptors (TCRs) and B cell receptors (BCRs), and their cognate antigens are one of the most fundamental processes in adaptive immunity. However, current methods for predicting AIR-antigen binding largely rely on sequence-derived features of AIRs, omitting the structure features that are essential for binding affinity. In this study, we present a deep learning framework, termed DeepAIR, for the accurate prediction of AIR-antigen binding by integrating both sequence and structure features of AIRs. DeepAIR achieves a Pearson's correlation of 0.813 in predicting the binding affinity of TCR, and a median area under the receiver-operating characteristic curve (AUC) of 0.904 and 0.942 in predicting the binding reactivity of TCR and BCR, respectively. Meanwhile, using TCR and BCR repertoire, DeepAIR correctly identifies every patient with nasopharyngeal carcinoma and inflammatory bowel disease in test data. Thus, DeepAIR improves the AIR-antigen binding prediction that facilitates the study of adaptive immunity.
Collapse
Affiliation(s)
- Yu Zhao
- AI Lab, Tencent, Shenzhen, China
| | - Bing He
- AI Lab, Tencent, Shenzhen, China
| | - Fan Xu
- AI Lab, Tencent, Shenzhen, China
| | - Chen Li
- Biomedicine Discovery Institute and Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
| | | | | | | | | | - Jamie Rossjohn
- Infection and Immunity Program and Department of Biochemistry and Molecular Biology, Biomedicine Discovery Institute, Monash University, Melbourne, VIC 3800, Australia
- Institute of Infection and Immunity, Cardiff University School of Medicine, Heath Park, Cardiff, UK
| | - Jiangning Song
- AI Lab, Tencent, Shenzhen, China
- Biomedicine Discovery Institute and Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
| | | |
Collapse
|