1
|
Voss K, Kaur KM, Banerjee R, Breden F, Pennell M. Applying phylogenetic methods for species delimitation to distinguish B-cell clonal families. Front Immunol 2024; 15:1505032. [PMID: 39687606 PMCID: PMC11646844 DOI: 10.3389/fimmu.2024.1505032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2024] [Accepted: 11/07/2024] [Indexed: 12/18/2024] Open
Abstract
The adaptive immune system generates a diverse array of B-cell receptors through the processes of V(D)J recombination and somatic hypermutation. B-cell receptors that bind to an antigen will undergo clonal expansion, creating a Darwinian evolutionary dynamic within individuals. A key step in studying these dynamics is to identify sequences derived from the same ancestral V(D)J recombination event (i.e. a clonal family). There are a number of widely used methods for accomplishing this task but a major limitation of all of them is that they rely, at least in part, on the ability to map sequences to a germline reference set. This requirement is particularly problematic in non-model systems where we often know little about the germline allelic diversity in the study population. Recognizing that delimiting B-cell clonal families is analogous to delimiting species from single locus data, we propose a novel strategy of reconstructing the phylogenetic tree of all B-cell sequences in a sample and using a popular species delimitation method, multi-rate Poisson Tree Processes (mPTP), to delimit clonal families. Using extensive simulations, we show that not only does this phylogenetically explicit approach perform well for the purpose of delimiting clonal families when no reference allele set is available, it performs similarly to state-of-the-art techniques developed specifically for B-cell data even when we have a complete reference allele set. Additionally, our analysis of an empirical dataset shows that mPTP performs similarly to leading methods in the field. These findings demonstrate the utility of using off-the-shelf phylogenetic techniques for analyzing B-cell clonal dynamics in non-model systems, and suggests that phylogenetic inference techniques may be potentially combined with mapping based approaches for even more robust inferences, even in model systems.
Collapse
Affiliation(s)
- Katalin Voss
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, United States
| | - Katrina M. Kaur
- Department of Zoology, University of British Columbia, Vancouver, BC, Canada
| | - Rituparna Banerjee
- Bioinformatics Graduate Program, Faculty of Science, University of British Columbia, Vancouver, BC, Canada
| | - Felix Breden
- Department of Biological Sciences, Simon Fraser University, Burnaby, BC, Canada
| | - Matt Pennell
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, United States
- Department of Biological Sciences, University of Southern California, Los Angeles, CA, United States
| |
Collapse
|
2
|
Sung K, Johnson MM, Dumm W, Simon N, Haddox H, Fukuyama J, Matsen FA. Thrifty wide-context models of B cell receptor somatic hypermutation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.11.26.625407. [PMID: 39651125 PMCID: PMC11623647 DOI: 10.1101/2024.11.26.625407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2024]
Abstract
Somatic hypermutation (SHM) is the diversity-generating process in antibody affinity maturation. Probabilistic models of SHM are needed for analyzing rare mutations, for understanding the selective forces guiding affinity maturation, and for understanding the underlying biochemical process. High throughput data offers the potential to develop and fit models of SHM on relevant data sets. In this paper we model SHM using modern frameworks. We are motivated by recent work suggesting the importance of a wider context for SHM, however, assigning an independent rate to each k-mer leads to an exponential proliferation of parameters. Thus, using convolutions on 3-mer embeddings, we develop "thrifty" models of SHM that have fewer free parameters than a 5-mer model and yet have a significantly wider context. These offer a slight performance improvement over a 5-mer model. We also find that a per-site effect is not necessary to explain SHM patterns given nucleotide context. Also, the two current methods for fitting an SHM model - on out-of-frame sequence data and on synonymous mutations - produce significantly different results, and augmenting out-of-frame data with synonymous mutations does not aid out-of-sample performance.
Collapse
|
3
|
Jagota M, Hsu C, Mazumder T, Sung K, DeWitt WS, Listgarten J, Matsen FA, Ye CJ, Song YS. Learning antibody sequence constraints from allelic inclusion. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.22.619760. [PMID: 39484623 PMCID: PMC11526943 DOI: 10.1101/2024.10.22.619760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/03/2024]
Abstract
Antibodies and B-cell receptors (BCRs) are produced by B cells, and are built of a heavy chain and a light chain. Although each B cell could express two different heavy chains and four different light chains, usually only a unique pair of heavy chain and light chain is expressed-a phenomenon known as allelic exclusion. However, a small fraction of naive-B cells violate allelic exclusion by expressing two productive light chains, one of which has impaired function; this has been called allelic inclusion. We demonstrate that these B cells can be used to learn constraints on antibody sequence. Using large-scale single-cell sequencing data from humans, we find examples of light chain allelic inclusion in thousands of naive-B cells, which is an order of magnitude larger than existing datasets. We train machine learning models to identify the abnormal sequences in these cells. The resulting models correlate with antibody properties that they were not trained on, including polyreactivity, surface expression, and mutation usage in affinity maturation. These correlations are larger than what is achieved by existing antibody modeling approaches, indicating that allelic inclusion data contains useful new information. We also investigate the impact of similar selection forces on the heavy chain in mouse, and observe that pairing with the surrogate light chain significantly restricts heavy chain diversity.
Collapse
Affiliation(s)
- Milind Jagota
- Computer Science Division, UC Berkeley, Berkeley, CA USA
| | - Chloe Hsu
- Computer Science Division, UC Berkeley, Berkeley, CA USA
| | - Thomas Mazumder
- Division of Rheumatology, Department of Medicine, UCSF, San Francisco, CA, USA
| | - Kevin Sung
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | | | | | - Frederick A. Matsen
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Chun Jimmie Ye
- Division of Rheumatology, Department of Medicine, UCSF, San Francisco, CA, USA
- Chan Zuckerberg Biohub, San Francisco, CA, USA
- Parker Institute for Cancer Immunotherapy, UCSF, San Francisco, CA, USA
- Institute for Human Genetics, UCSF, San Francisco, CA, USA
- Bakar Computational Health Sciences Institute, UCSF, San Francisco, California, USA
- Department of Epidemiology and Biostatistics, UCSF, San Francisco, CA, USA
| | - Yun S. Song
- Computer Science Division, UC Berkeley, Berkeley, CA USA
- Department of Statistics, UC Berkeley, Berkeley, CA, USA October 23, 2024
| |
Collapse
|
4
|
Ruiz F, Foreman WB, Lilly M, Baharani VA, Depierreux DM, Chohan V, Taylor AL, Guenthoer J, Ralph D, Matsen IV FA, Chu HY, Bieniasz PD, Côté M, Starr TN, Overbaugh J. Delineating the functional activity of antibodies with cross-reactivity to SARS-CoV-2, SARS-CoV-1 and related sarbecoviruses. PLoS Pathog 2024; 20:e1012650. [PMID: 39466880 PMCID: PMC11542851 DOI: 10.1371/journal.ppat.1012650] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2024] [Revised: 11/07/2024] [Accepted: 10/09/2024] [Indexed: 10/30/2024] Open
Abstract
The recurring spillover of pathogenic coronaviruses and demonstrated capacity of sarbecoviruses, such SARS-CoV-2, to rapidly evolve in humans underscores the need to better understand immune responses to this virus family. For this purpose, we characterized the functional breadth and potency of antibodies targeting the receptor binding domain (RBD) of the spike glycoprotein that exhibited cross-reactivity against SARS-CoV-2 variants, SARS-CoV-1 and sarbecoviruses from diverse clades and animal origins with spillover potential. One neutralizing antibody, C68.61, showed remarkable neutralization breadth against both SARS-CoV-2 variants and viruses from different sarbecovirus clades. C68.61, which targets a conserved RBD class 5 epitope, did not select for escape variants of SARS-CoV-2 or SARS-CoV-1 in culture nor have predicted escape variants among circulating SARS-CoV-2 strains, suggesting this epitope is functionally constrained. We identified 11 additional SARS-CoV-2/SARS-CoV-1 cross-reactive antibodies that target the more sequence conserved class 4 and class 5 epitopes within RBD that show activity against a subset of diverse sarbecoviruses with one antibody binding every single sarbecovirus RBD tested. A subset of these antibodies exhibited Fc-mediated effector functions as potent as antibodies that impact infection outcome in animal models. Thus, our study identified antibodies targeting conserved regions across SARS-CoV-2 variants and sarbecoviruses that may serve as therapeutics for pandemic preparedness as well as blueprints for the design of immunogens capable of eliciting cross-neutralizing responses.
Collapse
Affiliation(s)
- Felicitas Ruiz
- Human Biology Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
- Molecular and Cellular Biology Graduate Program, University of Washington, Seattle, Washington, United States of America
| | - William B. Foreman
- Department of Biochemistry, University of Utah School of Medicine, Salt Lake City, Utah, United States of America
| | - Michelle Lilly
- Human Biology Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Viren A. Baharani
- Laboratory of Retrovirology, The Rockefeller University, New York, New York, United States of America
- Laboratory of Molecular Immunology, The Rockefeller University, New York, New York, United States of America
| | - Delphine M. Depierreux
- Human Biology Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Vrasha Chohan
- Human Biology Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Ashley L. Taylor
- Department of Biochemistry, University of Utah School of Medicine, Salt Lake City, Utah, United States of America
| | - Jamie Guenthoer
- Human Biology Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Duncan Ralph
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Frederick A. Matsen IV
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
- Howard Hughes Medical Institute, Seattle, Washington, United States of America
| | - Helen Y. Chu
- Division of Allergy and Infectious Diseases, University of Washington, Seattle, Washington, United States of America
| | - Paul D. Bieniasz
- Laboratory of Retrovirology, The Rockefeller University, New York, New York, United States of America
- Howard Hughes Medical Institute, The Rockefeller University, New York, New York, United States of America
| | - Marceline Côté
- Department of Biochemistry, Microbiology and Immunology, and Centre for Infection, Immunity and Inflammation, University of Ottawa, Ottawa, Canada
| | - Tyler N. Starr
- Department of Biochemistry, University of Utah School of Medicine, Salt Lake City, Utah, United States of America
| | - Julie Overbaugh
- Human Biology Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| |
Collapse
|
5
|
Abbate MF, Dupic T, Vigne E, Shahsavarian MA, Walczak AM, Mora T. Computational detection of antigen-specific B cell receptors following immunization. Proc Natl Acad Sci U S A 2024; 121:e2401058121. [PMID: 39163333 PMCID: PMC11363332 DOI: 10.1073/pnas.2401058121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Accepted: 07/10/2024] [Indexed: 08/22/2024] Open
Abstract
B cell receptors (BCRs) play a crucial role in recognizing and fighting foreign antigens. High-throughput sequencing enables in-depth sampling of the BCRs repertoire after immunization. However, only a minor fraction of BCRs actively participate in any given infection. To what extent can we accurately identify antigen-specific sequences directly from BCRs repertoires? We present a computational method grounded on sequence similarity, aimed at identifying statistically significant responsive BCRs. This method leverages well-known characteristics of affinity maturation and expected diversity. We validate its effectiveness using longitudinally sampled human immune repertoire data following influenza vaccination and SARS-CoV-2 infections. We show that different lineages converge to the same responding Complementarity Determining Region 3, demonstrating convergent selection within an individual. The outcomes of this method hold promise for application in vaccine development, personalized medicine, and antibody-derived therapeutics.
Collapse
Affiliation(s)
- Maria Francesca Abbate
- Laboratoire de physique de l’École normale supérieure, CNRS, Paris Sciences et Lettres University, Sorbonne Université, and Université Paris-Cité, Paris75005, France
- Large Molecule Research, Sanofi, Vitry-sur-Seine94 400, France
| | - Thomas Dupic
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA02138
| | | | | | - Aleksandra M. Walczak
- Laboratoire de physique de l’École normale supérieure, CNRS, Paris Sciences et Lettres University, Sorbonne Université, and Université Paris-Cité, Paris75005, France
| | - Thierry Mora
- Laboratoire de physique de l’École normale supérieure, CNRS, Paris Sciences et Lettres University, Sorbonne Université, and Université Paris-Cité, Paris75005, France
| |
Collapse
|
6
|
Spisak N, Athènes G, Dupic T, Mora T, Walczak AM. Combining mutation and recombination statistics to infer clonal families in antibody repertoires. eLife 2024; 13:e86181. [PMID: 39120133 PMCID: PMC11441979 DOI: 10.7554/elife.86181] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Accepted: 07/22/2024] [Indexed: 08/10/2024] Open
Abstract
B-cell repertoires are characterized by a diverse set of receptors of distinct specificities generated through two processes of somatic diversification: V(D)J recombination and somatic hypermutations. B-cell clonal families stem from the same V(D)J recombination event, but differ in their hypermutations. Clonal families identification is key to understanding B-cell repertoire function, evolution, and dynamics. We present HILARy (high-precision inference of lineages in antibody repertoires), an efficient, fast, and precise method to identify clonal families from single- or paired-chain repertoire sequencing datasets. HILARy combines probabilistic models that capture the receptor generation and selection statistics with adapted clustering methods to achieve consistently high inference accuracy. It automatically leverages the phylogenetic signal of shared mutations in difficult repertoire subsets. Exploiting the high sensitivity of the method, we find the statistics of evolutionary properties such as the site frequency spectrum and dN/dS ratio do not depend on the junction length. We also identify a broad range of selection pressures spanning two orders of magnitude.
Collapse
Affiliation(s)
- Natanael Spisak
- Laboratoire de physique de l’École normale supérieure, CNRS, PSL University, Sorbonne Université and Université de ParisParisFrance
| | - Gabriel Athènes
- Laboratoire de physique de l’École normale supérieure, CNRS, PSL University, Sorbonne Université and Université de ParisParisFrance
- Saber Bio SAS, Institut du Cerveau, iPEPS The Healthtech HubParisFrance
| | - Thomas Dupic
- Department of Organismic and Evolutionary Biology, Harvard UniversityCambridgeUnited States
| | - Thierry Mora
- Laboratoire de physique de l’École normale supérieure, CNRS, PSL University, Sorbonne Université and Université de ParisParisFrance
| | - Aleksandra M Walczak
- Laboratoire de physique de l’École normale supérieure, CNRS, PSL University, Sorbonne Université and Université de ParisParisFrance
| |
Collapse
|
7
|
Guenthoer J, Garrett ME, Lilly M, Depierreux DM, Ruiz F, Chi M, Stoddard CI, Chohan V, Yaffe ZA, Sung K, Ralph D, Chu HY, Matsen FA, Overbaugh J. The S2 subunit of spike encodes diverse targets for functional antibody responses to SARS-CoV-2. PLoS Pathog 2024; 20:e1012383. [PMID: 39093891 PMCID: PMC11324185 DOI: 10.1371/journal.ppat.1012383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 08/14/2024] [Accepted: 07/01/2024] [Indexed: 08/04/2024] Open
Abstract
The SARS-CoV-2 virus responsible for the COVID-19 global pandemic has exhibited a striking capacity for viral evolution that drives continued evasion from vaccine and infection-induced immune responses. Mutations in the receptor binding domain of the S1 subunit of the spike glycoprotein have led to considerable escape from antibody responses, reducing the efficacy of vaccines and monoclonal antibody (mAb) therapies. Therefore, there is a need to interrogate more constrained regions of spike, such as the S2 subdomain. Here, we present a collection of S2 mAbs from two SARS-CoV-2 convalescent individuals that target multiple regions in S2, including regions outside of those commonly reported. One of the S2 mAbs, C20.119, which bound to a highly conserved epitope in the fusion peptide, was able to broadly neutralize across SARS-CoV-2 variants, SARS-CoV-1, and closely related zoonotic sarbecoviruses. The majority of the mAbs were non-neutralizing; however, many of them could mediate antibody-dependent cellular cytotoxicity (ADCC) at levels similar to the S1-targeting mAb S309 that was previously authorized for treatment of SARS-CoV-2 infections. Several of the mAbs with ADCC function also bound to spike trimers from other human coronaviruses (HCoVs), such as MERS-CoV and HCoV-HKU1. Our findings suggest S2 mAbs can target diverse epitopes in S2, including functional mAbs with HCoV and sarbecovirus breadth that likely target functionally constrained regions of spike. These mAbs could be developed for potential future pandemics, while also providing insight into ideal epitopes for eliciting a broad HCoV response.
Collapse
Affiliation(s)
- Jamie Guenthoer
- Human Biology Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Meghan E. Garrett
- Human Biology Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Michelle Lilly
- Human Biology Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Delphine M. Depierreux
- Human Biology Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Felicitas Ruiz
- Human Biology Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Margaret Chi
- Human Biology Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Caitlin I. Stoddard
- Human Biology Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Vrasha Chohan
- Human Biology Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Zak A. Yaffe
- Human Biology Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Kevin Sung
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Duncan Ralph
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Helen Y. Chu
- Division of Allergy and Infectious Diseases, University of Washington, Seattle, Washington, United States of America
| | - Frederick A. Matsen
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
- Howard Hughes Medical Institute, Seattle, Washington, United States of America
| | - Julie Overbaugh
- Human Biology Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| |
Collapse
|
8
|
Wang K, Cai L, Wang H, Shan S, Hu X, Zhang J. Protocol for fast clonal family inference and analysis from large-scale B cell receptor repertoire sequencing data. STAR Protoc 2024; 5:102969. [PMID: 38502687 PMCID: PMC10963638 DOI: 10.1016/j.xpro.2024.102969] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 01/26/2024] [Accepted: 03/03/2024] [Indexed: 03/21/2024] Open
Abstract
The expeditious identification and comprehensive analysis of clonal families from extensive B cell receptor (BCR) repertoire sequencing data are imperative for elucidating the intricacies of B cell immune responses. Here, we introduce a computational pipeline designed to swiftly deduce clonal families from bulk BCR heavy-chain sequencing data, accompanied by a suite of functional modules tailored to streamline post-clustering analysis. The outlined methodology encompasses guidelines for software installation, meticulous data preparation, and the systematic inference and analysis of clonal families. For complete details on the use and execution of this protocol, please refer to Wang et al.1.
Collapse
Affiliation(s)
- Kaixuan Wang
- Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin, China
| | - Linru Cai
- Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin, China
| | - Hao Wang
- Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin, China; Georgia Tech Shenzhen Institute (GTSI), Tianjin University, Shenzhen, Guangdong, China
| | - Shiwen Shan
- Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin, China
| | - Xihao Hu
- GV20 Therapeutics, Cambridge, MA, USA
| | - Jian Zhang
- Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin, China.
| |
Collapse
|
9
|
Voss K, Kaur KM, Banerjee R, Breden F, Pennell M. Evaluating methods for B-cell clonal family assignment. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.29.596491. [PMID: 38853833 PMCID: PMC11160721 DOI: 10.1101/2024.05.29.596491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/11/2024]
Abstract
The adaptive immune response relies on a diverse repertoire of B-cell receptors, each of which is characterized by a distinct sequence resulting from VDJ-recombination. Upon binding to an antigen, B-cells undergo clonal expansion and in a process unique to B-cells the overall binding affinity of the repertoire is further enhanced by somatic hypermutations in the receptor sequence. For B-cell repertoires it is therefore particularly important to analyze the dynamics of clonal expansion and patterns of somatic hypermutations and thus it is necessary to group the sequences into distinct clones to determine the number and identity of expanding clonal families responding to an antigen. Multiple methods are currently used to identify clones from sequences, employing distinct approaches to the problem. Until now there has not been an extensive comparison of how well these methods perform under the same conditions. Furthermore, since this is fundamentally a phylogenetics problem, we speculated that the mPTP method, which delimits species based on an analysis of changes in the underlying process of diversification, might perform as well as or better than existing methods. Here we conducted extensive simulations of B-cell repertoires under a diverse set of conditions and studied errors in clonal assignment and in downstream ancestral state reconstruction. We demonstrated that SCOPer-H consistently yielded superior results across parameters. However, this approach relies on a good reference assembly for the germline immunoglobulin genes which is lacking for many species. Using mPTP had lower error rates than tailor-made immunogenetic methods and should therefore be considered by researchers studying antibody evolution in non-model organisms without a reference genome.
Collapse
Affiliation(s)
- Katalin Voss
- Department of Quantitative and Computational Biology, University of Southern California, USA
| | - Katrina M. Kaur
- Department of Zoology, University of British Columbia, Canada
| | - Rituparna Banerjee
- Bioinformatics Graduate Program, Faculty of Science, University of British Columbia, Canada
| | - Felix Breden
- Department of Biological Sciences, Simon Fraser University, Canada
| | - Matt Pennell
- Department of Quantitative and Computational Biology, University of Southern California, USA
- Department of Biological Sciences, University of Southern California, USA
| |
Collapse
|
10
|
Balashova D, van Schaik BDC, Stratigopoulou M, Guikema JEJ, Caniels TG, Claireaux M, van Gils MJ, Musters A, Anang DC, de Vries N, Greiff V, van Kampen AHC. Systematic evaluation of B-cell clonal family inference approaches. BMC Immunol 2024; 25:13. [PMID: 38331731 PMCID: PMC11370117 DOI: 10.1186/s12865-024-00600-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Accepted: 01/18/2024] [Indexed: 02/10/2024] Open
Abstract
The reconstruction of clonal families (CFs) in B-cell receptor (BCR) repertoire analysis is a crucial step to understand the adaptive immune system and how it responds to antigens. The BCR repertoire of an individual is formed throughout life and is diverse due to several factors such as gene recombination and somatic hypermutation. The use of Adaptive Immune Receptor Repertoire sequencing (AIRR-seq) using next generation sequencing enabled the generation of full BCR repertoires that also include rare CFs. The reconstruction of CFs from AIRR-seq data is challenging and several approaches have been developed to solve this problem. Currently, most methods use the heavy chain (HC) only, as it is more variable than the light chain (LC). CF reconstruction options include the definition of appropriate sequence similarity measures, the use of shared mutations among sequences, and the possibility of reconstruction without preliminary clustering based on V- and J-gene annotation. In this study, we aimed to systematically evaluate different approaches for CF reconstruction and to determine their impact on various outcome measures such as the number of CFs derived, the size of the CFs, and the accuracy of the reconstruction. The methods were compared to each other and to a method that groups sequences based on identical junction sequences and another method that only determines subclones. We found that after accounting for data set variability, in particular sequencing depth and mutation load, the reconstruction approach has an impact on part of the outcome measures, including the number of CFs. Simulations indicate that unique junctions and subclones should not be used as substitutes for CF and that more complex methods do not outperform simpler methods. Also, we conclude that different approaches differ in their ability to correctly reconstruct CFs when not considering the LC and to identify shared CFs. The results showed the effect of different approaches on the reconstruction of CFs and highlighted the importance of choosing an appropriate method.
Collapse
Affiliation(s)
- Daria Balashova
- Amsterdam UMC location University of Amsterdam, Epidemiology and Data Science, Meibergdreef 9, Amsterdam, Netherlands
- Amsterdam Public Health, Methodology, Amsterdam, The Netherlands
- Amsterdam Infection and Immunity, Inflammatory Diseases, Amsterdam, The Netherlands
| | - Barbera D C van Schaik
- Amsterdam UMC location University of Amsterdam, Epidemiology and Data Science, Meibergdreef 9, Amsterdam, Netherlands
- Amsterdam Public Health, Methodology, Amsterdam, The Netherlands
- Amsterdam Infection and Immunity, Inflammatory Diseases, Amsterdam, The Netherlands
| | - Maria Stratigopoulou
- Cancer Center Amsterdam, Amsterdam, The Netherlands
- Amsterdam UMC location University of Amsterdam, Medical Microbiology and Infection Prevention, Meibergdreef 9, Amsterdam, Netherlands
| | - Jeroen E J Guikema
- Cancer Center Amsterdam, Amsterdam, The Netherlands
- Amsterdam UMC location University of Amsterdam, Pathology, Lymphoma and Myeloma Center Amsterdam, Meibergdreef 9, Amsterdam, Netherlands
| | - Tom G Caniels
- Amsterdam UMC location University of Amsterdam, Medical Microbiology and Infection Prevention, Meibergdreef 9, Amsterdam, Netherlands
- Amsterdam Infection and Immunity, Infectious Diseases, Amsterdam, The Netherlands
| | - Mathieu Claireaux
- Amsterdam UMC location University of Amsterdam, Medical Microbiology and Infection Prevention, Meibergdreef 9, Amsterdam, Netherlands
- Amsterdam Infection and Immunity, Infectious Diseases, Amsterdam, The Netherlands
| | - Marit J van Gils
- Amsterdam UMC location University of Amsterdam, Medical Microbiology and Infection Prevention, Meibergdreef 9, Amsterdam, Netherlands
- Amsterdam Infection and Immunity, Infectious Diseases, Amsterdam, The Netherlands
| | - Anne Musters
- Amsterdam UMC location University of Amsterdam, Experimental Immunology, Meibergdreef 9, Amsterdam, Netherlands
- Amsterdam Rheumatology & Immunology Center, Amsterdam, The Netherlands
| | - Dornatien C Anang
- Amsterdam UMC location University of Amsterdam, Experimental Immunology, Meibergdreef 9, Amsterdam, Netherlands
- Amsterdam Rheumatology & Immunology Center, Amsterdam, The Netherlands
| | - Niek de Vries
- Amsterdam UMC location University of Amsterdam, Experimental Immunology, Meibergdreef 9, Amsterdam, Netherlands
- Amsterdam Rheumatology & Immunology Center, Amsterdam, The Netherlands
| | - Victor Greiff
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
| | - Antoine H C van Kampen
- Amsterdam UMC location University of Amsterdam, Epidemiology and Data Science, Meibergdreef 9, Amsterdam, Netherlands.
- Amsterdam Public Health, Methodology, Amsterdam, The Netherlands.
- Amsterdam Infection and Immunity, Inflammatory Diseases, Amsterdam, The Netherlands.
- Biosystems Data Analysis, Swammerdam Institute for Life Sciences, University of Amsterdam, Amsterdam, The Netherlands.
| |
Collapse
|
11
|
Saunders KO, Counts J, Thakur B, Stalls V, Edwards R, Manne K, Lu X, Mansouri K, Chen Y, Parks R, Barr M, Sutherland L, Bal J, Havill N, Chen H, Machiele E, Jamieson N, Hora B, Kopp M, Janowska K, Anasti K, Jiang C, Van Itallie E, Venkatayogi S, Eaton A, Henderson R, Barbosa C, Alam SM, Santra S, Weissman D, Moody MA, Cain DW, Tam YK, Lewis M, Williams WB, Wiehe K, Montefiori DC, Acharya P, Haynes BF. Vaccine induction of CD4-mimicking HIV-1 broadly neutralizing antibody precursors in macaques. Cell 2024; 187:79-94.e24. [PMID: 38181743 PMCID: PMC10860651 DOI: 10.1016/j.cell.2023.12.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 09/08/2023] [Accepted: 12/01/2023] [Indexed: 01/07/2024]
Abstract
The CD4-binding site (CD4bs) is a conserved epitope on HIV-1 envelope (Env) that can be targeted by protective broadly neutralizing antibodies (bnAbs). HIV-1 vaccines have not elicited CD4bs bnAbs for many reasons, including the occlusion of CD4bs by glycans, expansion of appropriate naive B cells with immunogens, and selection of functional antibody mutations. Here, we demonstrate that immunization of macaques with a CD4bs-targeting immunogen elicits neutralizing bnAb precursors with structural and genetic features of CD4-mimicking bnAbs. Structures of the CD4bs nAb bound to HIV-1 Env demonstrated binding angles and heavy-chain interactions characteristic of all known human CD4-mimicking bnAbs. Macaque nAb were derived from variable and joining gene segments orthologous to the genes of human VH1-46-class bnAb. This vaccine study initiated in primates the B cells from which CD4bs bnAbs can derive, accomplishing the key first step in the development of an effective HIV-1 vaccine.
Collapse
Affiliation(s)
- Kevin O Saunders
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Surgery, Duke University School of Medicine, Durham, NC 27710, USA; Department of Immunology, Duke University School of Medicine, Durham, NC 27710, USA; Department of Molecular Genetics and Microbiology, Duke University School of Medicine, Durham, NC 27710, USA.
| | - James Counts
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Bhishem Thakur
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Victoria Stalls
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Robert Edwards
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Kartik Manne
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Xiaozhi Lu
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Katayoun Mansouri
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Yue Chen
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Rob Parks
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Maggie Barr
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Laura Sutherland
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Joena Bal
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Nicholas Havill
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Biology, Davidson College, Davidson, NC 28035, USA
| | - Haiyan Chen
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Emily Machiele
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Nolan Jamieson
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Bhavna Hora
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Megan Kopp
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Katarzyna Janowska
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Kara Anasti
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Chuancang Jiang
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Elizabeth Van Itallie
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Sravani Venkatayogi
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Amanda Eaton
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Rory Henderson
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | | | - S Munir Alam
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - Sampa Santra
- Beth Israel Deaconess Medical Center, Boston, MA 02215, USA
| | - Drew Weissman
- Department of Microbiology, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - M Anthony Moody
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Pediatrics, Duke University School of Medicine, Durham, NC 27710, USA
| | - Derek W Cain
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | | | | | - Wilton B Williams
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Surgery, Duke University School of Medicine, Durham, NC 27710, USA; Department of Immunology, Duke University School of Medicine, Durham, NC 27710, USA
| | - Kevin Wiehe
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
| | - David C Montefiori
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Surgery, Duke University School of Medicine, Durham, NC 27710, USA
| | - Priyamvada Acharya
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Surgery, Duke University School of Medicine, Durham, NC 27710, USA; Department of Biochemistry, Duke University School of Medicine, Durham, NC 27710, USA.
| | - Barton F Haynes
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC 27710, USA; Department of Immunology, Duke University School of Medicine, Durham, NC 27710, USA; Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA.
| |
Collapse
|
12
|
Hoehn KB, Kleinstein SH. B cell phylogenetics in the single cell era. Trends Immunol 2024; 45:62-74. [PMID: 38151443 PMCID: PMC10872299 DOI: 10.1016/j.it.2023.11.004] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 11/22/2023] [Accepted: 11/24/2023] [Indexed: 12/29/2023]
Abstract
The widespread availability of single-cell RNA sequencing (scRNA-seq) has led to the development of new methods for understanding immune responses. Single-cell transcriptome data can now be paired with B cell receptor (BCR) sequences. However, RNA from BCRs cannot be analyzed like most other genes because BCRs are genetically diverse within individuals. In humans, BCRs are shaped through recombination followed by mutation and selection for antigen binding. As these processes co-occur with cell division, B cells can be studied using phylogenetic trees representing the mutations within a clone. B cell trees can link experimental timepoints, tissues, or cellular subtypes. Here, we review the current state and potential of how B cell phylogenetics can be combined with single-cell data to understand immune responses.
Collapse
Affiliation(s)
- Kenneth B Hoehn
- Department of Biomedical Data Science, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA.
| | - Steven H Kleinstein
- Department of Pathology, Yale School of Medicine, New Haven, CT 06520, USA; Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA; Department of Immunobiology, Yale School of Medicine, New Haven, CT 06520, USA
| |
Collapse
|
13
|
Wang K, Hu X, Zhang J. Fast clonal family inference from large-scale B cell repertoire sequencing data. CELL REPORTS METHODS 2023; 3:100601. [PMID: 37788671 PMCID: PMC10626204 DOI: 10.1016/j.crmeth.2023.100601] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Revised: 07/31/2023] [Accepted: 09/08/2023] [Indexed: 10/05/2023]
Abstract
Advances in high-throughput sequencing technologies have facilitated the large-scale characterization of B cell receptor (BCR) repertoires. However, the vast amount and high diversity of the BCR sequences pose challenges for efficient and biologically meaningful analysis. Here, we introduce fastBCR, an efficient computational approach for inferring B cell clonal families from massive BCR heavy chain sequences. We demonstrate that fastBCR substantially reduces the running time while ensuring high accuracy on simulated datasets with diverse numbers of B cell lineages and varying mutation rates. We apply fastBCR to real BCR sequencing data from peripheral blood samples of COVID-19 patients, showing that the inferred clonal families display disease-associated features, as well as corresponding antigen-binding specificity and affinity. Overall, our results demonstrate the advantages of fastBCR for analyzing BCR repertoire data, which will facilitate the identification of disease-associated antibodies and improve our understanding of the B cell immune response.
Collapse
Affiliation(s)
- Kaixuan Wang
- Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin, China
| | - Xihao Hu
- GV20 Therapeutics, Cambridge, MA, USA
| | - Jian Zhang
- Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin, China.
| |
Collapse
|
14
|
Mathews J, Van Itallie E, Wiehe K, Schmidler SC. Computing the inducibility of B cell lineages under a context-dependent model of affinity maturation: Applications to sequential vaccine design. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.13.562156. [PMID: 37905016 PMCID: PMC10614816 DOI: 10.1101/2023.10.13.562156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]
Abstract
A key challenge in B cell lineage-based vaccine design is understanding the inducibility of target neutralizing antibodies. We approach this problem through the use of detailed stochastic modeling of the somatic hypermutation process that occurs during affinity maturation. Under such a model, sequence mutation rates are context-dependent, rendering standard probability calculations for sequence evolution intractable. We develop an algorithmic approach to rapid, accurate approximation of key marginal sequence likelihoods required to inform modern sequential vaccine design strategies. These calculated probabilities are used to define an inducibility index for selecting among potential targets for immunogen design. We apply this approach to the problem of choosing targets for the design of boosting immunogens aimed at elicitation of the HIV broadly-neutralizing antibody DH270min11.
Collapse
Affiliation(s)
- Joseph Mathews
- Department of Statistical Science, Duke University, Durham, NC, USA
| | | | - Kevin Wiehe
- Duke Human Vaccine Institute, Duke University School of Medicine, Durham, NC, USA
| | - Scott C. Schmidler
- Department of Statistical Science, Duke University, Durham, NC, USA
- Department of Computer Science, Duke University, Durham, NC, USA
- Program in Computational Biology and Bioinformatics, Duke University, Durham, NC, USA
| |
Collapse
|
15
|
Vieira MC, Palm AKE, Stamper CT, Tepora ME, Nguyen KD, Pham TD, Boyd SD, Wilson PC, Cobey S. Germline-encoded specificities and the predictability of the B cell response. PLoS Pathog 2023; 19:e1011603. [PMID: 37624867 PMCID: PMC10484431 DOI: 10.1371/journal.ppat.1011603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 09/07/2023] [Accepted: 08/07/2023] [Indexed: 08/27/2023] Open
Abstract
Antibodies result from the competition of B cell lineages evolving under selection for improved antigen recognition, a process known as affinity maturation. High-affinity antibodies to pathogens such as HIV, influenza, and SARS-CoV-2 are frequently reported to arise from B cells whose receptors, the precursors to antibodies, are encoded by particular immunoglobulin alleles. This raises the possibility that the presence of particular germline alleles in the B cell repertoire is a major determinant of the quality of the antibody response. Alternatively, initial differences in germline alleles' propensities to form high-affinity receptors might be overcome by chance events during affinity maturation. We first investigate these scenarios in simulations: when germline-encoded fitness differences are large relative to the rate and effect size variation of somatic mutations, the same germline alleles persistently dominate the response of different individuals. In contrast, if germline-encoded advantages can be easily overcome by subsequent mutations, allele usage becomes increasingly divergent over time, a pattern we then observe in mice experimentally infected with influenza virus. We investigated whether affinity maturation might nonetheless strongly select for particular amino acid motifs across diverse genetic backgrounds, but we found no evidence of convergence to similar CDR3 sequences or amino acid substitutions. These results suggest that although germline-encoded specificities can lead to similar immune responses between individuals, diverse evolutionary routes to high affinity limit the genetic predictability of responses to infection and vaccination.
Collapse
Affiliation(s)
- Marcos C. Vieira
- Department of Ecology and Evolution, University of Chicago, Chicago, United States of America
| | - Anna-Karin E. Palm
- Department of Medicine, Section of Rheumatology, University of Chicago, Chicago, United States of America
| | - Christopher T. Stamper
- Center for Infectious Medicine, Department of Medicine Huddinge, Karolinska Institutet, Karolinska University Hospital Huddinge, Stockholm, Sweden
- Committee on Immunology, University of Chicago, Chicago, United States of America
| | - Micah E. Tepora
- Department of Medicine, Section of Rheumatology, University of Chicago, Chicago, United States of America
| | - Khoa D. Nguyen
- Department of Pathology, Stanford University School of Medicine, Stanford, United States of America
| | - Tho D. Pham
- Department of Pathology, Stanford University School of Medicine, Stanford, United States of America
| | - Scott D. Boyd
- Department of Pathology, Stanford University School of Medicine, Stanford, United States of America
| | - Patrick C. Wilson
- Department of Medicine, Section of Rheumatology, University of Chicago, Chicago, United States of America
- Gale and Ira Drukier Institute for Children’s Health, Weill Cornell Medicine, New York City, United States of America
| | - Sarah Cobey
- Department of Ecology and Evolution, University of Chicago, Chicago, United States of America
| |
Collapse
|
16
|
Jeusset L, Abdollahi N, Verny T, Armand M, De Septenville A, Davi F, Bernardes JS. ViCloD, an interactive web tool for visualizing B cell repertoires and analyzing intraclonal diversities: application to human B-cell tumors. NAR Genom Bioinform 2023; 5:lqad064. [PMID: 37388820 PMCID: PMC10304752 DOI: 10.1093/nargab/lqad064] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2023] [Revised: 05/25/2023] [Accepted: 06/26/2023] [Indexed: 07/01/2023] Open
Abstract
High throughput sequencing of adaptive immune receptor repertoire (AIRR-seq) has provided numerous human immunoglobulin (IG) sequences allowing specific B cell receptor (BCR) studies such as the antigen-driven evolution of antibodies (soluble forms of the membrane-bound IG part of the BCR). AIRR-seq data allows researchers to examine intraclonal differences caused primarily by somatic hypermutations in IG genes and affinity maturation. Exploring this essential adaptive immunity process could help elucidate the generation of antibodies with high affinity or broadly neutralizing activities. Retracing their evolutionary history could also clarify how vaccines or pathogen exposition drive the humoral immune response, and unravel the clonal architecture of B cell tumors. Computational methods are necessary for large-scale analysis of AIRR-seq properties. However, there is no efficient and interactive tool for analyzing intraclonal diversity, permitting users to explore adaptive immune receptor repertoires in biological and clinical applications. Here we present ViCloD, a web server for large-scale visual analysis of repertoire clonality and intraclonal diversity. ViCloD uses preprocessed data in the format defined by the Adaptive Immune Receptor Repertoire (AIRR) Community. Then, it performs clonal grouping and evolutionary analyses, producing a collection of useful plots for clonal lineage inspection. The web server presents diverse functionalities, including repertoire navigation, clonal abundance analysis, and intraclonal evolutionary tree reconstruction. Users can download the analyzed data in different table formats and save the generated plots as images. ViCloD is a simple, versatile, and user-friendly tool that can help researchers and clinicians to analyze B cell intraclonal diversity. Moreover, its pipeline is optimized to process hundreds of thousands of sequences within a few minutes, allowing an efficient investigation of large and complex repertoires.
Collapse
Affiliation(s)
- Lucile Jeusset
- Sorbonne Université, CNRS, UMR 7238, Laboratoire de Biologie Computationnelle et Quantitative, Paris, France
- Sorbonne Université, AP-HP, Hôpital Pitié-Salpêtrière, Department of Biological Hematology, Paris, France
| | - Nika Abdollahi
- Sorbonne Université, CNRS, UMR 7238, Laboratoire de Biologie Computationnelle et Quantitative, Paris, France
- IMGT, the international ImMunoGeneTics Information System, CNRS, Institute of Human Genetics, Montpellier University, France
| | - Thibaud Verny
- Sorbonne Université, CNRS, UMR 7238, Laboratoire de Biologie Computationnelle et Quantitative, Paris, France
- Ecole des Mines ParisTech, Paris, France
| | - Marine Armand
- Sorbonne Université, AP-HP, Hôpital Pitié-Salpêtrière, Department of Biological Hematology, Paris, France
| | | | - Frédéric Davi
- Sorbonne Université, AP-HP, Hôpital Pitié-Salpêtrière, Department of Biological Hematology, Paris, France
| | - Juliana Silva Bernardes
- Sorbonne Université, CNRS, UMR 7238, Laboratoire de Biologie Computationnelle et Quantitative, Paris, France
| |
Collapse
|
17
|
Pelissier A, Luo S, Stratigopoulou M, Guikema JEJ, Rodríguez Martínez M. Exploring the impact of clonal definition on B-cell diversity: implications for the analysis of immune repertoires. Front Immunol 2023; 14:1123968. [PMID: 37138881 PMCID: PMC10150052 DOI: 10.3389/fimmu.2023.1123968] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 03/13/2023] [Indexed: 05/05/2023] Open
Abstract
The adaptive immune system has the extraordinary ability to produce a broad range of immunoglobulins that can bind a wide variety of antigens. During adaptive immune responses, activated B cells duplicate and undergo somatic hypermutation in their B-cell receptor (BCR) genes, resulting in clonal families of diversified B cells that can be related back to a common ancestor. Advances in high-throughput sequencing technologies have enabled the high-throughput characterization of B-cell repertoires, however, the accurate identification of clonally related BCR sequences remains a major challenge. In this study, we compare three different clone identification methods on both simulated and experimental data, and investigate their impact on the characterization of B-cell diversity. We observe that different methods lead to different clonal definitions, which affects the quantification of clonal diversity in repertoire data. Our analyses show that direct comparisons between clonal clusterings and clonal diversity of different repertoires should be avoided if different clone identification methods were used to define the clones. Despite this variability, the diversity indices inferred from the repertoires' clonal characterization across samples show similar patterns of variation regardless of the clonal identification method used. We find the Shannon entropy to be the most robust in terms of the variability of diversity rank across samples. Our analysis also suggests that the traditional germline gene alignment-based method for clonal identification remains the most accurate when the complete information about the sequence is known, but that alignment-free methods may be preferred for shorter sequencing read lengths. We make our implementation freely available as a Python library cdiversity.
Collapse
Affiliation(s)
- Aurelien Pelissier
- IBM Research Europe, Rüschlikon, Switzerland
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland
| | - Siyuan Luo
- IBM Research Europe, Rüschlikon, Switzerland
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland
| | - Maria Stratigopoulou
- Department of Pathology, Amsterdam University Medical Centers, location AMC, Lymphoma and Myeloma Center Amsterdam (LYMMCARE), Amsterdam, Netherlands
| | - Jeroen E. J. Guikema
- Department of Pathology, Amsterdam University Medical Centers, location AMC, Lymphoma and Myeloma Center Amsterdam (LYMMCARE), Amsterdam, Netherlands
| | | |
Collapse
|
18
|
Abdollahi N, Jeusset L, de Septenville A, Davi F, Bernardes JS. Reconstructing B cell lineage trees with minimum spanning tree and genotype abundances. BMC Bioinformatics 2023; 24:70. [PMID: 36849917 PMCID: PMC9972711 DOI: 10.1186/s12859-022-05112-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Accepted: 12/13/2022] [Indexed: 03/01/2023] Open
Abstract
B cell receptor (BCR) genes exposed to an antigen undergo somatic hypermutations and Darwinian antigen selection, generating a large BCR-antibody diversity. This process, known as B cell affinity maturation, increases antibody affinity, forming a specific B cell lineage that includes the unmutated ancestor and mutated variants. In a B cell lineage, cells with a higher antigen affinity will undergo clonal expansion, while those with a lower affinity will not proliferate and probably be eliminated. Therefore, cellular (genotype) abundance provides a valuable perspective on the ongoing evolutionary process. Phylogenetic tree inference is often used to reconstruct B cell lineage trees and represents the evolutionary dynamic of BCR affinity maturation. However, such methods should process B-cell population data derived from experimental sampling that might contain different cellular abundances. There are a few phylogenetic methods for tracing the evolutionary events occurring in B cell lineages; best-performing solutions are time-demanding and restricted to analysing a reduced number of sequences, while time-efficient methods do not consider cellular abundances. We propose ClonalTree, a low-complexity and accurate approach to construct B-cell lineage trees that incorporates genotype abundances into minimum spanning tree (MST) algorithms. Using both simulated and experimental data, we demonstrate that ClonalTree outperforms MST-based algorithms and achieves a comparable performance to a method that explores tree-generating space exhaustively. Furthermore, ClonalTree has a lower running time, being more convenient for building B-cell lineage trees from high-throughput BCR sequencing data, mainly in biomedical applications, where a lower computational time is appreciable. It is hundreds to thousands of times faster than exhaustive approaches, enabling the analysis of a large set of sequences within minutes or seconds and without loss of accuracy. The source code is freely available at github.com/julibinho/ClonalTree.
Collapse
Affiliation(s)
- Nika Abdollahi
- grid.462844.80000 0001 2308 1657UMR 7238, Laboratoire de Biologie Computationnelle et Quantitative, Sorbonne University, Paris, France ,grid.121334.60000 0001 2097 0141IMGT®, The International ImMunoGeneTics Information System, CNRS, Institute of Human Genetics, Montpellier University, Montpellier, France
| | - Lucile Jeusset
- grid.462844.80000 0001 2308 1657UMR 7238, Laboratoire de Biologie Computationnelle et Quantitative, Sorbonne University, Paris, France ,grid.462844.80000 0001 2308 1657AP-HP, Hôpital Pitié-Salpêtrière, Department of Biological Hematology, Sorbonne University, Paris, France
| | - Anne de Septenville
- grid.462844.80000 0001 2308 1657AP-HP, Hôpital Pitié-Salpêtrière, Department of Biological Hematology, Sorbonne University, Paris, France
| | - Frederic Davi
- grid.462844.80000 0001 2308 1657AP-HP, Hôpital Pitié-Salpêtrière, Department of Biological Hematology, Sorbonne University, Paris, France
| | - Juliana Silva Bernardes
- UMR 7238, Laboratoire de Biologie Computationnelle et Quantitative, Sorbonne University, Paris, France.
| |
Collapse
|
19
|
Ralph DK, Matsen FA. Inference of B cell clonal families using heavy/light chain pairing information. PLoS Comput Biol 2022; 18:e1010723. [PMID: 36441808 PMCID: PMC9731466 DOI: 10.1371/journal.pcbi.1010723] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Revised: 12/08/2022] [Accepted: 11/09/2022] [Indexed: 11/29/2022] Open
Abstract
Next generation sequencing of B cell receptor (BCR) repertoires has become a ubiquitous tool for understanding the antibody-mediated immune response: it is now common to have large volumes of sequence data coding for both the heavy and light chain subunits of the BCR. However, until the recent development of high throughput methods of preserving heavy/light chain pairing information, these samples contained no explicit information on which heavy chain sequence pairs with which light chain sequence. One of the first steps in analyzing such BCR repertoire samples is grouping sequences into clonally related families, where each stems from a single rearrangement event. Many methods of accomplishing this have been developed, however, none so far has taken full advantage of the newly-available pairing information. This information can dramatically improve clustering performance, especially for the light chain. The light chain has traditionally been challenging for clonal family inference because of its low diversity and consequent abundance of non-clonal families with indistinguishable naive rearrangements. Here we present a method of incorporating this pairing information into the clustering process in order to arrive at a more accurate partition of the data into clonally related families. We also demonstrate two methods of fixing imperfect pairing information, which may allow for simplified sample preparation and increased sequencing depth. Finally, we describe several other improvements to the partis software package.
Collapse
Affiliation(s)
- Duncan K. Ralph
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
- * E-mail:
| | - Frederick A. Matsen
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
- Department of Statistics, University of Washington, Seattle, Washington, United States of America
- Howard Hughes Medical Institute, Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| |
Collapse
|
20
|
Abdollahi N, Jeusset L, De Septenville AL, Ripoche H, Davi F, Bernardes JS. A multi-objective based clustering for inferring BCR clonal lineages from high-throughput B cell repertoire data. PLoS Comput Biol 2022; 18:e1010411. [PMID: 36037250 PMCID: PMC9462827 DOI: 10.1371/journal.pcbi.1010411] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Revised: 09/09/2022] [Accepted: 07/18/2022] [Indexed: 11/30/2022] Open
Abstract
The adaptive B cell response is driven by the expansion, somatic hypermutation, and selection of B cell clonal lineages. A high number of clonal lineages in a B cell population indicates a highly diverse repertoire, while clonal size distribution and sequence diversity reflect antigen selective pressure. Identifying clonal lineages is fundamental to many repertoire studies, including repertoire comparisons, clonal tracking, and statistical analysis. Several methods have been developed to group sequences from high-throughput B cell repertoire data. Current methods use clustering algorithms to group clonally-related sequences based on their similarities or distances. Such approaches create groups by optimizing a single objective that typically minimizes intra-clonal distances. However, optimizing several objective functions can be advantageous and boost the algorithm convergence rate. Here we propose MobiLLe, a new method based on multi-objective clustering. Our approach requires V(D)J annotations to obtain the initial groups and iteratively applies two objective functions that optimize cohesion and separation within clonal lineages simultaneously. We show that our method greatly improves clonal lineage grouping on simulated benchmarks with varied mutation rates compared to other tools. When applied to experimental repertoires generated from high-throughput sequencing, its clustering results are comparable to the most performing tools and can reproduce the results of previous publications. The method based on multi-objective clustering can accurately identify clonally-related antibody sequences and presents the lowest running time among state-of-art tools. All these features constitute an attractive option for repertoire analysis, particularly in the clinical context. MobiLLe can potentially help unravel the mechanisms involved in developing and evolving B cell malignancies. High-throughput sequencing can produce a large set of sequences and has profoundly changed our ability to study immune repertoires, particularly B cell receptor sequences. An important application is the analysis of the clonal lineage composition of B cell populations; it is the starting point of many immune repertoire studies, for instance, to differentiate between healthy individuals and those with lymphoid malignancies or other diseases. Several computational methods have been developed to identify clonal lineages from a set of B cell receptor sequences. Most of them apply clustering algorithms and optimize a single objective function that typically minimizes intra-clonal distances. However, optimizing several objective functions in parallel can benefit and increase the clustering performance and efficiency. We propose MobiLLe, the first multi-objective clonal lineage grouping method, which simultaneously optimizes two objective functions for minimizing intra-clonal diversity and maximizing inter-clonal differences. Our approach greatly improved clonal grouping on simulated benchmarks and performed comparably to the most powerful and recent methods on experimental samples. MobiLLe is computationally more efficient than existing tools and does not require any training process or hyper-parameter optimization. It can easily manage large-scale experimental repertoires, providing useful plots to help researchers detect clonally-related sequences in high-throughput B cell repertoire data.
Collapse
Affiliation(s)
- Nika Abdollahi
- Sorbonne Université, CNRS, UMR 7238, Laboratoire de Biologie Computationnelle et Quantitative, Paris, France
| | - Lucile Jeusset
- Sorbonne Université, CNRS, UMR 7238, Laboratoire de Biologie Computationnelle et Quantitative, Paris, France
- Sorbonne Université, AP-HP, Hôpital Pitié-Salpêtrière, UMR_S 1138 Department of Hematology, Paris, France
| | | | - Hugues Ripoche
- Sorbonne Université, CNRS, UMR 7238, Laboratoire de Biologie Computationnelle et Quantitative, Paris, France
| | - Frédéric Davi
- Sorbonne Université, AP-HP, Hôpital Pitié-Salpêtrière, UMR_S 1138 Department of Hematology, Paris, France
| | - Juliana Silva Bernardes
- Sorbonne Université, CNRS, UMR 7238, Laboratoire de Biologie Computationnelle et Quantitative, Paris, France
- * E-mail:
| |
Collapse
|
21
|
Affinity maturation for an optimal balance between long-term immune coverage and short-term resource constraints. Proc Natl Acad Sci U S A 2022; 119:2113512119. [PMID: 35177475 PMCID: PMC8872716 DOI: 10.1073/pnas.2113512119] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/07/2022] [Indexed: 12/15/2022] Open
Abstract
Humoral immunity relies on the mutation and selection of B cells to better recognize pathogens. This affinity maturation process produces cells with diverse recognition capabilities. Examining optimal immune strategies that maximize the long-term immune coverage at a minimal metabolic cost, we show when the immune system should mount a de novo response rather than rely on existing memory cells. Our theory recapitulates known modes of the B cell response, predicts the empirical form of the distribution of clone sizes, and rationalizes as a trade-off between metabolic and immune costs the antigenic imprinting effects that limit the efficacy of vaccines (original antigenic sin). Our predictions provide a framework to interpret experimental results that could be used to inform vaccination strategies. In order to target threatening pathogens, the adaptive immune system performs a continuous reorganization of its lymphocyte repertoire. Following an immune challenge, the B cell repertoire can evolve cells of increased specificity for the encountered strain. This process of affinity maturation generates a memory pool whose diversity and size remain difficult to predict. We assume that the immune system follows a strategy that maximizes the long-term immune coverage and minimizes the short-term metabolic costs associated with affinity maturation. This strategy is defined as an optimal decision process on a finite dimensional phenotypic space, where a preexisting population of cells is sequentially challenged with a neutrally evolving strain. We show that the low specificity and high diversity of memory B cells—a key experimental result—can be explained as a strategy to protect against pathogens that evolve fast enough to escape highly potent but narrow memory. This plasticity of the repertoire drives the emergence of distinct regimes for the size and diversity of the memory pool, depending on the density of de novo responding cells and on the mutation rate of the strain. The model predicts power-law distributions of clonotype sizes observed in data and rationalizes antigenic imprinting as a strategy to minimize metabolic costs while keeping good immune protection against future strains.
Collapse
|
22
|
Bulk gDNA Sequencing of Antibody Heavy-Chain Gene Rearrangements for Detection and Analysis of B-Cell Clone Distribution: A Method by the AIRR Community. Methods Mol Biol 2022; 2453:317-343. [PMID: 35622334 PMCID: PMC9374196 DOI: 10.1007/978-1-0716-2115-8_18] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
In this method we illustrate how to amplify, sequence, and analyze antibody/immunoglobulin (IG) heavy-chain gene rearrangements from genomic DNA that is derived from bulk populations of cells by next-generation sequencing (NGS). We focus on human source material and illustrate how bulk gDNA-based sequencing can be used to examine clonal architecture and networks in different samples that are sequenced from the same individual. Although bulk gDNA-based sequencing can be performed on both IG heavy (IGH) or kappa/lambda light (IGK/IGL) chains, we focus here on IGH gene rearrangements because IG heavy chains are more diverse, tend to harbor higher levels of somatic hypermutations (SHM), and are more reliable for clone identification and tracking. We also provide a procedure, including code, and detailed instructions for processing and annotation of the NGS data. From these data we show how to identify expanded clones, visualize the overall clonal landscape, and track clonal lineages in different samples from the same individual. This method has a broad range of applications, including the identification and monitoring of expanded clones, the analysis of blood and tissue-based clonal networks, and the study of immune responses including clonal evolution.
Collapse
|
23
|
Marquez S, Babrak L, Greiff V, Hoehn KB, Lees WD, Luning Prak ET, Miho E, Rosenfeld AM, Schramm CA, Stervbo U. Adaptive Immune Receptor Repertoire (AIRR) Community Guide to Repertoire Analysis. Methods Mol Biol 2022; 2453:297-316. [PMID: 35622333 PMCID: PMC9761518 DOI: 10.1007/978-1-0716-2115-8_17] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Adaptive immune receptor repertoires (AIRRs) are rich with information that can be mined for insights into the workings of the immune system. Gene usage, CDR3 properties, clonal lineage structure, and sequence diversity are all capable of revealing the dynamic immune response to perturbation by disease, vaccination, or other interventions. Here we focus on a conceptual introduction to the many aspects of repertoire analysis and orient the reader toward the uses and advantages of each. Along the way, we note some of the many software tools that have been developed for these investigations and link the ideas discussed to chapters on methods provided elsewhere in this volume.
Collapse
Affiliation(s)
- Susanna Marquez
- Department of Pathology, Yale School of Medicine, New Haven, CT, USA
| | - Lmar Babrak
- Institute of Biomedical Engineering and Medical Informatics, School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Muttenz, Switzerland
| | - Victor Greiff
- Department of Immunology, University of Oslo, Oslo University Hospital, Oslo, Norway
| | - Kenneth B Hoehn
- Department of Pathology, Yale School of Medicine, New Haven, CT, USA
| | - William D Lees
- Institute of Structural and Molecular Biology, Birkbeck College, University of London, London, UK
| | - Eline T Luning Prak
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Enkelejda Miho
- Institute of Biomedical Engineering and Medical Informatics, School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Muttenz, Switzerland
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
- aiNET GmbH, Basel, Switzerland
| | - Aaron M Rosenfeld
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Chaim A Schramm
- Vaccine Research Center, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, USA.
| | - Ulrik Stervbo
- Center for Translational Medicine, Immunology, and Transplantation, Medical Department I, Marien Hospital Herne, University Hospital of the Ruhr-University Bochum, Herne, Germany.
- Immundiagnostik, Marien Hospital Herne, University Hospital of the Ruhr-University Bochum, Herne, Germany.
| |
Collapse
|
24
|
Slabodkin A, Chernigovskaya M, Mikocziova I, Akbar R, Scheffer L, Pavlović M, Bashour H, Snapkov I, Mehta BB, Weber CR, Gutierrez-Marcos J, Sollid LM, Haff IH, Sandve GK, Robert PA, Greiff V. Individualized VDJ recombination predisposes the available Ig sequence space. Genome Res 2021; 31:2209-2224. [PMID: 34815307 PMCID: PMC8647828 DOI: 10.1101/gr.275373.121] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Accepted: 10/20/2021] [Indexed: 11/25/2022]
Abstract
The process of recombination between variable (V), diversity (D), and joining (J) immunoglobulin (Ig) gene segments determines an individual's naive Ig repertoire and, consequently, (auto)antigen recognition. VDJ recombination follows probabilistic rules that can be modeled statistically. So far, it remains unknown whether VDJ recombination rules differ between individuals. If these rules differed, identical (auto)antigen-specific Ig sequences would be generated with individual-specific probabilities, signifying that the available Ig sequence space is individual specific. We devised a sensitivity-tested distance measure that enables inter-individual comparison of VDJ recombination models. We discovered, accounting for several sources of noise as well as allelic variation in Ig sequencing data, that not only unrelated individuals but also human monozygotic twins and even inbred mice possess statistically distinguishable immunoglobulin recombination models. This suggests that, in addition to genetic, there is also nongenetic modulation of VDJ recombination. We demonstrate that population-wide individualized VDJ recombination can result in orders of magnitude of difference in the probability to generate (auto)antigen-specific Ig sequences. Our findings have implications for immune receptor-based individualized medicine approaches relevant to vaccination, infection, and autoimmunity.
Collapse
Affiliation(s)
- Andrei Slabodkin
- Department of Immunology and Oslo University Hospital, University of Oslo, 0372 Oslo, Norway
| | - Maria Chernigovskaya
- Department of Immunology and Oslo University Hospital, University of Oslo, 0372 Oslo, Norway
| | - Ivana Mikocziova
- Department of Immunology and Oslo University Hospital, University of Oslo, 0372 Oslo, Norway
| | - Rahmad Akbar
- Department of Immunology and Oslo University Hospital, University of Oslo, 0372 Oslo, Norway
| | - Lonneke Scheffer
- Department of Informatics, University of Oslo, 0373 Oslo, Norway
| | - Milena Pavlović
- Department of Informatics, University of Oslo, 0373 Oslo, Norway
| | - Habib Bashour
- School of Life Sciences, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Igor Snapkov
- Department of Immunology and Oslo University Hospital, University of Oslo, 0372 Oslo, Norway
| | - Brij Bhushan Mehta
- Department of Immunology and Oslo University Hospital, University of Oslo, 0372 Oslo, Norway
| | - Cédric R Weber
- Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland
| | | | - Ludvig M Sollid
- Department of Immunology and Oslo University Hospital, University of Oslo, 0372 Oslo, Norway
| | | | | | - Philippe A Robert
- Department of Immunology and Oslo University Hospital, University of Oslo, 0372 Oslo, Norway
| | - Victor Greiff
- Department of Immunology and Oslo University Hospital, University of Oslo, 0372 Oslo, Norway
| |
Collapse
|
25
|
Shipley MM, Mangala Prasad V, Doepker LE, Dingens A, Ralph DK, Harkins E, Dhar A, Arenz D, Chohan V, Weight H, Mandaliya K, Bloom JD, Matsen FA, Lee KK, Overbaugh JM. Functional development of a V3/glycan-specific broadly neutralizing antibody isolated from a case of HIV superinfection. eLife 2021; 10:68110. [PMID: 34263727 PMCID: PMC8376252 DOI: 10.7554/elife.68110] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 07/14/2021] [Indexed: 12/12/2022] Open
Abstract
Stimulating broadly neutralizing antibodies (bnAbs) directly from germline remains a barrier for HIV vaccines. HIV superinfection elicits bnAbs more frequently than single infection, providing clues of how to elicit such responses. We used longitudinal antibody sequencing and structural studies to characterize bnAb development from a superinfection case. BnAb QA013.2 bound initial and superinfecting viral Env, despite its probable naive progenitor only recognizing the superinfecting strain, suggesting both viruses influenced this lineage. A 4.15 Å cryo-EM structure of QA013.2 bound to native-like trimer showed recognition of V3 signatures (N301/N332 and GDIR). QA013.2 relies less on CDRH3 and more on framework and CDRH1 for affinity and breadth compared to other V3/glycan-specific bnAbs. Antigenic profiling revealed that viral escape was achieved by changes in the structurally-defined epitope and by mutations in V1. These results highlight shared and novel properties of QA013.2 relative to other V3/glycan-specific bnAbs in the setting of sequential, diverse antigens.
Collapse
Affiliation(s)
- Mackenzie M Shipley
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Vidya Mangala Prasad
- Department of Medicinal Chemistry, University of Washington, Seattle, United States
| | - Laura E Doepker
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Adam Dingens
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Duncan K Ralph
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Elias Harkins
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Amrit Dhar
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Dana Arenz
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Vrasha Chohan
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Haidyn Weight
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Kishor Mandaliya
- Coast Provincial General Hospital, Women's Health Project, Mombasa, Kenya
| | - Jesse D Bloom
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, United States.,Department of Genome Sciences, University of Washington, Seattle, United States.,Howard Hughes Medical Institute, Chevy Chase, United States
| | - Frederick A Matsen
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Kelly K Lee
- Department of Medicinal Chemistry, University of Washington, Seattle, United States
| | - Julie M Overbaugh
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, United States
| |
Collapse
|
26
|
Bhardwaj V, Pevzner PA, Rashtchian C, Safonova Y. Trace Reconstruction Problems in Computational Biology. IEEE TRANSACTIONS ON INFORMATION THEORY 2021; 67:3295-3314. [PMID: 34176957 PMCID: PMC8224466 DOI: 10.1109/tit.2020.3030569] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
The problem of reconstructing a string from its error-prone copies, the trace reconstruction problem, was introduced by Vladimir Levenshtein two decades ago. While there has been considerable theoretical work on trace reconstruction, practical solutions have only recently started to emerge in the context of two rapidly developing research areas: immunogenomics and DNA data storage. In immunogenomics, traces correspond to mutated copies of genes, with mutations generated naturally by the adaptive immune system. In DNA data storage, traces correspond to noisy copies of DNA molecules that encode digital data, with errors being artifacts of the data retrieval process. In this paper, we introduce several new trace generation models and open questions relevant to trace reconstruction for immunogenomics and DNA data storage, survey theoretical results on trace reconstruction, and highlight their connections to computational biology. Throughout, we discuss the applicability and shortcomings of known solutions and suggest future research directions.
Collapse
Affiliation(s)
- Vinnu Bhardwaj
- Electrical and Computer Engineering Department, University of California San Diego, La Jolla, USA
| | - Pavel A. Pevzner
- Computer Science and Engineering Department, University of California San Diego, La Jolla, USA
| | - Cyrus Rashtchian
- Computer Science and Engineering Department, University of California San Diego, La Jolla, USA
- Qualcomm Institute, University of California San Diego, La Jolla, USA
| | - Yana Safonova
- Computer Science and Engineering Department, University of California San Diego, La Jolla, USA
| |
Collapse
|
27
|
Lindenbaum O, Nouri N, Kluger Y, Kleinstein SH. Alignment free identification of clones in B cell receptor repertoires. Nucleic Acids Res 2021; 49:e21. [PMID: 33330933 PMCID: PMC7913774 DOI: 10.1093/nar/gkaa1160] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Revised: 11/10/2020] [Accepted: 11/13/2020] [Indexed: 11/22/2022] Open
Abstract
Following antigenic challenge, activated B cells rapidly expand and undergo somatic hypermutation, yielding groups of clonally related B cells with diversified immunoglobulin receptors. Inference of clonal relationships based on the receptor sequence is an essential step in many adaptive immune receptor repertoire sequencing studies. These relationships are typically identified by a multi-step process that involves: (i) grouping sequences based on shared V and J gene assignments, and junction lengths and (ii) clustering these sequences using a junction-based distance. However, this approach is sensitive to the initial gene assignments, which are error-prone, and fails to identify clonal relatives whose junction length has changed through accumulation of indels. Through defining a translation-invariant feature space in which we cluster the sequences, we develop an alignment free clonal identification method that does not require gene assignments and is not restricted to a fixed junction length. This alignment free approach has higher sensitivity compared to a typical junction-based distance method without loss of specificity and PPV. While the alignment free procedure identifies clones that are broadly consistent with the junction-based distance method, it also identifies clones with characteristics (multiple V or J gene assignments or junction lengths) that are not detectable with the junction-based distance method.
Collapse
Affiliation(s)
- Ofir Lindenbaum
- Program in Applied Mathematics, Yale University, New Haven, CT, USA
| | - Nima Nouri
- Department of Pathology, Yale School of Medicine, New Haven, CT, USA.,Center for Medical Informatics, Yale University, New Haven, CT 06511, USA
| | - Yuval Kluger
- Program in Applied Mathematics, Yale University, New Haven, CT, USA.,Department of Pathology, Yale School of Medicine, New Haven, CT, USA.,Interdepartmental Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
| | - Steven H Kleinstein
- Department of Pathology, Yale School of Medicine, New Haven, CT, USA.,Interdepartmental Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA.,Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA
| |
Collapse
|
28
|
Ripoll DR, Chaudhury S, Wallqvist A. Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification. PLoS Comput Biol 2021; 17:e1008864. [PMID: 33780441 PMCID: PMC8032195 DOI: 10.1371/journal.pcbi.1008864] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Revised: 04/08/2021] [Accepted: 03/10/2021] [Indexed: 12/05/2022] Open
Abstract
High-throughput B-cell sequencing has opened up new avenues for investigating complex mechanisms underlying our adaptive immune response. These technological advances drive data generation and the need to mine and analyze the information contained in these large datasets, in particular the identification of therapeutic antibodies (Abs) or those associated with disease exposure and protection. Here, we describe our efforts to use artificial intelligence (AI)-based image-analyses for prospective classification of Abs based solely on sequence information. We hypothesized that Abs recognizing the same part of an antigen share a limited set of features at the binding interface, and that the binding site regions of these Abs share share common structure and physicochemical property patterns that can serve as a "fingerprint" to recognize uncharacterized Abs. We combined large-scale sequence-based protein-structure predictions to generate ensembles of 3-D Ab models, reduced the Ab binding interface to a 2-D image (fingerprint), used pre-trained convolutional neural networks to extract features, and trained deep neural networks (DNNs) to classify Abs. We evaluated this approach using Ab sequences derived from human HIV and Ebola viral infections to differentiate between two Abs, Abs belonging to specific B-cell family lineages, and Abs with different epitope preferences. In addition, we explored a different type of DNN method to detect one class of Abs from a larger pool of Abs. Testing on Ab sets that had been kept aside during model training, we achieved average prediction accuracies ranging from 71-96% depending on the complexity of the classification task. The high level of accuracies reached during these classification tests suggests that the DNN models were able to learn a series of structural patterns shared by Abs belonging to the same class. The developed methodology provides a means to apply AI-based image recognition techniques to analyze high-throughput B-cell sequencing datasets (repertoires) for Ab classification.
Collapse
Affiliation(s)
- Daniel R. Ripoll
- DoD Biotechnology High Performance Computing Software Applications Institute, Telemedicine and Advanced Technology Research Center, U.S. Army Medical Research and Development Command, Fort Detrick, Maryland, United States of America
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Inc. (HJF), Bethesda, Maryland, United States of America
| | - Sidhartha Chaudhury
- DoD Biotechnology High Performance Computing Software Applications Institute, Telemedicine and Advanced Technology Research Center, U.S. Army Medical Research and Development Command, Fort Detrick, Maryland, United States of America
- Center for Enabling Capabilities, Walter Reed Army Institute of Research, Silver Spring, Maryland, United States of America
| | - Anders Wallqvist
- DoD Biotechnology High Performance Computing Software Applications Institute, Telemedicine and Advanced Technology Research Center, U.S. Army Medical Research and Development Command, Fort Detrick, Maryland, United States of America
| |
Collapse
|
29
|
Doepker LE, Danon S, Harkins E, Ralph DK, Yaffe Z, Garrett ME, Dhar A, Wagner C, Stumpf MM, Arenz D, Williams JA, Jaoko W, Mandaliya K, Lee KK, Matsen FA, Overbaugh JM. Development of antibody-dependent cell cytotoxicity function in HIV-1 antibodies. eLife 2021; 10:e63444. [PMID: 33427196 PMCID: PMC7884072 DOI: 10.7554/elife.63444] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 01/08/2021] [Indexed: 11/27/2022] Open
Abstract
A prerequisite for the design of an HIV vaccine that elicits protective antibodies is understanding the developmental pathways that result in desirable antibody features. The development of antibodies that mediate antibody-dependent cellular cytotoxicity (ADCC) is particularly relevant because such antibodies have been associated with HIV protection in humans. We reconstructed the developmental pathways of six human HIV-specific ADCC antibodies using longitudinal antibody sequencing data. Most of the inferred naive antibodies did not mediate detectable ADCC. Gain of antigen binding and ADCC function typically required mutations in complementarity determining regions of one or both chains. Enhancement of ADCC potency often required additional mutations in framework regions. Antigen binding affinity and ADCC activity were correlated, but affinity alone was not sufficient to predict ADCC potency. Thus, elicitation of broadly active ADCC antibodies may require mutations that enable high-affinity antigen recognition along with mutations that optimize factors contributing to functional ADCC activity.
Collapse
Affiliation(s)
- Laura E Doepker
- Human Biology Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
| | - Sonja Danon
- Human Biology Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
| | - Elias Harkins
- Public Health Sciences Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
| | - Duncan K Ralph
- Public Health Sciences Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
| | - Zak Yaffe
- Human Biology Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
- Medical Scientist Training Program, University of Washington School of MedicineSeattleUnited States
| | - Meghan E Garrett
- Human Biology Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
- Molecular and Cellular Biology Graduate Program, University of Washington and Fred Hutchinson Cancer Research CenterSeattleUnited States
| | - Amrit Dhar
- Public Health Sciences Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
- Department of Statistics, University of WashingtonSeattleUnited States
| | - Cassia Wagner
- Medical Scientist Training Program, University of Washington School of MedicineSeattleUnited States
| | - Megan M Stumpf
- Human Biology Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
| | - Dana Arenz
- Human Biology Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
| | - James A Williams
- Department of Medicinal Chemistry, University of WashingtonSeattleUnited States
| | - Walter Jaoko
- Department of Medicinal Microbiology, University of NairobiNairobiKenya
| | - Kishor Mandaliya
- Coast Provincial General Hospital, Women’s Health ProjectMombasaKenya
| | - Kelly K Lee
- Department of Medicinal Chemistry, University of WashingtonSeattleUnited States
| | - Frederick A Matsen
- Public Health Sciences Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
| | - Julie M Overbaugh
- Human Biology Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
- Public Health Sciences Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
| |
Collapse
|
30
|
Ralph DK, Matsen FA. Using B cell receptor lineage structures to predict affinity. PLoS Comput Biol 2020; 16:e1008391. [PMID: 33175831 PMCID: PMC7682889 DOI: 10.1371/journal.pcbi.1008391] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Revised: 11/23/2020] [Accepted: 08/30/2020] [Indexed: 11/18/2022] Open
Abstract
We are frequently faced with a large collection of antibodies, and want to select those with highest affinity for their cognate antigen. When developing a first-line therapeutic for a novel pathogen, for instance, we might look for such antibodies in patients that have recovered. There exist effective experimental methods of accomplishing this, such as cell sorting and baiting; however they are time consuming and expensive. Next generation sequencing of B cell receptor (BCR) repertoires offers an additional source of sequences that could be tapped if we had a reliable method of selecting those coding for the best antibodies. In this paper we introduce a method that uses evolutionary information from the family of related sequences that share a naive ancestor to predict the affinity of each resulting antibody for its antigen. When combined with information on the identity of the antigen, this method should provide a source of effective new antibodies. We also introduce a method for a related task: given an antibody of interest and its inferred ancestral lineage, which branches in the tree are likely to harbor key affinity-increasing mutations? We evaluate the performance of these methods on a wide variety of simulated samples, as well as two real data samples. These methods are implemented as part of continuing development of the partis BCR inference package, available at https://github.com/psathyrella/partis. Comments Please post comments or questions on this paper as new issues at https://git.io/Jvxkn.
Collapse
Affiliation(s)
- Duncan K. Ralph
- Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | | |
Collapse
|
31
|
Abstract
Advances in reading, writing, and editing DNA are providing unprecedented insights into the complexity of immunological systems. This combination of systems and synthetic biology methods is enabling the quantitative and precise understanding of molecular recognition in adaptive immunity, thus providing a framework for reprogramming immune responses for translational medicine. In this review, we will highlight state-of-the-art methods such as immune repertoire sequencing, immunoinformatics, and immunogenomic engineering and their application toward adaptive immunity. We showcase novel and interdisciplinary approaches that have the promise of transforming the design and breadth of molecular and cellular immunotherapies.
Collapse
Affiliation(s)
- Lucia Csepregi
- Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland
| | - Roy A. Ehling
- Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland
| | - Bastian Wagner
- Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland
| | - Sai T. Reddy
- Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland
| |
Collapse
|
32
|
Zhao F, Joyce C, Burns A, Nogal B, Cottrell CA, Ramos A, Biddle T, Pauthner M, Nedellec R, Qureshi H, Mason R, Landais E, Briney B, Ward AB, Burton DR, Sok D. Mapping Neutralizing Antibody Epitope Specificities to an HIV Env Trimer in Immunized and in Infected Rhesus Macaques. Cell Rep 2020; 32:108122. [PMID: 32905766 PMCID: PMC7487785 DOI: 10.1016/j.celrep.2020.108122] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Revised: 06/29/2020] [Accepted: 08/18/2020] [Indexed: 01/25/2023] Open
Abstract
BG505 SOSIP is a well-characterized near-native recombinant HIV Envelope (Env) trimer that holds promise as part of a sequential HIV immunogen regimen to induce broadly neutralizing antibodies (bnAbs). Rhesus macaques are considered the most appropriate pre-clinical animal model for monitoring antibody (Ab) responses. Accordingly, we report here the isolation of 45 BG505 autologous neutralizing antibodies (nAbs) with multiple specificities from SOSIP-immunized and BG505 SHIV-infected rhesus macaques. We associate the most potent neutralization with two epitopes: the C3/V5 and V1/V3 regions. We show that all of the nAbs bind in close proximity to known bnAb epitopes and might therefore sterically hinder elicitation of bnAbs. We also identify a "public clonotype" that targets the immunodominant C3/V5 nAb epitope, which suggests that common antibody rearrangements might help determine humoral responses to Env immunogens. The results highlight important considerations for vaccine design in anticipation of results of the BG505 SOSIP trimer in clinical trials.
Collapse
Affiliation(s)
- Fangzhu Zhao
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA; Center for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA 92037, USA; IAVI Neutralizing Antibody Center, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Collin Joyce
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA; Center for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA 92037, USA; IAVI Neutralizing Antibody Center, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Alison Burns
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA; Center for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA 92037, USA; IAVI Neutralizing Antibody Center, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Bartek Nogal
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Christopher A Cottrell
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Alejandra Ramos
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA; Center for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA 92037, USA; IAVI Neutralizing Antibody Center, The Scripps Research Institute, La Jolla, CA 92037, USA; IAVI, New York, NY 10004, USA
| | - Trevor Biddle
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA; Center for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA 92037, USA; IAVI Neutralizing Antibody Center, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Matthias Pauthner
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA; Center for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA 92037, USA; IAVI Neutralizing Antibody Center, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Rebecca Nedellec
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA; Center for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA 92037, USA; IAVI Neutralizing Antibody Center, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Huma Qureshi
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA; Center for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA 92037, USA; IAVI Neutralizing Antibody Center, The Scripps Research Institute, La Jolla, CA 92037, USA; IAVI, New York, NY 10004, USA
| | - Rosemarie Mason
- Vaccine Research Center, National Institutes of Health, Bethesda, MD 20892, USA
| | - Elise Landais
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA; Center for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA 92037, USA; IAVI Neutralizing Antibody Center, The Scripps Research Institute, La Jolla, CA 92037, USA; IAVI, New York, NY 10004, USA
| | - Bryan Briney
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA; Center for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA 92037, USA; Center for Viral Systems Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Andrew B Ward
- Center for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA 92037, USA; IAVI Neutralizing Antibody Center, The Scripps Research Institute, La Jolla, CA 92037, USA; Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Dennis R Burton
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA; Center for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA 92037, USA; IAVI Neutralizing Antibody Center, The Scripps Research Institute, La Jolla, CA 92037, USA; Ragon Institute of Massachusetts General Hospital, Massachusetts Institute of Technology, and Harvard, Cambridge, MA 02114, USA.
| | - Devin Sok
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA; Center for HIV/AIDS Vaccine Development (CHAVD), The Scripps Research Institute, La Jolla, CA 92037, USA; IAVI Neutralizing Antibody Center, The Scripps Research Institute, La Jolla, CA 92037, USA; IAVI, New York, NY 10004, USA.
| |
Collapse
|
33
|
Dhar A, Ralph DK, Minin VN, Matsen FA. A Bayesian phylogenetic hidden Markov model for B cell receptor sequence analysis. PLoS Comput Biol 2020; 16:e1008030. [PMID: 32804924 PMCID: PMC7451993 DOI: 10.1371/journal.pcbi.1008030] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Revised: 08/27/2020] [Accepted: 06/08/2020] [Indexed: 11/24/2022] Open
Abstract
The human body generates a diverse set of high affinity antibodies, the soluble form of B cell receptors (BCRs), that bind to and neutralize invading pathogens. The natural development of BCRs must be understood in order to design vaccines for highly mutable pathogens such as influenza and HIV. BCR diversity is induced by naturally occurring combinatorial "V(D)J" rearrangement, mutation, and selection processes. Most current methods for BCR sequence analysis focus on separately modeling the above processes. Statistical phylogenetic methods are often used to model the mutational dynamics of BCR sequence data, but these techniques do not consider all the complexities associated with B cell diversification such as the V(D)J rearrangement process. In particular, standard phylogenetic approaches assume the DNA bases of the progenitor (or "naive") sequence arise independently and according to the same distribution, ignoring the complexities of V(D)J rearrangement. In this paper, we introduce a novel approach to Bayesian phylogenetic inference for BCR sequences that is based on a phylogenetic hidden Markov model (phylo-HMM). This technique not only integrates a naive rearrangement model with a phylogenetic model for BCR sequence evolution but also naturally accounts for uncertainty in all unobserved variables, including the phylogenetic tree, via posterior distribution sampling.
Collapse
MESH Headings
- Bayes Theorem
- Computational Biology
- Gene Rearrangement, B-Lymphocyte/genetics
- Humans
- Markov Chains
- Models, Genetic
- Phylogeny
- Receptors, Antigen, B-Cell/classification
- Receptors, Antigen, B-Cell/genetics
- Receptors, Antigen, B-Cell/immunology
- Sequence Analysis, DNA/methods
- Somatic Hypermutation, Immunoglobulin/genetics
- Vaccines
Collapse
Affiliation(s)
- Amrit Dhar
- Department of Statistics, University of Washington, Seattle, Washington, United States of America
- Computational Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| | - Duncan K. Ralph
- Computational Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| | - Vladimir N. Minin
- Department of Statistics, University of California, Irvine, California, United States of America
| | - Frederick A. Matsen
- Computational Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| |
Collapse
|
34
|
Foglierini M, Pappas L, Lanzavecchia A, Corti D, Perez L. AncesTree: An interactive immunoglobulin lineage tree visualizer. PLoS Comput Biol 2020; 16:e1007731. [PMID: 32649725 PMCID: PMC7375605 DOI: 10.1371/journal.pcbi.1007731] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2020] [Revised: 07/22/2020] [Accepted: 06/14/2020] [Indexed: 12/22/2022] Open
Abstract
High-throughput sequencing of human immunoglobulin genes allows analysis of antibody repertoires and the reconstruction of clonal lineage evolution. The study of antibodies (Abs) affinity maturation is of specific interest to understand the generation of Abs with high affinity or broadly neutralizing activities. Moreover, phylogenic analysis enables the identification of the key somatic mutations required to achieve optimal antigen binding. The Immcantation framework provides a start-to-finish set of analytical methods for high-throughput adaptive immune receptor repertoire sequencing (AIRR-Seq; Rep-Seq) data. Furthermore, Immcantation's Change-O package has developed IgPhyML, an algorithm designed to build specifically immunoglobulin (Ig) phylogenic trees. Meanwhile Phylip, an algorithm that has been originally developed for applications in ecology and macroevolution, can also be used for the phylogenic reconstruction of antibodies maturation pathway. To complement Ig lineages made by IgPhyML or Dnaml (Phylip), we developed AncesTree, a graphic user interface (GUI) that aims to give researchers the opportunity to interactively explore antibodies clonal evolution. AncesTree displays interactive immunoglobulins phylogenic tree, Ig related mutations and sequence alignments using additional information coming from specialized antibody tools. The GUI is a Java standalone application allowing interaction with Ig tree that can run under Windows, Linux and Mac OS.
Collapse
Affiliation(s)
- Mathilde Foglierini
- Università della Svizzera italiana, Faculty of Biomedical Sciences, Institute for Research in Biomedicine, Bellinzona, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Leontios Pappas
- Università della Svizzera italiana, Faculty of Biomedical Sciences, Institute for Research in Biomedicine, Bellinzona, Switzerland
| | - Antonio Lanzavecchia
- Università della Svizzera italiana, Faculty of Biomedical Sciences, Institute for Research in Biomedicine, Bellinzona, Switzerland
| | - Davide Corti
- Humabs Biomed SA, Vir Biotechnology, Bellinzona, Switzerland
| | - Laurent Perez
- Università della Svizzera italiana, Faculty of Biomedical Sciences, Institute for Research in Biomedicine, Bellinzona, Switzerland
| |
Collapse
|
35
|
Nouri N, Kleinstein SH. Somatic hypermutation analysis for improved identification of B cell clonal families from next-generation sequencing data. PLoS Comput Biol 2020; 16:e1007977. [PMID: 32574157 PMCID: PMC7347241 DOI: 10.1371/journal.pcbi.1007977] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2019] [Revised: 07/09/2020] [Accepted: 05/21/2020] [Indexed: 01/11/2023] Open
Abstract
Adaptive immune receptor repertoire sequencing (AIRR-Seq) offers the possibility of identifying and tracking B cell clonal expansions during adaptive immune responses. Members of a B cell clone are descended from a common ancestor and share the same initial V(D)J rearrangement, but their B cell receptor (BCR) sequence may differ due to the accumulation of somatic hypermutations (SHMs). Clonal relationships are learned from AIRR-seq data by analyzing the BCR sequence, with the most common methods focused on the highly diverse junction region. However, clonally related cells often share SHMs which have been accumulated during affinity maturation. Here, we investigate whether shared SHMs in the V and J segments of the BCR can be leveraged along with the junction sequence to improve the ability to identify clonally related sequences. We develop independent distance functions that capture junction similarity and shared mutations, and combine these in a spectral clustering framework to infer the BCR clonal relationships. Using both simulated and experimental data, we show that this model improves both the sensitivity and specificity for identifying B cell clones. Source code for this method is freely available in the SCOPer (Spectral Clustering for clOne Partitioning) R package (version 0.2 or newer) in the Immcantation framework: www.immcantation.org under the AGPLv3 license.
Collapse
Affiliation(s)
- Nima Nouri
- Department of Pathology, Yale School of Medicine, New Haven, Connecticut, United States of America
- Center for Medical Informatics, Yale School of Medicine, New Haven, Connecticut, United States of America
| | - Steven H. Kleinstein
- Department of Pathology, Yale School of Medicine, New Haven, Connecticut, United States of America
- Center for Medical Informatics, Yale School of Medicine, New Haven, Connecticut, United States of America
- Interdepartmental Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
| |
Collapse
|
36
|
Inter- and intraspecies comparison of phylogenetic fingerprints and sequence diversity of immunoglobulin variable genes. Immunogenetics 2020; 72:279-294. [PMID: 32367185 DOI: 10.1007/s00251-020-01164-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Accepted: 04/13/2020] [Indexed: 10/24/2022]
Abstract
Protection and neutralization of a vast array of pathogens is accomplished by the tremendous diversity of the B cell receptor (BCR) repertoire. For jawed vertebrates, this diversity is initiated via the somatic recombination of immunoglobulin (Ig) germline elements. While it is clear that the number of these germline segments differs from species to species, the extent of cross-species sequence diversity remains largely uncharacterized. Here we use extensive computational and statistical methods to investigate the sequence diversity and evolutionary relationship between Ig variable (V), diversity (D), and joining (J) germline segments across nine commonly studied species ranging from zebrafish to human. Metrics such as guanine-cytosine (GC) content showed low redundancy across Ig germline genes within a given species. Other comparisons, including amino acid motifs, evolutionary selection, and sequence diversity, revealed species-specific properties. Additionally, we showed that the germline-encoded diversity differs across antibody (recombined V-D-J) repertoires of various B cell subsets. To facilitate future comparative immunogenomics analysis, we created VDJgermlines, an R package that contains the germline sequences from multiple species. Our study informs strategies for the humanization and engineering of therapeutic antibodies.
Collapse
|
37
|
Complete characterization of the human immune cell transcriptome using accurate full-length cDNA sequencing. Genome Res 2020; 30:589-601. [PMID: 32312742 PMCID: PMC7197476 DOI: 10.1101/gr.257188.119] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2019] [Accepted: 04/03/2020] [Indexed: 01/06/2023]
Abstract
The human immune system relies on highly complex and diverse transcripts and the proteins they encode. These include transcripts encoding human leukocyte antigen (HLA) receptors as well as B cell and T cell receptors (BCR and TCR). Determining which alleles an individual possesses for each HLA gene (high-resolution HLA typing) is essential to establish donor–recipient compatibility in organ and bone marrow transplantations. In turn, the repertoires of millions of unique BCR and TCR transcripts in each individual carry a vast amount of health-relevant information. Both short-read RNA-seq-based HLA typing and BCR/TCR repertoire sequencing (AIRR-seq) currently rely on our incomplete knowledge of the genetic diversity at HLA and BCR/TCR loci. Here, we generated over 10,000,000 full-length cDNA sequences at a median accuracy of 97.9% using our nanopore sequencing-based Rolling Circle Amplification to Concatemeric Consensus (R2C2) protocol. We used this data set to (1) show that deep and accurate full-length cDNA sequencing can be used to provide isoform-level transcriptome analysis for more than 9000 loci, (2) generate accurate sequences of HLA alleles, and (3) extract detailed AIRR data for the analysis of the adaptive immune system. The HLA and AIRR analysis approaches we introduce here are untargeted and therefore do not require prior knowledge of the composition or genetic diversity of HLA and BCR/TCR loci.
Collapse
|
38
|
Doepker LE, Simonich CA, Ralph D, Shipley MM, Garrett M, Gobillot T, Vigdorovich V, Sather DN, Nduati R, Matsen FA, Overbaugh JM. Diversity and Function of Maternal HIV-1-Specific Antibodies at the Time of Vertical Transmission. J Virol 2020; 94:e01594-19. [PMID: 32075936 PMCID: PMC7163126 DOI: 10.1128/jvi.01594-19] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Accepted: 02/08/2020] [Indexed: 12/21/2022] Open
Abstract
Infants of HIV-positive mothers can acquire HIV infection by various routes, but even in the absence of antiviral treatment, the majority of these infants do not become infected. There is evidence that maternal antibodies provide some protection from infection, but gestational maternal antibodies have not yet been characterized in detail. One of the most studied vertically infected infants is BG505, as the virus from this infant yielded an Envelope protein that was successfully developed as a stable trimer. Here, we isolated and characterized 39 HIV-specific neutralizing monoclonal antibodies (nAbs) from MG505, the mother of BG505, at a time point just prior to vertical transmission. These nAbs belonged to 21 clonal families and employed a variety of VH genes. Many were specific for the HIV-1 Env V3 loop, and this V3 specificity correlated with measurable antibody-dependent cellular cytotoxicity (ADCC) activity. The isolated nAbs did not recapitulate the full breadth of heterologous or autologous virus neutralization by contemporaneous plasma. Notably, we found that the V3-targeting nAb families neutralized one particular maternal Env variant, even though all tested variants had low V3 sequence diversity and were measurably bound by these nAbs. None of the nAbs neutralized BG505 transmitted virus. Furthermore, the MG505 nAb families were found at relatively low frequencies within the maternal B cell repertoire; all were less than 0.25% of total IgG sequences. Our findings illustrate an example of the diversity of HIV-1 nAbs within one mother, cumulatively resulting in a collection of antibody specificities that can contribute to the transmission bottleneck.IMPORTANCE Mother-to-child-transmission of HIV-1 offers a unique setting in which maternal antibodies both within the mother and passively transferred to the infant are present at the time of viral exposure. Untreated HIV-exposed human infants are infected at a rate of 30 to 40%, meaning that some infants do not get infected despite continued exposure to virus. Since the potential of HIV-specific immune responses to provide protection against HIV is a central goal of HIV vaccine design, understanding the nature of maternal antibodies may provide insights into immune mechanisms of protection. In this study, we isolated and characterized HIV-specific antibodies from the mother of an infant whose transmitted virus has been well studied.
Collapse
Affiliation(s)
- Laura E Doepker
- Human Biology Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Cassandra A Simonich
- Human Biology Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
- Medical Scientist Training Program, University of Washington, Seattle, Washington, USA
| | - Duncan Ralph
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Mackenzie M Shipley
- Human Biology Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Meghan Garrett
- Human Biology Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
- Molecular and Cellular Biology Graduate Program, University of Washington, Seattle, Washington, USA
| | - Theodore Gobillot
- Human Biology Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
- Medical Scientist Training Program, University of Washington, Seattle, Washington, USA
| | - Vladimir Vigdorovich
- Center for Global Infectious Disease Research, Seattle Children's Research Institute, Seattle, Washington, USA
| | - D Noah Sather
- Center for Global Infectious Disease Research, Seattle Children's Research Institute, Seattle, Washington, USA
- Department of Global Health, University of Washington, Seattle, Washington, USA
| | - Ruth Nduati
- Department of Pediatrics and Child Health, University of Nairobi, Nairobi, Kenya
| | - Frederick A Matsen
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Julie M Overbaugh
- Human Biology Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| |
Collapse
|
39
|
Puelma Touzel M, Walczak AM, Mora T. Inferring the immune response from repertoire sequencing. PLoS Comput Biol 2020; 16:e1007873. [PMID: 32348312 PMCID: PMC7213749 DOI: 10.1371/journal.pcbi.1007873] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Revised: 05/11/2020] [Accepted: 04/14/2020] [Indexed: 11/19/2022] Open
Abstract
High-throughput sequencing of B- and T-cell receptors makes it possible to track immune repertoires across time, in different tissues, and in acute and chronic diseases or in healthy individuals. However, quantitative comparison between repertoires is confounded by variability in the read count of each receptor clonotype due to sampling, library preparation, and expression noise. Here, we present a general Bayesian approach to disentangle repertoire variations from these stochastic effects. Using replicate experiments, we first show how to learn the natural variability of read counts by inferring the distributions of clone sizes as well as an explicit noise model relating true frequencies of clones to their read count. We then use that null model as a baseline to infer a model of clonal expansion from two repertoire time points taken before and after an immune challenge. Applying our approach to yellow fever vaccination as a model of acute infection in humans, we identify candidate clones participating in the response.
Collapse
Affiliation(s)
- Maximilian Puelma Touzel
- Laboratoire de physique de l’École normale supérieure (PSL University), CNRS, Sorbonne Université, Université de Paris, Paris, France
- Mila, Université de Montréal, Montreal, Canada
| | - Aleksandra M. Walczak
- Laboratoire de physique de l’École normale supérieure (PSL University), CNRS, Sorbonne Université, Université de Paris, Paris, France
| | - Thierry Mora
- Laboratoire de physique de l’École normale supérieure (PSL University), CNRS, Sorbonne Université, Université de Paris, Paris, France
| |
Collapse
|
40
|
Olson BJ, Moghimi P, Schramm CA, Obraztsova A, Ralph D, Vander Heiden JA, Shugay M, Shepherd AJ, Lees W, Matsen FA. sumrep: A Summary Statistic Framework for Immune Receptor Repertoire Comparison and Model Validation. Front Immunol 2019; 10:2533. [PMID: 31736960 PMCID: PMC6838214 DOI: 10.3389/fimmu.2019.02533] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2019] [Accepted: 10/11/2019] [Indexed: 12/28/2022] Open
Abstract
The adaptive immune system generates an incredible diversity of antigen receptors for B and T cells to keep dangerous pathogens at bay. The DNA sequences coding for these receptors arise by a complex recombination process followed by a series of productivity-based filters, as well as affinity maturation for B cells, giving considerable diversity to the circulating pool of receptor sequences. Although these datasets hold considerable promise for medical and public health applications, the complex structure of the resulting adaptive immune receptor repertoire sequencing (AIRR-seq) datasets makes analysis difficult. In this paper we introduce sumrep, an R package that efficiently performs a wide variety of repertoire summaries and comparisons, and show how sumrep can be used to perform model validation. We find that summaries vary in their ability to differentiate between datasets, although many are able to distinguish between covariates such as donor, timepoint, and cell type for BCR and TCR repertoires. We show that deletion and insertion lengths resulting from V(D)J recombination tend to be more discriminative characterizations of a repertoire than summaries that describe the amino acid composition of the CDR3 region. We also find that state-of-the-art generative models excel at recapitulating gene usage and recombination statistics in a given experimental repertoire, but struggle to capture many physiochemical properties of real repertoires.
Collapse
Affiliation(s)
- Branden J Olson
- Fred Hutchinson Cancer Research Center, Seattle, WA, United States.,Department of Statistics, University of Washington, Seattle, WA, United States
| | - Pejvak Moghimi
- Department of Biological Sciences, Institute of Structural and Molecular Biology, Birkbeck, University of London, London, United Kingdom
| | - Chaim A Schramm
- Vaccine Research Center, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Anna Obraztsova
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia.,Genomics of Adaptive Immunity Department, Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Moscow, Russia
| | - Duncan Ralph
- Fred Hutchinson Cancer Research Center, Seattle, WA, United States
| | - Jason A Vander Heiden
- Department of Bioinformatics and Computational Biology, Genentech, Inc., South San Francisco, CA, United States
| | - Mikhail Shugay
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia.,Genomics of Adaptive Immunity Department, Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Moscow, Russia.,Department of Molecular Technologies, Pirogov Russian National Research Medical University, Moscow, Russia
| | - Adrian J Shepherd
- Department of Biological Sciences, Institute of Structural and Molecular Biology, Birkbeck, University of London, London, United Kingdom
| | - William Lees
- Department of Biological Sciences, Institute of Structural and Molecular Biology, Birkbeck, University of London, London, United Kingdom
| | | |
Collapse
|
41
|
Nouri N, Kleinstein SH. A spectral clustering-based method for identifying clones from high-throughput B cell repertoire sequencing data. Bioinformatics 2019; 34:i341-i349. [PMID: 29949968 PMCID: PMC6022594 DOI: 10.1093/bioinformatics/bty235] [Citation(s) in RCA: 54] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Motivation B cells derive their antigen-specificity through the expression of Immunoglobulin (Ig) receptors on their surface. These receptors are initially generated stochastically by somatic re-arrangement of the DNA and further diversified following antigen-activation by a process of somatic hypermutation, which introduces mainly point substitutions into the receptor DNA at a high rate. Recent advances in next-generation sequencing have enabled large-scale profiling of the B cell Ig repertoire from blood and tissue samples. A key computational challenge in the analysis of these data is partitioning the sequences to identify descendants of a common B cell (i.e. a clone). Current methods group sequences using a fixed distance threshold, or a likelihood calculation that is computationally-intensive. Here, we propose a new method based on spectral clustering with an adaptive threshold to determine the local sequence neighborhood. Validation using simulated and experimental datasets demonstrates that this method has high sensitivity and specificity compared to a fixed threshold that is optimized for these measures. In addition, this method works on datasets where choosing an optimal fixed threshold is difficult and is more computationally efficient in all cases. The ability to quickly and accurately identify members of a clone from repertoire sequencing data will greatly improve downstream analyses. Clonally-related sequences cannot be treated independently in statistical models, and clonal partitions are used as the basis for the calculation of diversity metrics, lineage reconstruction and selection analysis. Thus, the spectral clustering-based method here represents an important contribution to repertoire analysis. Availability and implementation Source code for this method is freely available in the SCOPe (Spectral Clustering for clOne Partitioning) R package in the Immcantation framework: www.immcantation.org under the CC BY-SA 4.0 license. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Nima Nouri
- Department of Pathology, Yale School of Medicine, New Haven, CT, USA
| | - Steven H Kleinstein
- Department of Pathology, Yale School of Medicine, New Haven, CT, USA.,Interdepartmental Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
| |
Collapse
|
42
|
Zhou JQ, Kleinstein SH. Cutting Edge: Ig H Chains Are Sufficient to Determine Most B Cell Clonal Relationships. THE JOURNAL OF IMMUNOLOGY 2019; 203:1687-1692. [PMID: 31484734 DOI: 10.4049/jimmunol.1900666] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/18/2019] [Accepted: 08/02/2019] [Indexed: 01/10/2023]
Abstract
B cell clonal expansion is vital for adaptive immunity. High-throughput BCR sequencing enables investigating this process but requires computational inference to identify clonal relationships. This inference usually relies on only the BCR H chain, as most current protocols do not preserve H:L chain pairing. The extent to which paired L chains aids inference is unknown. Using human single-cell paired BCR datasets, we assessed the ability of H chain-based clonal clustering to identify clones. Of the expanded clones identified, <20% grouped cells expressing inconsistent L chains. H chains from these misclustered clones contained more distant junction sequences and shared fewer V segment mutations than the accurate clones. This suggests that additional H chain information could be leveraged to refine clonal relationships. Conversely, L chains were insufficient to refine H chain-based clonal clusters. Overall, the BCR H chain alone is sufficient to identify clonal relationships with confidence.
Collapse
Affiliation(s)
- Julian Q Zhou
- Interdepartmental Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06511
| | - Steven H Kleinstein
- Interdepartmental Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06511; .,Department of Pathology, Yale School of Medicine, New Haven, CT 06520; and.,Department of Immunobiology, Yale School of Medicine, New Haven, CT 06520
| |
Collapse
|
43
|
Davis MM, Boyd SD. Recent progress in the analysis of αβT cell and B cell receptor repertoires. Curr Opin Immunol 2019; 59:109-114. [PMID: 31326777 PMCID: PMC7075470 DOI: 10.1016/j.coi.2019.05.012] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2019] [Accepted: 05/28/2019] [Indexed: 01/10/2023]
Abstract
T cell receptors (TCRs) and B cell receptors (BCRs) are vertebrate evolution's best answer to the threat of microbial pathogens that can evolve much faster than ourselves. These antigen receptors are generated during T cell or B cell development by combinatorial rearrangement of germline genome V, D and J gene segments, and with junctional residues capable of enormous diversity. For decades the complexity of these receptor repertoires has limited their analysis, but advances in DNA sequencing technology and an array of complementary tools have now made their study much more tractable, filling a major gap in our ability to understand immunology as a system. Here, we summarize the recent approaches and discoveries that are enabling these advances, with some suggestions as to what may lie ahead.
Collapse
Affiliation(s)
- Mark M Davis
- Institute for Immunity, Transplantation, and Infection, Stanford University School of Medicine, Stanford, CA, USA; Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA; The Howard Hughes Medical Institute, Chevy Chase, MD, USA.
| | - Scott D Boyd
- Institute for Immunity, Transplantation, and Infection, Stanford University School of Medicine, Stanford, CA, USA; The Sean N. Parker Center for Allergy and Asthma Research at Stanford University, Stanford, CA, USA; Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| |
Collapse
|
44
|
Ralph DK, Matsen FA. Per-sample immunoglobulin germline inference from B cell receptor deep sequencing data. PLoS Comput Biol 2019; 15:e1007133. [PMID: 31329576 PMCID: PMC6675132 DOI: 10.1371/journal.pcbi.1007133] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2017] [Revised: 08/01/2019] [Accepted: 05/28/2019] [Indexed: 11/26/2022] Open
Abstract
The collection of immunoglobulin genes in an individual's germline, which gives rise to B cell receptors via recombination, is known to vary significantly across individuals. In humans, for example, each individual has only a fraction of the several hundred known V alleles. Furthermore, the currently-accepted set of known V alleles is both incomplete (particularly for non-European samples), and contains a significant number of spurious alleles. The resulting uncertainty as to which immunoglobulin alleles are present in any given sample results in inaccurate B cell receptor sequence annotations, and in particular inaccurate inferred naive ancestors. In this paper we first show that the currently widespread practice of aligning each sequence to its closest match in the full set of IMGT alleles results in a very large number of spurious alleles that are not in the sample's true set of germline V alleles. We then describe a new method for inferring each individual's germline gene set from deep sequencing data, and show that it improves upon existing methods by making a detailed comparison on a variety of simulated and real data samples. This new method has been integrated into the partis annotation and clonal family inference package, available at https://github.com/psathyrella/partis, and is run by default without affecting overall run time.
Collapse
Affiliation(s)
- Duncan K. Ralph
- Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| | - Frederick A. Matsen
- Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| |
Collapse
|
45
|
Feng J, Shaw DA, Minin VN, Simon N, Matsen FA. Survival analysis of DNA mutation motifs with penalized proportional hazards. Ann Appl Stat 2019; 13:1268-1294. [PMID: 33214798 PMCID: PMC7673484 DOI: 10.1214/18-aoas1233] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
Antibodies, an essential part of our immune system, develop through an intricate process to bind a wide array of pathogens. This process involves randomly mutating DNA sequences encoding these antibodies to find variants with improved binding, though mutations are not distributed uniformly across sequence sites. Immunologists observe this nonuniformity to be consistent with "mutation motifs", which are short DNA subsequences that affect how likely a given site is to experience a mutation. Quantifying the effect of motifs on mutation rates is challenging: a large number of possible motifs makes this statistical problem high dimensional, while the unobserved history of the mutation process leads to a nontrivial missing data problem. We introduce an ℓ 1-penalized proportional hazards model to infer mutation motifs and their effects. In order to estimate model parameters, our method uses a Monte Carlo EM algorithm to marginalize over the unknown ordering of mutations. We show that our method performs better on simulated data compared to current methods and leads to more parsimonious models. The application of proportional hazards to mutation processes is, to our knowledge, novel and formalizes the current methods in a statistical framework that can be easily extended to analyze the effect of other biological features on mutation rates.
Collapse
Affiliation(s)
- Jean Feng
- Department of Biostatistics, University of Washington Seattle, WA, USA
| | - David A. Shaw
- Computational Biology Program, Fred Hutchinson Cancer Research Center Seattle, WA, USA
| | - Vladimir N. Minin
- Department of Statistics, University of California, Irvine, Irvine, CA, USA
| | - Noah Simon
- Department of Biostatistics, University of Washington Seattle, WA, USA
| | - Frederick A. Matsen
- Computational Biology Program, Fred Hutchinson Cancer Research Center Seattle, WA, USA
| |
Collapse
|
46
|
Simonich CA, Doepker L, Ralph D, Williams JA, Dhar A, Yaffe Z, Gentles L, Small CT, Oliver B, Vigdorovich V, Mangala Prasad V, Nduati R, Sather DN, Lee KK, Matsen Iv FA, Overbaugh J. Kappa chain maturation helps drive rapid development of an infant HIV-1 broadly neutralizing antibody lineage. Nat Commun 2019; 10:2190. [PMID: 31097697 PMCID: PMC6522554 DOI: 10.1038/s41467-019-09481-7] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2018] [Accepted: 03/08/2019] [Indexed: 12/14/2022] Open
Abstract
HIV-infected infants develop broadly neutralizing plasma responses with more rapid kinetics than adults, suggesting the ontogeny of infant responses could better inform a path to achievable vaccine targets. Here we reconstruct the developmental lineage of BF520.1, an infant-derived HIV-specific broadly neutralizing antibody (bnAb), using computational methods developed specifically for this purpose. We find that the BF520.1 inferred naive precursor binds HIV Env. We also show that heterologous cross-clade neutralizing activity evolved in the infant within six months of infection and that, ultimately, only 2% SHM is needed to achieve the full breadth of the mature antibody. Mutagenesis and structural analyses reveal that, for this infant bnAb, substitutions in the kappa chain were critical for activity, particularly in CDRL1. Overall, the developmental pathway of this infant antibody includes features distinct from adult antibodies, including several that may be amenable to better vaccine responses.
Collapse
Affiliation(s)
- Cassandra A Simonich
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA, 98109, USA
- Medical Scientist Training Program, University of Washington School of Medicine, Seattle, WA, 98195, USA
| | - Laura Doepker
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA, 98109, USA
| | - Duncan Ralph
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, 98109, USA
| | - James A Williams
- Department of Medicinal Chemistry, University of Washington, Seattle, WA, 98195, USA
| | - Amrit Dhar
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, 98109, USA
- Department of Statistics, University of Washington, Seattle, WA, 98195, USA
| | - Zak Yaffe
- Medical Scientist Training Program, University of Washington School of Medicine, Seattle, WA, 98195, USA
| | - Lauren Gentles
- Department of Microbiology, University of Washington, Seattle, WA, 98195, USA
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, 98109, USA
| | - Christopher T Small
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, 98109, USA
| | - Brian Oliver
- Center for Infectious Disease Research, Seattle, WA, 98109, USA
| | | | - Vidya Mangala Prasad
- Department of Medicinal Chemistry, University of Washington, Seattle, WA, 98195, USA
| | - Ruth Nduati
- Department of Pediatrics and Child Health, University of Nairobi, Nairobi, Kenya
| | - D Noah Sather
- Center for Infectious Disease Research, Seattle, WA, 98109, USA
| | - Kelly K Lee
- Department of Medicinal Chemistry, University of Washington, Seattle, WA, 98195, USA
| | - Frederick A Matsen Iv
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, 98109, USA
| | - Julie Overbaugh
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, WA, 98109, USA.
- Medical Scientist Training Program, University of Washington School of Medicine, Seattle, WA, 98195, USA.
| |
Collapse
|
47
|
López-Santibáñez-Jácome L, Avendaño-Vázquez SE, Flores-Jasso CF. The Pipeline Repertoire for Ig-Seq Analysis. Front Immunol 2019; 10:899. [PMID: 31114573 PMCID: PMC6503734 DOI: 10.3389/fimmu.2019.00899] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2018] [Accepted: 04/08/2019] [Indexed: 11/22/2022] Open
Abstract
With the advent of high-throughput sequencing of immunoglobulin genes (Ig-Seq), the understanding of antibody repertoires and their dynamics among individuals and populations has become an exciting area of research. There is an increasing number of computational tools that aid in every step of the immune repertoire characterization. However, since not all tools function identically, every pipeline has its unique rationale and capabilities, creating a rich blend of useful features that may appear intimidating for newcomer laboratories with the desire to plunge into immune repertoire analysis to expand and improve their research; hence, all pipeline strengths and differences may not seem evident. In this review we provide a practical and organized list of the current set of computational tools, focusing on their most attractive features and differences in order to carry out the characterization of antibody repertoires so that the reader better decides a strategic approach for the experimental design, and computational pathways for the analyses of immune repertoires.
Collapse
Affiliation(s)
- Laura López-Santibáñez-Jácome
- Consorcio de Metabolismo de RNA, Instituto Nacional de Medicina Genómica, Mexico City, Mexico
- Maestría en Ciencia de Datos, Instituto Tecnológico Autónomo de México, Mexico City, Mexico
| | | | | |
Collapse
|
48
|
Corrie BD, Marthandan N, Zimonja B, Jaglale J, Zhou Y, Barr E, Knoetze N, Breden FMW, Christley S, Scott JK, Cowell LG, Breden F. iReceptor: A platform for querying and analyzing antibody/B-cell and T-cell receptor repertoire data across federated repositories. Immunol Rev 2019; 284:24-41. [PMID: 29944754 DOI: 10.1111/imr.12666] [Citation(s) in RCA: 107] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Next-generation sequencing allows the characterization of the adaptive immune receptor repertoire (AIRR) in exquisite detail. These large-scale AIRR-seq data sets have rapidly become critical to vaccine development, understanding the immune response in autoimmune and infectious disease, and monitoring novel therapeutics against cancer. However, at present there is no easy way to compare these AIRR-seq data sets across studies and institutions. The ability to combine and compare information for different disease conditions will greatly enhance the value of AIRR-seq data for improving biomedical research and patient care. The iReceptor Data Integration Platform (gateway.ireceptor.org) provides one implementation of the AIRR Data Commons envisioned by the AIRR Community (airr-community.org), an initiative that is developing protocols to facilitate sharing and comparing AIRR-seq data. The iReceptor Scientific Gateway links distributed (federated) AIRR-seq repositories, allowing sequence searches or metadata queries across multiple studies at multiple institutions, returning sets of sequences fulfilling specific criteria. We present a review of the development of iReceptor, and how it fits in with the general trend toward sharing genomic and health data, and the development of standards for describing and reporting AIRR-seq data. Researchers interested in integrating their repositories of AIRR-seq data into the iReceptor Platform are invited to contact support@ireceptor.org.
Collapse
Affiliation(s)
- Brian D Corrie
- The IRMACS Centre, Simon Fraser University, Burnaby, BC, Canada
| | - Nishanth Marthandan
- The IRMACS Centre, Simon Fraser University, Burnaby, BC, Canada.,Deptartment of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada
| | - Bojan Zimonja
- The IRMACS Centre, Simon Fraser University, Burnaby, BC, Canada
| | - Jerome Jaglale
- The IRMACS Centre, Simon Fraser University, Burnaby, BC, Canada
| | - Yang Zhou
- The IRMACS Centre, Simon Fraser University, Burnaby, BC, Canada
| | - Emily Barr
- The IRMACS Centre, Simon Fraser University, Burnaby, BC, Canada
| | - Nicole Knoetze
- The IRMACS Centre, Simon Fraser University, Burnaby, BC, Canada
| | | | - Scott Christley
- Department of Clinical Sciences, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Jamie K Scott
- Deptartment of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada.,Faculty of Health Sciences, Simon Fraser University, Burnaby, BC, Canada
| | - Lindsay G Cowell
- Department of Clinical Sciences, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Felix Breden
- The IRMACS Centre, Simon Fraser University, Burnaby, BC, Canada.,Department of Biological Sciences, Simon Fraser University, Burnaby, BC, Canada
| |
Collapse
|
49
|
DeWitt WS, Mesin L, Victora GD, Minin VN, Matsen FA. Using Genotype Abundance to Improve Phylogenetic Inference. Mol Biol Evol 2019; 35:1253-1265. [PMID: 29474671 PMCID: PMC5913685 DOI: 10.1093/molbev/msy020] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Modern biological techniques enable very dense genetic sampling of unfolding evolutionary histories, and thus frequently sample some genotypes multiple times. This motivates strategies to incorporate genotype abundance information in phylogenetic inference. In this article, we synthesize a stochastic process model with standard sequence-based phylogenetic optimality, and show that tree estimation is substantially improved by doing so. Our method is validated with extensive simulations and an experimental single-cell lineage tracing study of germinal center B cell receptor affinity maturation.
Collapse
Affiliation(s)
- William S DeWitt
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA.,Department of Genome Sciences, University of Washington, Seattle, WA
| | - Luka Mesin
- Laboratory of Lymphocyte Dynamics, The Rockefeller University, New York, NY
| | - Gabriel D Victora
- Laboratory of Lymphocyte Dynamics, The Rockefeller University, New York, NY
| | | | - Frederick A Matsen
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA
| |
Collapse
|
50
|
Neu KE, Guthmiller JJ, Huang M, La J, Vieira MC, Kim K, Zheng NY, Cortese M, Tepora ME, Hamel NJ, Rojas KT, Henry C, Shaw D, Dulberger CL, Pulendran B, Cobey S, Khan AA, Wilson PC. Spec-seq unveils transcriptional subpopulations of antibody-secreting cells following influenza vaccination. J Clin Invest 2018; 129:93-105. [PMID: 30457979 DOI: 10.1172/jci121341] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Accepted: 10/09/2018] [Indexed: 12/25/2022] Open
Abstract
Vaccines are among the most effective public health tools for combating certain infectious diseases such as influenza. The role of the humoral immune system in vaccine-induced protection is widely appreciated; however, our understanding of how antibody specificities relate to B cell function remains limited due to the complexity of polyclonal antibody responses. To address this, we developed the Spec-seq framework, which allows for simultaneous monoclonal antibody (mAb) characterization and transcriptional profiling from the same single cell. Here, we present the first application of the Spec-seq framework, which we applied to human plasmablasts after influenza vaccination in order to characterize transcriptional differences governed by B cell receptor (BCR) isotype and vaccine reactivity. Our analysis did not find evidence of long-term transcriptional specialization between plasmablasts of different isotypes. However, we did find enhanced transcriptional similarity between clonally related B cells, as well as distinct transcriptional signatures ascribed by BCR vaccine recognition. These data suggest IgG and IgA vaccine-positive plasmablasts are largely similar, whereas IgA vaccine-negative cells appear to be transcriptionally distinct from conventional, terminally differentiated, antigen-induced peripheral blood plasmablasts.
Collapse
Affiliation(s)
- Karlynn E Neu
- The Committee on Immunology.,The Department of Medicine, Section of Rheumatology
| | | | - Min Huang
- The Department of Medicine, Section of Rheumatology
| | - Jennifer La
- The Department of Pathology, Molecular Pathogenesis and Molecular Medicine, and
| | - Marcos C Vieira
- The Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois, USA
| | - Kangchon Kim
- The Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois, USA
| | | | - Mario Cortese
- Emory Vaccine Center, Emory University, Atlanta, Georgia, USA
| | | | | | | | - Carole Henry
- The Department of Medicine, Section of Rheumatology
| | - Dustin Shaw
- The Committee on Immunology.,The Department of Medicine, Section of Rheumatology
| | - Charles L Dulberger
- The Department of Biochemistry and Molecular Biophysics, The University of Chicago, Chicago, Illinois, USA
| | - Bali Pulendran
- Emory Vaccine Center, Emory University, Atlanta, Georgia, USA
| | - Sarah Cobey
- The Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois, USA
| | - Aly A Khan
- Toyota Technological Institute at Chicago, Chicago, Illinois, USA
| | - Patrick C Wilson
- The Committee on Immunology.,The Department of Medicine, Section of Rheumatology
| |
Collapse
|