1
|
Celentano M, DeWitt WS, Prillo S, Song YS. Exact and efficient phylodynamic simulation from arbitrarily large populations. Proc Natl Acad Sci U S A 2025; 122:e2412978122. [PMID: 40366686 PMCID: PMC12107099 DOI: 10.1073/pnas.2412978122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Accepted: 04/15/2025] [Indexed: 05/15/2025] Open
Abstract
Many biological studies involve inferring the evolutionary history of a sample of individuals from a large population and interpreting the reconstructed tree. Such an ascertained tree typically represents only a small part of a comprehensive population tree and is distorted by survivorship and sampling biases. Inferring evolutionary parameters from ascertained trees requires modeling both the underlying population dynamics and the ascertainment process. A crucial component of this phylodynamic modeling involves tree simulation, which is used to benchmark probabilistic inference methods. To simulate an ascertained tree, one must first simulate the full population tree and then prune unobserved lineages. Consequently, the computational cost is determined not by the size of the final simulated tree, but by the size of the population tree in which it is embedded. In most biological scenarios, simulations of the entire population are prohibitively expensive due to computational demands placed on lineages without sampled descendants. Here, we address this challenge by proving that, for any partially ascertained process from a general multitype birth-death-mutation-sampling model, there exists an equivalent process with complete sampling and no death, a property which we leverage to develop a highly efficient algorithm for simulating trees. Our algorithm scales linearly with the size of the final simulated tree and is independent of the population size, enabling simulations from extremely large populations beyond the reach of current methods but essential for various biological applications. We anticipate that this massive speedup will significantly advance the development of novel inference methods that require extensive training data.
Collapse
Affiliation(s)
- Michael Celentano
- Department of Statistics, University of California, Berkeley, CA94720
| | - William S. DeWitt
- Computer Science Division, University of California, Berkeley, CA94720
| | - Sebastian Prillo
- Computer Science Division, University of California, Berkeley, CA94720
| | - Yun S. Song
- Department of Statistics, University of California, Berkeley, CA94720
- Computer Science Division, University of California, Berkeley, CA94720
| |
Collapse
|
2
|
Asashima H, Akao S, Matsumoto I. Emerging roles of checkpoint molecules on B cells. Immunol Med 2025:1-12. [PMID: 39819449 DOI: 10.1080/25785826.2025.2454045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2024] [Accepted: 01/07/2025] [Indexed: 01/19/2025] Open
Abstract
Immune checkpoint molecules, including both co-inhibitory molecules and co-stimulatory molecules, are known to play critical roles in regulating T-cell responses. During the last decades, immunotherapies targeting these molecules (such as programmed cell death 1 (PD-1), and lymphocyte activation gene 3 (LAG-3)) have provided clinical benefits in many cancers. It is becoming apparent that not only T cells, but also B cells have a capacity to express some checkpoint molecules. These were originally thought to be only the markers for regulatory B cells which produce IL-10, but recent studies suggest that these molecules (especially T-cell immunoglobulin and mucin domain 1 (TIM-1), T cell immunoreceptor with Ig and ITIM domains (TIGIT), and PD-1) can regulate intrinsic B-cell activation and functions. Here, we focus on these molecules and summarize their characteristics, ligands, and functions on B cells.
Collapse
Affiliation(s)
- Hiromitsu Asashima
- Department of Rheumatology, Institute of Medicine, University of Tsukuba, Ibaraki, Japan
| | - Satoshi Akao
- Department of Rheumatology, Institute of Medicine, University of Tsukuba, Ibaraki, Japan
| | - Isao Matsumoto
- Department of Rheumatology, Institute of Medicine, University of Tsukuba, Ibaraki, Japan
| |
Collapse
|
3
|
Shaban D, Najm N, Droin L, Nijnik A. Hematopoietic Stem Cell Fates and the Cellular Hierarchy of Mammalian Hematopoiesis: from Transplantation Models to New Insights from in Situ Analyses. Stem Cell Rev Rep 2025; 21:28-44. [PMID: 39222178 DOI: 10.1007/s12015-024-10782-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/17/2024] [Indexed: 09/04/2024]
Abstract
Hematopoiesis is the process that generates the cells of the blood and immune system from hematopoietic stem and progenitor cells (HSPCs) and represents the system with the most rapid cell turnover in a mammalian organism. HSPC differentiation trajectories, their underlying molecular mechanisms, and their dysfunctions in hematologic disorders are the focal research questions of experimental hematology. While HSPC transplantations in murine models are the traditional tool in this research field, recent advances in genome editing and next generation sequencing resulted in the development of many fundamentally new approaches for the analyses of mammalian hematopoiesis in situ and at single cell resolution. The current review will cover many recent developments in this field in murine models, from the bulk lineage tracing studies of HSPC differentiation to the barcoding of individual HSPCs with Cre-recombinase, Sleeping Beauty transposase, or CRISPR/Cas9 tools, to map hematopoietic cell fates, together with their transcriptional and epigenetic states. We also address studies of the clonal dynamics of human hematopoiesis, from the tracing of HSPC clonal behaviours based on viral integration sites in gene therapy patients to the recent analyses of unperturbed human hematopoiesis based on naturally accrued mutations in either nuclear or mitochondrial genomes. Such studies are revolutionizing our understanding of HSPC biology and hematopoiesis both under homeostatic conditions and in the response to various forms of physiological stress, reveal the mechanisms responsible for the decline of hematopoietic function with age, and in the future may advance the understanding and management of the diverse disorders of hematopoiesis.
Collapse
Affiliation(s)
- Dania Shaban
- Department of Physiology, McGill University, 368 Bellini Life Sciences Complex, 3649 Promenade Sir William Osler, Montreal, QC, H3G 0B1, Canada
- McGill University Research Centre on Complex Traits, McGill University, Montreal, QC, Canada
| | - Nay Najm
- Department of Physiology, McGill University, 368 Bellini Life Sciences Complex, 3649 Promenade Sir William Osler, Montreal, QC, H3G 0B1, Canada
- McGill University Research Centre on Complex Traits, McGill University, Montreal, QC, Canada
| | - Lucie Droin
- Department of Physiology, McGill University, 368 Bellini Life Sciences Complex, 3649 Promenade Sir William Osler, Montreal, QC, H3G 0B1, Canada
- McGill University Research Centre on Complex Traits, McGill University, Montreal, QC, Canada
| | - Anastasia Nijnik
- Department of Physiology, McGill University, 368 Bellini Life Sciences Complex, 3649 Promenade Sir William Osler, Montreal, QC, H3G 0B1, Canada.
- McGill University Research Centre on Complex Traits, McGill University, Montreal, QC, Canada.
| |
Collapse
|
4
|
Voss K, Kaur KM, Banerjee R, Breden F, Pennell M. Applying phylogenetic methods for species delimitation to distinguish B-cell clonal families. Front Immunol 2024; 15:1505032. [PMID: 39687606 PMCID: PMC11646844 DOI: 10.3389/fimmu.2024.1505032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2024] [Accepted: 11/07/2024] [Indexed: 12/18/2024] Open
Abstract
The adaptive immune system generates a diverse array of B-cell receptors through the processes of V(D)J recombination and somatic hypermutation. B-cell receptors that bind to an antigen will undergo clonal expansion, creating a Darwinian evolutionary dynamic within individuals. A key step in studying these dynamics is to identify sequences derived from the same ancestral V(D)J recombination event (i.e. a clonal family). There are a number of widely used methods for accomplishing this task but a major limitation of all of them is that they rely, at least in part, on the ability to map sequences to a germline reference set. This requirement is particularly problematic in non-model systems where we often know little about the germline allelic diversity in the study population. Recognizing that delimiting B-cell clonal families is analogous to delimiting species from single locus data, we propose a novel strategy of reconstructing the phylogenetic tree of all B-cell sequences in a sample and using a popular species delimitation method, multi-rate Poisson Tree Processes (mPTP), to delimit clonal families. Using extensive simulations, we show that not only does this phylogenetically explicit approach perform well for the purpose of delimiting clonal families when no reference allele set is available, it performs similarly to state-of-the-art techniques developed specifically for B-cell data even when we have a complete reference allele set. Additionally, our analysis of an empirical dataset shows that mPTP performs similarly to leading methods in the field. These findings demonstrate the utility of using off-the-shelf phylogenetic techniques for analyzing B-cell clonal dynamics in non-model systems, and suggests that phylogenetic inference techniques may be potentially combined with mapping based approaches for even more robust inferences, even in model systems.
Collapse
Affiliation(s)
- Katalin Voss
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, United States
| | - Katrina M. Kaur
- Department of Zoology, University of British Columbia, Vancouver, BC, Canada
| | - Rituparna Banerjee
- Bioinformatics Graduate Program, Faculty of Science, University of British Columbia, Vancouver, BC, Canada
| | - Felix Breden
- Department of Biological Sciences, Simon Fraser University, Burnaby, BC, Canada
| | - Matt Pennell
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, United States
- Department of Biological Sciences, University of Southern California, Los Angeles, CA, United States
| |
Collapse
|
5
|
Foglierini M, Perez L. [AI-based discovery of broadly neutralizing antibodies against HIV-1]. Med Sci (Paris) 2024; 40:819-821. [PMID: 39656977 DOI: 10.1051/medsci/2024144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2024] Open
Affiliation(s)
- Mathilde Foglierini
- Département de médicine, Service d'immunologie et allergie, Centre hospitalier universitaire Vaudois (CHUV) et université de Lausanne (UNIL), Faculté de biologie et médecine, Lausanne, Suisse
| | - Laurent Perez
- Département de médicine, Service d'immunologie et allergie, Centre hospitalier universitaire Vaudois (CHUV) et université de Lausanne (UNIL), Faculté de biologie et médecine, Lausanne, Suisse
| |
Collapse
|
6
|
Farnia M, Tahiri N. New generalized metric based on branch length distance to compare B cell lineage trees. Algorithms Mol Biol 2024; 19:22. [PMID: 39369262 PMCID: PMC11453055 DOI: 10.1186/s13015-024-00267-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2024] [Accepted: 09/24/2024] [Indexed: 10/07/2024] Open
Abstract
The B cell lineage tree encapsulates the successive phases of B cell differentiation and maturation, transitioning from hematopoietic stem cells to mature, antibody-secreting cells within the immune system. Mathematically, this lineage can be conceptualized as an evolutionary tree, where each node represents a distinct stage in B cell development, and the edges reflect the differentiation pathways. To compare these lineage trees, a rigorous mathematical metric is essential. Analyzing B cell lineage trees mathematically and quantifying changes in lineage attributes over time necessitates a comparison methodology capable of accurately assessing and measuring these changes. Addressing the intricacies of multiple B cell lineage tree comparisons, this study introduces a novel metric that enhances the precision of comparative analysis. This metric is formulated on principles of metric theory and evolutionary biology, quantifying the dissimilarities between lineage trees by measuring branch length distance and weight. By providing a framework for systematically classifying lineage trees, this metric facilitates the development of predictive models that are crucial for the creation of targeted immunotherapy and vaccines. To validate the effectiveness of this new metric, synthetic datasets that mimic the complexity and variability of real B cell lineage structures are employed. We demonstrated the ability of the new metric method to accurately capture the evolutionary nuances of B cell lineages.
Collapse
Affiliation(s)
- Mahsa Farnia
- Department of Computer Science, University of Sherbrooke, 2500, boul. de l'Université, Sherbrooke, QC, J1K 2R1, Canada
| | - Nadia Tahiri
- Department of Computer Science, University of Sherbrooke, 2500, boul. de l'Université, Sherbrooke, QC, J1K 2R1, Canada.
| |
Collapse
|
7
|
Engelbrecht E, Rodriguez OL, Watson CT. Addressing Technical Pitfalls in Pursuit of Molecular Factors That Mediate Immunoglobulin Gene Regulation. JOURNAL OF IMMUNOLOGY (BALTIMORE, MD. : 1950) 2024; 213:651-662. [PMID: 39007649 PMCID: PMC11333172 DOI: 10.4049/jimmunol.2400131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Accepted: 06/13/2024] [Indexed: 07/16/2024]
Abstract
The expressed Ab repertoire is a critical determinant of immune-related phenotypes. Ab-encoding transcripts are distinct from other expressed genes because they are transcribed from somatically rearranged gene segments. Human Abs are composed of two identical H and L chain polypeptides derived from genes in IGH locus and one of two L chain loci. The combinatorial diversity that results from Ab gene rearrangement and the pairing of different H and L chains contributes to the immense diversity of the baseline Ab repertoire. During rearrangement, Ab gene selection is mediated by factors that influence chromatin architecture, promoter/enhancer activity, and V(D)J recombination. Interindividual variation in the composition of the Ab repertoire associates with germline variation in IGH, implicating polymorphism in Ab gene regulation. Determining how IGH variants directly mediate gene regulation will require integration of these variants with other functional genomic datasets. In this study, we argue that standard approaches using short reads have limited utility for characterizing regulatory regions in IGH at haplotype resolution. Using simulated and chromatin immunoprecipitation sequencing reads, we define features of IGH that limit use of short reads and a single reference genome, namely 1) the highly duplicated nature of the DNA sequence in IGH and 2) structural polymorphisms that are frequent in the population. We demonstrate that personalized diploid references enhance performance of short-read data for characterizing mappable portions of the locus, while also showing that long-read profiling tools will ultimately be needed to fully resolve functional impacts of IGH germline variation on expressed Ab repertoires.
Collapse
Affiliation(s)
- Eric Engelbrecht
- Department of Biochemistry and Molecular Genetics, University of Louisville, Louisville, KY
| | - Oscar L Rodriguez
- Department of Biochemistry and Molecular Genetics, University of Louisville, Louisville, KY
| | - Corey T Watson
- Department of Biochemistry and Molecular Genetics, University of Louisville, Louisville, KY
| |
Collapse
|
8
|
Celentano M, DeWitt WS, Prillo S, Song YS. Exact and efficient phylodynamic simulation from arbitrarily large populations. ARXIV 2024:arXiv:2402.17153v2. [PMID: 38463501 PMCID: PMC10925381] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
Many biological studies involve inferring the evolutionary history of a sample of individuals from a large population and interpreting the reconstructed tree. Such an ascertained tree typically represents only a small part of a comprehensive population tree and is distorted by survivorship and sampling biases. Inferring evolutionary parameters from ascertained trees requires modeling both the underlying population dynamics and the ascertainment process. A crucial component of this phylodynamic modeling involves tree simulation, which is used to benchmark probabilistic inference methods. To simulate an ascertained tree, one must first simulate the full population tree and then prune unobserved lineages. Consequently, the computational cost is determined not by the size of the final simulated tree, but by the size of the population tree in which it is embedded. In most biological scenarios, simulations of the entire population are prohibitively expensive due to computational demands placed on lineages without sampled descendants. Here, we address this challenge by proving that, for any partially ascertained process from a general multi-type birth-death-mutation-sampling model, there exists an equivalent process with complete sampling and no death, a property which we leverage to develop a highly efficient algorithm for simulating trees. Our algorithm scales linearly with the size of the final simulated tree and is independent of the population size, enabling simulations from extremely large populations beyond the reach of current methods but essential for various biological applications. We anticipate that this unprecedented speedup will significantly advance the development of novel inference methods that require extensive training data.
Collapse
Affiliation(s)
| | | | | | - Yun S Song
- Department of Statistics, University of California, Berkeley
- Computer Science Division, University of California, Berkeley
| |
Collapse
|
9
|
Wang K, Cai L, Wang H, Shan S, Hu X, Zhang J. Protocol for fast clonal family inference and analysis from large-scale B cell receptor repertoire sequencing data. STAR Protoc 2024; 5:102969. [PMID: 38502687 PMCID: PMC10963638 DOI: 10.1016/j.xpro.2024.102969] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 01/26/2024] [Accepted: 03/03/2024] [Indexed: 03/21/2024] Open
Abstract
The expeditious identification and comprehensive analysis of clonal families from extensive B cell receptor (BCR) repertoire sequencing data are imperative for elucidating the intricacies of B cell immune responses. Here, we introduce a computational pipeline designed to swiftly deduce clonal families from bulk BCR heavy-chain sequencing data, accompanied by a suite of functional modules tailored to streamline post-clustering analysis. The outlined methodology encompasses guidelines for software installation, meticulous data preparation, and the systematic inference and analysis of clonal families. For complete details on the use and execution of this protocol, please refer to Wang et al.1.
Collapse
Affiliation(s)
- Kaixuan Wang
- Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin, China
| | - Linru Cai
- Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin, China
| | - Hao Wang
- Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin, China; Georgia Tech Shenzhen Institute (GTSI), Tianjin University, Shenzhen, Guangdong, China
| | - Shiwen Shan
- Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin, China
| | - Xihao Hu
- GV20 Therapeutics, Cambridge, MA, USA
| | - Jian Zhang
- Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin, China.
| |
Collapse
|
10
|
Irac SE, Soon MSF, Borcherding N, Tuong ZK. Single-cell immune repertoire analysis. Nat Methods 2024; 21:777-792. [PMID: 38637691 DOI: 10.1038/s41592-024-02243-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Accepted: 03/12/2024] [Indexed: 04/20/2024]
Abstract
Single-cell T cell and B cell antigen receptor-sequencing data analysis can potentially perform in-depth assessments of adaptive immune cells that inform on understanding immune cell development to tracking clonal expansion in disease and therapy. However, it has been extremely challenging to analyze and interpret T cells and B cells and their adaptive immune receptor repertoires at the single-cell level due to not only the complexity of the data but also the underlying biology. In this Review, we delve into the computational breakthroughs that have transformed the analysis of single-cell T cell and B cell antigen receptor-sequencing data.
Collapse
Affiliation(s)
- Sergio E Irac
- Cancer Immunoregulation and Immunotherapy, QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Megan Sioe Fei Soon
- Ian Frazer Centre for Children's Immunotherapy Research, Child Health Research Centre, Faculty of Medicine, The University of Queensland, Brisbane, Queensland, Australia
| | - Nicholas Borcherding
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO, USA
- Omniscope, Palo Alto, CA, USA
| | - Zewen Kelvin Tuong
- Ian Frazer Centre for Children's Immunotherapy Research, Child Health Research Centre, Faculty of Medicine, The University of Queensland, Brisbane, Queensland, Australia.
- Frazer Institute, Faculty of Medicine, The University of Queensland, Brisbane, QLD, Australia.
| |
Collapse
|