1
|
Jelcic I, Naghavian R, Fanaswala I, Macnair W, Esposito C, Calini D, Han Y, Marti Z, Raposo C, Sarabia Del Castillo J, Oldrati P, Erny D, Kana V, Zheleznyakova G, Al Nimer F, Tackenberg B, Reichen I, Khademi M, Piehl F, Robinson MD, Jelcic I, Sospedra M, Pelkmans L, Malhotra D, Reynolds R, Jagodic M, Martin R. T-bet+ CXCR3+ B cells drive hyperreactive B-T cell interactions in multiple sclerosis. Cell Rep Med 2025; 6:102027. [PMID: 40107244 PMCID: PMC11970401 DOI: 10.1016/j.xcrm.2025.102027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 05/16/2024] [Accepted: 02/20/2025] [Indexed: 03/22/2025]
Abstract
Multiple sclerosis (MS) is an autoimmune disease of the central nervous system (CNS). Self-peptide-dependent autoproliferation (AP) of B and T cells is a key mechanism in MS. Here, we show that pro-inflammatory B-T cell-enriched cell clusters (BTECs) form during AP and mirror features of a germinal center reaction. T-bet+CXCR3+ B cells are the main cell subset amplifying and sustaining their counterpart Th1 cells via interferon (IFN)-γ and are present in highly inflamed meningeal tissue. The underlying B cell activation signature is reflected by epigenetic modifications and receptor-ligand interactions with self-reactive T cells. AP+ CXCR3+ B cells show marked clonal evolution from memory to somatically hypermutated plasmablasts and upregulation of IFN-γ-related genes. Our data underscore a key role of T-bet+CXCR3+ B cells in the pathogenesis of MS in both the peripheral immune system and the CNS compartment, and thus they appear to be involved in both early relapsing-remitting disease and the chronic stage.
Collapse
Affiliation(s)
- Ivan Jelcic
- Neuroimmunology and MS Research Section (NIMS), Neurology Clinic, University of Zurich, University Hospital Zurich, 8091 Zurich, Switzerland; Roche Pharma Research and Early Development (pRED), Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd, Basel, Switzerland.
| | - Reza Naghavian
- Neuroimmunology and MS Research Section (NIMS), Neurology Clinic, University of Zurich, University Hospital Zurich, 8091 Zurich, Switzerland
| | - Imran Fanaswala
- SIB Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland; Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
| | - Will Macnair
- Roche Pharma Research and Early Development (pRED), Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd, Basel, Switzerland
| | - Cinzia Esposito
- Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
| | - Daniela Calini
- Roche Pharma Research and Early Development (pRED), Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd, Basel, Switzerland
| | - Yanan Han
- Department of Clinical Neuroscience, Karolinska Institutet, Center for Molecular Medicine, Karolinska University Hospital, Stockholm, Sweden
| | - Zoe Marti
- Neuroimmunology and MS Research Section (NIMS), Neurology Clinic, University of Zurich, University Hospital Zurich, 8091 Zurich, Switzerland; Cellerys AG, Schlieren, Switzerland
| | - Catarina Raposo
- Roche Pharma Research and Early Development (pRED), Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd, Basel, Switzerland
| | | | - Pietro Oldrati
- Neuroimmunology and MS Research Section (NIMS), Neurology Clinic, University of Zurich, University Hospital Zurich, 8091 Zurich, Switzerland; Cellerys AG, Schlieren, Switzerland
| | - Daniel Erny
- Neuroimmunology and MS Research Section (NIMS), Neurology Clinic, University of Zurich, University Hospital Zurich, 8091 Zurich, Switzerland; Institute of Neuropathology, University of Freiburg, Freiburg, Germany
| | - Veronika Kana
- Neuroimmunology and MS Research Section (NIMS), Neurology Clinic, University of Zurich, University Hospital Zurich, 8091 Zurich, Switzerland
| | - Galina Zheleznyakova
- Department of Clinical Neuroscience, Karolinska Institutet, Center for Molecular Medicine, Karolinska University Hospital, Stockholm, Sweden
| | - Faiez Al Nimer
- Neuroimmunology and MS Research Section (NIMS), Neurology Clinic, University of Zurich, University Hospital Zurich, 8091 Zurich, Switzerland; Department of Clinical Neuroscience, Karolinska Institutet, Center for Molecular Medicine, Karolinska University Hospital, Stockholm, Sweden
| | - Björn Tackenberg
- Product Development Medical Affairs, Neuroscience and Rare Disease, F. Hoffmann-La Roche Ltd, Basel, Switzerland
| | - Ina Reichen
- Neuroimmunology and MS Research Section (NIMS), Neurology Clinic, University of Zurich, University Hospital Zurich, 8091 Zurich, Switzerland
| | - Mohsen Khademi
- Department of Clinical Neuroscience, Karolinska Institutet, Center for Molecular Medicine, Karolinska University Hospital, Stockholm, Sweden
| | - Fredrik Piehl
- Department of Clinical Neuroscience, Karolinska Institutet, Center for Molecular Medicine, Karolinska University Hospital, Stockholm, Sweden
| | - Mark D Robinson
- SIB Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland; Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
| | - Ilijas Jelcic
- Neuroimmunology and MS Research Section (NIMS), Neurology Clinic, University of Zurich, University Hospital Zurich, 8091 Zurich, Switzerland
| | - Mireia Sospedra
- Neuroimmunology and MS Research Section (NIMS), Neurology Clinic, University of Zurich, University Hospital Zurich, 8091 Zurich, Switzerland; Cellerys AG, Schlieren, Switzerland
| | - Lucas Pelkmans
- Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
| | - Dheeraj Malhotra
- Roche Pharma Research and Early Development (pRED), Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd, Basel, Switzerland
| | | | - Maja Jagodic
- Department of Clinical Neuroscience, Karolinska Institutet, Center for Molecular Medicine, Karolinska University Hospital, Stockholm, Sweden
| | - Roland Martin
- Neuroimmunology and MS Research Section (NIMS), Neurology Clinic, University of Zurich, University Hospital Zurich, 8091 Zurich, Switzerland; Institute of Experimental Immunology, University of Zurich, Zurich, Switzerland; Therapeutic Design Unit, Center for Molecular Medicine, Department of Clinical Neurosciences, Karolinska Institutet, Stockholm, Sweden; Cellerys AG, Schlieren, Switzerland.
| |
Collapse
|
2
|
Sullivan DK, Min KHJ, Hjörleifsson KE, Luebbert L, Holley G, Moses L, Gustafsson J, Bray NL, Pimentel H, Booeshaghi AS, Melsted P, Pachter L. kallisto, bustools and kb-python for quantifying bulk, single-cell and single-nucleus RNA-seq. Nat Protoc 2025; 20:587-607. [PMID: 39390263 DOI: 10.1038/s41596-024-01057-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Accepted: 07/29/2024] [Indexed: 10/12/2024]
Abstract
The term 'RNA-seq' refers to a collection of assays based on sequencing experiments that involve quantifying RNA species from bulk tissue, single cells or single nuclei. The kallisto, bustools and kb-python programs are free, open-source software tools for performing this analysis that together can produce gene expression quantification from raw sequencing reads. The quantifications can be individualized for multiple cells, multiple samples or both. Additionally, these tools allow gene expression values to be classified as originating from nascent RNA species or mature RNA species, making this workflow amenable to both cell-based and nucleus-based assays. This protocol describes in detail how to use kallisto and bustools in conjunction with a wrapper, kb-python, to preprocess RNA-seq data. Execution of this protocol requires basic familiarity with a command line environment. With this protocol, quantification of a moderately sized RNA-seq dataset can be completed within minutes.
Collapse
Affiliation(s)
- Delaney K Sullivan
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA
- UCLA-Caltech Medical Scientist Training Program, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | | | | | - Laura Luebbert
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA
| | | | - Lambda Moses
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA
| | | | | | - Harold Pimentel
- Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Computational Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - A Sina Booeshaghi
- Department of Bioengineering, University of California, Berkeley, Berkeley, CA, USA.
| | - Páll Melsted
- deCODE Genetics/Amgen Inc., Reykjavik, Iceland.
- School of Engineering and Natural Sciences, University of Iceland, Reykjavik, Iceland.
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA.
- Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, USA.
| |
Collapse
|
3
|
Sullivan D, Hjörleifsson K, Swarna N, Oakes C, Holley G, Melsted P, Pachter L. Accurate quantification of nascent and mature RNAs from single-cell and single-nucleus RNA-seq. Nucleic Acids Res 2025; 53:gkae1137. [PMID: 39657125 PMCID: PMC11724275 DOI: 10.1093/nar/gkae1137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Revised: 10/28/2024] [Accepted: 12/05/2024] [Indexed: 12/14/2024] Open
Abstract
In single-cell and single-nucleus RNA sequencing (RNA-seq), the coexistence of nascent (unprocessed) and mature (processed) messenger RNA (mRNA) poses challenges in accurate read mapping and the interpretation of count matrices. The traditional transcriptome reference, defining the "region of interest" in bulk RNA-seq, restricts its focus to mature mRNA transcripts. This restriction leads to two problems: reads originating outside of the "region of interest" are prone to mismapping within this region, and additionally, such external reads cannot be matched to specific transcript targets. Expanding the "region of interest" to encompass both nascent and mature mRNA transcript targets provides a more comprehensive framework for RNA-seq analysis. Here, we introduce the concept of distinguishing flanking k-mers (DFKs) to improve mapping of sequencing reads. We have developed an algorithm to identify DFKs, which serve as a sophisticated "background filter", enhancing the accuracy of mRNA quantification. This dual strategy of an expanded region of interest coupled with the use of DFKs enhances the precision in quantifying both mature and nascent mRNA molecules, as well as in delineating reads of ambiguous status.
Collapse
Affiliation(s)
- Delaney K Sullivan
- Division of Biology and Biological Engineering, California Institute of Technology, 1200 E California Blvd, Pasadena, CA 91125, USA
- UCLA-Caltech Medical Scientist Training Program, David Geffen School of Medicine, University of California, Los Angeles, 885 Tiverton Drive, Los Angeles, CA 90095, USA
| | - Kristján Eldjárn Hjörleifsson
- Department of Computing and Mathematical Sciences, California Institute of Technology, 1200 E California Blvd, Pasadena, CA 91125, USA
| | - Nikhila P Swarna
- Division of Biology and Biological Engineering, California Institute of Technology, 1200 E California Blvd, Pasadena, CA 91125, USA
| | - Conrad Oakes
- Division of Biology and Biological Engineering, California Institute of Technology, 1200 E California Blvd, Pasadena, CA 91125, USA
| | - Guillaume Holley
- deCODE Genetics/Amgen Inc., Sturlugata 8, 101 Reykjavík, Iceland
| | - Páll Melsted
- deCODE Genetics/Amgen Inc., Sturlugata 8, 101 Reykjavík, Iceland
- School of Engineering and Natural Sciences, University of Iceland, Sæmundargata 2, 102 Reykjavík, Iceland
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology, 1200 E California Blvd, Pasadena, CA 91125, USA
- Department of Computing and Mathematical Sciences, California Institute of Technology, 1200 E California Blvd, Pasadena, CA 91125, USA
| |
Collapse
|
4
|
Downie JM, Musich RJ, Geraghty CM, Caraballo A, He S, Khawaled S, Lachut K, Long T, Zhou JY, Yilmaz OH, Stappenbeck T, Chan AT, Drew DA. Optimizing single-cell RNA sequencing methods for human colon biopsies: droplet-based vs. picowell-based platforms. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.24.600526. [PMID: 38979379 PMCID: PMC11230261 DOI: 10.1101/2024.06.24.600526] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
Background & Aims Single-cell RNA sequencing (scRNA) has empowered many insights into gastrointestinal microenvironments. However, profiling human biopsies using droplet-based scRNA (D-scRNA) is challenging since it requires immediate processing to minimize epithelial cell damage. In contrast, picowell-based (P-scRNA) platforms permit short-term frozen storage before sequencing. We compared P- and D-scRNA platforms on cells derived from human colon biopsies. Methods Endoscopic rectosigmoid mucosal biopsies were obtained from two adults and conducted D-scRNA (10X Chromium) and P-scRNA (Honeycomb HIVE) in parallel using an individual's pool of single cells (> 10,000 cells/participant). Three experiments were performed to evaluate 1) P-scRNA with cells under specific storage conditions (immediately processed [fresh], vs. frozen at -20C vs. -80C [2 weeks]); 2) fresh P-scRNA versus fresh D-scRNA; and 3) P-scRNA stored at -80C with fresh D-scRNA. Results Significant recovery of loaded cells was achieved for fresh (80.9%) and -80C (48.5%) P-scRNA and D-scRNA (76.6%), but not -20C P-scRNA (3.7%). However, D-scRNA captures more typeable cells among recovered cells (71.5% vs. 15.8% Fresh and 18.4% -80C P-scRNA), and these cells exhibit higher gene coverage at the expense of higher mitochondrial read fractions across most cell types. Cells profiled using D-scRNA demonstrated more consistent gene expression profiles among the same cell type than those profiled using P-scRNA. Significant intra-cell-type differences were observed in profiled gene classes across platforms. Conclusions Our results highlight non-overlapping advantages of P-scRNA and D-scRNA and underscore the need for innovation to enable high-fidelity capture of colonic epithelial cells. The platform-specific variation highlights the challenges of maintaining rigor and reproducibility across studies that use different platforms.
Collapse
|
5
|
He D, Gao Y, Chan SS, Quintana-Parrilla N, Patro R. Forseti: a mechanistic and predictive model of the splicing status of scRNA-seq reads. Bioinformatics 2024; 40:i297-i306. [PMID: 38940130 PMCID: PMC11256924 DOI: 10.1093/bioinformatics/btae207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/29/2024] Open
Abstract
MOTIVATION Short-read single-cell RNA-sequencing (scRNA-seq) has been used to study cellular heterogeneity, cellular fate, and transcriptional dynamics. Modeling splicing dynamics in scRNA-seq data is challenging, with inherent difficulty in even the seemingly straightforward task of elucidating the splicing status of the molecules from which sequenced fragments are drawn. This difficulty arises, in part, from the limited read length and positional biases, which substantially reduce the specificity of the sequenced fragments. As a result, the splicing status of many reads in scRNA-seq is ambiguous because of a lack of definitive evidence. We are therefore in need of methods that can recover the splicing status of ambiguous reads which, in turn, can lead to more accuracy and confidence in downstream analyses. RESULTS We develop Forseti, a predictive model to probabilistically assign a splicing status to scRNA-seq reads. Our model has two key components. First, we train a binding affinity model to assign a probability that a given transcriptomic site is used in fragment generation. Second, we fit a robust fragment length distribution model that generalizes well across datasets deriving from different species and tissue types. Forseti combines these two trained models to predict the splicing status of the molecule of origin of reads by scoring putative fragments that associate each alignment of sequenced reads with proximate potential priming sites. Using both simulated and experimental data, we show that our model can precisely predict the splicing status of many reads and identify the true gene origin of multi-gene mapped reads. AVAILABILITY AND IMPLEMENTATION Forseti and the code used for producing the results are available at https://github.com/COMBINE-lab/forseti under a BSD 3-clause license.
Collapse
Affiliation(s)
- Dongze He
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, United States
- Program in Computational Biology, Bioinformatics and Genomices, University of Maryland, College Park, MD 20742, United States
| | - Yuan Gao
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, United States
- Program in Computational Biology, Bioinformatics and Genomices, University of Maryland, College Park, MD 20742, United States
| | - Spencer Skylar Chan
- Department of Computer Science, University of Maryland, College Park, MD 20742, United States
| | | | - Rob Patro
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, United States
- Department of Computer Science, University of Maryland, College Park, MD 20742, United States
| |
Collapse
|
6
|
He D, Mount SM, Patro R. scCensus: Off-target scRNA-seq reads reveal meaningful biology. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.29.577807. [PMID: 38352549 PMCID: PMC10862729 DOI: 10.1101/2024.01.29.577807] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/23/2024]
Abstract
Single-cell RNA-sequencing (scRNA-seq) provides unprecedented insights into cellular heterogeneity. Although scRNA-seq reads from most prevalent and popular tagged-end protocols are expected to arise from the 3' end of polyadenylated RNAs, recent studies have shown that "off-target" reads can constitute a substantial portion of the read population. In this work, we introduced scCensus, a comprehensive analysis workflow for systematically evaluating and categorizing off-target reads in scRNA-seq. We applied scCensus to seven scRNA-seq datasets. Our analysis of intergenic reads shows that these off-target reads contain information about chromatin structure and can be used to identify similar cells across modalities. Our analysis of antisense reads suggests that these reads can be used to improve gene detection and capture interesting transcriptional activities like antisense transcription. Furthermore, using splice-aware quantification, we find that spliced and unspliced reads provide distinct information about cell clusters and biomarkers, suggesting the utility of integrating signals from reads with different splicing statuses. Overall, our results suggest that off-target scRNA-seq reads contain underappreciated information about various transcriptional activities. These observations about yet-unexploited information in existing scRNA-seq data will help guide and motivate the community to improve current algorithms and analysis methods, and to develop novel approaches that utilize off-target reads to extend the reach and accuracy of single-cell data analysis pipelines.
Collapse
Affiliation(s)
- Dongze He
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA
- Program in Computational Biology, Bioinformatics and Genomices, University of Maryland, College Park, MD 20742, USA
| | - Stephen M. Mount
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA
| | - Rob Patro
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA
- Department of Computer Science, University of Maryland, College Park, MD 20742, USA
| |
Collapse
|
7
|
Sullivan DK, Min KH(J, Hjörleifsson KE, Luebbert L, Holley G, Moses L, Gustafsson J, Bray NL, Pimentel H, Booeshaghi AS, Melsted P, Pachter L. kallisto, bustools, and kb-python for quantifying bulk, single-cell, and single-nucleus RNA-seq. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.11.21.568164. [PMID: 38045414 PMCID: PMC10690192 DOI: 10.1101/2023.11.21.568164] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
The term "RNA-seq" refers to a collection of assays based on sequencing experiments that involve quantifying RNA species from bulk tissue, from single cells, or from single nuclei. The kallisto, bustools, and kb-python programs are free, open-source software tools for performing this analysis that together can produce gene expression quantification from raw sequencing reads. The quantifications can be individualized for multiple cells, multiple samples, or both. Additionally, these tools allow gene expression values to be classified as originating from nascent RNA species or mature RNA species, making this workflow amenable to both cell-based and nucleus-based assays. This protocol describes in detail how to use kallisto and bustools in conjunction with a wrapper, kb-python, to preprocess RNA-seq data.
Collapse
Affiliation(s)
- Delaney K. Sullivan
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125, USA
- UCLA-Caltech Medical Scientist Training Program, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | | | | | - Laura Luebbert
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125, USA
| | | | - Lambda Moses
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125, USA
| | | | - Nicolas L. Bray
- Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA
| | - Harold Pimentel
- Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, 90095, USA
- Department of Computational Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - A. Sina Booeshaghi
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125, USA
- School of Engineering and Natural Sciences, University of Iceland, Reykjavik, Iceland
| | - Páll Melsted
- deCODE Genetics/Amgen Inc., Reykjavik, Iceland
- School of Engineering and Natural Sciences, University of Iceland, Reykjavik, Iceland
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125, USA
- Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, 91125, USA
| |
Collapse
|