1
|
Shinde P, Soldevila F, Reyna J, Aoki M, Rasmussen M, Willemsen L, Kojima M, Ha B, Greenbaum JA, Overton JA, Guzman-Orozco H, Nili S, Orfield S, Gygi JP, da Silva Antunes R, Sette A, Grant B, Olsen LR, Konstorum A, Guan L, Ay F, Kleinstein SH, Peters B. A multi-omics systems vaccinology resource to develop and test computational models of immunity. CELL REPORTS METHODS 2024; 4:100731. [PMID: 38490204 PMCID: PMC10985234 DOI: 10.1016/j.crmeth.2024.100731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 01/04/2024] [Accepted: 02/20/2024] [Indexed: 03/17/2024]
Abstract
Systems vaccinology studies have identified factors affecting individual vaccine responses, but comparing these findings is challenging due to varying study designs. To address this lack of reproducibility, we established a community resource for comparing Bordetella pertussis booster responses and to host annual contests for predicting patients' vaccination outcomes. We report here on our experiences with the "dry-run" prediction contest. We found that, among 20+ models adopted from the literature, the most successful model predicting vaccination outcome was based on age alone. This confirms our concerns about the reproducibility of conclusions between different vaccinology studies. Further, we found that, for newly trained models, handling of baseline information on the target variables was crucial. Overall, multiple co-inertia analysis gave the best results of the tested modeling approaches. Our goal is to engage community in these prediction challenges by making data and models available and opening a public contest in August 2024.
Collapse
Affiliation(s)
- Pramod Shinde
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Ferran Soldevila
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Joaquin Reyna
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA; Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, San Diego, CA, USA
| | - Minori Aoki
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Mikkel Rasmussen
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA; Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Lisa Willemsen
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Mari Kojima
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Brendan Ha
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Jason A Greenbaum
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - James A Overton
- Knocean Inc., 107 Quebec Avenue, Toronto, Ontario M6P 2T3, Canada
| | - Hector Guzman-Orozco
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Somayeh Nili
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Shelby Orfield
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Jeremy P Gygi
- Program in Computational Biology & Bioinformatics, Yale University, New Haven, CT, USA
| | - Ricardo da Silva Antunes
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Alessandro Sette
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA; Department of Medicine, University of California, San Diego, San Diego, CA, USA
| | - Barry Grant
- Department of Molecular Biology, School of Biological Sciences, University of California, San Diego, La Jolla, CA, USA
| | - Lars Rønn Olsen
- Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Anna Konstorum
- Department of Pathology, Yale University School of Medicine, New Haven, CT, USA
| | - Leying Guan
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
| | - Ferhat Ay
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA; Department of Medicine, University of California, San Diego, San Diego, CA, USA
| | - Steven H Kleinstein
- Program in Computational Biology & Bioinformatics, Yale University, New Haven, CT, USA; Department of Pathology, Yale University School of Medicine, New Haven, CT, USA
| | - Bjoern Peters
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA; Department of Medicine, University of California, San Diego, San Diego, CA, USA.
| |
Collapse
|
2
|
Robles EE, Jin Y, Smyth P, Scheuermann RH, Bui JD, Wang HY, Oak J, Qian Y. A cell-level discriminative neural network model for diagnosis of blood cancers. Bioinformatics 2023; 39:btad585. [PMID: 37756695 PMCID: PMC10563151 DOI: 10.1093/bioinformatics/btad585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 09/12/2023] [Accepted: 09/22/2023] [Indexed: 09/29/2023] Open
Abstract
MOTIVATION Precise identification of cancer cells in patient samples is essential for accurate diagnosis and clinical monitoring but has been a significant challenge in machine learning approaches for cancer precision medicine. In most scenarios, training data are only available with disease annotation at the subject or sample level. Traditional approaches separate the classification process into multiple steps that are optimized independently. Recent methods either focus on predicting sample-level diagnosis without identifying individual pathologic cells or are less effective for identifying heterogeneous cancer cell phenotypes. RESULTS We developed a generalized end-to-end differentiable model, the Cell Scoring Neural Network (CSNN), which takes sample-level training data and predicts the diagnosis of the testing samples and the identity of the diagnostic cells in the sample, simultaneously. The cell-level density differences between samples are linked to the sample diagnosis, which allows the probabilities of individual cells being diagnostic to be calculated using backpropagation. We applied CSNN to two independent clinical flow cytometry datasets for leukemia diagnosis. In both qualitative and quantitative assessments, CSNN outperformed preexisting neural network modeling approaches for both cancer diagnosis and cell-level classification. Post hoc decision trees and 2D dot plots were generated for interpretation of the identified cancer cells, showing that the identified cell phenotypes match the cancer endotypes observed clinically in patient cohorts. Independent data clustering analysis confirmed the identified cancer cell populations. AVAILABILITY AND IMPLEMENTATION The source code of CSNN and datasets used in the experiments are publicly available on GitHub (http://github.com/erobl/csnn). Raw FCS files can be downloaded from FlowRepository (ID: FR-FCM-Z6YK).
Collapse
Affiliation(s)
- Edgar E Robles
- Department of Computer Science, University of California, Irvine, CA 92697, United States
| | - Ye Jin
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, United States
| | - Padhraic Smyth
- Department of Computer Science, University of California, Irvine, CA 92697, United States
| | - Richard H Scheuermann
- Department of Informatics, J. Craig Venter Institute, La Jolla, CA 92037, United States
- Department of Pathology, University of California, San Diego, CA 92093, United States
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA 92037, United States
| | - Jack D Bui
- Department of Pathology, University of California, San Diego, CA 92093, United States
| | - Huan-You Wang
- Department of Pathology, University of California, San Diego, CA 92093, United States
| | - Jean Oak
- Department of Pathology, Stanford University, Stanford, CA 94305, United States
| | - Yu Qian
- Department of Informatics, J. Craig Venter Institute, La Jolla, CA 92037, United States
| |
Collapse
|
3
|
Shinde P, Soldevila F, Reyna J, Aoki M, Rasmussen M, Willemsen L, Kojima M, Ha B, Greenbaum JA, Overton JA, Guzman-Orozco H, Nili S, Orfield S, Gygi JP, da Silva Antunes R, Sette A, Grant B, Olsen LR, Konstorum A, Guan L, Ay F, Kleinstein SH, Peters B. A systems vaccinology resource to develop and test computational models of immunity. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.28.555193. [PMID: 37693565 PMCID: PMC10491180 DOI: 10.1101/2023.08.28.555193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2023]
Abstract
Computational models that predict an individual's response to a vaccine offer the potential for mechanistic insights and personalized vaccination strategies. These models are increasingly derived from systems vaccinology studies that generate immune profiles from human cohorts pre- and post-vaccination. Most of these studies involve relatively small cohorts and profile the response to a single vaccine. The ability to assess the performance of the resulting models would be improved by comparing their performance on independent datasets, as has been done with great success in other areas of biology such as protein structure predictions. To transfer this approach to system vaccinology studies, we established a prototype platform that focuses on the evaluation of Computational Models of Immunity to Pertussis Booster vaccinations (CMI-PB). A community resource, CMI-PB generates experimental data for the explicit purpose of model evaluation, which is performed through a series of annual data releases and associated contests. We here report on our experience with the first such 'dry run' for a contest where the goal was to predict individual immune responses based on pre-vaccination multi-omic profiles. Over 30 models adopted from the literature were tested, but only one was predictive, and was based on age alone. The performance of new models built using CMI-PB training data was much better, but varied significantly based on the choice of pre-vaccination features used and the model building strategy. This suggests that previously published models developed for other vaccines do not generalize well to Pertussis Booster vaccination. Overall, these results reinforced the need for comparative analysis across models and datasets that CMI-PB aims to achieve. We are seeking wider community engagement for our first public prediction contest, which will open in early 2024.
Collapse
Affiliation(s)
- Pramod Shinde
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Ferran Soldevila
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Joaquin Reyna
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, CA, USA
| | - Minori Aoki
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Mikkel Rasmussen
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
- Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Lisa Willemsen
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Mari Kojima
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Brendan Ha
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Jason A Greenbaum
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - James A Overton
- Knocean Inc., 107 Quebec Ave. Toronto, Ontario, M6P 2T3, Canada
| | - Hector Guzman-Orozco
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Somayeh Nili
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Shelby Orfield
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Jeremy P. Gygi
- Program in Computational Biology & Bioinformatics, Yale University, New Haven, CT, USA
| | - Ricardo da Silva Antunes
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
| | - Alessandro Sette
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
- Department of Medicine, University of California, San Diego, San Diego, CA, USA
| | - Barry Grant
- Department of Molecular Biology, School of Biological Sciences, University of California San Diego, La Jolla, California, USA
| | - Lars Rønn Olsen
- Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Anna Konstorum
- Department of Pathology, Yale University School of Medicine, New Haven, CT, USA
| | - Leying Guan
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
| | - Ferhat Ay
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
- Department of Medicine, University of California, San Diego, San Diego, CA, USA
| | - Steven H. Kleinstein
- Program in Computational Biology & Bioinformatics, Yale University, New Haven, CT, USA
- Department of Pathology, Yale University School of Medicine, New Haven, CT, USA
| | - Bjoern Peters
- Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, La Jolla, CA, USA
- Department of Medicine, University of California, San Diego, San Diego, CA, USA
| |
Collapse
|
4
|
Robles EE, Jin Y, Smyth P, Scheuermann RH, Bui JD, Wang HY, Oak J, Qian Y. A cell-level discriminative neural network model for diagnosis of blood cancers. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.02.07.23285606. [PMID: 36798344 PMCID: PMC9934808 DOI: 10.1101/2023.02.07.23285606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]
Abstract
Motivation Precise identification of cancer cells in patient samples is essential for accurate diagnosis and clinical monitoring but has been a significant challenge in machine learning approaches for cancer precision medicine. In most scenarios, training data are only available with disease annotation at the subject or sample level. Traditional approaches separate the classification process into multiple steps that are optimized independently. Recent methods either focus on predicting sample-level diagnosis without identifying individual pathologic cells or are less effective for identifying heterogeneous cancer cell phenotypes. Results We developed a generalized end-to-end differentiable model, the Cell Scoring Neural Network (CSNN), which takes the available sample-level training data and predicts both the diagnosis of the testing samples and the identity of the diagnostic cells in the sample, simultaneously. The cell-level density differences between samples are linked to the sample diagnosis, which allows the probabilities of individual cells being diagnostic to be calculated using backpropagation. We applied CSNN to two independent clinical flow cytometry datasets for leukemia diagnosis. In both qualitative and quantitative assessments, CSNN outperformed preexisting neural network modeling approaches for both cancer diagnosis and cell-level classification. Post hoc decision trees and 2D dot plots were generated for interpretation of the identified cancer cells, showing that the identified cell phenotypes match the cancer endotypes observed clinically in patient cohorts. Independent data clustering analysis confirmed the identified cancer cell populations. Availability The source code of CSNN and datasets used in the experiments are publicly available on GitHub and FlowRepository. Contact Edgar E. Robles: roblesee@uci.edu and Yu Qian: mqian@jcvi.org. Supplementary information Supplementary data are available on GitHub and at Bioinformatics online.
Collapse
|
5
|
Draijer C, Florez-Sampedro L, Reker-Smit C, Post E, van Dijk F, Melgert BN. Explaining the polarized macrophage pool during murine allergic lung inflammation. Front Immunol 2022; 13:1056477. [PMID: 36605195 PMCID: PMC9807907 DOI: 10.3389/fimmu.2022.1056477] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Accepted: 12/02/2022] [Indexed: 12/24/2022] Open
Abstract
Introduction Differentially polarized macrophages, especially YM1+ and MHCII+ macrophages, play an important role in asthma development. The origin of these polarized macrophages has not been elucidated yet. We therefore aimed to investigate how proliferation, monocyte recruitment, and/or switching of polarization states contribute to this specific pool of polarized interstitial and alveolar macrophages during development of house dust mite (HDM)-induced allergic lung inflammation in mice. Methods Male and female mice were first treated intranasally with PKH26 to label lung-resident macrophages and were then exposed to either HDM or phosphate-buffered saline (PBS) for two weeks. Different myeloid immune cell types were quantified in lung tissue and blood using flow cytometry. Results We found that macrophage polarization only starts up in the second week of HDM exposures. Before this happened, unpolarized alveolar and interstitial macrophages transiently increased in HDM-exposed mice. This transient increase was mostly local proliferation of alveolar macrophages, while interstitial macrophages also contained unlabeled macrophages suggesting monocyte contribution. After two weeks of exposures, the number of interstitial and alveolar macrophages was similar between HDM and PBS-exposed mice, but the distribution of polarization states was remarkably different. HDM-exposed mice selectively developed YM1+ alveolar macrophages and MHCII-hi interstitial macrophages while nonpolarized macrophages were lost compared to PBS-exposed mice. Discussion In this HDM model we have shown that development of a polarized macrophage pool during allergic inflammation is first dependent on proliferation of nonpolarized tissue-resident macrophages with some help of infiltrating unlabeled cells, presumably circulating monocytes. These nonpolarized macrophages then acquire their polarized phenotype by upregulating YM1 on alveolar macrophages and MHCII on interstitial macrophages. This novel information will help us to better understand the role of macrophages in asthma and designing therapeutic strategies targeting macrophage functions.
Collapse
Affiliation(s)
- Christina Draijer
- GRIAC- Groningen Research Institute for Asthma and COPD, University Medical Center Groningen, University of Groningen, Groningen, Netherlands
| | - Laura Florez-Sampedro
- GRIAC- Groningen Research Institute for Asthma and COPD, University Medical Center Groningen, University of Groningen, Groningen, Netherlands,Department of Chemical and Pharmaceutical Biology, University of Groningen, Groningen, Netherlands,Department of Molecular Pharmacology, University of Groningen, Groningen, Netherlands
| | - Catharina Reker-Smit
- Department of Pharmacokinetics, Toxicology and Targeting, University of Groningen, Groningen, Netherlands
| | - Eduard Post
- Department of Pharmacokinetics, Toxicology and Targeting, University of Groningen, Groningen, Netherlands
| | - Fransien van Dijk
- GRIAC- Groningen Research Institute for Asthma and COPD, University Medical Center Groningen, University of Groningen, Groningen, Netherlands,Department of Molecular Pharmacology, University of Groningen, Groningen, Netherlands
| | - Barbro N. Melgert
- GRIAC- Groningen Research Institute for Asthma and COPD, University Medical Center Groningen, University of Groningen, Groningen, Netherlands,Department of Molecular Pharmacology, University of Groningen, Groningen, Netherlands,*Correspondence: Barbro N. Melgert,
| |
Collapse
|
6
|
Zhang Y, Sun H, Mandava A, Aevermann BD, Kollmann TR, Scheuermann RH, Qiu X, Qian Y. FastMix: a versatile data integration pipeline for cell type-specific biomarker inference. Bioinformatics 2022; 38:4735-4744. [PMID: 36018232 PMCID: PMC9801972 DOI: 10.1093/bioinformatics/btac585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Revised: 08/18/2022] [Accepted: 08/25/2022] [Indexed: 01/07/2023] Open
Abstract
MOTIVATION Flow cytometry (FCM) and transcription profiling are the two widely used assays in translational immunology research. However, there is no data integration pipeline for analyzing these two types of assays together with experiment variables for biomarker inference. Current FCM data analysis mainly relies on subjective manual gating analysis, which is difficult to be directly integrated with other automated computational methods. Existing deconvolutional analysis of bulk transcriptomics relies on predefined marker genes in the transcriptomics data, which are unavailable for novel cell types and does not utilize the FCM data that provide canonical phenotypic definitions of the cell types. RESULTS We developed a novel analytics pipeline-FastMix-for computational immunology, which integrates flow cytometry, bulk transcriptomics and clinical covariates for identifying cell type-specific gene expression signatures and biomarker genes. FastMix addresses the 'large p, small n' problem in the gene expression and flow cytometry integration analysis via a linear mixed effects model (LMER) for both cross-sectional and longitudinal studies. Its novel moment-based estimator not only reduces bias in parameter estimation but also is more efficient than iterative optimization. The FastMix pipeline also includes a cutting-edge flow cytometry data analysis method-DAFi-for identifying cell populations of interest and their characteristics. Simulation studies showed that FastMix produced smaller type I/II errors than competing methods. Validation using real data of two vaccine studies showed that FastMix identified a consistent set of signature genes as in independent single-cell RNA-seq analysis, producing additional interesting findings. AVAILABILITY AND IMPLEMENTATION Source code of FastMix is publicly available at https://github.com/terrysun0302/FastMix. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | | | - Aishwarya Mandava
- Department of Informatics, J. Craig Venter Institute, La Jolla, CA 92037, USA
| | - Brian D Aevermann
- Department of Informatics, J. Craig Venter Institute, La Jolla, CA 92037, USA
| | - Tobias R Kollmann
- Systems Vaccinology, Telethon Kids Institute, Perth Children’s Hospital, University of Western Australia, Nedlands, WA 6009, Australia
| | - Richard H Scheuermann
- Department of Informatics, J. Craig Venter Institute, La Jolla, CA 92037, USA,Department of Pathology, University of California, San Diego, La Jolla, CA 92093, USA
| | - Xing Qiu
- To whom correspondence should be addressed. or
| | - Yu Qian
- To whom correspondence should be addressed. or
| |
Collapse
|
7
|
da Silva Antunes R, Soldevila F, Pomaznoy M, Babor M, Bennett J, Tian Y, Khalil N, Qian Y, Mandava A, Scheuermann RH, Cortese M, Pulendran B, Petro CD, Gilkes AP, Purcell LA, Sette A, Peters B. A system-view of Bordetella pertussis booster vaccine responses in adults primed with whole-cell versus acellular vaccine in infancy. JCI Insight 2021; 6:141023. [PMID: 33690224 PMCID: PMC8119213 DOI: 10.1172/jci.insight.141023] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Accepted: 03/03/2021] [Indexed: 01/26/2023] Open
Abstract
The increased incidence of whooping cough worldwide suggests that current vaccination against Bordetella pertussis infection has limitations in quality and duration of protection. The resurgence of infection has been linked to the introduction of acellular vaccines (aP), which have an improved safety profile compared with the previously used whole-cell (wP) vaccines. To determine immunological differences between aP and wP priming in infancy, we performed a systems approach of the immune response to booster vaccination. Transcriptomic, proteomic, cytometric, and serologic profiling revealed multiple shared immune responses with different kinetics across cohorts, including an increase of blood monocyte frequencies and strong antigen-specific IgG responses. Additionally, we found a prominent subset of aP-primed individuals (30%) with a strong differential signature, including higher levels of expression for CCL3, NFKBIA, and ICAM1. Contrary to the wP individuals, this subset displayed increased PT-specific IgE responses after boost and higher antigen-specific IgG4 and IgG3 antibodies against FHA and FIM2/3 at baseline and after boost. Overall, the results show that, while broad immune response patterns to Tdap boost overlap between aP- and wP-primed individuals, a subset of aP-primed individuals present a divergent response. These findings provide candidate targets to study the causes and correlates of waning immunity after aP vaccination.
Collapse
Affiliation(s)
- Ricardo da Silva Antunes
- Division of Vaccine Discovery, La Jolla Institute for Allergy and Immunology, La Jolla, California, USA
| | - Ferran Soldevila
- Division of Vaccine Discovery, La Jolla Institute for Allergy and Immunology, La Jolla, California, USA
| | - Mikhail Pomaznoy
- Division of Vaccine Discovery, La Jolla Institute for Allergy and Immunology, La Jolla, California, USA
| | - Mariana Babor
- Division of Vaccine Discovery, La Jolla Institute for Allergy and Immunology, La Jolla, California, USA
| | - Jason Bennett
- Division of Vaccine Discovery, La Jolla Institute for Allergy and Immunology, La Jolla, California, USA
| | - Yuan Tian
- Division of Vaccine Discovery, La Jolla Institute for Allergy and Immunology, La Jolla, California, USA
| | - Natalie Khalil
- Division of Vaccine Discovery, La Jolla Institute for Allergy and Immunology, La Jolla, California, USA
| | - Yu Qian
- J. Craig Venter Institute, La Jolla, California, USA
| | | | - Richard H Scheuermann
- J. Craig Venter Institute, La Jolla, California, USA.,University of California San Diego School of Medicine, La Jolla, California, USA
| | - Mario Cortese
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, California, USA
| | - Bali Pulendran
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, California, USA
| | | | | | - Lisa A Purcell
- Regeneron Pharmaceuticals Inc., Tarrytown, New York, USA
| | - Alessandro Sette
- Division of Vaccine Discovery, La Jolla Institute for Allergy and Immunology, La Jolla, California, USA.,Regeneron Pharmaceuticals Inc., Tarrytown, New York, USA
| | - Bjoern Peters
- Division of Vaccine Discovery, La Jolla Institute for Allergy and Immunology, La Jolla, California, USA.,University of California San Diego School of Medicine, La Jolla, California, USA
| |
Collapse
|
8
|
Cheung M, Campbell JJ, Whitby L, Thomas RJ, Braybrook J, Petzing J. Current trends in flow cytometry automated data analysis software. Cytometry A 2021; 99:1007-1021. [PMID: 33606354 DOI: 10.1002/cyto.a.24320] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2020] [Revised: 01/21/2021] [Accepted: 01/28/2021] [Indexed: 12/16/2022]
Abstract
Automated flow cytometry (FC) data analysis tools for cell population identification and characterization are increasingly being used in academic, biotechnology, pharmaceutical, and clinical laboratories. The development of these computational methods is designed to overcome reproducibility and process bottleneck issues in manual gating, however, the take-up of these tools remains (anecdotally) low. Here, we performed a comprehensive literature survey of state-of-the-art computational tools typically published by research, clinical, and biomanufacturing laboratories for automated FC data analysis and identified popular tools based on literature citation counts. Dimensionality reduction methods ranked highly, such as generic t-distributed stochastic neighbor embedding (t-SNE) and its initial Matlab-based implementation for cytometry data viSNE. Software with graphical user interfaces also ranked highly, including PhenoGraph, SPADE1, FlowSOM, and Citrus, with unsupervised learning methods outnumbering supervised learning methods, and algorithm type popularity spread across K-Means, hierarchical, density-based, model-based, and other classes of clustering algorithms. Additionally, to illustrate the actual use typically within clinical spaces alongside frequent citations, a survey issued by UK NEQAS Leucocyte Immunophenotyping to identify software usage trends among clinical laboratories was completed. The survey revealed 53% of laboratories have not yet taken up automated cell population identification methods, though among those that have, Infinicyt software is the most frequently identified. Survey respondents considered data output quality to be the most important factor when using automated FC data analysis software, followed by software speed and level of technical support. This review found differences in software usage between biomedical institutions, with tools for discovery, data exploration, and visualization more popular in academia, whereas automated tools for specialized targeted analysis that apply supervised learning methods were more used in clinical settings.
Collapse
Affiliation(s)
- Melissa Cheung
- Centre for Biological Engineering, Loughborough University, Loughborough, Leicestershire, United Kingdom
| | | | - Liam Whitby
- UK NEQAS for Leucocyte Immunophenotyping, Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, United Kingdom
| | - Robert J Thomas
- Centre for Biological Engineering, Loughborough University, Loughborough, Leicestershire, United Kingdom
| | - Julian Braybrook
- National Measurement Laboratory, LGC, Teddington, United Kingdom
| | - Jon Petzing
- Centre for Biological Engineering, Loughborough University, Loughborough, Leicestershire, United Kingdom
| |
Collapse
|
9
|
Abstract
PURPOSE OF REVIEW Artificial intelligence has pervasively transformed many industries and is beginning to shape medical practice. New use cases are being identified in subspecialty domains of medicine and, in particular, application of artificial intelligence has found its way to the practice of allergy-immunology. Here, we summarize recent developments, emerging applications and obstacles to realizing full potential. RECENT FINDINGS Artificial/augmented intelligence and machine learning are being used to reduce dimensional complexity, understand cellular interactions and advance vaccine work in the basic sciences. In genomics, bioinformatic methods are critical for variant calling and classification. For clinical work, artificial intelligence is enabling disease detection, risk profiling and decision support. These approaches are just beginning to have impact upon the field of clinical immunology and much opportunity exists for further advancement. SUMMARY This review highlights use of computational methods for analysis of large datasets across the spectrum of research and clinical care for patients with immunological disorders. Here, we discuss how big data methods are presently being used across the field clinical immunology.
Collapse
|
10
|
Validation of a hybrid approach to standardize immunophenotyping analysis in large population studies: The Health and Retirement Study. Sci Rep 2020; 10:8759. [PMID: 32472068 PMCID: PMC7260195 DOI: 10.1038/s41598-020-65016-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Accepted: 04/01/2020] [Indexed: 12/11/2022] Open
Abstract
Traditional manual gating strategies are often time-intensive, place a high burden on the analyzer, and are susceptible to bias between analyzers. Several automated gating methods have shown to exceed performance of manual gating for a limited number of cell subsets. However, many of the automated algorithms still require significant manual interventions or have yet to demonstrate their utility in large datasets. Therefore, we developed an approach that utilizes a previously published automated algorithm (OpenCyto framework) with a manually created hierarchically cell gating template implemented, along with a custom developed visualization software (FlowAnnotator) to rapidly and efficiently analyze immunophenotyping data in large population studies. This approach allows pre-defining populations that can be analyzed solely by automated analysis and incorporating manual refinement for smaller downstream populations. We validated this method with traditional manual gating strategies for 24 subsets of T cells, B cells, NK cells, monocytes and dendritic cells in 931 participants from the Health and Retirement Study (HRS). Our results show a high degree of correlation (r ≥ 0.80) for 18 (78%) of the 24 cell subsets. For the remaining subsets, the correlation was low (<0.80) primarily because of the low numbers of events recorded in these subsets. The mean difference in the absolute counts between the hybrid method and manual gating strategy of these cell subsets showed results that were very similar to the traditional manual gating method. We describe a practical method for standardization of immunophenotyping methods in large scale population studies that provides a rapid, accurate and reproducible alternative to labor intensive manual gating strategies.
Collapse
|
11
|
Unbiased analysis of peripheral blood mononuclear cells reveals CD4 T cell response to RSV matrix protein. Vaccine X 2020; 5:100065. [PMID: 32529184 PMCID: PMC7280769 DOI: 10.1016/j.jvacx.2020.100065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2019] [Revised: 02/13/2020] [Accepted: 04/20/2020] [Indexed: 12/02/2022] Open
Abstract
Respiratory syncytial virus (RSV) is the most important cause of respiratory tract illness especially in young infants that develop severe disease requiring hospitalization, and accounting for 74,000–126,000 admissions in the United States (Rezaee et al., 2017; Resch, 2017). Observations of neonatal and infant T cells suggest that they may express different immune markers compared to T-cells from older children. Flow cytometry analysis of cellular responses using “conventional” anti-viral markers (IL2, IFN-γ, TNF, IL10 and IL4) upon RSV-peptide stimulation detected an overall low RSV response in peripheral blood. Therefore we sought an unbiased approach to identify RSV-specific immune markers using RNA-sequencing upon stimulation of infant PBMCs with overlapping peptides representing RSV antigens. To understand the cellular response using transcriptional signatures, transcription factors and cell-type specific signatures were used to investigate breadth of response across peptides. Unexpected from the ICS data, M peptide induced a response equivalent to the F-peptide and was characterized by activation of GATA2, 3, STAT3 and IRF1. This along with upregulation of several unconventional T cell signatures was only observed upon M-peptide stimulation. Moreover, signatures of natural RSV infections were identified from the data available in the public domain to investigate similarities between transcriptional signatures from PBMCs and upon peptide stimulation. This analysis also suggested activation of T cell response upon M-peptide stimulation. Hence, based on transcriptional response, markers were chosen to validate the role of M-peptide in activation of T cells. Indeed, CD4+CXCL9+ cells were identified upon M-peptide stimulation by flow cytometry. Future work using additional markers identified in this study could reveal additional unconventional T cells responding to RSV infections in infants. In conclusion, T cell responses to RSV in infants may not follow the canonical Th1/Th2 patterns of effector responses but include additional functions that may be unique to the neonatal period and correlate with clinical outcomes.
Collapse
|
12
|
Ji D, Putzel P, Qian Y, Chang I, Mandava A, Scheuermann RH, Bui JD, Wang H, Smyth P. Machine Learning of Discriminative Gate Locations for Clinical Diagnosis. Cytometry A 2020; 97:296-307. [PMID: 31691488 PMCID: PMC7079150 DOI: 10.1002/cyto.a.23906] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2019] [Revised: 08/22/2019] [Accepted: 09/18/2019] [Indexed: 01/03/2023]
Abstract
High-throughput single-cell cytometry technologies have significantly improved our understanding of cellular phenotypes to support translational research and the clinical diagnosis of hematological and immunological diseases. However, subjective and ad hoc manual gating analysis does not adequately handle the increasing volume and heterogeneity of cytometry data for optimal diagnosis. Prior work has shown that machine learning can be applied to classify cytometry samples effectively. However, many of the machine learning classification results are either difficult to interpret without using characteristics of cell populations to make the classification, or suboptimal due to the use of inaccurate cell population characteristics derived from gating boundaries. To date, little has been done to optimize both the gating boundaries and the diagnostic accuracy simultaneously. In this work, we describe a fully discriminative machine learning approach that can simultaneously learn feature representations (e.g., combinations of coordinates of gating boundaries) and classifier parameters for optimizing clinical diagnosis from cytometry measurements. The approach starts from an initial gating position and then refines the position of the gating boundaries by gradient descent until a set of globally-optimized gates across different samples are achieved. The learning procedure is constrained by regularization terms encoding domain knowledge that encourage the algorithm to seek interpretable results. We evaluate the proposed approach using both simulated and real data, producing classification results on par with those generated via human expertise, in terms of both the positions of the gating boundaries and the diagnostic accuracy. © 2019 The Authors. Cytometry Part A published by Wiley Periodicals, Inc. on behalf of International Society for Advancement of Cytometry.
Collapse
Affiliation(s)
- Disi Ji
- Department of Computer ScienceUniversity of CaliforniaIrvineCalifornia
| | - Preston Putzel
- Department of Computer ScienceUniversity of CaliforniaIrvineCalifornia
| | - Yu Qian
- InformaticsJ. Craig Venter InstituteLa JollaCalifornia
| | - Ivan Chang
- InformaticsJ. Craig Venter InstituteLa JollaCalifornia
| | | | - Richard H. Scheuermann
- InformaticsJ. Craig Venter InstituteLa JollaCalifornia
- Department of PathologyUniversity of CaliforniaSan Diego, La JollaCalifornia
| | - Jack D. Bui
- Department of PathologyUniversity of CaliforniaSan Diego, La JollaCalifornia
| | - Huan‐You Wang
- Department of PathologyUniversity of CaliforniaSan Diego, La JollaCalifornia
| | - Padhraic Smyth
- Department of Computer ScienceUniversity of CaliforniaIrvineCalifornia
| |
Collapse
|
13
|
Pedersen CB, Olsen LR. Algorithmic Clustering Of Single-Cell Cytometry Data-How Unsupervised Are These Analyses Really? Cytometry A 2019; 97:219-221. [PMID: 31688998 DOI: 10.1002/cyto.a.23917] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Accepted: 09/27/2019] [Indexed: 12/22/2022]
Affiliation(s)
- Christina Bligaard Pedersen
- Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark.,Center for Genomic Medicine, Rigshospitalet-Copenhagen University Hospital, Copenhagen, Denmark
| | - Lars Rønn Olsen
- Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark.,Center for Genomic Medicine, Rigshospitalet-Copenhagen University Hospital, Copenhagen, Denmark
| |
Collapse
|
14
|
Lee H, Sun Y, Patti-Diaz L, Hedrick M, Ehrhardt AG. High-Throughput Analysis of Clinical Flow Cytometry Data by Automated Gating. Bioinform Biol Insights 2019; 13:1177932219838851. [PMID: 30983860 PMCID: PMC6448119 DOI: 10.1177/1177932219838851] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2019] [Accepted: 02/13/2019] [Indexed: 11/30/2022] Open
Abstract
Advancements in flow cytometers with capability to measure 15 or more parameters have enabled us to characterize cell populations at unprecedented levels of detail. Beyond discovery research, there is now a growing demand to dive deeper into evaluating the immune response in clinical trials for immune modulating compounds. However, for high-volume, complex flow cytometry data generated in clinical trials, conventional manual gating remains the standard of practice. Traditional manual gating is resource intense and becomes a bottleneck and an impractical method to complete high volumes of flow cytometry data analysis. Current efforts to automate “manual gating” have shown that computational algorithms can facilitate the analysis of daunting multi-parameter data; however, a greater degree of precision in comparison with traditional manual gating is needed for wide-scale adoption of automated gating methods. In an effort to more closely follow the manual gating process, our automated gating pipeline was created to include negative controls (Fluorescence Minus One [FMO]) to enhance the reliability of gate placement. We demonstrate that use of an automated pipeline, heavily relying on FMO controls for population discrimination, can analyze multi-parameter, large-scale clinical datasets with comparable precision and accuracy to traditional manual gating.
Collapse
Affiliation(s)
- Hunjoong Lee
- Hunjoong Lee, Clinical Flow Cytometry, Translational Medicine, Bristol-Myers Squibb, Pennington, NJ 08534, USA.
| | | | | | | | | |
Collapse
|