1
|
Yi M, Zhan T, Rui H, Chervoneva I. Functional protein biomarkers based on distributions of expression levels in single-cell imaging data. Bioinformatics 2025; 41:btaf182. [PMID: 40257750 PMCID: PMC12070390 DOI: 10.1093/bioinformatics/btaf182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2024] [Revised: 04/01/2025] [Accepted: 04/19/2025] [Indexed: 04/22/2025] Open
Abstract
MOTIVATION The intra-tumor heterogeneity of protein expression is well recognized and may provide important information for cancer prognosis and predicting treatment responses. Analytic methods that account for spatial heterogeneity remain methodologically complex and computationally demanding for single-cell protein expression. For many functional proteins, single-cell expressions vary independently of spatial localization in a substantial proportion of the tumor tissues, and incorporation of spatial information may not affect the prognostic value of such protein biomarkers. RESULTS We developed a new framework for using the distributions of functional single-cell protein expression levels as cancer biomarkers. The quantile functions of single-cell expressions are used to fully capture the heterogeneity of protein expression across all cancer cells. The quantile index (QI) biomarker is defined as an integral of an unspecified function which may depend linearly or nonlinearly on a tissue-specific quantile function. Linear and nonlinear versions of QI biomarkers based on single-cell expressions of ER, Ki67, TS, and CyclinD3 were derived and evaluated as predictors of progression-free survival or high mitotic index in a large breast cancer dataset. We evaluated performance and demonstrated the advantages of nonlinear QI biomarkers through simulation studies. AVAILABILITY AND IMPLEMENTATION The associated R package Qindex is available at https://CRAN.R-project.org/package=Qindex and R package hyper.gam is available at https://github.com/tingtingzhan/hyper.gam. Examples of R code and detailed instructions could be found in vignette quantile-index-predictor (https://CRAN.R-project.org/package=hyper.gam/vignettes/applications.html#quantile-index-predictor).
Collapse
Affiliation(s)
- Misung Yi
- Department of Statistics & Data Science, College of Software and Convergence, Dankook University, Suji-gu, Gyeonggi-do 16890, Korea
| | - Tingting Zhan
- Division of Biostatistics & Bioinformatics, Department of Pharmacology, Physiology & Cancer Biology, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, PA 19107, United States
| | - Hallgeir Rui
- Division of Cancer Biology, Department of Pharmacology, Physiology & Cancer Biology, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, PA 19107, United States
| | - Inna Chervoneva
- Division of Biostatistics & Bioinformatics, Department of Pharmacology, Physiology & Cancer Biology, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, PA 19107, United States
| |
Collapse
|
2
|
Soupir AC, Gadiyar IV, Helm BR, Harris CR, Vandekar SN, Peres LC, Coffey RJ, Wrobel J, Ma S, Fridley BL. Benchmarking Spatial Co-Localization Methods for Single-Cell Multiplex Imaging Data with Applications to High-Grade Serous Ovarian and Triple Negative Breast Cancer. STATISTICS AND DATA SCIENCE IN IMAGING 2025; 2:2437947. [PMID: 40051984 PMCID: PMC11883755 DOI: 10.1080/29979676.2024.2437947] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Accepted: 11/26/2024] [Indexed: 03/09/2025]
Abstract
Single-cell multiplex imaging (scMI) measures cell locations and phenotypes within a tissue and can be used to understand the tumor microenvironment. In scMI studies, it is often of interest to quantify spatial co-localization of immune cells and its association with clinical outcomes; however, it remains unknown which of the many available spatial indices have adequate power to detect spatial within-sample co-localization and its association with patient outcomes, such as survival. In this study, the performance of six frequentist metrics of spatial co-localization used in scMI studies were evaluated. Simulated data was used to assess the power and type I error of these spatial metrics to detect signficant co-localization. Furthermore, these spatial co-localization methods were applied to two scMI studies - a high-grade serous ovarian cancer (HGSOC) study and triple negative breast cancer (TNBC) study - to detect within-sample co-localization between cell types and their sensitivity to detect differences in survival across samples. In the simulation study, Ripley's K had the greatest power to identify co-localization followed closely by pair correlation g; all other statistics showed little power across all simulation scenarios. In the application of the methods to cancer studies, the results consistently point to pair correlation g and Ripley's K as indices with the most power for detecting significant co-localization in scMI data. Furthermore, pair correlation g, Ripley's K, and the scLMM index were most effective for estimating between-sample associations between level of co-localization and survival.
Collapse
Affiliation(s)
- Alex C Soupir
- Department of Biostatistics & Bioinformatics, Moffitt Cancer Center
| | - Ishaan V Gadiyar
- Department of Biostatistics, Vanderbilt University Medical Center
| | - Bryan R Helm
- Department of Biostatistics, Vanderbilt University Medical Center
| | - Coleman R Harris
- Department of Biostatistics, Vanderbilt University Medical Center
| | - Simon N Vandekar
- Department of Biostatistics, Vanderbilt University Medical Center
| | - Lauren C Peres
- Department of Cancer Epidemiology, Moffitt Cancer Center
| | - Robert J Coffey
- Department of Cell and Developmental Biology, Vanderbilt University Medical Center
| | - Julia Wrobel
- Department of Biostatistics & Bioinformatics, Emory University
| | - Siyuan Ma
- Department of Biostatistics, Vanderbilt University Medical Center
| | - Brooke L Fridley
- Division of Health Services & Outcomes Research, Children's Mercy
| |
Collapse
|
3
|
Wrobel J, Soupir AC, Hayes MT, Peres LC, Vu T, Leroux A, Fridley BL. mxfda: a comprehensive toolkit for functional data analysis of single-cell spatial data. BIOINFORMATICS ADVANCES 2024; 4:vbae155. [PMID: 39552929 PMCID: PMC11568348 DOI: 10.1093/bioadv/vbae155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/25/2024] [Revised: 10/01/2024] [Accepted: 10/13/2024] [Indexed: 11/19/2024]
Abstract
Summary Technologies that produce spatial single-cell (SC) data have revolutionized the study of tissue microstructures and promise to advance personalized treatment of cancer by revealing new insights about the tumor microenvironment. Functional data analysis (FDA) is an ideal analytic framework for connecting cell spatial relationships to patient outcomes, but can be challenging to implement. To address this need, we present mxfda, an R package for end-to-end analysis of SC spatial data using FDA. mxfda implements a suite of methods to facilitate spatial analysis of SC imaging data using FDA techniques. Availability and implementation The mxfda R package is freely available at https://cran.r-project.org/package=mxfda and has detailed documentation, including four vignettes, available at http://juliawrobel.com/mxfda/.
Collapse
Affiliation(s)
- Julia Wrobel
- Department of Biostatistics & Bioinformatics, Emory University, Atlanta, GA 30322, United States
| | - Alex C Soupir
- Department of Biostatistics & Bioinformatics, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL 33612, United States
| | - Mitchell T Hayes
- Department of Genitourinary Oncology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL 33612, United States
| | - Lauren C Peres
- Department of Cancer Epidemiology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL 33612, United States
| | - Thao Vu
- Department of Biostatistics & Informatics, Colorado School of Public Health, Aurora, CO 80045, United States
| | - Andrew Leroux
- Department of Biostatistics & Informatics, Colorado School of Public Health, Aurora, CO 80045, United States
| | - Brooke L Fridley
- Division of Health Services & Outcomes Research, Children’s Mercy, Kansas City, MO 64108, United States
| |
Collapse
|
4
|
Tsang AP, Krishnan SN, Eliason JN, McGue JJ, Qin A, Frankel TL, Rao A. Assessing the Tumor Immune Landscape Across Multiple Spatial Scales to Differentiate Immunotherapy Response in Metastatic Non-Small Cell Lung Cancer. J Transl Med 2024; 104:102148. [PMID: 39389312 DOI: 10.1016/j.labinv.2024.102148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Revised: 09/16/2024] [Accepted: 10/01/2024] [Indexed: 10/12/2024] Open
Abstract
Although immune checkpoint inhibitor-based therapy has shown promising results in non-small cell lung cancer patients with high programmed death-ligand 1 expression, not all patients respond to therapy. The tumor microenvironment (TME) is complex and heterogeneous, making it challenging to understand the key agents and features that influence response to therapies. In this study, we leverage multiplex fluorescent immunohistochemistry to quantitatively assess interactions between tumor and immune cells in an effort to identify patterns occurring at multiple spatial levels of the TME. To do so, we introduce several computational methods novel to a data set of 1,269 multiplex fluorescent immunohistochemistry images from a cohort of 52 patients with metastatic non-small cell lung cancer. With the spatial G-cross function, we quantify the degree of cell interaction at an entire image level, where we see significantly increased activity of cytotoxic T cells and helper T cells with epithelial tumor cells in responders to immune checkpoint inhibitor-based (P = .022 and P < .001, respectively) and decreased activity of T-regulatory cells with epithelial tumor cells compared with nonresponders (P = .010). By leveraging spatial overlap methods, we define tumor subregions (which we call the tumor "periphery," "edge." and "center") and discover more localized immune-immune interactions influencing positive response, including those between cytotoxic T cells and helper T cells with antigen presenting cells in these subregions specifically. Finally, we trained an interpretable deep learning model that identified key cellular regions of interest that most influenced response classification (area under the curve = 0.71 ± 0.02). Assessing spatial interactions within these subregions further revealed new insights that were not significant at the whole image level, particularly the elevated association of antigen presenting cells and T-regulatory cells with one another in responder groups (P = .024). Altogether, we demonstrate that elucidating patterns of cell composition and interplay across multiple levels of spatial analyses can improve our understanding of the TME and better differentiate patient responses to immunotherapy.
Collapse
MESH Headings
- Carcinoma, Non-Small-Cell Lung/immunology
- Carcinoma, Non-Small-Cell Lung/pathology
- Carcinoma, Non-Small-Cell Lung/secondary
- Carcinoma, Non-Small-Cell Lung/therapy
- Lung Neoplasms/immunology
- Lung Neoplasms/pathology
- Lung Neoplasms/therapy
- Immune Checkpoint Inhibitors/pharmacology
- Immune Checkpoint Inhibitors/therapeutic use
- Antineoplastic Agents, Immunological/pharmacology
- Antineoplastic Agents, Immunological/therapeutic use
- Tumor Microenvironment/drug effects
- Tumor Microenvironment/immunology
- Cell Communication/drug effects
- Cell Communication/immunology
- T-Lymphocytes, Cytotoxic/drug effects
- T-Lymphocytes, Cytotoxic/immunology
- T-Lymphocytes, Helper-Inducer/drug effects
- T-Lymphocytes, Helper-Inducer/immunology
- Antigen-Presenting Cells/drug effects
- Antigen-Presenting Cells/immunology
- Deep Learning
- Area Under Curve
- Spatial Analysis
- Fluorescent Antibody Technique
- Cohort Studies
- Humans
- Treatment Outcome
Collapse
Affiliation(s)
- Ashley P Tsang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan.
| | - Santhoshi N Krishnan
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan
| | - Joel N Eliason
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan
| | - Jake J McGue
- Department of Surgery, University of Michigan, Ann Arbor, Michigan
| | - Angel Qin
- Department of Internal Medicine, Division of Hematology-Oncology, University of Michigan, Ann Arbor, Michigan
| | | | - Arvind Rao
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan; Department of Biostatistics, University of Michigan, Ann Arbor, Michigan; Department of Biomedical Engineering, University of Michigan, Ann Arbor, Michigan; Department of Radiation Oncology, University of Michigan, Ann Arbor, Michigan.
| |
Collapse
|
5
|
Samorodnitsky S, Campbell K, Ribas A, Wu MC. A Spatial Omnibus Test (SPOT) for Spatial Proteomic Data. Bioinformatics 2024; 40:btae425. [PMID: 38950184 PMCID: PMC11257711 DOI: 10.1093/bioinformatics/btae425] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Revised: 05/10/2024] [Accepted: 06/28/2024] [Indexed: 07/03/2024] Open
Abstract
MOTIVATION Spatial proteomics can reveal the spatial organization of immune cells in the tumor immune microenvironment. Relating measures of spatial clustering, such as Ripley's K or Besag's L, to patient outcomes may offer important clinical insights. However, these measures require pre-specifying a radius in which to quantify clustering, yet no consensus exists on the optimal radius which may be context-specific. RESULTS We propose a SPatial Omnibus Test (SPOT) which conducts this analysis across a range of candidate radii. At each radius, SPOT evaluates the association between the spatial summary and outcome, adjusting for confounders. SPOT then aggregates results across radii using the Cauchy combination test, yielding an omnibus P-value characterizing the overall degree of association. Using simulations, we verify that the type I error rate is controlled and show SPOT can be more powerful than alternatives. We also apply SPOT to ovarian and lung cancer studies. AVAILABILITY AND IMPLEMENTATION An R package and tutorial are provided at https://github.com/sarahsamorodnitsky/SPOT.
Collapse
Affiliation(s)
- Sarah Samorodnitsky
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle 98109, USA
- SWOG Statistics and Data Management Center, Fred Hutchinson Cancer Center, Seattle 98109, USA
| | - Katie Campbell
- Department of Medicine, Division of Hematology/Oncology, University of California Los Angeles, Los Angeles 90095, USA
| | - Antoni Ribas
- Department of Medicine, Division of Hematology/Oncology, University of California Los Angeles, Los Angeles 90095, USA
| | - Michael C Wu
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle 98109, USA
- SWOG Statistics and Data Management Center, Fred Hutchinson Cancer Center, Seattle 98109, USA
| |
Collapse
|
6
|
Kulkarni R, Patel G, Eliason J, Rao A. Tensor decomposition to identify context-aware spatial neighborhoods in the tumor microenvironment. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2024; 2024:1-4. [PMID: 40039439 DOI: 10.1109/embc53108.2024.10782693] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/06/2025]
Abstract
The tumor microenvironment (TME) is a complex ecosystem composed of diverse cell types whose spatial interactions play an important role in tumor progression. Many cell types have characteristic biomarkers that can be detected at single cell resolution through multiplex tissue imaging methods. There exist several image analysis tools to quantify such interactions, but they calculate summary statistics one image at a time. In this study, we use a scalable canonical polyadic (CP) tensor decomposition method to probe the spatial heterogeneity of cell type distributions simultaneously across samples. We use a publicly available colorectal cancer (CRC) dataset capturing 56 cell-surface markers from 35 patients across 136 images. Through analysis of tensor decomposition factors, we are able to identify tensor components corresponding to key cellular neighborhoods, such as bulk tumor and tumor edge. Wilcoxon rank sum tests on nine component decomposition identified components that varied significantly across Crohn's-like reaction (CLR) and diffuse inflammatory infiltration (DII) patient groups, including granulocyte enrichment in DII. Survival analysis identified distinct components associated with high tumor level and poor patient prognosis.
Collapse
|
7
|
Samorodnitsky S, Campbell K, Ribas A, Wu MC. A Spatial Omnibus Test (SPOT) for Spatial Proteomic Data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.08.584117. [PMID: 38559053 PMCID: PMC10979932 DOI: 10.1101/2024.03.08.584117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Spatial proteomics can reveal the spatial organization of immune cells in the tumor immune microenvironment. Relating measures of spatial clustering, such as Ripley's K or Besag's L, to patient outcomes may offer important clinical insights. However, these measures require pre-specifying a radius in which to quantify clustering, yet no consensus exists on the optimal radius which may be context-specific. We propose a SPatial Omnibus Test (SPOT) which conducts this analysis across a range of candidate radii. At each radius, SPOT evaluates the association between the spatial summary and outcome, adjusting for confounders. SPOT then aggregates results across radii using the Cauchy combination test, yielding an omnibus p-value characterizing the overall degree of association. Using simulations, we verify that the type I error rate is controlled and show SPOT can be more powerful than alternatives. We also apply SPOT to an ovarian cancer study. An R package and tutorial is provided at https://github.com/sarahsamorodnitsky/SPOT.
Collapse
Affiliation(s)
- Sarah Samorodnitsky
- Public Health Sciences Division, Fred Hutch Cancer Center
- SWOG Statistics and Data Management Center
| | - Katie Campbell
- Medicine, Division of Hematology/Oncology, University of California Los Angeles
| | - Antoni Ribas
- Medicine, Division of Hematology/Oncology, University of California Los Angeles
| | - Michael C Wu
- Public Health Sciences Division, Fred Hutch Cancer Center
- SWOG Statistics and Data Management Center
| |
Collapse
|