1
|
Yi M, Zhan T, Rui H, Chervoneva I. Functional protein biomarkers based on distributions of expression levels in single-cell imaging data. Bioinformatics 2025; 41:btaf182. [PMID: 40257750 PMCID: PMC12070390 DOI: 10.1093/bioinformatics/btaf182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2024] [Revised: 04/01/2025] [Accepted: 04/19/2025] [Indexed: 04/22/2025] Open
Abstract
MOTIVATION The intra-tumor heterogeneity of protein expression is well recognized and may provide important information for cancer prognosis and predicting treatment responses. Analytic methods that account for spatial heterogeneity remain methodologically complex and computationally demanding for single-cell protein expression. For many functional proteins, single-cell expressions vary independently of spatial localization in a substantial proportion of the tumor tissues, and incorporation of spatial information may not affect the prognostic value of such protein biomarkers. RESULTS We developed a new framework for using the distributions of functional single-cell protein expression levels as cancer biomarkers. The quantile functions of single-cell expressions are used to fully capture the heterogeneity of protein expression across all cancer cells. The quantile index (QI) biomarker is defined as an integral of an unspecified function which may depend linearly or nonlinearly on a tissue-specific quantile function. Linear and nonlinear versions of QI biomarkers based on single-cell expressions of ER, Ki67, TS, and CyclinD3 were derived and evaluated as predictors of progression-free survival or high mitotic index in a large breast cancer dataset. We evaluated performance and demonstrated the advantages of nonlinear QI biomarkers through simulation studies. AVAILABILITY AND IMPLEMENTATION The associated R package Qindex is available at https://CRAN.R-project.org/package=Qindex and R package hyper.gam is available at https://github.com/tingtingzhan/hyper.gam. Examples of R code and detailed instructions could be found in vignette quantile-index-predictor (https://CRAN.R-project.org/package=hyper.gam/vignettes/applications.html#quantile-index-predictor).
Collapse
Affiliation(s)
- Misung Yi
- Department of Statistics & Data Science, College of Software and Convergence, Dankook University, Suji-gu, Gyeonggi-do 16890, Korea
| | - Tingting Zhan
- Division of Biostatistics & Bioinformatics, Department of Pharmacology, Physiology & Cancer Biology, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, PA 19107, United States
| | - Hallgeir Rui
- Division of Cancer Biology, Department of Pharmacology, Physiology & Cancer Biology, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, PA 19107, United States
| | - Inna Chervoneva
- Division of Biostatistics & Bioinformatics, Department of Pharmacology, Physiology & Cancer Biology, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, PA 19107, United States
| |
Collapse
|
2
|
Soupir AC, Gadiyar IV, Helm BR, Harris CR, Vandekar SN, Peres LC, Coffey RJ, Wrobel J, Ma S, Fridley BL. Benchmarking Spatial Co-Localization Methods for Single-Cell Multiplex Imaging Data with Applications to High-Grade Serous Ovarian and Triple Negative Breast Cancer. STATISTICS AND DATA SCIENCE IN IMAGING 2025; 2:2437947. [PMID: 40051984 PMCID: PMC11883755 DOI: 10.1080/29979676.2024.2437947] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Accepted: 11/26/2024] [Indexed: 03/09/2025]
Abstract
Single-cell multiplex imaging (scMI) measures cell locations and phenotypes within a tissue and can be used to understand the tumor microenvironment. In scMI studies, it is often of interest to quantify spatial co-localization of immune cells and its association with clinical outcomes; however, it remains unknown which of the many available spatial indices have adequate power to detect spatial within-sample co-localization and its association with patient outcomes, such as survival. In this study, the performance of six frequentist metrics of spatial co-localization used in scMI studies were evaluated. Simulated data was used to assess the power and type I error of these spatial metrics to detect signficant co-localization. Furthermore, these spatial co-localization methods were applied to two scMI studies - a high-grade serous ovarian cancer (HGSOC) study and triple negative breast cancer (TNBC) study - to detect within-sample co-localization between cell types and their sensitivity to detect differences in survival across samples. In the simulation study, Ripley's K had the greatest power to identify co-localization followed closely by pair correlation g; all other statistics showed little power across all simulation scenarios. In the application of the methods to cancer studies, the results consistently point to pair correlation g and Ripley's K as indices with the most power for detecting significant co-localization in scMI data. Furthermore, pair correlation g, Ripley's K, and the scLMM index were most effective for estimating between-sample associations between level of co-localization and survival.
Collapse
Affiliation(s)
- Alex C Soupir
- Department of Biostatistics & Bioinformatics, Moffitt Cancer Center
| | - Ishaan V Gadiyar
- Department of Biostatistics, Vanderbilt University Medical Center
| | - Bryan R Helm
- Department of Biostatistics, Vanderbilt University Medical Center
| | - Coleman R Harris
- Department of Biostatistics, Vanderbilt University Medical Center
| | - Simon N Vandekar
- Department of Biostatistics, Vanderbilt University Medical Center
| | - Lauren C Peres
- Department of Cancer Epidemiology, Moffitt Cancer Center
| | - Robert J Coffey
- Department of Cell and Developmental Biology, Vanderbilt University Medical Center
| | - Julia Wrobel
- Department of Biostatistics & Bioinformatics, Emory University
| | - Siyuan Ma
- Department of Biostatistics, Vanderbilt University Medical Center
| | - Brooke L Fridley
- Division of Health Services & Outcomes Research, Children's Mercy
| |
Collapse
|
3
|
Samorodnitsky S, Campbell K, Little A, Ling W, Zhao N, Chen YC, Wu MC. Detecting Clinically Relevant Topological Structures in Multiplexed Spatial Proteomics Imaging Using TopKAT. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.12.18.628976. [PMID: 39764056 PMCID: PMC11702633 DOI: 10.1101/2024.12.18.628976] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 01/16/2025]
Abstract
Novel multiplexed spatial proteomics imaging platforms expose the spatial architecture of cells in the tumor microenvironment (TME). The diverse cell population in the TME, including its spatial context, has been shown to have important clinical implications, correlating with disease prognosis and treatment response. The accelerating implementation of spatial proteomic technologies motivates new statistical models to test if cell-level images associate with patient-level endpoints. Few existing methods can robustly characterize the geometry of the spatial arrangement of cells and also yield both a valid and powerful test for association with patient-level outcomes. We propose a topology-based approach that combines persistent homology with kernel testing to determine if topological structures created by cells predict continuous, binary, or survival clinical endpoints. We term our method TopKAT (Topological Kernel Association Test) and show that it can be more powerful than statistical tests grounded in the spatial point process model, particularly when cells arise along the boundary of a ring. We demonstrate the properties of TopKAT through simulation studies and apply it to two studies of triple negative breast cancer where we show that TopKAT recovers clinically relevant topological structures in the spatial distribution of immune and tumor cells.
Collapse
Affiliation(s)
- Sarah Samorodnitsky
- Public Health Sciences Division, Fred Hutchinson Cancer Center
- SWOG Statistics and Data Management Center
| | - Katie Campbell
- Medicine, Division of Hematology/Oncology, University of California Los Angeles
| | - Amarise Little
- Public Health Sciences Division, Fred Hutchinson Cancer Center
- SWOG Statistics and Data Management Center
| | - Wodan Ling
- Population Health Sciences, Weill Cornell Medical College
| | - Ni Zhao
- Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University
| | - Yen-Chi Chen
- Department of Statistics, University of Washington
| | - Michael C. Wu
- Public Health Sciences Division, Fred Hutchinson Cancer Center
- SWOG Statistics and Data Management Center
| |
Collapse
|
4
|
Sarkar S, Möller A, Hartebrodt A, Erdmann M, Ostalecki C, Baur A, Blumenthal DB. Spatial cell graph analysis reveals skin tissue organization characteristic for cutaneous T cell lymphoma. NPJ Syst Biol Appl 2024; 10:143. [PMID: 39622929 PMCID: PMC11612425 DOI: 10.1038/s41540-024-00474-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2024] [Accepted: 11/22/2024] [Indexed: 12/06/2024] Open
Abstract
Cutaneous T-cell lymphomas (CTCLs) are non-Hodgkin lymphomas caused by malignant T cells which migrate to the skin and lead to rash-like lesions which can be difficult to distinguish from inflammatory skin conditions like atopic dermatitis (AD) and psoriasis (PSO). To characterize CTCL in comparison to these differential diagnoses, we carried out multi-antigen imaging on 69 skin tissue samples (21 CTCL, 23 AD, 25 PSO). The resulting protein abundance maps were then analyzed via scoring functions to quantify the heterogeneity of the individual cells' neighborhoods within spatial graphs inferred from the cells' positions in the tissue samples. Our analyses reveal characteristic patterns of skin tissue organization in CTCL as compared to AD and PSO, including a combination of increased local entropy and egophily in T-cell neighborhoods. These results could not only pave the way for high-precision diagnosis of CTCL, but may also facilitate further insights into cellular disease mechanisms.
Collapse
Affiliation(s)
- Suryadipto Sarkar
- Biomedical Network Science Lab, Department Artificial Intelligence in Biomedical Engineering (AIBE), Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Erlangen, Germany
| | - Anna Möller
- Biomedical Network Science Lab, Department Artificial Intelligence in Biomedical Engineering (AIBE), Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Erlangen, Germany
| | - Anne Hartebrodt
- Biomedical Network Science Lab, Department Artificial Intelligence in Biomedical Engineering (AIBE), Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Erlangen, Germany
| | - Michael Erdmann
- Department of Dermatology, Uniklinikum Erlangen, Deutsches Zentrum Immuntherapie (DZI), Comprehensive Cancer Center Erlangen-European Metropolitan Area of Nuremberg (CCC ER-EMN), Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Erlangen, Germany
| | - Christian Ostalecki
- Department of Dermatology, Uniklinikum Erlangen, Deutsches Zentrum Immuntherapie (DZI), Comprehensive Cancer Center Erlangen-European Metropolitan Area of Nuremberg (CCC ER-EMN), Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Erlangen, Germany
| | - Andreas Baur
- Department of Dermatology, Uniklinikum Erlangen, Deutsches Zentrum Immuntherapie (DZI), Comprehensive Cancer Center Erlangen-European Metropolitan Area of Nuremberg (CCC ER-EMN), Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Erlangen, Germany
| | - David B Blumenthal
- Biomedical Network Science Lab, Department Artificial Intelligence in Biomedical Engineering (AIBE), Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Erlangen, Germany.
| |
Collapse
|
5
|
Wrobel J, Soupir AC, Hayes MT, Peres LC, Vu T, Leroux A, Fridley BL. mxfda: a comprehensive toolkit for functional data analysis of single-cell spatial data. BIOINFORMATICS ADVANCES 2024; 4:vbae155. [PMID: 39552929 PMCID: PMC11568348 DOI: 10.1093/bioadv/vbae155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/25/2024] [Revised: 10/01/2024] [Accepted: 10/13/2024] [Indexed: 11/19/2024]
Abstract
Summary Technologies that produce spatial single-cell (SC) data have revolutionized the study of tissue microstructures and promise to advance personalized treatment of cancer by revealing new insights about the tumor microenvironment. Functional data analysis (FDA) is an ideal analytic framework for connecting cell spatial relationships to patient outcomes, but can be challenging to implement. To address this need, we present mxfda, an R package for end-to-end analysis of SC spatial data using FDA. mxfda implements a suite of methods to facilitate spatial analysis of SC imaging data using FDA techniques. Availability and implementation The mxfda R package is freely available at https://cran.r-project.org/package=mxfda and has detailed documentation, including four vignettes, available at http://juliawrobel.com/mxfda/.
Collapse
Affiliation(s)
- Julia Wrobel
- Department of Biostatistics & Bioinformatics, Emory University, Atlanta, GA 30322, United States
| | - Alex C Soupir
- Department of Biostatistics & Bioinformatics, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL 33612, United States
| | - Mitchell T Hayes
- Department of Genitourinary Oncology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL 33612, United States
| | - Lauren C Peres
- Department of Cancer Epidemiology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL 33612, United States
| | - Thao Vu
- Department of Biostatistics & Informatics, Colorado School of Public Health, Aurora, CO 80045, United States
| | - Andrew Leroux
- Department of Biostatistics & Informatics, Colorado School of Public Health, Aurora, CO 80045, United States
| | - Brooke L Fridley
- Division of Health Services & Outcomes Research, Children’s Mercy, Kansas City, MO 64108, United States
| |
Collapse
|
6
|
Mi H, Sivagnanam S, Ho WJ, Zhang S, Bergman D, Deshpande A, Baras AS, Jaffee EM, Coussens LM, Fertig EJ, Popel AS. Computational methods and biomarker discovery strategies for spatial proteomics: a review in immuno-oncology. Brief Bioinform 2024; 25:bbae421. [PMID: 39179248 PMCID: PMC11343572 DOI: 10.1093/bib/bbae421] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Revised: 07/11/2024] [Accepted: 08/09/2024] [Indexed: 08/26/2024] Open
Abstract
Advancements in imaging technologies have revolutionized our ability to deeply profile pathological tissue architectures, generating large volumes of imaging data with unparalleled spatial resolution. This type of data collection, namely, spatial proteomics, offers invaluable insights into various human diseases. Simultaneously, computational algorithms have evolved to manage the increasing dimensionality of spatial proteomics inherent in this progress. Numerous imaging-based computational frameworks, such as computational pathology, have been proposed for research and clinical applications. However, the development of these fields demands diverse domain expertise, creating barriers to their integration and further application. This review seeks to bridge this divide by presenting a comprehensive guideline. We consolidate prevailing computational methods and outline a roadmap from image processing to data-driven, statistics-informed biomarker discovery. Additionally, we explore future perspectives as the field moves toward interfacing with other quantitative domains, holding significant promise for precision care in immuno-oncology.
Collapse
Affiliation(s)
- Haoyang Mi
- Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, MD 21205, United States
| | - Shamilene Sivagnanam
- The Knight Cancer Institute, Oregon Health and Science University, Portland, OR 97201, United States
- Department of Cell, Development and Cancer Biology, Oregon Health and Science University, Portland, OR 97201, United States
| | - Won Jin Ho
- Department of Oncology, Johns Hopkins University School of Medicine, MD 21205, United States
- Convergence Institute, Johns Hopkins University, Baltimore, MD 21205, United States
| | - Shuming Zhang
- Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, MD 21205, United States
| | - Daniel Bergman
- Department of Oncology, Johns Hopkins University School of Medicine, MD 21205, United States
- Convergence Institute, Johns Hopkins University, Baltimore, MD 21205, United States
| | - Atul Deshpande
- Department of Oncology, Johns Hopkins University School of Medicine, MD 21205, United States
- Convergence Institute, Johns Hopkins University, Baltimore, MD 21205, United States
- Bloomberg-Kimmel Institute for Cancer Immunotherapy, Johns Hopkins University School of Medicine, Baltimore, MD 21205, United States
| | - Alexander S Baras
- Bloomberg-Kimmel Institute for Cancer Immunotherapy, Johns Hopkins University School of Medicine, Baltimore, MD 21205, United States
- Department of Pathology, Johns Hopkins University School of Medicine, MD 21205, United States
- The Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD 21205, United States
| | - Elizabeth M Jaffee
- Department of Oncology, Johns Hopkins University School of Medicine, MD 21205, United States
- Convergence Institute, Johns Hopkins University, Baltimore, MD 21205, United States
- Bloomberg-Kimmel Institute for Cancer Immunotherapy, Johns Hopkins University School of Medicine, Baltimore, MD 21205, United States
| | - Lisa M Coussens
- The Knight Cancer Institute, Oregon Health and Science University, Portland, OR 97201, United States
- Department of Cell, Development and Cancer Biology, Oregon Health and Science University, Portland, OR 97201, United States
- Brenden-Colson Center for Pancreatic Care, Oregon Health and Science University, Portland, OR 97201, United States
| | - Elana J Fertig
- Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, MD 21205, United States
- Department of Oncology, Johns Hopkins University School of Medicine, MD 21205, United States
- Convergence Institute, Johns Hopkins University, Baltimore, MD 21205, United States
- Bloomberg-Kimmel Institute for Cancer Immunotherapy, Johns Hopkins University School of Medicine, Baltimore, MD 21205, United States
- Department of Applied Mathematics and Statistics, Johns Hopkins University Whiting School of Engineering, Baltimore, MD 21218, United States
| | - Aleksander S Popel
- Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, MD 21205, United States
- Department of Oncology, Johns Hopkins University School of Medicine, MD 21205, United States
| |
Collapse
|
7
|
Samorodnitsky S, Campbell K, Ribas A, Wu MC. A Spatial Omnibus Test (SPOT) for Spatial Proteomic Data. Bioinformatics 2024; 40:btae425. [PMID: 38950184 PMCID: PMC11257711 DOI: 10.1093/bioinformatics/btae425] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Revised: 05/10/2024] [Accepted: 06/28/2024] [Indexed: 07/03/2024] Open
Abstract
MOTIVATION Spatial proteomics can reveal the spatial organization of immune cells in the tumor immune microenvironment. Relating measures of spatial clustering, such as Ripley's K or Besag's L, to patient outcomes may offer important clinical insights. However, these measures require pre-specifying a radius in which to quantify clustering, yet no consensus exists on the optimal radius which may be context-specific. RESULTS We propose a SPatial Omnibus Test (SPOT) which conducts this analysis across a range of candidate radii. At each radius, SPOT evaluates the association between the spatial summary and outcome, adjusting for confounders. SPOT then aggregates results across radii using the Cauchy combination test, yielding an omnibus P-value characterizing the overall degree of association. Using simulations, we verify that the type I error rate is controlled and show SPOT can be more powerful than alternatives. We also apply SPOT to ovarian and lung cancer studies. AVAILABILITY AND IMPLEMENTATION An R package and tutorial are provided at https://github.com/sarahsamorodnitsky/SPOT.
Collapse
Affiliation(s)
- Sarah Samorodnitsky
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle 98109, USA
- SWOG Statistics and Data Management Center, Fred Hutchinson Cancer Center, Seattle 98109, USA
| | - Katie Campbell
- Department of Medicine, Division of Hematology/Oncology, University of California Los Angeles, Los Angeles 90095, USA
| | - Antoni Ribas
- Department of Medicine, Division of Hematology/Oncology, University of California Los Angeles, Los Angeles 90095, USA
| | - Michael C Wu
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle 98109, USA
- SWOG Statistics and Data Management Center, Fred Hutchinson Cancer Center, Seattle 98109, USA
| |
Collapse
|
8
|
Samorodnitsky S, Campbell K, Ribas A, Wu MC. A Spatial Omnibus Test (SPOT) for Spatial Proteomic Data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.08.584117. [PMID: 38559053 PMCID: PMC10979932 DOI: 10.1101/2024.03.08.584117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Spatial proteomics can reveal the spatial organization of immune cells in the tumor immune microenvironment. Relating measures of spatial clustering, such as Ripley's K or Besag's L, to patient outcomes may offer important clinical insights. However, these measures require pre-specifying a radius in which to quantify clustering, yet no consensus exists on the optimal radius which may be context-specific. We propose a SPatial Omnibus Test (SPOT) which conducts this analysis across a range of candidate radii. At each radius, SPOT evaluates the association between the spatial summary and outcome, adjusting for confounders. SPOT then aggregates results across radii using the Cauchy combination test, yielding an omnibus p-value characterizing the overall degree of association. Using simulations, we verify that the type I error rate is controlled and show SPOT can be more powerful than alternatives. We also apply SPOT to an ovarian cancer study. An R package and tutorial is provided at https://github.com/sarahsamorodnitsky/SPOT.
Collapse
Affiliation(s)
- Sarah Samorodnitsky
- Public Health Sciences Division, Fred Hutch Cancer Center
- SWOG Statistics and Data Management Center
| | - Katie Campbell
- Medicine, Division of Hematology/Oncology, University of California Los Angeles
| | - Antoni Ribas
- Medicine, Division of Hematology/Oncology, University of California Los Angeles
| | - Michael C Wu
- Public Health Sciences Division, Fred Hutch Cancer Center
- SWOG Statistics and Data Management Center
| |
Collapse
|