1
|
Laubscher E, Wang X, Razin N, Dougherty T, Xu RJ, Ombelets L, Pao E, Graf W, Moffitt JR, Yue Y, Van Valen D. Accurate single-molecule spot detection for image-based spatial transcriptomics with weakly supervised deep learning. Cell Syst 2024; 15:475-482.e6. [PMID: 38754367 PMCID: PMC11995858 DOI: 10.1016/j.cels.2024.04.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 02/05/2024] [Accepted: 04/18/2024] [Indexed: 05/18/2024]
Abstract
Image-based spatial transcriptomics methods enable transcriptome-scale gene expression measurements with spatial information but require complex, manually tuned analysis pipelines. We present Polaris, an analysis pipeline for image-based spatial transcriptomics that combines deep-learning models for cell segmentation and spot detection with a probabilistic gene decoder to quantify single-cell gene expression accurately. Polaris offers a unifying, turnkey solution for analyzing spatial transcriptomics data from multiplexed error-robust FISH (MERFISH), sequential fluorescence in situ hybridization (seqFISH), or in situ RNA sequencing (ISS) experiments. Polaris is available through the DeepCell software library (https://github.com/vanvalenlab/deepcell-spots) and https://www.deepcell.org.
Collapse
Affiliation(s)
- Emily Laubscher
- Division of Chemistry and Chemical Engineering, Caltech, Pasadena, CA 91125, USA
| | - Xuefei Wang
- Division of Biology and Biological Engineering, Caltech, Pasadena, CA 91125, USA
| | - Nitzan Razin
- Division of Biology and Biological Engineering, Caltech, Pasadena, CA 91125, USA
| | - Tom Dougherty
- Division of Biology and Biological Engineering, Caltech, Pasadena, CA 91125, USA
| | - Rosalind J Xu
- Program in Cellular and Molecular Medicine, Boston Children's Hospital, Boston, MA 02115, USA; Department of Microbiology, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA; Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA 02115, USA
| | - Lincoln Ombelets
- Division of Chemistry and Chemical Engineering, Caltech, Pasadena, CA 91125, USA
| | - Edward Pao
- Division of Biology and Biological Engineering, Caltech, Pasadena, CA 91125, USA
| | - William Graf
- Division of Biology and Biological Engineering, Caltech, Pasadena, CA 91125, USA
| | - Jeffrey R Moffitt
- Program in Cellular and Molecular Medicine, Boston Children's Hospital, Boston, MA 02115, USA; Department of Microbiology, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA; Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Yisong Yue
- Division of Computational and Mathematical Sciences, Caltech, Pasadena, CA 91125, USA
| | - David Van Valen
- Division of Biology and Biological Engineering, Caltech, Pasadena, CA 91125, USA; Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA.
| |
Collapse
|
2
|
Laubscher E, Wang X(J, Razin N, Dougherty T, Xu RJ, Ombelets L, Pao E, Graf W, Moffitt JR, Yue Y, Van Valen D. Accurate single-molecule spot detection for image-based spatial transcriptomics with weakly supervised deep learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.09.03.556122. [PMID: 37732188 PMCID: PMC10508757 DOI: 10.1101/2023.09.03.556122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/22/2023]
Abstract
Image-based spatial transcriptomics methods enable transcriptome-scale gene expression measurements with spatial information but require complex, manually-tuned analysis pipelines. We present Polaris, an analysis pipeline for image-based spatial transcriptomics that combines deep learning models for cell segmentation and spot detection with a probabilistic gene decoder to quantify single-cell gene expression accurately. Polaris offers a unifying, turnkey solution for analyzing spatial transcriptomics data from MERFSIH, seqFISH, or ISS experiments. Polaris is available through the DeepCell software library (https://github.com/vanvalenlab/deepcell-spots) and https://www.deepcell.org.
Collapse
Affiliation(s)
- Emily Laubscher
- Division of Chemistry and Chemical Engineering, Caltech, Pasadena, CA
| | | | - Nitzan Razin
- Division of Biology and Biological Engineering, Caltech, Pasadena, CA
| | - Tom Dougherty
- Division of Biology and Biological Engineering, Caltech, Pasadena, CA
| | - Rosalind J. Xu
- Program in Cellular and Molecular Medicine, Boston Children’s Hospital, Boston MA
- Department of Microbiology, Blavatnik Institute, Harvard Medical School, Boston, MA
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA
| | - Lincoln Ombelets
- Division of Chemistry and Chemical Engineering, Caltech, Pasadena, CA
| | - Edward Pao
- Division of Biology and Biological Engineering, Caltech, Pasadena, CA
| | - William Graf
- Division of Biology and Biological Engineering, Caltech, Pasadena, CA
| | - Jeffrey R. Moffitt
- Program in Cellular and Molecular Medicine, Boston Children’s Hospital, Boston MA
- Department of Microbiology, Blavatnik Institute, Harvard Medical School, Boston, MA
- Broad Institute of Harvard and MIT, Cambridge, MA
| | - Yisong Yue
- Division of Computational and Mathematical Sciences, Caltech, Pasadena, CA
| | - David Van Valen
- Division of Biology and Biological Engineering, Caltech, Pasadena, CA
- Howard Hughes Medical Institute, Chevy Chase, MD
| |
Collapse
|
3
|
Wagner SJ, Matek C, Shetab Boushehri S, Boxberg M, Lamm L, Sadafi A, Winter DJE, Marr C, Peng T. Built to Last? Reproducibility and Reusability of Deep Learning Algorithms in Computational Pathology. Mod Pathol 2024; 37:100350. [PMID: 37827448 DOI: 10.1016/j.modpat.2023.100350] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 10/02/2023] [Accepted: 10/03/2023] [Indexed: 10/14/2023]
Abstract
Recent progress in computational pathology has been driven by deep learning. While code and data availability are essential to reproduce findings from preceding publications, ensuring a deep learning model's reusability is more challenging. For that, the codebase should be well-documented and easy to integrate into existing workflows and models should be robust toward noise and generalizable toward data from different sources. Strikingly, only a few computational pathology algorithms have been reused by other researchers so far, let alone employed in a clinical setting. To assess the current state of reproducibility and reusability of computational pathology algorithms, we evaluated peer-reviewed articles available in PubMed, published between January 2019 and March 2021, in 5 use cases: stain normalization; tissue type segmentation; evaluation of cell-level features; genetic alteration prediction; and inference of grading, staging, and prognostic information. We compiled criteria for data and code availability and statistical result analysis and assessed them in 160 publications. We found that only one-quarter (41 of 160 publications) made code publicly available. Among these 41 studies, three-quarters (30 of 41) analyzed their results statistically, half of them (20 of 41) released their trained model weights, and approximately a third (16 of 41) used an independent cohort for evaluation. Our review is intended for both pathologists interested in deep learning and researchers applying algorithms to computational pathology challenges. We provide a detailed overview of publications with published code in the field, list reusable data handling tools, and provide criteria for reproducibility and reusability.
Collapse
Affiliation(s)
- Sophia J Wagner
- Helmholtz AI, Helmholtz Munich-German Research Center for Environmental Health, Neuherberg, Germany; School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
| | - Christian Matek
- Institute of AI for Health, Helmholtz Munich-German Research Center for Environmental Health, Neuherberg, Germany; Institute of Pathology, University Hospital Erlangen, Erlangen, Germany
| | - Sayedali Shetab Boushehri
- School of Computation, Information and Technology, Technical University of Munich, Garching, Germany; Institute of AI for Health, Helmholtz Munich-German Research Center for Environmental Health, Neuherberg, Germany; Data & Analytics (D&A), Roche Pharma Research and Early Development (pRED), Roche Innovation Center Munich, Germany
| | - Melanie Boxberg
- Institute of Pathology, Technical University Munich, Munich, Germany; Institute of Pathology Munich-North, Munich, Germany
| | - Lorenz Lamm
- Helmholtz AI, Helmholtz Munich-German Research Center for Environmental Health, Neuherberg, Germany; Helmholtz Pioneer Campus, Helmholtz Munich-German Research Center for Environmental Health, Neuherberg, Germany
| | - Ario Sadafi
- School of Computation, Information and Technology, Technical University of Munich, Garching, Germany; Institute of AI for Health, Helmholtz Munich-German Research Center for Environmental Health, Neuherberg, Germany
| | - Dominik J E Winter
- Institute of AI for Health, Helmholtz Munich-German Research Center for Environmental Health, Neuherberg, Germany; School of Life Sciences, Technical University of Munich, Weihenstephan, Germany
| | - Carsten Marr
- Institute of AI for Health, Helmholtz Munich-German Research Center for Environmental Health, Neuherberg, Germany.
| | - Tingying Peng
- Helmholtz AI, Helmholtz Munich-German Research Center for Environmental Health, Neuherberg, Germany.
| |
Collapse
|
4
|
A generalizable and robust deep learning algorithm for mitosis detection in multicenter breast histopathological images. Med Image Anal 2023; 84:102703. [PMID: 36481608 DOI: 10.1016/j.media.2022.102703] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2022] [Revised: 09/16/2022] [Accepted: 11/21/2022] [Indexed: 11/24/2022]
Abstract
Mitosis counting of biopsies is an important biomarker for breast cancer patients, which supports disease prognostication and treatment planning. Developing a robust mitotic cell detection model is highly challenging due to its complex growth pattern and high similarities with non-mitotic cells. Most mitosis detection algorithms have poor generalizability across image domains and lack reproducibility and validation in multicenter settings. To overcome these issues, we propose a generalizable and robust mitosis detection algorithm (called FMDet), which is independently tested on multicenter breast histopathological images. To capture more refined morphological features of cells, we convert the object detection task as a semantic segmentation problem. The pixel-level annotations for mitotic nuclei are obtained by taking the intersection of the masks generated from a well-trained nuclear segmentation model and the bounding boxes provided by the MIDOG 2021 challenge. In our segmentation framework, a robust feature extractor is developed to capture the appearance variations of mitotic cells, which is constructed by integrating a channel-wise multi-scale attention mechanism into a fully convolutional network structure. Benefiting from the fact that the changes in the low-level spectrum do not affect the high-level semantic perception, we employ a Fourier-based data augmentation method to reduce domain discrepancies by exchanging the low-frequency spectrum between two domains. Our FMDet algorithm has been tested in the MIDOG 2021 challenge and ranked first place. Further, our algorithm is also externally validated on four independent datasets for mitosis detection, which exhibits state-of-the-art performance in comparison with previously published results. These results demonstrate that our algorithm has the potential to be deployed as an assistant decision support tool in clinical practice. Our code has been released at https://github.com/Xiyue-Wang/1st-in-MICCAI-MIDOG-2021-challenge.
Collapse
|
5
|
Ragab M, Farouk S. Sabir M. Arithmetic Optimization with Deep Learning Enabled Anomaly Detection in燬mart City. COMPUTERS, MATERIALS & CONTINUA 2022; 73:381-395. [DOI: 10.32604/cmc.2022.027327] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 03/30/2022] [Indexed: 10/28/2024]
|
6
|
Su R, van der Sluijs M, Cornelissen SA, Lycklama G, Hofmeijer J, Majoie CB, van Doormaal PJ, van Es AC, Ruijters D, Niessen WJ, van der Lugt A, van Walsum T. Spatio-temporal deep learning for automatic detection of intracranial vessel perforation in digital subtraction angiography during endovascular thrombectomy. Med Image Anal 2022; 77:102377. [DOI: 10.1016/j.media.2022.102377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Revised: 01/17/2022] [Accepted: 01/19/2022] [Indexed: 11/16/2022]
|
7
|
Yakimovich A, Beaugnon A, Huang Y, Ozkirimli E. Labels in a haystack: Approaches beyond supervised learning in biomedical applications. PATTERNS (NEW YORK, N.Y.) 2021; 2:100383. [PMID: 34950904 PMCID: PMC8672145 DOI: 10.1016/j.patter.2021.100383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Recent advances in biomedical machine learning demonstrate great potential for data-driven techniques in health care and biomedical research. However, this potential has thus far been hampered by both the scarcity of annotated data in the biomedical domain and the diversity of the domain's subfields. While unsupervised learning is capable of finding unknown patterns in the data by design, supervised learning requires human annotation to achieve the desired performance through training. With the latter performing vastly better than the former, the need for annotated datasets is high, but they are costly and laborious to obtain. This review explores a family of approaches existing between the supervised and the unsupervised problem setting. The goal of these algorithms is to make more efficient use of the available labeled data. The advantages and limitations of each approach are addressed and perspectives are provided.
Collapse
Affiliation(s)
- Artur Yakimovich
- Roche Pharma International Informatics, Roche Products Limited, Welwyn Garden City, UK
| | - Anaël Beaugnon
- Roche Pharma International Informatics, Roche, Boulogne-Billancourt, France
| | - Yi Huang
- Roche Pharma International Informatics, Roche (China) Holding Ltd., Shanghai, China
| | - Elif Ozkirimli
- Roche Pharma International Informatics, F. Hoffmann-La Roche AG, Kaiseraugst, Switzerland
| |
Collapse
|
8
|
Hallou A, Yevick HG, Dumitrascu B, Uhlmann V. Deep learning for bioimage analysis in developmental biology. Development 2021; 148:dev199616. [PMID: 34490888 PMCID: PMC8451066 DOI: 10.1242/dev.199616] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Deep learning has transformed the way large and complex image datasets can be processed, reshaping what is possible in bioimage analysis. As the complexity and size of bioimage data continues to grow, this new analysis paradigm is becoming increasingly ubiquitous. In this Review, we begin by introducing the concepts needed for beginners to understand deep learning. We then review how deep learning has impacted bioimage analysis and explore the open-source resources available to integrate it into a research project. Finally, we discuss the future of deep learning applied to cell and developmental biology. We analyze how state-of-the-art methodologies have the potential to transform our understanding of biological systems through new image-based analysis and modelling that integrate multimodal inputs in space and time.
Collapse
Affiliation(s)
- Adrien Hallou
- Cavendish Laboratory, Department of Physics, University of Cambridge, Cambridge, CB3 0HE, UK
- Wellcome Trust/Cancer Research UK Gurdon Institute, University of Cambridge, Cambridge, CB2 1QN, UK
- Wellcome Trust/Medical Research Council Stem Cell Institute, University of Cambridge, Cambridge, CB2 1QR, UK
| | - Hannah G. Yevick
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, 02142, USA
| | - Bianca Dumitrascu
- Computer Laboratory, Cambridge, University of Cambridge, Cambridge, CB3 0FD, UK
| | - Virginie Uhlmann
- European Bioinformatics Institute, European Molecular Biology Laboratory, Cambridge, CB10 1SD, UK
| |
Collapse
|