1
|
Foran DJ, Durbin EB, Chen W, Sadimin E, Sharma A, Banerjee I, Kurc T, Li N, Stroup AM, Harris G, Gu A, Schymura M, Gupta R, Bremer E, Balsamo J, DiPrima T, Wang F, Abousamra S, Samaras D, Hands I, Ward K, Saltz JH. An Expandable Informatics Framework for Enhancing Central Cancer Registries with Digital Pathology Specimens, Computational Imaging Tools, and Advanced Mining Capabilities. J Pathol Inform 2022; 13:5. [PMID: 35136672 PMCID: PMC8794027 DOI: 10.4103/jpi.jpi_31_21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Accepted: 04/30/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Population-based state cancer registries are an authoritative source for cancer statistics in the United States. They routinely collect a variety of data, including patient demographics, primary tumor site, stage at diagnosis, first course of treatment, and survival, on every cancer case that is reported across all U.S. states and territories. The goal of our project is to enrich NCI's Surveillance, Epidemiology, and End Results (SEER) registry data with high-quality population-based biospecimen data in the form of digital pathology, machine-learning-based classifications, and quantitative histopathology imaging feature sets (referred to here as Pathomics features). MATERIALS AND METHODS As part of the project, the underlying informatics infrastructure was designed, tested, and implemented through close collaboration with several participating SEER registries to ensure consistency with registry processes, computational scalability, and ability to support creation of population cohorts that span multiple sites. Utilizing computational imaging algorithms and methods to both generate indices and search for matches makes it possible to reduce inter- and intra-observer inconsistencies and to improve the objectivity with which large image repositories are interrogated. RESULTS Our team has created and continues to expand a well-curated repository of high-quality digitized pathology images corresponding to subjects whose data are routinely collected by the collaborating registries. Our team has systematically deployed and tested key, visual analytic methods to facilitate automated creation of population cohorts for epidemiological studies and tools to support visualization of feature clusters and evaluation of whole-slide images. As part of these efforts, we are developing and optimizing advanced search and matching algorithms to facilitate automated, content-based retrieval of digitized specimens based on their underlying image features and staining characteristics. CONCLUSION To meet the challenges of this project, we established the analytic pipelines, methods, and workflows to support the expansion and management of a growing repository of high-quality digitized pathology and information-rich, population cohorts containing objective imaging and clinical attributes to facilitate studies that seek to discriminate among different subtypes of disease, stratify patient populations, and perform comparisons of tumor characteristics within and across patient cohorts. We have also successfully developed a suite of tools based on a deep-learning method to perform quantitative characterizations of tumor regions, assess infiltrating lymphocyte distributions, and generate objective nuclear feature measurements. As part of these efforts, our team has implemented reliable methods that enable investigators to systematically search through large repositories to automatically retrieve digitized pathology specimens and correlated clinical data based on their computational signatures.
Collapse
Affiliation(s)
- David J. Foran
- Center for Biomedical Informatics, Rutgers Cancer Institute of New Jersey, New Brunswick, NJ, USA
- Department of Pathology and Laboratory Medicine, Rutgers-Robert Wood Johnson Medical School, Piscataway, NJ, USA
| | - Eric B. Durbin
- Kentucky Cancer Registry, Markey Cancer Center, University of Kentucky, Lexington, KY, USA
- Division of Biomedical Informatics, Department of Internal Medicine, College of Medicine, Lexington, KY, USA
| | - Wenjin Chen
- Center for Biomedical Informatics, Rutgers Cancer Institute of New Jersey, New Brunswick, NJ, USA
| | - Evita Sadimin
- Center for Biomedical Informatics, Rutgers Cancer Institute of New Jersey, New Brunswick, NJ, USA
- Department of Pathology and Laboratory Medicine, Rutgers-Robert Wood Johnson Medical School, Piscataway, NJ, USA
| | - Ashish Sharma
- Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, GA, USA
| | - Imon Banerjee
- Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, GA, USA
| | - Tahsin Kurc
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY, USA
| | - Nan Li
- Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, GA, USA
| | - Antoinette M. Stroup
- New Jersey State Cancer Registry, Rutgers Cancer Institute of New Jersey, New Brunswick, NJ, USA
| | - Gerald Harris
- New Jersey State Cancer Registry, Rutgers Cancer Institute of New Jersey, New Brunswick, NJ, USA
| | - Annie Gu
- Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, GA, USA
| | - Maria Schymura
- New York State Cancer Registry, New York State Department of Health, Albany, NY, USA
| | - Rajarsi Gupta
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY, USA
| | - Erich Bremer
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY, USA
| | - Joseph Balsamo
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY, USA
| | - Tammy DiPrima
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY, USA
| | - Feiqiao Wang
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY, USA
| | - Shahira Abousamra
- Department of Computer Science, Stony Brook University, Stony Brook, NY, USA
| | - Dimitris Samaras
- Department of Computer Science, Stony Brook University, Stony Brook, NY, USA
| | - Isaac Hands
- Division of Biomedical Informatics, Department of Internal Medicine, College of Medicine, Lexington, KY, USA
| | - Kevin Ward
- Georgia State Cancer Registry, Georgia Department of Public Health, Atlanta, GA, USA
| | - Joel H. Saltz
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY, USA
| |
Collapse
|
2
|
Niazi MKK, Senaras C, Pennell M, Arole V, Tozbikian G, Gurcan MN. Relationship between the Ki67 index and its area based approximation in breast cancer. BMC Cancer 2018; 18:867. [PMID: 30176814 PMCID: PMC6122570 DOI: 10.1186/s12885-018-4735-5] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2017] [Accepted: 08/08/2018] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND The Ki67 Index has been extensively studied as a prognostic biomarker in breast cancer. However, its clinical adoption is largely hampered by the lack of a standardized method to assess Ki67 that limits inter-laboratory reproducibility. It is important to standardize the computation of the Ki67 Index before it can be effectively used in clincial practice. METHOD In this study, we develop a systematic approach towards standardization of the Ki67 Index. We first create the ground truth consisting of tumor positive and tumor negative nuclei by registering adjacent breast tissue sections stained with Ki67 and H&E. The registration is followed by segmentation of positive and negative nuclei within tumor regions from Ki67 images. The true Ki67 Index is then approximated with a linear model of the area of positive to the total area of tumor nuclei. RESULTS When tested on 75 images of Ki67 stained breast cancer biopsies, the proposed method resulted in an average root mean square error of 3.34. In comparison, an expert pathologist resulted in an average root mean square error of 9.98 and an existing automated approach produced an average root mean square error of 5.64. CONCLUSIONS We show that it is possible to approximate the true Ki67 Index accurately without detecting individual nuclei and also statically demonstrate the weaknesses of commonly adopted approaches that use both tumor and non-tumor regions together while compensating for the latter with higher order approximations.
Collapse
Affiliation(s)
| | - Caglar Senaras
- Center for Biomedical Informatics, Wake Forest School of Medicine, Winston-Salem, USA
| | - Michael Pennell
- Division of Biostatistics, College of Public Health, The Ohio State University, Columbus, USA
| | - Vidya Arole
- Department of Biomedical Informatics, The Ohio State University, Columbus, USA
| | - Gary Tozbikian
- Department of Pathology, The Ohio State University, Columbus, USA
| | - Metin N. Gurcan
- Center for Biomedical Informatics, Wake Forest School of Medicine, Winston-Salem, USA
| |
Collapse
|
3
|
Bashshur RL, Krupinski EA, Thrall JH, Bashshur N. The Empirical Foundations of Teleradiology and Related Applications: A Review of the Evidence. Telemed J E Health 2016; 22:868-898. [PMID: 27585301 PMCID: PMC5107673 DOI: 10.1089/tmj.2016.0149] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2016] [Accepted: 07/10/2016] [Indexed: 02/06/2023] Open
Abstract
INTRODUCTION Radiology was founded on a technological discovery by Wilhelm Roentgen in 1895. Teleradiology also had its roots in technology dating back to 1947 with the successful transmission of radiographic images through telephone lines. Diagnostic radiology has become the eye of medicine in terms of diagnosing and treating injury and disease. This article documents the empirical foundations of teleradiology. METHODS A selective review of the credible literature during the past decade (2005-2015) was conducted, using robust research design and adequate sample size as criteria for inclusion. FINDINGS The evidence regarding feasibility of teleradiology and related information technology applications has been well documented for several decades. The majority of studies focused on intermediate outcomes, as indicated by comparability between teleradiology and conventional radiology. A consistent trend of concordance between the two modalities was observed in terms of diagnostic accuracy and reliability. Additional benefits include reductions in patient transfer, rehospitalization, and length of stay.
Collapse
Affiliation(s)
| | | | - James H. Thrall
- Department of Radiology, Massachusetts General Hospital, Harvard, Boston, Massachusetts
| | - Noura Bashshur
- University of Michigan Health System, Ann Arbor, Michigan
| |
Collapse
|
4
|
Zhang X, Dou H, Ju T, Xu J, Zhang S. Fusing Heterogeneous Features From Stacked Sparse Autoencoder for Histopathological Image Analysis. IEEE J Biomed Health Inform 2016; 20:1377-83. [DOI: 10.1109/jbhi.2015.2461671] [Citation(s) in RCA: 53] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
5
|
Xing F, Yang L. Robust Nucleus/Cell Detection and Segmentation in Digital Pathology and Microscopy Images: A Comprehensive Review. IEEE Rev Biomed Eng 2016; 9:234-63. [PMID: 26742143 PMCID: PMC5233461 DOI: 10.1109/rbme.2016.2515127] [Citation(s) in RCA: 213] [Impact Index Per Article: 26.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Digital pathology and microscopy image analysis is widely used for comprehensive studies of cell morphology or tissue structure. Manual assessment is labor intensive and prone to interobserver variations. Computer-aided methods, which can significantly improve the objectivity and reproducibility, have attracted a great deal of interest in recent literature. Among the pipeline of building a computer-aided diagnosis system, nucleus or cell detection and segmentation play a very important role to describe the molecular morphological information. In the past few decades, many efforts have been devoted to automated nucleus/cell detection and segmentation. In this review, we provide a comprehensive summary of the recent state-of-the-art nucleus/cell segmentation approaches on different types of microscopy images including bright-field, phase-contrast, differential interference contrast, fluorescence, and electron microscopies. In addition, we discuss the challenges for the current methods and the potential future work of nucleus/cell detection and segmentation.
Collapse
|
6
|
Kurc T, Qi X, Wang D, Wang F, Teodoro G, Cooper L, Nalisnik M, Yang L, Saltz J, Foran DJ. Scalable analysis of Big pathology image data cohorts using efficient methods and high-performance computing strategies. BMC Bioinformatics 2015; 16:399. [PMID: 26627175 PMCID: PMC4667532 DOI: 10.1186/s12859-015-0831-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2015] [Accepted: 11/16/2015] [Indexed: 11/16/2022] Open
Abstract
BACKGROUND We describe a suite of tools and methods that form a core set of capabilities for researchers and clinical investigators to evaluate multiple analytical pipelines and quantify sensitivity and variability of the results while conducting large-scale studies in investigative pathology and oncology. The overarching objective of the current investigation is to address the challenges of large data sizes and high computational demands. RESULTS The proposed tools and methods take advantage of state-of-the-art parallel machines and efficient content-based image searching strategies. The content based image retrieval (CBIR) algorithms can quickly detect and retrieve image patches similar to a query patch using a hierarchical analysis approach. The analysis component based on high performance computing can carry out consensus clustering on 500,000 data points using a large shared memory system. CONCLUSIONS Our work demonstrates efficient CBIR algorithms and high performance computing can be leveraged for efficient analysis of large microscopy images to meet the challenges of clinically salient applications in pathology. These technologies enable researchers and clinical investigators to make more effective use of the rich informational content contained within digitized microscopy specimens.
Collapse
Affiliation(s)
- Tahsin Kurc
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, USA.
| | - Xin Qi
- Department of Pathology & Laboratory Medicine, Rutgers -- Robert Wood Johnson Medical School, New Brunswick, USA.
- Rutgers Cancer Institute of New Jersey, New Brunswick, USA.
| | - Daihou Wang
- Department of Electrical and Computer Engineering, Rutgers University, New Brunswick, USA.
| | - Fusheng Wang
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, USA.
- Department of Computer Science, Stony Brook University, Stony Brook, USA.
| | - George Teodoro
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, USA.
- Department of Computer Science, University of Brasilia, Brasília, Brazil.
| | - Lee Cooper
- Department of Biomedical Informatics, Emory University, Atlanta, USA.
| | - Michael Nalisnik
- Department of Biomedical Informatics, Emory University, Atlanta, USA.
| | - Lin Yang
- Department of Biomedical Engineering, University of Florida, Gainesville, USA.
| | - Joel Saltz
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, USA.
| | - David J Foran
- Department of Pathology & Laboratory Medicine, Rutgers -- Robert Wood Johnson Medical School, New Brunswick, USA.
- Rutgers Cancer Institute of New Jersey, New Brunswick, USA.
| |
Collapse
|
7
|
Wang Y, McManus DT, Arthur K, Johnston BT, Kennedy AJ, Coleman HG, Murray LJ, Hamilton PW. Whole slide image cytometry: a novel method to detect abnormal DNA content in Barrett's esophagus. J Transl Med 2015; 95:1319-30. [PMID: 26237272 DOI: 10.1038/labinvest.2015.98] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2015] [Revised: 06/12/2015] [Accepted: 06/15/2015] [Indexed: 12/20/2022] Open
Abstract
Barrett's esophagus (BE) is a precursor of esophageal adenocarcinoma (EAC). Both low-grade dysplasia (LGD) and high-grade dysplasia (HGD) are associated with an increased risk of progression to EAC. However, histological interpretation and grading of dysplasia (particularly LGD) is subjective and poorly reproducible. This study has combined whole slide imaging with DNA image cytometry to provide a novel method for the detection of abnormal DNA content through image analysis of tissue sections. A total of 20 cases were evaluated, including 8 negative for dysplasia (NFD), 6 LGD, and 6 HGD. Feulgen-stained esophageal sections were scanned in their entirety. Barrett's mucosa was interactively chosen for automatic nuclei segmentation where irrelevant cell types were ignored. The combined DNA content histogram for all nuclei within selected image regions was then obtained. In addition, three histogram measurements were computed, including xER-5C, 2cDI, and DNA-MG. Visual evaluation suggested the shape of DNA content histograms from NFD, LGD, and HGD cases exhibiting identifiable differences. The histogram measurements, xER-5C, 2cDI, and DNA-MG, were shown to be effective in differentiating metaplastic from dysplastic cases with statistical significance. Moreover, they also successfully separated NFD, LGD, and HGD patients with statistical significance. Whole slide image cytometry is a novel and effective method for the detection of abnormal DNA content in BE. Compared with histological review, it is more objective. Compared with flow cytometry and cytology-preparation image cytometry, it is low cost, simple to use, only requires a single 1 μm section, and facilitates selection of tissue and topographical correlation. Whole slide image cytometry can detect differences in DNA content between NFD, LGD, and HGD patients in this cross-sectional study. Abnormal DNA content detection by whole slide image cytometry is a promising biomarker of progression that could affect future diagnostics in BE.
Collapse
Affiliation(s)
- Yinhai Wang
- Finland Institute for Molecular Medicine (FIMM), University of Helsinki, Helsinki, Finland
| | - Damian T McManus
- Belfast HSC Trust, Belfast, UK
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, UK
| | - Kenneth Arthur
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, UK
| | | | | | - Helen G Coleman
- Centre for Public Health, Queen's University Belfast, Belfast, UK
| | - Liam J Murray
- Centre for Public Health, Queen's University Belfast, Belfast, UK
| | - Peter W Hamilton
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, UK
| |
Collapse
|
8
|
Zhang X, Liu W, Dundar M, Badve S, Zhang S. Towards large-scale histopathological image analysis: hashing-based image retrieval. IEEE TRANSACTIONS ON MEDICAL IMAGING 2015; 34:496-506. [PMID: 25314696 DOI: 10.1109/tmi.2014.2361481] [Citation(s) in RCA: 77] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Automatic analysis of histopathological images has been widely utilized leveraging computational image-processing methods and modern machine learning techniques. Both computer-aided diagnosis (CAD) and content-based image-retrieval (CBIR) systems have been successfully developed for diagnosis, disease detection, and decision support in this area. Recently, with the ever-increasing amount of annotated medical data, large-scale and data-driven methods have emerged to offer a promise of bridging the semantic gap between images and diagnostic information. In this paper, we focus on developing scalable image-retrieval techniques to cope intelligently with massive histopathological images. Specifically, we present a supervised kernel hashing technique which leverages a small amount of supervised information in learning to compress a 10 000-dimensional image feature vector into only tens of binary bits with the informative signatures preserved. These binary codes are then indexed into a hash table that enables real-time retrieval of images in a large database. Critically, the supervised information is employed to bridge the semantic gap between low-level image features and high-level diagnostic information. We build a scalable image-retrieval framework based on the supervised hashing technique and validate its performance on several thousand histopathological images acquired from breast microscopic tissues. Extensive evaluations are carried out in terms of image classification (i.e., benign versus actionable categorization) and retrieval tests. Our framework achieves about 88.1% classification accuracy as well as promising time efficiency. For example, the framework can execute around 800 queries in only 0.01 s, comparing favorably with other commonly used dimensionality reduction and feature selection methods.
Collapse
|
9
|
Wang F, Kong J, Gao J, Cooper LAD, Kurc T, Zhou Z, Adler D, Vergara-Niedermayr C, Katigbak B, Brat DJ, Saltz JH. A high-performance spatial database based approach for pathology imaging algorithm evaluation. J Pathol Inform 2013; 4:5. [PMID: 23599905 PMCID: PMC3624706 DOI: 10.4103/2153-3539.108543] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2012] [Accepted: 12/06/2012] [Indexed: 11/04/2022] Open
Abstract
BACKGROUND Algorithm evaluation provides a means to characterize variability across image analysis algorithms, validate algorithms by comparison with human annotations, combine results from multiple algorithms for performance improvement, and facilitate algorithm sensitivity studies. The sizes of images and image analysis results in pathology image analysis pose significant challenges in algorithm evaluation. We present an efficient parallel spatial database approach to model, normalize, manage, and query large volumes of analytical image result data. This provides an efficient platform for algorithm evaluation. Our experiments with a set of brain tumor images demonstrate the application, scalability, and effectiveness of the platform. CONTEXT The paper describes an approach and platform for evaluation of pathology image analysis algorithms. The platform facilitates algorithm evaluation through a high-performance database built on the Pathology Analytic Imaging Standards (PAIS) data model. AIMS (1) Develop a framework to support algorithm evaluation by modeling and managing analytical results and human annotations from pathology images; (2) Create a robust data normalization tool for converting, validating, and fixing spatial data from algorithm or human annotations; (3) Develop a set of queries to support data sampling and result comparisons; (4) Achieve high performance computation capacity via a parallel data management infrastructure, parallel data loading and spatial indexing optimizations in this infrastructure. MATERIALS AND METHODS WE HAVE CONSIDERED TWO SCENARIOS FOR ALGORITHM EVALUATION: (1) algorithm comparison where multiple result sets from different methods are compared and consolidated; and (2) algorithm validation where algorithm results are compared with human annotations. We have developed a spatial normalization toolkit to validate and normalize spatial boundaries produced by image analysis algorithms or human annotations. The validated data were formatted based on the PAIS data model and loaded into a spatial database. To support efficient data loading, we have implemented a parallel data loading tool that takes advantage of multi-core CPUs to accelerate data injection. The spatial database manages both geometric shapes and image features or classifications, and enables spatial sampling, result comparison, and result aggregation through expressive structured query language (SQL) queries with spatial extensions. To provide scalable and efficient query support, we have employed a shared nothing parallel database architecture, which distributes data homogenously across multiple database partitions to take advantage of parallel computation power and implements spatial indexing to achieve high I/O throughput. RESULTS Our work proposes a high performance, parallel spatial database platform for algorithm validation and comparison. This platform was evaluated by storing, managing, and comparing analysis results from a set of brain tumor whole slide images. The tools we develop are open source and available to download. CONCLUSIONS Pathology image algorithm validation and comparison are essential to iterative algorithm development and refinement. One critical component is the support for queries involving spatial predicates and comparisons. In our work, we develop an efficient data model and parallel database approach to model, normalize, manage and query large volumes of analytical image result data. Our experiments demonstrate that the data partitioning strategy and the grid-based indexing result in good data distribution across database nodes and reduce I/O overhead in spatial join queries through parallel retrieval of relevant data and quick subsetting of datasets. The set of tools in the framework provide a full pipeline to normalize, load, manage and query analytical results for algorithm evaluation.
Collapse
Affiliation(s)
- Fusheng Wang
- Department of Biomedical Informatics, Emory University, USA ; Center for Comprehensive Informatics, Emory University, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Bhattacharyya D, Bandyopadhyay SK, Kim TH. Diagnosis of breast cancer by tissue analysis. Chin J Cancer Res 2013; 25:39-45. [PMID: 23372340 DOI: 10.3978/j.issn.1000-9604.2012.12.02] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2012] [Accepted: 12/14/2012] [Indexed: 11/14/2022] Open
Abstract
In this paper, we propose a technique to locate abnormal growth of cells in breast tissue and suggest further pathological test, when require. We compare normal breast tissue with malignant invasive breast tissue by a series of image processing steps. Normal ductal epithelial cells and ductal/lobular invasive carcinogenic cells also consider for comparison here in this paper. In fact, features of cancerous breast tissue (invasive) are extracted and analyses with normal breast tissue. We also suggest the breast cancer recognition technique through image processing and prevention by controlling p53 gene mutation to some extent.
Collapse
Affiliation(s)
- Debnath Bhattacharyya
- Computer Science and Engineering Department, NFET, NSHM Knowledge Campus - Durgapur, Durgapur - 713212, India
| | | | | |
Collapse
|
11
|
Wang F, Kong J, Cooper L, Pan T, Kurc T, Chen W, Sharma A, Niedermayr C, Oh TW, Brat D, Farris AB, Foran DJ, Saltz J. A data model and database for high-resolution pathology analytical image informatics. J Pathol Inform 2011; 2:32. [PMID: 21845230 PMCID: PMC3153692 DOI: 10.4103/2153-3539.83192] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2011] [Accepted: 06/01/2011] [Indexed: 11/14/2022] Open
Abstract
BACKGROUND The systematic analysis of imaged pathology specimens often results in a vast amount of morphological information at both the cellular and sub-cellular scales. While microscopy scanners and computerized analysis are capable of capturing and analyzing data rapidly, microscopy image data remain underutilized in research and clinical settings. One major obstacle which tends to reduce wider adoption of these new technologies throughout the clinical and scientific communities is the challenge of managing, querying, and integrating the vast amounts of data resulting from the analysis of large digital pathology datasets. This paper presents a data model, which addresses these challenges, and demonstrates its implementation in a relational database system. CONTEXT This paper describes a data model, referred to as Pathology Analytic Imaging Standards (PAIS), and a database implementation, which are designed to support the data management and query requirements of detailed characterization of micro-anatomic morphology through many interrelated analysis pipelines on whole-slide images and tissue microarrays (TMAs). AIMS (1) Development of a data model capable of efficiently representing and storing virtual slide related image, annotation, markup, and feature information. (2) Development of a database, based on the data model, capable of supporting queries for data retrieval based on analysis and image metadata, queries for comparison of results from different analyses, and spatial queries on segmented regions, features, and classified objects. SETTINGS AND DESIGN The work described in this paper is motivated by the challenges associated with characterization of micro-scale features for comparative and correlative analyses involving whole-slides tissue images and TMAs. Technologies for digitizing tissues have advanced significantly in the past decade. Slide scanners are capable of producing high-magnification, high-resolution images from whole slides and TMAs within several minutes. Hence, it is becoming increasingly feasible for basic, clinical, and translational research studies to produce thousands of whole-slide images. Systematic analysis of these large datasets requires efficient data management support for representing and indexing results from hundreds of interrelated analyses generating very large volumes of quantifications such as shape and texture and of classifications of the quantified features. MATERIALS AND METHODS We have designed a data model and a database to address the data management requirements of detailed characterization of micro-anatomic morphology through many interrelated analysis pipelines. The data model represents virtual slide related image, annotation, markup and feature information. The database supports a wide range of metadata and spatial queries on images, annotations, markups, and features. RESULTS We currently have three databases running on a Dell PowerEdge T410 server with CentOS 5.5 Linux operating system. The database server is IBM DB2 Enterprise Edition 9.7.2. The set of databases consists of 1) a TMA database containing image analysis results from 4740 cases of breast cancer, with 641 MB storage size; 2) an algorithm validation database, which stores markups and annotations from two segmentation algorithms and two parameter sets on 18 selected slides, with 66 GB storage size; and 3) an in silico brain tumor study database comprising results from 307 TCGA slides, with 365 GB storage size. The latter two databases also contain human-generated annotations and markups for regions and nuclei. CONCLUSIONS Modeling and managing pathology image analysis results in a database provide immediate benefits on the value and usability of data in a research study. The database provides powerful query capabilities, which are otherwise difficult or cumbersome to support by other approaches such as programming languages. Standardized, semantic annotated data representation and interfaces also make it possible to more efficiently share image data and analysis results.
Collapse
Affiliation(s)
- Fusheng Wang
- Center for Comprehensive Informatics, Emory University, USA
| | - Jun Kong
- Center for Comprehensive Informatics, Emory University, USA
| | - Lee Cooper
- Center for Comprehensive Informatics, Emory University, USA
| | - Tony Pan
- Center for Comprehensive Informatics, Emory University, USA
| | - Tahsin Kurc
- Center for Comprehensive Informatics, Emory University, USA
| | - Wenjin Chen
- Center for Biomedical Imaging and Informatics, Georgia State University, USA
| | - Ashish Sharma
- Center for Comprehensive Informatics, Emory University, USA
| | | | - Tae W Oh
- Department of Computer Information Systems, Georgia State University, USA
| | - Daniel Brat
- Department of Pathology and Laboratory Medicine, School of Medicine, Emory University, USA
| | - Alton B Farris
- Department of Pathology and Laboratory Medicine, School of Medicine, Emory University, USA
| | - David J Foran
- Center for Biomedical Imaging and Informatics, The Cancer Institute of New Jersey, UMDNJ-Robert Wood Johnson Medical School, USA
| | - Joel Saltz
- Center for Comprehensive Informatics, Emory University, USA
| |
Collapse
|
12
|
Foran DJ, Yang L, Chen W, Hu J, Goodell LA, Reiss M, Wang F, Kurc T, Pan T, Sharma A, Saltz JH. ImageMiner: a software system for comparative analysis of tissue microarrays using content-based image retrieval, high-performance computing, and grid technology. J Am Med Inform Assoc 2011; 18:403-15. [PMID: 21606133 PMCID: PMC3128405 DOI: 10.1136/amiajnl-2011-000170] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2011] [Accepted: 04/09/2011] [Indexed: 11/03/2022] Open
Abstract
OBJECTIVE AND DESIGN The design and implementation of ImageMiner, a software platform for performing comparative analysis of expression patterns in imaged microscopy specimens such as tissue microarrays (TMAs), is described. ImageMiner is a federated system of services that provides a reliable set of analytical and data management capabilities for investigative research applications in pathology. It provides a library of image processing methods, including automated registration, segmentation, feature extraction, and classification, all of which have been tailored, in these studies, to support TMA analysis. The system is designed to leverage high-performance computing machines so that investigators can rapidly analyze large ensembles of imaged TMA specimens. To support deployment in collaborative, multi-institutional projects, ImageMiner features grid-enabled, service-based components so that multiple instances of ImageMiner can be accessed remotely and federated. RESULTS The experimental evaluation shows that: (1) ImageMiner is able to support reliable detection and feature extraction of tumor regions within imaged tissues; (2) images and analysis results managed in ImageMiner can be searched for and retrieved on the basis of image-based features, classification information, and any correlated clinical data, including any metadata that have been generated to describe the specified tissue and TMA; and (3) the system is able to reduce computation time of analyses by exploiting computing clusters, which facilitates analysis of larger sets of tissue samples.
Collapse
Affiliation(s)
- David J Foran
- Center for Biomedical Imaging & Informatics, UMDNJ-Robert Wood Johnson Medical School, New Brunswick, New Jersey, USA
- The Cancer Institute of New Jersey, University of Medicine and Dentistry of New Jersey, New Brunswick, New Jersey, USA
| | - Lin Yang
- Center for Biomedical Imaging & Informatics, UMDNJ-Robert Wood Johnson Medical School, New Brunswick, New Jersey, USA
| | - Wenjin Chen
- Center for Biomedical Imaging & Informatics, UMDNJ-Robert Wood Johnson Medical School, New Brunswick, New Jersey, USA
- The Cancer Institute of New Jersey, University of Medicine and Dentistry of New Jersey, New Brunswick, New Jersey, USA
| | - Jun Hu
- Center for Biomedical Imaging & Informatics, UMDNJ-Robert Wood Johnson Medical School, New Brunswick, New Jersey, USA
- The Cancer Institute of New Jersey, University of Medicine and Dentistry of New Jersey, New Brunswick, New Jersey, USA
| | - Lauri A Goodell
- Center for Biomedical Imaging & Informatics, UMDNJ-Robert Wood Johnson Medical School, New Brunswick, New Jersey, USA
| | - Michael Reiss
- The Cancer Institute of New Jersey, University of Medicine and Dentistry of New Jersey, New Brunswick, New Jersey, USA
| | - Fusheng Wang
- Center for Comprehensive Informatics, Emory University School of Medicine, Atlanta, Georgia, USA
| | - Tahsin Kurc
- Center for Comprehensive Informatics, Emory University School of Medicine, Atlanta, Georgia, USA
- Department of Biomedical Engineering, Emory University, Atlanta, Georgia, USA
| | - Tony Pan
- Center for Comprehensive Informatics, Emory University School of Medicine, Atlanta, Georgia, USA
| | - Ashish Sharma
- Center for Comprehensive Informatics, Emory University School of Medicine, Atlanta, Georgia, USA
- Department of Biomedical Engineering, Emory University, Atlanta, Georgia, USA
| | - Joel H Saltz
- Center for Comprehensive Informatics, Emory University School of Medicine, Atlanta, Georgia, USA
| |
Collapse
|
13
|
Ultra-fast processing of gigapixel Tissue MicroArray images using High Performance Computing. Cell Oncol (Dordr) 2011; 34:495-507. [DOI: 10.1007/s13402-011-0046-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/30/2010] [Indexed: 11/25/2022] Open
|
14
|
Yang L, Tuzel O, Chen W, Meer P, Salaru G, Goodell LA, Foran DJ. PathMiner: a Web-based tool for computer-assisted diagnostics in pathology. ACTA ACUST UNITED AC 2009; 13:291-9. [PMID: 19171530 DOI: 10.1109/titb.2008.2008801] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Large-scale, multisite collaboration has become indispensable for a wide range of research and clinical activities that rely on the capacity of individuals to dynamically acquire, share, and assess images and correlated data. In this paper, we report the development of a Web-based system, PathMiner , for interactive telemedicine, intelligent archiving, and automated decision support in pathology. The PathMiner system supports network-based submission of queries and can automatically locate and retrieve digitized pathology specimens along with correlated molecular studies of cases from "ground-truth" databases that exhibit spectral and spatial profiles consistent with a given query image. The statistically most probable diagnosis is provided to the individual who is seeking decision support. To test the system under real-case scenarios, a pipeline infrastructure was developed and a network-based test laboratory was established at strategic sites at the University of Medicine and Dentistry of New Jersey-Robert Wood Johnson Medical School, Robert Wood Johnson University Hospital, the University of Pennsylvania School of Medicine, Hospital of the University of Pennsylvania, The Cancer Institute of New Jersey, and Rutgers University. The average five-class classification accuracy of the system was 93.18% based on a tenfold cross validation on a close dataset containing 3691 imaged specimens. We also conducted prospective performance studies with the PathMiner system in real applications in which the specimens exhibited large variations in staining characters compared with the training data. The average five-class classification accuracy in this open-set experiment was 87.22%. We also provide the comparative results with the previous literature and the PathMiner system shows superior performance.
Collapse
Affiliation(s)
- Lin Yang
- Department of Electrical and Computer Engineering, Rutgers University, Piscataway, NJ 08854, USA.
| | | | | | | | | | | | | |
Collapse
|
15
|
Cukierski WJ, Foran DJ. Using Betweenness Centrality to Identify Manifold Shortcuts. PROCEEDINGS. IEEE INTERNATIONAL CONFERENCE ON DATA MINING 2008; 2008:949-958. [PMID: 20607142 PMCID: PMC2895570 DOI: 10.1109/icdmw.2008.39] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
High-dimensional data presents a challenge to tasks of pattern recognition and machine learning. Dimensionality reduction (DR) methods remove the unwanted variance and make these tasks tractable. Several nonlinear DR methods, such as the well known ISOMAP algorithm, rely on a neighborhood graph to compute geodesic distances between data points. These graphs can contain unwanted edges which connect disparate regions of one or more manifolds. This topological sensitivity is well known [1], [2], [3], yet handling high-dimensional, noisy data in the absence of a priori manifold knowledge, remains an open and difficult problem. This work introduces a divisive, edge-removal method based on graph betweenness centrality which can robustly identify manifold-shorting edges. The problem of graph construction in high dimension is discussed and the proposed algorithm is fit into the ISOMAP workflow. ROC analysis is performed and the performance is tested on synthetic and real datasets.
Collapse
Affiliation(s)
- William J. Cukierski
- Dept. of Biomedical Engineering, Rutgers University and the University of Medicine and Dentistry of New Jersey
| | - David J. Foran
- Dept. of Pathology, University of Medicine and Dentistry of New Jersey
| |
Collapse
|
16
|
Fuchs TJ, Lange T, Wild PJ, Moch H, Buhmann JM. Weakly Supervised Cell Nuclei Detection and Segmentation on Tissue Microarrays of Renal Clear Cell Carcinoma. LECTURE NOTES IN COMPUTER SCIENCE 2008. [DOI: 10.1007/978-3-540-69321-5_18] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|