1
|
Baloi A, Costea C, Gutt R, Balacescu O, Turcu F, Belean B. Hexagonal-Grid-Layout Image Segmentation Using Shock Filters: Computational Complexity Case Study for Microarray Image Analysis Related to Machine Learning Approaches. SENSORS (BASEL, SWITZERLAND) 2023; 23:2582. [PMID: 36904788 PMCID: PMC10007319 DOI: 10.3390/s23052582] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Revised: 02/17/2023] [Accepted: 02/21/2023] [Indexed: 06/18/2023]
Abstract
Hexagonal grid layouts are advantageous in microarray technology; however, hexagonal grids appear in many fields, especially given the rise of new nanostructures and metamaterials, leading to the need for image analysis on such structures. This work proposes a shock-filter-based approach driven by mathematical morphology for the segmentation of image objects disposed in a hexagonal grid. The original image is decomposed into a pair of rectangular grids, such that their superposition generates the initial image. Within each rectangular grid, the shock-filters are once again used to confine the foreground information for each image object into an area of interest. The proposed methodology was successfully applied for microarray spot segmentation, whereas its character of generality is underlined by the segmentation results obtained for two other types of hexagonal grid layouts. Considering the segmentation accuracy through specific quality measures for microarray images, such as the mean absolute error and the coefficient of variation, high correlations of our computed spot intensity features with the annotated reference values were found, indicating the reliability of the proposed approach. Moreover, taking into account that the shock-filter PDE formalism is targeting the one-dimensional luminance profile function, the computational complexity to determine the grid is minimized. The order of growth for the computational complexity of our approach is at least one order of magnitude lower when compared with state-of-the-art microarray segmentation approaches, ranging from classical to machine learning ones.
Collapse
Affiliation(s)
- Aurel Baloi
- Research Center for Integrated Analysis and Territorial Management, University of Bucharest, 4-12 Regina Elisabeta, 030018 Bucharest, Romania
- Faculty of Administration and Business, University of Bucharest, 030018 Bucharest, Romania
| | - Carmen Costea
- Department of Mathematics, Faculty of Automation and Computer Science, Technical University of Cluj-Napoca, 400114 Cluj-Napoca, Romania
| | - Robert Gutt
- Center of Advanced Research and Technologies for Alternative Energies, National Institute for Research and Development of Isotopic and Molecular Technologies, 400293 Cluj-Napoca, Romania
| | - Ovidiu Balacescu
- Department of Genetics, Genomics and Experimental Pathology, The Oncology Institute, Prof. Dr. Ion Chiricuta, 400015 Cluj-Napoca, Romania
| | - Flaviu Turcu
- Center of Advanced Research and Technologies for Alternative Energies, National Institute for Research and Development of Isotopic and Molecular Technologies, 400293 Cluj-Napoca, Romania
- Faculty of Physics, Babes-Bolyai University, 400084 Cluj-Napoca, Romania
| | - Bogdan Belean
- Center of Advanced Research and Technologies for Alternative Energies, National Institute for Research and Development of Isotopic and Molecular Technologies, 400293 Cluj-Napoca, Romania
| |
Collapse
|
2
|
Joseph SM, Sathidevi PS. An Automated cDNA Microarray Image Analysis for the Determination of Gene Expression Ratios. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:136-150. [PMID: 34910637 DOI: 10.1109/tcbb.2021.3135650] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
This paper proposes a fully automated technique for cDNA microarray image analysis. Initially, an effective preprocessing stage combined with gridding is built to get the individual spot regions of images. Current work begins with the proposal of a new rule to get the foreground (spot) and background regions in the spot blocks, which uses TV-L1 image denoising, spot block binarization, and finds the most accurate spot label by measuring the centroid differences of labelled regions in the block with that of the spot block centroid. The credibility of the segmentation rule on real images is evaluated by metrics: mean absolute error (MAE) and coefficient of variation (CV) and on synthetic images by metrics: probability of error (PE) and discrepancy distance (DD). The performance values on real and synthetic datasets reveal better results than the competitive methods. After the segmentation, prior to the spot intensity extraction, background intensity correction and flagging of noisy spots are executed. Using the lowess method, intensities are normalized, and gene expression ratios are determined. To comprehend the linearities of red and green intensities and to discern up and down-regulated genes (abnormal), fold-change factor, scatter and box plots are also used to represent the gene expression levels.
Collapse
|
3
|
Shao G, Li D, Zhang J, Yang J, Shangguan Y. Automatic microarray image segmentation with clustering-based algorithms. PLoS One 2019; 14:e0210075. [PMID: 30668601 PMCID: PMC6342330 DOI: 10.1371/journal.pone.0210075] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2018] [Accepted: 11/20/2018] [Indexed: 12/27/2022] Open
Abstract
Image segmentation, as a key step of microarray image processing, is crucial for obtaining the spot expressions simultaneously. However, state-of-art clustering-based segmentation algorithms are sensitive to noises. To solve this problem and improve the segmentation accuracy, in this article, several improvements are introduced into the fast and simple clustering methods (K-means and Fuzzy C means). Firstly, a contrast enhancement algorithm is implemented in image preprocessing to improve the gridding precision. Secondly, the data-driven means are proposed for cluster center initialization, instead of usual random setting. The third improvement is that the multi features, including intensity features, spatial features, and shape features, are implemented in feature selection to replace the sole pixel intensity feature used in the traditional clustering-based methods to avoid taking noises as spot pixels. Moreover, the principal component analysis is adopted for various feature extraction. Finally, an adaptive adjustment algorithm is proposed based on data mining and learning for further dealing with the missing spots or low contrast spots. Experiments on real and simulation data sets indicate that the proposed improvements made our proposed method obtains higher segmented precision than the traditional K-means and Fuzzy C means clustering methods.
Collapse
Affiliation(s)
- Guifang Shao
- Department of Automation, Xiamen University, Xiamen, China
- * E-mail:
| | - Dongyao Li
- Department of Automation, Xiamen University, Xiamen, China
| | - Junfa Zhang
- Department of Automation, Xiamen University, Xiamen, China
| | - Jianbo Yang
- Department of Automation, Xiamen University, Xiamen, China
| | - Yali Shangguan
- Department of Automation, Xiamen University, Xiamen, China
| |
Collapse
|
4
|
Katsigiannis S, Zacharia E, Maroulis D. MIGS-GPU: Microarray Image Gridding and Segmentation on the GPU. IEEE J Biomed Health Inform 2016; 21:867-874. [PMID: 26960232 DOI: 10.1109/jbhi.2016.2537922] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Complementary DNA (cDNA) microarray is a powerful tool for simultaneously studying the expression level of thousands of genes. Nevertheless, the analysis of microarray images remains an arduous and challenging task due to the poor quality of the images that often suffer from noise, artifacts, and uneven background. In this study, the MIGS-GPU [Microarray Image Gridding and Segmentation on Graphics Processing Unit (GPU)] software for gridding and segmenting microarray images is presented. MIGS-GPU's computations are performed on the GPU by means of the compute unified device architecture (CUDA) in order to achieve fast performance and increase the utilization of available system resources. Evaluation on both real and synthetic cDNA microarray images showed that MIGS-GPU provides better performance than state-of-the-art alternatives, while the proposed GPU implementation achieves significantly lower computational times compared to the respective CPU approaches. Consequently, MIGS-GPU can be an advantageous and useful tool for biomedical laboratories, offering a user-friendly interface that requires minimum input in order to run.
Collapse
|
5
|
Hernández-Cabronero M, Blanes I, Pinho AJ, Marcellin MW, Serra-Sagristà J. Analysis-Driven Lossy Compression of DNA Microarray Images. IEEE TRANSACTIONS ON MEDICAL IMAGING 2016; 35:654-664. [PMID: 26462084 DOI: 10.1109/tmi.2015.2489262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
DNA microarrays are one of the fastest-growing new technologies in the field of genetic research, and DNA microarray images continue to grow in number and size. Since analysis techniques are under active and ongoing development, storage, transmission and sharing of DNA microarray images need be addressed, with compression playing a significant role. However, existing lossless coding algorithms yield only limited compression performance (compression ratios below 2:1), whereas lossy coding methods may introduce unacceptable distortions in the analysis process. This work introduces a novel Relative Quantizer (RQ), which employs non-uniform quantization intervals designed for improved compression while bounding the impact on the DNA microarray analysis. This quantizer constrains the maximum relative error introduced into quantized imagery, devoting higher precision to pixels critical to the analysis process. For suitable parameter choices, the resulting variations in the DNA microarray analysis are less than half of those inherent to the experimental variability. Experimental results reveal that appropriate analysis can still be performed for average compression ratios exceeding 4.5:1.
Collapse
|
6
|
Belean B, Borda M, Ackermann J, Koch I, Balacescu O. Unsupervised image segmentation for microarray spots with irregular contours and inner holes. BMC Bioinformatics 2015; 16:412. [PMID: 26698293 PMCID: PMC4690322 DOI: 10.1186/s12859-015-0842-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2015] [Accepted: 12/09/2015] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Microarray analysis represents a powerful way to test scientific hypotheses on the functionality of cells. The measurements consider the whole genome, and the large number of generated data requires sophisticated analysis. To date, no gold-standard for the analysis of microarray images has been established. Due to the lack of a standard approach there is a strong need to identify new processing algorithms. METHODS We propose a novel approach based on hyperbolic partial differential equations (PDEs) for unsupervised spot segmentation. Prior to segmentation, morphological operations were applied for the identification of co-localized groups of spots. A grid alignment was performed to determine the borderlines between rows and columns of spots. PDEs were applied to detect the inflection points within each column and row; vertical and horizontal luminance profiles were evolved respectively. The inflection points of the profiles determined borderlines that confined a spot within adapted rectangular areas. A subsequent k-means clustering determined the pixels of each individual spot and its local background. RESULTS We evaluated the approach for a data set of microarray images taken from the Stanford Microarray Database (SMD). The data set is based on two studies on global gene expression profiles of Arabidopsis Thaliana. We computed values for spot intensity, regression ratio, and coefficient of determination. For spots with irregular contours and inner holes, we found intensity values that were significantly different from those determined by the GenePix Pro microarray analysis software. We determined the set of differentially expressed genes from our intensities and identified more activated genes than were predicted by the GenePix software. CONCLUSIONS Our method represents a worthwhile alternative and complement to standard approaches used in industry and academy. We highlight the importance of our spot segmentation approach, which identified supplementary important genes, to better explains the molecular mechanisms that are activated in a defense responses to virus and pathogen infection.
Collapse
Affiliation(s)
- Bogdan Belean
- CETATEA Research Centre, National Institute for Research and Development of Isotopic and Molecular Technologies - INCDTIM, 67 - 103 Donat, Cluj-Napoca, Romania.
| | - Monica Borda
- Department of Communication, Technical University of Cluj-Napoca, Baritiu 26-28, Cluj-Napoca, Romania.
| | - Jörg Ackermann
- Molecular Bioinformatics Group, Institute of Computer Science, Faculty of Computer Science and Mathematics, Cluster of Excellence Frankfurt "Macromolecular Complexes", Johann Wolfgang Goethe-University, Baritiu 26-28, Frankfurt am Main, Germany.
| | - Ina Koch
- Molecular Bioinformatics Group, Institute of Computer Science, Faculty of Computer Science and Mathematics, Cluster of Excellence Frankfurt "Macromolecular Complexes", Johann Wolfgang Goethe-University, Baritiu 26-28, Frankfurt am Main, Germany.
| | - Ovidiu Balacescu
- Department of Functional Genomics and Experimental Pathology, The Oncology Institute "Prof. Dr. Ion Chiricuta", Cluj-Napoca, Romania.
| |
Collapse
|
7
|
Saberkari H, Bahrami S, Shamsi M, Amoshahy MJ, Ghavifekr HB, Sedaaghi MH. Fully Automated Complementary DNA Microarray Segmentation using a Novel Fuzzy-based Algorithm. JOURNAL OF MEDICAL SIGNALS & SENSORS 2015; 5:182-91. [PMID: 26284175 PMCID: PMC4528357 DOI: 10.4103/2228-7477.161494] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2015] [Accepted: 06/13/2015] [Indexed: 11/29/2022]
Abstract
DNA microarray is a powerful approach to study simultaneously, the expression of 1000 of genes in a single experiment. The average value of the fluorescent intensity could be calculated in a microarray experiment. The calculated intensity values are very close in amount to the levels of expression of a particular gene. However, determining the appropriate position of every spot in microarray images is a main challenge, which leads to the accurate classification of normal and abnormal (cancer) cells. In this paper, first a preprocessing approach is performed to eliminate the noise and artifacts available in microarray cells using the nonlinear anisotropic diffusion filtering method. Then, the coordinate center of each spot is positioned utilizing the mathematical morphology operations. Finally, the position of each spot is exactly determined through applying a novel hybrid model based on the principle component analysis and the spatial fuzzy c-means clustering (SFCM) algorithm. Using a Gaussian kernel in SFCM algorithm will lead to improving the quality in complementary DNA microarray segmentation. The performance of the proposed algorithm has been evaluated on the real microarray images, which is available in Stanford Microarray Databases. Results illustrate that the accuracy of microarray cells segmentation in the proposed algorithm reaches to 100% and 98% for noiseless/noisy cells, respectively.
Collapse
Affiliation(s)
- Hamidreza Saberkari
- Department of Electrical Engineering, Sahand University of Technology, Tabriz, Iran
| | - Sheyda Bahrami
- Department of Electrical Engineering, Sahand University of Technology, Tabriz, Iran
| | - Mousa Shamsi
- Department of Electrical Engineering, Sahand University of Technology, Tabriz, Iran
| | | | | | | |
Collapse
|
8
|
Shao G, Li T, Zuo W, Wu S, Liu T. A Combinational Clustering Based Method for cDNA Microarray Image Segmentation. PLoS One 2015; 10:e0133025. [PMID: 26241767 PMCID: PMC4524615 DOI: 10.1371/journal.pone.0133025] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2014] [Accepted: 06/22/2015] [Indexed: 12/03/2022] Open
Abstract
Microarray technology plays an important role in drawing useful biological conclusions by analyzing thousands of gene expressions simultaneously. Especially, image analysis is a key step in microarray analysis and its accuracy strongly depends on segmentation. The pioneering works of clustering based segmentation have shown that k-means clustering algorithm and moving k-means clustering algorithm are two commonly used methods in microarray image processing. However, they usually face unsatisfactory results because the real microarray image contains noise, artifacts and spots that vary in size, shape and contrast. To improve the segmentation accuracy, in this article we present a combination clustering based segmentation approach that may be more reliable and able to segment spots automatically. First, this new method starts with a very simple but effective contrast enhancement operation to improve the image quality. Then, an automatic gridding based on the maximum between-class variance is applied to separate the spots into independent areas. Next, among each spot region, the moving k-means clustering is first conducted to separate the spot from background and then the k-means clustering algorithms are combined for those spots failing to obtain the entire boundary. Finally, a refinement step is used to replace the false segmentation and the inseparable ones of missing spots. In addition, quantitative comparisons between the improved method and the other four segmentation algorithms--edge detection, thresholding, k-means clustering and moving k-means clustering--are carried out on cDNA microarray images from six different data sets. Experiments on six different data sets, 1) Stanford Microarray Database (SMD), 2) Gene Expression Omnibus (GEO), 3) Baylor College of Medicine (BCM), 4) Swiss Institute of Bioinformatics (SIB), 5) Joe DeRisi's individual tiff files (DeRisi), and 6) University of California, San Francisco (UCSF), indicate that the improved approach is more robust and sensitive to weak spots. More importantly, it can obtain higher segmentation accuracy in the presence of noise, artifacts and weakly expressed spots compared with the other four methods.
Collapse
Affiliation(s)
- Guifang Shao
- Department of Automation, Xiamen University, Xiamen, P.R. China
| | - Tiejun Li
- Information Engineering College, Jimei University, Xiamen, P.R. China
| | - Wangda Zuo
- Department of Civil, Architectural and Environmental Engineering, University of Miami, Coral Gables, United States of America
| | - Shunxiang Wu
- Department of Automation, Xiamen University, Xiamen, P.R. China
| | - Tundong Liu
- Department of Automation, Xiamen University, Xiamen, P.R. China
| |
Collapse
|
9
|
Katsigiannis S, Zacharia E, Maroulis D. Grow-cut based automatic cDNA microarray image segmentation. IEEE Trans Nanobioscience 2014; 14:138-45. [PMID: 25438323 DOI: 10.1109/tnb.2014.2369961] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Complementary DNA (cDNA) microarray is a well-established tool for simultaneously studying the expression level of thousands of genes. Segmentation of microarray images is one of the main stages in a microarray experiment. However, it remains an arduous and challenging task due to the poor quality of images. Images suffer from noise, artifacts, and uneven background, while spots depicted on images can be poorly contrasted and deformed. In this paper, an original approach for the segmentation of cDNA microarray images is proposed. First, a preprocessing stage is applied in order to reduce the noise levels of the microarray image. Then, the grow-cut algorithm is applied separately to each spot location, employing an automated seed selection procedure, in order to locate the pixels belonging to spots. Application on datasets containing synthetic and real microarray images shows that the proposed algorithm performs better than other previously proposed methods. Moreover, in order to exploit the independence of the segmentation task for each separate spot location, both a multithreaded CPU and a graphics processing unit (GPU) implementation were evaluated.
Collapse
|
10
|
Low-complexity PDE-based approach for automatic microarray image processing. Med Biol Eng Comput 2014; 53:99-110. [DOI: 10.1007/s11517-014-1214-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2013] [Accepted: 10/20/2014] [Indexed: 10/24/2022]
|
11
|
FPGA based system for automatic cDNA microarray image processing. Comput Med Imaging Graph 2012; 36:419-29. [DOI: 10.1016/j.compmedimag.2012.01.002] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2011] [Revised: 11/23/2011] [Accepted: 01/26/2012] [Indexed: 11/23/2022]
|
12
|
Zacharia E, Maroulis DE. 3-D Spot Modeling for Automatic Segmentation of cDNA Microarray Images. IEEE Trans Nanobioscience 2010; 9:181-92. [DOI: 10.1109/tnb.2010.2050900] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|