1
|
Efficient Deep Feature Based Semantic Image Retrieval. Neural Process Lett 2023. [DOI: 10.1007/s11063-022-11079-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
|
2
|
Liu Y, Tian M, Xu C, Zhao L. Neural network feature learning based on image self-encoding. INT J ADV ROBOT SYST 2020. [DOI: 10.1177/1729881420921653] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
With the rapid development of information technology and the arrival of the era of big data, people’s access to information is increasingly relying on information such as images. Today, image data are showing an increasing trend in the form of an index. How to use deep learning models to extract valuable information from massive data is very important. In the face of such a situation, people cannot accurately and timely find out the information they need. Therefore, the research on image retrieval technology is very important. Image retrieval is an important technology in the field of computer vision image processing. It realizes fast and accurate query of similar images in image database. The excellent feature representation not only can represent the category information of the image but also capture the relevant semantic information of the image. If the neural network feature learning expression is combined with the image retrieval field, it will definitely improve the application of image retrieval technology. To solve the above problems, this article studies the problems encountered in deep learning neural network feature learning based on image self-encoding and discusses its feature expression in the field of image retrieval. By adding the spatial relationship information obtained by image self-encoding in the neural network training process, the feature expression ability of the selected neural network is improved, and the neural network feature learning based on image coding is successfully applied to the popular field of image retrieval.
Collapse
Affiliation(s)
- Yangyang Liu
- School of Mechanical Engineering and Electronic Information, China University of Geosciences, Wuhan, Hubei, China
| | - Minghua Tian
- College of Electronic Information Engineering, Inner Mongolia University, Hohhot, Neimenggu, China
| | - Chang Xu
- School of Mechanical Engineering and Electronic Information, China University of Geosciences, Wuhan, Hubei, China
| | - Lixiang Zhao
- School of Mechanical Engineering and Electronic Information, China University of Geosciences, Wuhan, Hubei, China
| |
Collapse
|
3
|
Xu S, An X. ML2S-SVM: multi-label least-squares support vector machine classifiers. ELECTRONIC LIBRARY 2019. [DOI: 10.1108/el-09-2019-0207] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Purpose
Image classification is becoming a supporting technology in several image-processing tasks. Due to rich semantic information contained in the images, it is very popular for an image to have several labels or tags. This paper aims to develop a novel multi-label classification approach with superior performance.
Design/methodology/approach
Many multi-label classification problems share two main characteristics: label correlations and label imbalance. However, most of current methods are devoted to either model label relationship or to only deal with unbalanced problem with traditional single-label methods. In this paper, multi-label classification problem is regarded as an unbalanced multi-task learning problem. Multi-task least-squares support vector machine (MTLS-SVM) is generalized for this problem, renamed as multi-label LS-SVM (ML2S-SVM).
Findings
Experimental results on the emotions, scene, yeast and bibtex data sets indicate that the ML2S-SVM is competitive with respect to the state-of-the-art methods in terms of Hamming loss and instance-based F1 score. The values of resulting parameters largely influence the performance of ML2S-SVM, so it is necessary for users to identify proper parameters in advance.
Originality/value
On the basis of MTLS-SVM, a novel multi-label classification approach, ML2S-SVM, is put forward. This method can overcome the unbalanced problem but also explicitly models arbitrary order correlations among labels by allowing multiple labels to share a subspace. In addition, the multi-label classification approach has a wider range of applications. That is to say, it is not limited to the field of image classification.
Collapse
|
4
|
Alkhatib M, Hafiane A. Robust Adaptive Median Binary Pattern for Noisy Texture Classification and Retrieval. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2019; 28:5407-5418. [PMID: 31107648 DOI: 10.1109/tip.2019.2916742] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Texture is an important characteristic for different computer vision tasks and applications. Local binary pattern (LBP) is considered one of the most efficient texture descriptors yet. However, LBP has some notable limitations, in particular its sensitivity to noise. In this paper, we address these criteria by introducing a novel texture descriptor, robust adaptive median binary pattern (RAMBP). RAMBP is based on a process involving classification of noisy pixels, adaptive analysis window, scale analysis, and a comparison of image medians. The proposed method handles images with highly noisy textures and increases the discriminative properties by capturing microstructure and macrostructure texture information. The method was evaluated on popular texture datasets for classification and retrieval tasks and under different high noise conditions. Without any training or prior knowledge of the noise type, RAMBP achieved the best classification compared to state-of-the-art techniques. It scored more than 90% under 50% impulse noise densities, more than 95% under Gaussian noised textures with a standard deviation σ = 5 , more than 99% under Gaussian blurred textures with a standard deviation σ = 1.25 , and more than 90% for mixed noise. The proposed method yielded competitive results and proved to be one of the best descriptors in noise-free texture classification. Furthermore, RAMBP showed high performance for the problem of noisy texture retrieval providing high scores of recall and precision measures for textures with high noise levels. Finally, compared with the state-of-the-art methods, RAMBP achieves a good running time with low feature dimensionality.
Collapse
|
5
|
Zhao Y, Belkasim S, Arteta A, Lee S. The Stability and Noise Tolerance of Cartesian Zernike Moments Invariants. PATTERN RECOGNITION AND IMAGE ANALYSIS 2019. [DOI: 10.1134/s1054661818040296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
6
|
Jeena Jacob I, Srinivasagan KG, Ebby Darney P, Jayapriya K. Deep learned Inter-Channel Colored Texture Pattern: a new chromatic-texture descriptor. Pattern Anal Appl 2019. [DOI: 10.1007/s10044-019-00780-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
7
|
Sarwar A, Mehmood Z, Saba T, Qazi KA, Adnan A, Jamal H. A novel method for content-based image retrieval to improve the effectiveness of the bag-of-words model using a support vector machine. J Inf Sci 2018. [DOI: 10.1177/0165551518782825] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The advancements in the multimedia technologies result in the growth of the image databases. To retrieve images from such image databases using visual attributes of the images is a challenging task due to the close visual appearance among the visual attributes of these images, which also introduces the issue of the semantic gap. In this article, we recommend a novel method established on the bag-of-words (BoW) model, which perform visual words integration of the local intensity order pattern (LIOP) feature and local binary pattern variance (LBPV) feature to reduce the issue of the semantic gap and enhance the performance of the content-based image retrieval (CBIR). The recommended method uses LIOP and LBPV features to build two smaller size visual vocabularies (one from each feature), which are integrated together to build a larger size of the visual vocabulary, which also contains complementary features of both descriptors. Because for efficient CBIR, the smaller size of the visual vocabulary improves the recall, while the bigger size of the visual vocabulary improves the precision or accuracy of the CBIR. The comparative analysis of the recommended method is performed on three image databases, namely, WANG-1K, WANG-1.5K and Holidays. The experimental analysis of the recommended method on these image databases proves its robust performance as compared with the recent CBIR methods.
Collapse
Affiliation(s)
- Amna Sarwar
- Department of Software Engineering, University of Engineering and Technology – Taxila, Pakistan
| | - Zahid Mehmood
- Department of Software Engineering, University of Engineering and Technology – Taxila, Pakistan
| | - Tanzila Saba
- College of Computer & Information Sciences, Prince Sultan University, Riyadh, Saudi Arabia
| | - Khurram Ashfaq Qazi
- Department of Software Engineering, University of Engineering and Technology – Taxila, Pakistan
| | - Ahmed Adnan
- Department of Computer Science, University of Engineering and Technology – Taxila, Pakistan
| | - Habibullah Jamal
- Faculty of Engineering Sciences, Ghulam Ishaq Khan Institute of Engineering Sciences and Technology, Topi, Pakistan
| |
Collapse
|
8
|
Chen YW, Huang X, Chen D, Han XH. Generic and Specific Impressions Estimation and Their Application to KANSEI-Based Clothing Fabric Image Retrieval. INT J PATTERN RECOGN 2018. [DOI: 10.1142/s0218001418540241] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Current image retrieval techniques are mainly based on text or visual contents. However, both text-based and contents-based methods lack the capability of utilizing human intuition and KANSEI (impression). In this paper, we proposed an impression-based image retrieval method in order to realize the image retrieval according to our impression presented by impression keywords. We first propose a generic and specific impressions estimation method based on machine learning and then apply it to impression-based clothing fabric image retrieval. We use a semantic differential (SD) method to measure the user’s impressions such as brightness and warmth while they view a cloth fabric image. We also extract both global and local features of cloth fabric images such as color and texture using computer vision techniques. Then we use support vector regression to model the mapping functions between the generic impression (or specific impression) and image features. The learnt mapping functions are used to estimate the generic and specific impressions of cloth fabric images. The retrieval is done by comparing the query impression with the estimated impression of images in the database.
Collapse
Affiliation(s)
- Yen-Wei Chen
- College of Information Science and Engineering, Ritsumeikan University, Kusatsu, Gunma, Japan
- College of Computer Science and Technology, Zhejiang University, Zhejiang, P. R. China
| | - Xinyin Huang
- School of Education, Soochow University, Suzhou, Jiangsu, P. R. China
| | - Dingye Chen
- College of Information Science and Engineering, Ritsumeikan University, Kusatsu, Gunma, Japan
| | - Xian-Hua Han
- Artificial Intelligence Research Center, Yamaguchi University, Yamaguchi, Japan
| |
Collapse
|
9
|
Hwang KH, Lee H, Koh G, Willrett D, Rubin DL. Building and Querying RDF/OWL Database of Semantically Annotated Nuclear Medicine Images. J Digit Imaging 2018; 30:4-10. [PMID: 27785632 DOI: 10.1007/s10278-016-9916-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
As the use of positron emission tomography-computed tomography (PET-CT) has increased rapidly, there is a need to retrieve relevant medical images that can assist image interpretation. However, the images themselves lack the explicit information needed for query. We constructed a semantically structured database of nuclear medicine images using the Annotation and Image Markup (AIM) format and evaluated the ability the AIM annotations to improve image search. We created AIM annotation templates specific to the nuclear medicine domain and used them to annotate 100 nuclear medicine PET-CT studies in AIM format using controlled vocabulary. We evaluated image retrieval from 20 specific clinical queries. As the gold standard, two nuclear medicine physicians manually retrieved the relevant images from the image database using free text search of radiology reports for the same queries. We compared query results with the manually retrieved results obtained by the physicians. The query performance indicated a 98 % recall for simple queries and a 89 % recall for complex queries. In total, the queries provided 95 % (75 of 79 images) recall, 100 % precision, and an F1 score of 0.97 for the 20 clinical queries. Three of the four images missed by the queries required reasoning for successful retrieval. Nuclear medicine images augmented using semantic annotations in AIM enabled high recall and precision for simple queries, helping physicians to retrieve the relevant images. Further study using a larger data set and the implementation of an inference engine may improve query results for more complex queries.
Collapse
Affiliation(s)
- Kyung Hoon Hwang
- Department of Nuclear Medicine, Gachon University Gil Medical Center, Incheon, South Korea
| | - Haejun Lee
- Department of Nuclear Medicine, Gachon University Gil Medical Center, Incheon, South Korea
| | - Geon Koh
- Department of Nuclear Medicine, Gachon University Gil Medical Center, Incheon, South Korea
| | - Debra Willrett
- Department of Radiology, Stanford University, Stanford, CA, USA
| | - Daniel L Rubin
- Department of Radiology, Stanford University, Stanford, CA, USA. .,Department of Medicine (Biomedical Informatics Research), Stanford University, Stanford, CA, USA.
| |
Collapse
|
10
|
Sabetghadam S, Lupu M, Bierig R, Rauber A. A faceted approach to reachability analysis of graph modelled collections. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL 2017; 7:157-171. [PMID: 30956928 PMCID: PMC6417456 DOI: 10.1007/s13735-017-0145-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/06/2017] [Revised: 10/31/2017] [Accepted: 11/28/2017] [Indexed: 06/09/2023]
Abstract
Nowadays, there is a proliferation of available information sources from different modalities-text, images, audio, video and more. Information objects are not isolated anymore. They are frequently connected via metadata, semantic links, etc. This leads to various challenges in graph-based information retrieval. This paper is concerned with the reachability analysis of multimodal graph modelled collections. We use our framework to leverage the combination of features of different modalities through our formulation of faceted search. This study highlights the effect of different facets and link types in improving reachability of relevant information objects. The experiments are performed on the Image CLEF 2011 Wikipedia collection with about 400,000 documents and images. The results demonstrate that the combination of different facets is conductive to obtain higher reachability. We obtain 373% recall gain for very hard topics by using our graph model of the collection. Further, by adding semantic links to the collection, we gain a 10% increase in the overall recall.
Collapse
Affiliation(s)
- Serwah Sabetghadam
- Institute of Software Technology and Interactive Systems, Vienna University of Technology, Vienna, Austria
| | - Mihai Lupu
- Institute of Software Technology and Interactive Systems, Vienna University of Technology, Vienna, Austria
| | - Ralf Bierig
- Department of Computer Science, Maynooth University, Maynooth, Ireland
| | - Andreas Rauber
- Institute of Software Technology and Interactive Systems, Vienna University of Technology, Vienna, Austria
| |
Collapse
|
11
|
Yu J, Yang X, Gao F, Tao D. Deep Multimodal Distance Metric Learning Using Click Constraints for Image Ranking. IEEE TRANSACTIONS ON CYBERNETICS 2017; 47:4014-4024. [PMID: 27529881 DOI: 10.1109/tcyb.2016.2591583] [Citation(s) in RCA: 105] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
How do we retrieve images accurately? Also, how do we rank a group of images precisely and efficiently for specific queries? These problems are critical for researchers and engineers to generate a novel image searching engine. First, it is important to obtain an appropriate description that effectively represent the images. In this paper, multimodal features are considered for describing images. The images unique properties are reflected by visual features, which are correlated to each other. However, semantic gaps always exist between images visual features and semantics. Therefore, we utilize click feature to reduce the semantic gap. The second key issue is learning an appropriate distance metric to combine these multimodal features. This paper develops a novel deep multimodal distance metric learning (Deep-MDML) method. A structured ranking model is adopted to utilize both visual and click features in distance metric learning (DML). Specifically, images and their related ranking results are first collected to form the training set. Multimodal features, including click and visual features, are collected with these images. Next, a group of autoencoders is applied to obtain initially a distance metric in different visual spaces, and an MDML method is used to assign optimal weights for different modalities. Next, we conduct alternating optimization to train the ranking model, which is used for the ranking of new queries with click features. Compared with existing image ranking methods, the proposed method adopts a new ranking model to use multimodal features, including click features and visual features in DML. We operated experiments to analyze the proposed Deep-MDML in two benchmark data sets, and the results validate the effects of the method.
Collapse
|
12
|
Ge M, Persia F. A Survey of Multimedia Recommender Systems: Challenges and Opportunities. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING 2017. [DOI: 10.1142/s1793351x17500039] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Multimedia information has been extensively growing from a variety of sources such as cameras or video recorders. In order to select the useful multimedia objects, multimedia recommender system has been emerging as a tool to help users choose which multimedia objects might be interesting for them. However, given the complexity of multimedia objects, it is challenging to provide effective multimedia recommendations. In this paper, we therefore conduct a survey in both the multimedia information system and recommender system communities. We further focus on the works that span the two communities, especially the research on multimedia recommender systems. Based on our review, we propose a set of research challenges, which can be used to implicate the future research directions for multimedia recommender systems. For each research challenge, we have also provided the insights of how to perform the follow-up research.
Collapse
Affiliation(s)
- Mouzhi Ge
- Faculty of Informatics, Masaryk University, Brno 60200, Czech Republic
| | - Fabio Persia
- Faculty of Computer Science, Free University of Bozen-Bolzano, Bozen-Bolzano, 39100, Italy
| |
Collapse
|
13
|
Islam SM, Banerjee M, Bhattacharyya S, Chakraborty S. Content-based image retrieval based on multiple extended fuzzy-rough framework. Appl Soft Comput 2017. [DOI: 10.1016/j.asoc.2017.03.036] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
14
|
Mukherjee A. Retrieval of Multimedia Information Using Content-Based Image Retrieval (CBIR) Techniques. Biometrics 2017. [DOI: 10.4018/978-1-5225-0983-7.ch027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
This chapter will focus on the concept of Content-based image retrieval. Searching of an image or video database based on text based description is a manual labor intensive process. Descriptions of the file are usually typed manually for each image by human operators because the automatic generation of keywords for the images is difficult without incorporation of visual information and feature extraction. This method is impractical in today's multimedia information era. “Content-based” means that the search will analyze the actual contents of the image rather than the metadata such as keywords, tags, and descriptions associated with the image. The term “content” in this context might refer to colors, shapes, textures, or any other information that can be derived from the image itself. Several important sections are highlighted in this chapter, like architectures, query techniques, multidimensional indexing, video retrieval and different application sections of CBIR.
Collapse
|
15
|
|
16
|
A new fusion approach for content based image retrieval with color histogram and local directional pattern. INT J MACH LEARN CYB 2016. [DOI: 10.1007/s13042-016-0597-9] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
17
|
Mane PP, Bawane NG. An effective technique for the content based image retrieval to reduce the semantic gap based on an optimal classifier technique. PATTERN RECOGNITION AND IMAGE ANALYSIS 2016. [DOI: 10.1134/s1054661816030159] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
18
|
A medical image retrieval scheme with relevance feedback through a medical social network. SOCIAL NETWORK ANALYSIS AND MINING 2016. [DOI: 10.1007/s13278-016-0362-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
19
|
Angelov P, Sadeghi-Tehran P. Look-a-Like: A Fast Content-Based Image Retrieval Approach Using a Hierarchically Nested Dynamically Evolving Image Clouds and Recursive Local Data Density. INT J INTELL SYST 2016. [DOI: 10.1002/int.21837] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Affiliation(s)
- Plamen Angelov
- School of Computing and Communications, Data Science Group; Lancaster University; Lancaster LA1 4WA UK
| | - Pouria Sadeghi-Tehran
- School of Computing and Communications, Data Science Group; Lancaster University; Lancaster LA1 4WA UK
| |
Collapse
|
20
|
Abstract
In the commercial use of picture collections, a heavy dependency continues to be exhibited on a concept-based image retrieval paradigm in which the query is verbalised by the client and resolved as a metadata text-matching operation. The practical and philosophical challenges posed by the indexing aspect of image metadata construction are significant and frequently expressed. Nevertheless, it has taken image digitisation to bring this particular information retrieval problem to prominence in the research agenda. Metamorphosed into a binary data structure, the digital image offers some enticing processing opportunities which content-based image retrieval techniques are exploiting with developing success. Drawing on studies of user need, this paper seeks to explain why a heavy dependency will continue to be placed on concept-based rather than content-based image retrieval techniques within archival image collections. In contrast, the promising nature of content-based techniques from the viewpoint of a growing clientele with less traditional visual information needs will also be considered. The paper concludes by offering the view that, while both concept-based and content-based approaches suffer from operational limitations, the further development of a hybrid image retrieval paradigm which combines the two approaches makes a potentially valuable contribution to the research agenda for visual image retrieval.
Collapse
|
21
|
Spatial Up-Scaling Correction for Leaf Area Index Based on the Fractal Theory. REMOTE SENSING 2016. [DOI: 10.3390/rs8030197] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
22
|
Xu Y, Lin L, Hu H, Yu H, Jin C, Wang J, Han X, Chen YW. Combined Density, Texture and Shape Features of Multi-phase Contrast-Enhanced CT Images for CBIR of Focal Liver Lesions: A Preliminary Study. INNOVATION IN MEDICINE AND HEALTHCARE 2015 2016. [DOI: 10.1007/978-3-319-23024-5_20] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/09/2023]
|
23
|
Kurc T, Qi X, Wang D, Wang F, Teodoro G, Cooper L, Nalisnik M, Yang L, Saltz J, Foran DJ. Scalable analysis of Big pathology image data cohorts using efficient methods and high-performance computing strategies. BMC Bioinformatics 2015; 16:399. [PMID: 26627175 PMCID: PMC4667532 DOI: 10.1186/s12859-015-0831-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2015] [Accepted: 11/16/2015] [Indexed: 11/16/2022] Open
Abstract
Background We describe a suite of tools and methods that form a core set of capabilities for researchers and clinical investigators to evaluate multiple analytical pipelines and quantify sensitivity and variability of the results while conducting large-scale studies in investigative pathology and oncology. The overarching objective of the current investigation is to address the challenges of large data sizes and high computational demands. Results The proposed tools and methods take advantage of state-of-the-art parallel machines and efficient content-based image searching strategies. The content based image retrieval (CBIR) algorithms can quickly detect and retrieve image patches similar to a query patch using a hierarchical analysis approach. The analysis component based on high performance computing can carry out consensus clustering on 500,000 data points using a large shared memory system. Conclusions Our work demonstrates efficient CBIR algorithms and high performance computing can be leveraged for efficient analysis of large microscopy images to meet the challenges of clinically salient applications in pathology. These technologies enable researchers and clinical investigators to make more effective use of the rich informational content contained within digitized microscopy specimens.
Collapse
Affiliation(s)
- Tahsin Kurc
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, USA.
| | - Xin Qi
- Department of Pathology & Laboratory Medicine, Rutgers -- Robert Wood Johnson Medical School, New Brunswick, USA. .,Rutgers Cancer Institute of New Jersey, New Brunswick, USA.
| | - Daihou Wang
- Department of Electrical and Computer Engineering, Rutgers University, New Brunswick, USA.
| | - Fusheng Wang
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, USA. .,Department of Computer Science, Stony Brook University, Stony Brook, USA.
| | - George Teodoro
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, USA. .,Department of Computer Science, University of Brasilia, Brasília, Brazil.
| | - Lee Cooper
- Department of Biomedical Informatics, Emory University, Atlanta, USA.
| | - Michael Nalisnik
- Department of Biomedical Informatics, Emory University, Atlanta, USA.
| | - Lin Yang
- Department of Biomedical Engineering, University of Florida, Gainesville, USA.
| | - Joel Saltz
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, USA.
| | - David J Foran
- Department of Pathology & Laboratory Medicine, Rutgers -- Robert Wood Johnson Medical School, New Brunswick, USA. .,Rutgers Cancer Institute of New Jersey, New Brunswick, USA.
| |
Collapse
|
24
|
Qi X, Wang D, Rodero I, Diaz-Montes J, Gensure RH, Xing F, Zhong H, Goodell L, Parashar M, Foran DJ, Yang L. Content-based histopathology image retrieval using CometCloud. BMC Bioinformatics 2014; 15:287. [PMID: 25155691 PMCID: PMC4161917 DOI: 10.1186/1471-2105-15-287] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2013] [Accepted: 08/12/2014] [Indexed: 11/12/2022] Open
Abstract
Background The development of digital imaging technology is creating extraordinary levels of accuracy that provide support for improved reliability in different aspects of the image analysis, such as content-based image retrieval, image segmentation, and classification. This has dramatically increased the volume and rate at which data are generated. Together these facts make querying and sharing non-trivial and render centralized solutions unfeasible. Moreover, in many cases this data is often distributed and must be shared across multiple institutions requiring decentralized solutions. In this context, a new generation of data/information driven applications must be developed to take advantage of the national advanced cyber-infrastructure (ACI) which enable investigators to seamlessly and securely interact with information/data which is distributed across geographically disparate resources. This paper presents the development and evaluation of a novel content-based image retrieval (CBIR) framework. The methods were tested extensively using both peripheral blood smears and renal glomeruli specimens. The datasets and performance were evaluated by two pathologists to determine the concordance. Results The CBIR algorithms that were developed can reliably retrieve the candidate image patches exhibiting intensity and morphological characteristics that are most similar to a given query image. The methods described in this paper are able to reliably discriminate among subtle staining differences and spatial pattern distributions. By integrating a newly developed dual-similarity relevance feedback module into the CBIR framework, the CBIR results were improved substantially. By aggregating the computational power of high performance computing (HPC) and cloud resources, we demonstrated that the method can be successfully executed in minutes on the Cloud compared to weeks using standard computers. Conclusions In this paper, we present a set of newly developed CBIR algorithms and validate them using two different pathology applications, which are regularly evaluated in the practice of pathology. Comparative experimental results demonstrate excellent performance throughout the course of a set of systematic studies. Additionally, we present and evaluate a framework to enable the execution of these algorithms across distributed resources. We show how parallel searching of content-wise similar images in the dataset significantly reduces the overall computational time to ensure the practical utility of the proposed CBIR algorithms.
Collapse
Affiliation(s)
- Xin Qi
- Department of Pathology and Laboratory Medicine, Rutger Robert Wood Johnson Medical School, 675 Hoes Lane, Piscataway, NJ, USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Kumar A, Kim J, Cai W, Fulham M, Feng D. Content-based medical image retrieval: a survey of applications to multidimensional and multimodality data. J Digit Imaging 2013; 26:1025-39. [PMID: 23846532 PMCID: PMC3824925 DOI: 10.1007/s10278-013-9619-2] [Citation(s) in RCA: 138] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Medical imaging is fundamental to modern healthcare, and its widespread use has resulted in the creation of image databases, as well as picture archiving and communication systems. These repositories now contain images from a diverse range of modalities, multidimensional (three-dimensional or time-varying) images, as well as co-aligned multimodality images. These image collections offer the opportunity for evidence-based diagnosis, teaching, and research; for these applications, there is a requirement for appropriate methods to search the collections for images that have characteristics similar to the case(s) of interest. Content-based image retrieval (CBIR) is an image search technique that complements the conventional text-based retrieval of images by using visual features, such as color, texture, and shape, as search criteria. Medical CBIR is an established field of study that is beginning to realize promise when applied to multidimensional and multimodality medical data. In this paper, we present a review of state-of-the-art medical CBIR approaches in five main categories: two-dimensional image retrieval, retrieval of images with three or more dimensions, the use of nonimage data to enhance the retrieval, multimodality image retrieval, and retrieval from diverse datasets. We use these categories as a framework for discussing the state of the art, focusing on the characteristics and modalities of the information used during medical image retrieval.
Collapse
Affiliation(s)
- Ashnil Kumar
- Biomedical and Multimedia Information Technology (BMIT) Research Group, School of Information Technologies, University of Sydney, Building J12, Sydney, NSW, 2006, Australia,
| | | | | | | | | |
Collapse
|
26
|
Krishnamoorthi R, Sathiya Devi S. A simple computational model for image retrieval with weighted multifeatures based on orthogonal polynomials and genetic algorithm. Neurocomputing 2013. [DOI: 10.1016/j.neucom.2012.05.030] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
27
|
García Seco de Herrera A, Markonis D, Müller H. Bag–of–Colors for Biomedical Document Image Classification. ACTA ACUST UNITED AC 2013. [DOI: 10.1007/978-3-642-36678-9_11] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/21/2023]
|
28
|
Hwang KH, Lee H, Choi D. Medical image retrieval: past and present. Healthc Inform Res 2012; 18:3-9. [PMID: 22509468 PMCID: PMC3324753 DOI: 10.4258/hir.2012.18.1.3] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2012] [Revised: 03/24/2012] [Accepted: 03/26/2012] [Indexed: 11/23/2022] Open
Abstract
With the widespread dissemination of picture archiving and communication systems (PACSs) in hospitals, the amount of imaging data is rapidly increasing. Effective image retrieval systems are required to manage these complex and large image databases. The authors reviewed the past development and the present state of medical image retrieval systems including text-based and content-based systems. In order to provide a more effective image retrieval service, the intelligent content-based retrieval systems combined with semantic systems are required.
Collapse
Affiliation(s)
- Kyung Hoon Hwang
- Department of Nuclear Medicine, Gachon University Gil Hospital, Incheon, Korea
| | | | | |
Collapse
|
29
|
Mining historical manuscripts with local color patches. Knowl Inf Syst 2012. [DOI: 10.1007/s10115-011-0401-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
30
|
SHYU MEILING, CHEN SHUCHING, SUN QIBIN, YU HEATHER. OVERVIEW AND FUTURE TRENDS OF MULTIMEDIA RESEARCH FOR CONTENT ACCESS AND DISTRIBUTION. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING 2012. [DOI: 10.1142/s1793351x07000044] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The advances in information technology, computational capability, and communication networks have enabled large-scale data collection and distribution of vast amounts of multimedia data available to consumer and enterprise applications. With the proliferation of multimedia data and ever-growing requests for multimedia applications, reliable and efficient tools and techniques are urgently sought for multimedia content analysis and retrieval, as well as secure media streaming, distribution or communication. Though many research efforts have been devoted to these aspects, it is still far from maturity and there exist many open issues. In this paper, an overview of the challenges and issues as well as future trends on multimedia content access and distribution will be discussed. The focuses include: (i) how to bridge the gaps between the semantic meaning and low-level media characteristics; (ii) how to handle user perception subjectivity problem; (iii) how to provide multimedia network communication support for media streaming and P2P (peer-to-peer) media distribution; and (iv) how to ensure various aspects of secure media, including acquisition, processing, storage, and communication.
Collapse
Affiliation(s)
- MEI-LING SHYU
- Department of Electrical and Computer Engineering, University of Miami, Coral Gables, FL 33124, USA
| | - SHU-CHING CHEN
- School of Computing and Information Sciences, Florida International University, Miami, FL 33199, USA
| | - QIBIN SUN
- Department of Media Understanding, Institute for Infocomm Research, Singapore 119613, Singapore
| | - HEATHER YU
- Panasonic Princeton Laboratory, 2 Research Way, Princeton, NJ, 08540, USA
| |
Collapse
|
31
|
CHATZICHRISTOFIS SAVVASA, ZAGORIS KONSTANTINOS, BOUTALIS YIANNISS, PAPAMARKOS NIKOS. ACCURATE IMAGE RETRIEVAL BASED ON COMPACT COMPOSITE DESCRIPTORS AND RELEVANCE FEEDBACK INFORMATION. INT J PATTERN RECOGN 2011. [DOI: 10.1142/s0218001410007890] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
In this paper a new set of descriptors appropriate for image indexing and retrieval is proposed. The proposed descriptors address the tremendously increased need for efficient content-based image retrieval (CBIR) in many application areas such as the Internet, biomedicine, commerce and education. These applications commonly store image information in large image databases where the image information cannot be accessed or used unless the database is organized to allow efficient storage, browsing and retrieval. To be applicable in the design of large image databases, the proposed descriptors are compact, with the smallest requiring only 23 bytes per image. The proposed descriptors' structure combines color and texture information which are extracted using fuzzy approaches. To evaluate the performance of the proposed descriptors, the objective Average Normalized Modified Retrieval Rank (ANMRR) is used. Experiments conducted on five benchmarking image databases demonstrate the effectiveness of the proposed descriptors in outperforming other state-of-the-art descriptors. Also, a Auto Relevance Feedback (ARF) technique is introduced which is based on the proposed descriptors. This technique readjusts the initial retrieval results based on user preferences improving the retrieval score significantly. An online demo of the image retrieval system img(Anaktisi) that implements the proposed descriptors can be found at .
Collapse
Affiliation(s)
- SAVVAS A. CHATZICHRISTOFIS
- Department of Electrical and Computer Engineering, Democritus University of Thrace, 12. Vas. Sofias, Xanthi, 67100, Greece
| | - KONSTANTINOS ZAGORIS
- Department of Electrical and Computer Engineering, Democritus University of Thrace, 12. Vas. Sofias, Xanthi, 67100, Greece
| | - YIANNIS S. BOUTALIS
- Department of Electrical and Computer Engineering, Democritus University of Thrace, 12. Vas. Sofias, Xanthi, 67100, Greece
| | - NIKOS PAPAMARKOS
- Department of Electrical and Computer Engineering, Democritus University of Thrace, 12. Vas. Sofias, Xanthi, 67100, Greece
| |
Collapse
|
32
|
KWAN PAULW, KAMEYAMA KEISUKE, GAO JUNBIN, TORAICHI KAZUO. CONTENT-BASED IMAGE RETRIEVAL OF CULTURAL HERITAGE SYMBOLS BY INTERACTION OF VISUAL PERSPECTIVES. INT J PATTERN RECOGN 2011. [DOI: 10.1142/s0218001411008816] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Content-based Image Retrieval (CBIR) has been an active area of research for retrieving similar images from large repositories, without the prerequisite of manual labeling. Most current CBIR algorithms can faithfully return a list of images that matches the visual perspective of their inventors, who might decide to use a certain combination of image features like edges, colors and textures of regions as well as their spatial distribution during processing. In practice, however, the retrieved images rarely correspond exactly to the results expected by the users, a problem that has come to be known as the semantic gap. In this paper, we propose a novel and extensible multidimensional approach called matrix of visual perspectives as a solution for addressing this semantic gap. Our approach exploits the dynamic cross-interaction (in other words, mix-and-match) of image features and similarity metrics to produce results that attempt to mimic the mental visual picture of the user. Experimental results on retrieving similar Japanese cultural heritage symbols called kamons by a prototype system confirm that the interaction of visual perspectives in the user can be effectively captured and reflected. The benefits of this approach are broader. They can be equally applicable to the development of CBIR systems for other types of images, whether cultural or noncultural, by adapting to different sets of application specific image features.
Collapse
Affiliation(s)
- PAUL W. KWAN
- School of Science and Technology, University of New England, Armidale NSW 2351, Australia
| | - KEISUKE KAMEYAMA
- Department of Computer Science, Graduate School of SIE, University of Tsukuba, 1-1-1 Tennodai Tsukuba-Shi Ibaraki 305-8573, Japan
| | - JUNBIN GAO
- School of Computing and Mathematics, Charles Sturt University, Bathurst NSW 2795, Australia
| | - KAZUO TORAICHI
- Fluency Laboratories Inc., Tsukuba Industrial Liaison and Cooperative Research Center, University of Tsukuba, 1-1-1 Tennodai Tsukuba-Shi Ibaraki 205-8573, Japan
| |
Collapse
|
33
|
YOO HUNWOO, JANG DONGSIK, SEO KWANGKYU, LEE MYUNGEUI. RETRIEVING IMAGES BY COMPARING HOMOGENEOUS COLOR AND TEXTURE OBJECTS IN THE IMAGE. INT J PATTERN RECOGN 2011. [DOI: 10.1142/s0218001404003514] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
An object-based image retrieval method is addressed in this paper. For that purpose, a new image segmentation algorithm and image comparing method between segmented objects are proposed. For image segmentation, color and textural features are extracted from each pixel in the image and these features are used as inputs into VQ (Vector Quantization) clustering method, which yields homogeneous objects in terms of color and texture. In this procedure, colors are quantized into a few dominant colors for simple representation and efficient retrieval. In the retrieval case, two comparing schemes are proposed. Comparisons between one query object and multi-objects of a database image and comparisons between multi-query objects and multi-objects of a database image are proposed. For fast retrieval, dominant object colors are key-indexed into the database.
Collapse
Affiliation(s)
- HUN-WOO YOO
- Center for Cognitive Science, Yonsei University, 134, Shinchon-Dong, Seodaemun-Ku, Seoul 120-749, Korea
| | - DONG-SIK JANG
- Department of Industrial Systems and Information Engineering, Korea University, Sungbuk-gu Anam-dong 5 Ga 1, Seoul 136-701, Korea
| | - KWANG-KYU SEO
- Department of Industrial Information and Systems Engineering, Sangmyung University, San 98-20, Anso-Dong, Chonan, Chungnam 330-720, Korea
| | - MYUNG-EUI LEE
- School of Information Technology, Korea University of Technology & Education, Chonan P.O.B 55, Chungnam, 330-708, Korea
| |
Collapse
|
34
|
Kumar N, Berg AC, Belhumeur PN, Nayar SK. Describable Visual Attributes for Face Verification and Image Search. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2011; 33:1962-1977. [PMID: 21383395 DOI: 10.1109/tpami.2011.48] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
We introduce the use of describable visual attributes for face verification and image search. Describable visual attributes are labels that can be given to an image to describe its appearance. This paper focuses on images of faces and the attributes used to describe them, although the concepts also apply to other domains. Examples of face attributes include gender, age, jaw shape, nose size, etc. The advantages of an attribute-based representation for vision tasks are manifold: They can be composed to create descriptions at various levels of specificity; they are generalizable, as they can be learned once and then applied to recognize new objects or categories without any further training; and they are efficient, possibly requiring exponentially fewer attributes (and training data) than explicitly naming each category. We show how one can create and label large data sets of real-world images to train classifiers which measure the presence, absence, or degree to which an attribute is expressed in images. These classifiers can then automatically label new images. We demonstrate the current effectiveness--and explore the future potential--of using attributes for face verification and image search via human and computational experiments. Finally, we introduce two new face data sets, named FaceTracer and PubFig, with labeled attributes and identities, respectively.
Collapse
|
35
|
Biomedical imaging modality classification using combined visual features and textual terms. Int J Biomed Imaging 2011; 2011:241396. [PMID: 21912534 PMCID: PMC3170788 DOI: 10.1155/2011/241396] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2011] [Revised: 05/12/2011] [Accepted: 07/06/2011] [Indexed: 11/18/2022] Open
Abstract
We describe an approach for the automatic modality classification in medical image retrieval task of the 2010 CLEF cross-language image retrieval campaign (ImageCLEF). This paper is focused on the process of feature
extraction from medical images and fuses the different extracted visual features and textual feature for modality classification. To extract visual features from the images, we used histogram descriptor of edge, gray, or color intensity and block-based variation as global features and SIFT histogram as local feature. For textual feature of image representation, the binary histogram of some predefined vocabulary words from image captions is used. Then, we combine the different features using normalized kernel functions for SVM classification. Furthermore, for some easy misclassified modality pairs such as CT and MR or PET and NM modalities, a local classifier is used for distinguishing samples in the pair modality to improve performance. The proposed strategy is evaluated with the provided modality dataset by ImageCLEF 2010.
Collapse
|
36
|
Qi X, Barrett S, Chang R. A noise-resilient collaborative learning approach to content-based image retrieval. INT J INTELL SYST 2011. [DOI: 10.1002/int.20503] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
37
|
Okamoto K, Dong F, Yoshida S, Hirota K. Content-Based Image Retrieval via Combination of Similarity Measures. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS 2011. [DOI: 10.20965/jaciii.2011.p0687] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
A multiple (dis)similarity measure combination framework via normalization and weighting of measures is proposed to find suitable measure combinations in terms of retrieval accuracy and computational cost. In the combination of Manhattan and Hellinger distances, the computational time is more than 12 times faster and the retrieval accuracy improves or remains at the same level, when compared with Minkowski distance, a measure having the best retrieval accuracy in the single measure scenario. These performances are determined on a visual word based image retrieval system by using the Corel collections. Due to the reduction of computational cost and robustness of retrieval accuracy in this combination, applications include retrieval employing large number of images and categories in a database.
Collapse
|
38
|
Thang ND, Rasheed T, Lee YK, Lee S, Kim TS. Content-based facial image retrieval using constrained independent component analysis. Inf Sci (N Y) 2011. [DOI: 10.1016/j.ins.2011.03.021] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
39
|
|
40
|
Wang HJ, Chang CY. Semantic real-world image classification for image retrieval with fuzzy-ART neural network. Neural Comput Appl 2011. [DOI: 10.1007/s00521-011-0660-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
41
|
You J, Li Q, Wang J. On Hierarchical Content-Based Image Retrieval by Dynamic Indexing and Guided Search. INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE 2010. [DOI: 10.4018/jcini.2010100102] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
This paper presents a new approach to content-based image retrieval by using dynamic indexing and guided search in a hierarchical structure, and extending data mining and data warehousing techniques. The proposed algorithms include a wavelet-based scheme for multiple image feature extraction, the extension of a conventional data warehouse and an image database to an image data warehouse for dynamic image indexing. It also provides an image data schema for hierarchical image representation and dynamic image indexing, a statistically based feature selection scheme to achieve flexible similarity measures, and a feature component code to facilitate query processing and guide the search for the best matching. A series of case studies are reported, which include a wavelet-based image color hierarchy, classification of satellite images, tropical cyclone pattern recognition, and personal identification using multi-level palmprint and face features. Experimental results confirm that the new approach is feasible for content-based image retrieval.
Collapse
Affiliation(s)
- Jane You
- The Hong Kong Polytechnic University, China
| | - Qin Li
- The Hong Kong Polytechnic University, China
| | | |
Collapse
|
42
|
Fast shape matching and retrieval based on approximate dynamic space warping. ARTIFICIAL LIFE AND ROBOTICS 2010. [DOI: 10.1007/s10015-010-0814-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
43
|
Bae SH, Juang BH. IPSILON: incremental parsing for semantic indexing of latent concepts. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2010; 19:1933-1947. [PMID: 20215073 DOI: 10.1109/tip.2010.2045019] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
A new framework for content-based image retrieval, which takes advantage of the source characterization property of a universal source coding scheme, is investigated. Based upon a new class of multidimensional incremental parsing algorithm, extended from the Lempel-Ziv incremental parsing code, the proposed method captures the occurrence pattern of visual elements from a given image. A linguistic processing technique, namely the latent semantic analysis (LSA) method, is then employed to identify associative ensembles of visual elements, which lay the foundation for intelligent visual information analysis. In 2-D applications, incremental parsing decomposes an image into elementary patches that are different from the conventional fixed square-block type patches. When used in compressive representations, it is amenable in schemes that do not rely on average distortion criteria, a methodology that is a departure from the conventional vector quantization. We call this methodology a parsed representation. In this article, we present our implementations of an image retrieval system, called IPSILON, with parsed representations induced by different perceptual distortion thresholds. We evaluate the effectiveness of the use of the parsed representations by comparing their performance with that of four image retrieval systems, one using the conventional vector quantization for visual information analysis under the same LSA paradigm, another using a method called SIMPLIcity which is based upon an image segmentation and integrated region matching, and the other two based upon query-by-semantic-example and query-by-visual-example. The first two of them were tested with 20,000 images of natural scenes, and the others were tested with a portion of the images. The experimental results show that the proposed parsed representation efficiently captures the salient features in visual images and the IPSILON systems outperform other systems in terms of retrieval precision and distortion robustness.
Collapse
Affiliation(s)
- Soo Hyun Bae
- Center for Signal and Image Processing, Georgia Institute of Technology, Atlanta, GA 30332-0250, USA.
| | | |
Collapse
|
44
|
Yang L, Tuzel O, Chen W, Meer P, Salaru G, Goodell LA, Foran DJ. PathMiner: a Web-based tool for computer-assisted diagnostics in pathology. ACTA ACUST UNITED AC 2009; 13:291-9. [PMID: 19171530 DOI: 10.1109/titb.2008.2008801] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Large-scale, multisite collaboration has become indispensable for a wide range of research and clinical activities that rely on the capacity of individuals to dynamically acquire, share, and assess images and correlated data. In this paper, we report the development of a Web-based system, PathMiner , for interactive telemedicine, intelligent archiving, and automated decision support in pathology. The PathMiner system supports network-based submission of queries and can automatically locate and retrieve digitized pathology specimens along with correlated molecular studies of cases from "ground-truth" databases that exhibit spectral and spatial profiles consistent with a given query image. The statistically most probable diagnosis is provided to the individual who is seeking decision support. To test the system under real-case scenarios, a pipeline infrastructure was developed and a network-based test laboratory was established at strategic sites at the University of Medicine and Dentistry of New Jersey-Robert Wood Johnson Medical School, Robert Wood Johnson University Hospital, the University of Pennsylvania School of Medicine, Hospital of the University of Pennsylvania, The Cancer Institute of New Jersey, and Rutgers University. The average five-class classification accuracy of the system was 93.18% based on a tenfold cross validation on a close dataset containing 3691 imaged specimens. We also conducted prospective performance studies with the PathMiner system in real applications in which the specimens exhibited large variations in staining characters compared with the training data. The average five-class classification accuracy in this open-set experiment was 87.22%. We also provide the comparative results with the previous literature and the PathMiner system shows superior performance.
Collapse
Affiliation(s)
- Lin Yang
- Department of Electrical and Computer Engineering, Rutgers University, Piscataway, NJ 08854, USA.
| | | | | | | | | | | | | |
Collapse
|
45
|
Li S, Lee MC, Pun CM. Complex Zernike Moments Features for Shape-Based Image Retrieval. ACTA ACUST UNITED AC 2009. [DOI: 10.1109/tsmca.2008.2007988] [Citation(s) in RCA: 144] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
46
|
Lin C, Tsai C, Roan J. Personal photo browsing and retrieval by clustering techniques. ONLINE INFORMATION REVIEW 2008. [DOI: 10.1108/14684520810923926] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
47
|
Jing Y, Baluja S. VisualRank: applying PageRank to large-scale image search. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2008; 30:1877-1890. [PMID: 18787237 DOI: 10.1109/tpami.2008.121] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Because of the relative ease in understanding and processing text, commercial image-search systems often rely on techniques that are largely indistinguishable from text-search. Recently, academic studies have demonstrated the effectiveness of employing image-based features to provide alternative or additional signals. However, it remains uncertain whether such techniques will generalize to a large number of popular web queries, and whether the potential improvement to search quality warrants the additional computational cost. In this work, we cast the image-ranking problem into the task of identifying "authority" nodes on an inferred visual similarity graph and propose VisualRank to analyze the visual link structures among images. The images found to be "authorities" are chosen as those that answer the image-queries well. To understand the performance of such an approach in a real system, we conducted a series of large-scale experiments based on the task of retrieving images for 2000 of the most popular products queries. Our experimental results show significant improvement, in terms of user satisfaction and relevancy, in comparison to the most recent Google Image Search results. Maintaining modest computational cost is vital to ensuring that this procedure can be used in practice; we describe the techniques required to make this system practical for large scale deployment in commercial search engines.
Collapse
Affiliation(s)
- Yushi Jing
- Georgia Institute of Technology, Atlanta, USA.
| | | |
Collapse
|
48
|
Alajlan N, Kamel MS, Freeman GH. Geometry-based image retrieval in binary image databases. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2008; 30:1003-1013. [PMID: 18421106 DOI: 10.1109/tpami.2008.37] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
In this paper, a geometry-based image retrieval system is developed for multi-object images. We model both shape and topology of image objects using a structured representation called curvature tree (CT). The hierarchy of the CT reflects the inclusion relationships between the image objects. To facilitate shape-based matching, triangle-area representation (TAR) of each object is stored at the corresponding node in the CT. The similarity between two multi-object images is measured based on the maximum similarity subtree isomorphism (MSSI) between their CTs. For this purpose, we adopt a recursive algorithm to solve the MSSI problem and a very effective dynamic programming algorithm to measure the similarity between the attributed nodes. Our matching scheme agrees with many recent findings in psychology about the human perception of multi-object images. Experiments on a database of 13500 real and synthesized medical images and the MPEG-7 CE-1 database of 1400 shape images have shown the effectiveness of the proposed method.
Collapse
Affiliation(s)
- Naif Alajlan
- Department of Electrical Engineering, Engineering College, King Saud University, Saudi Arabia.
| | | | | |
Collapse
|
49
|
Suganthan PN. Shape indexing using self-organizing maps. IEEE TRANSACTIONS ON NEURAL NETWORKS 2008; 13:835-40. [PMID: 18244479 DOI: 10.1109/tnn.2002.1021884] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
In this paper, we propose a novel approach to generate the topology-preserving mapping of structural shapes using self-organizing maps (SOMs). The structural information of the geometrical shapes is captured by relational attribute vectors. These vectors are quantised using an SOM. Using this SOM, a histogram is generated for every shape. These histograms are treated as inputs to train another SOM which yields a topology-preserving mapping of the geometric shapes. By appropriately choosing the relational vectors, it is possible to generate a mapping that is invariant to some chosen transformations, such as rotation, translation, scale, affine, or perspective transformations. Experimental results using trademark objects are presented to demonstrate the performance of the proposed methodology.
Collapse
Affiliation(s)
- P N Suganthan
- Sch. of Electr. and Electron. Eng., Nanyang Technol. Univ., Singapore
| |
Collapse
|
50
|
FaceTracer: A Search Engine for Large Collections of Images with Faces. LECTURE NOTES IN COMPUTER SCIENCE 2008. [DOI: 10.1007/978-3-540-88693-8_25] [Citation(s) in RCA: 148] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]
|