1
|
Yan N, Gao C, Liu D, Li H, Li L, Wu F. SSSIC: Semantics-to-Signal Scalable Image Coding With Learned Structural Representations. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2021; 30:8939-8954. [PMID: 34699359 DOI: 10.1109/tip.2021.3121131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
We address the requirement of image coding for joint human-machine vision, i.e., the decoded image serves both human observation and machine analysis/understanding. Previously, human vision and machine vision have been extensively studied by image (signal) compression and (image) feature compression, respectively. Recently, for joint human-machine vision, several studies have been devoted to joint compression of images and features, but the correlation between images and features is still unclear. We identify the deep network as a powerful toolkit for generating structural image representations. From the perspective of information theory, the deep features of an image naturally form an entropy decreasing series: a scalable bitstream is achieved by compressing the features backward from a deeper layer to a shallower layer until culminating with the image signal. Moreover, we can obtain learned representations by training the deep network for a given semantic analysis task or multiple tasks and acquire deep features that are related to semantics. With the learned structural representations, we propose SSSIC, a framework to obtain an embedded bitstream that can be either partially decoded for semantic analysis or fully decoded for human vision. We implement an exemplar SSSIC scheme using coarse-to-fine image classification as the driven semantic analysis task. We also extend the scheme for object detection and instance segmentation tasks. The experimental results demonstrate the effectiveness of the proposed SSSIC framework and establish that the exemplar scheme achieves higher compression efficiency than separate compression of images and features.
Collapse
|
2
|
Chakraborty C. Performance Analysis of Compression Techniques for Chronic Wound Image Transmission Under Smartphone-Enabled Tele-Wound Network. INTERNATIONAL JOURNAL OF E-HEALTH AND MEDICAL COMMUNICATIONS 2019. [DOI: 10.4018/ijehmc.2019040101] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
The healing status of chronic wounds is important for monitoring the condition of the wounds. This article designs and discusses the implementation of smartphone-enabled compression technique under a tele-wound network (TWN) system. Nowadays, there is a huge demand for memory and bandwidth savings for clinical data processing. Wound images are captured using a smartphone through a metadata application page. Then, they are compressed and sent to the telemedical hub with a set partitioning in hierarchical tree (SPIHT) compression algorithm. The transmitted image can then be reduced, followed by an improvement in the segmentation accuracy and sensitivity. Better wound healing treatment depends on segmentation and classification accuracy. The proposed framework is evaluated in terms of rates (bits per pixel), compression ratio, peak signal to noise ratio, transmission time, mean square error and diagnostic quality under telemedicine framework. A SPIHT compression technique assisted YDbDr-Fuzzy c-means clustering considerably reduces the execution time (105s), is simple to implement, saves memory (18 KB), improves segmentation accuracy (98.39%), and yields better results than the same without using SPIHT. The results favor the possibility of developing a practical smartphone-enabled telemedicine system and show the potential for being implemented in the field of clinical evaluation and the management of chronic wounds in the future.
Collapse
Affiliation(s)
- Chinmay Chakraborty
- Dept. of Electronics & Communication Engineering, Birla Institute of Technology, Jharkhand, India
| |
Collapse
|
3
|
|
4
|
Mammeri A, Hadjou B, Khoumsi A. A Survey of Image Compression Algorithms for Visual Sensor Networks. ACTA ACUST UNITED AC 2012. [DOI: 10.5402/2012/760320] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
With the advent of visual sensor networks (VSNs), energy-aware compression algorithms have gained wide attention. That is, new strategies and mechanisms for power-efficient image compression algorithms are developed, since the application of the conventional methods is not always energy beneficial. In this paper, we provide a survey of image compression algorithms for visual sensor networks, ranging from the conventional standards such as JPEG and JPEG2000 to a new compression method, for example, compressive sensing. We provide the advantages and shortcomings of the application of these algorithms in VSN, a literature review of their application in VSN, as well as an open research issue for each compression standard/method. Moreover, factors influencing the design of compression algorithms in the context of VSN are presented. We conclude by some guidelines which concern the design of a compression method for VSN.
Collapse
Affiliation(s)
- Abdelhamid Mammeri
- Department of Electrical and Computer Engineering, University of Sherbrooke, Sherbrooke, QC, Canada J1K 2R1
| | - Brahim Hadjou
- Department of Electrical and Computer Engineering, University of Sherbrooke, Sherbrooke, QC, Canada J1K 2R1
| | - Ahmed Khoumsi
- Department of Electrical and Computer Engineering, University of Sherbrooke, Sherbrooke, QC, Canada J1K 2R1
| |
Collapse
|
5
|
Video Compression Schemes Using Edge Feature on Wireless Video Sensor Networks. JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING 2012. [DOI: 10.1155/2012/421307] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
This paper puts forward a low-complexity video compression algorithm that uses the edges of objects in the frames to estimate and compensate for motion. Based on the proposed algorithm, two schemes that balance energy consumption among nodes in a cluster on a wireless video sensor network (WVSN) are proposed. In these schemes, we divide the compression process into several small processing components, which are then distributed to multiple nodes along a path from a source node to a cluster head in a cluster. We conduct extensive computational simulations to examine the truth of our method and find that the proposed schemes not only balance energy consumption of sensor nodes by sharing of the processing tasks but also improve the quality of decoding video by using edges of objects in the frames.
Collapse
|
6
|
Lalgudi HG, Marcellin MW, Bilgin A, Oh H, Nadar MS. View compensated compression of volume rendered images for remote visualization. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2009; 18:1501-1511. [PMID: 19447720 DOI: 10.1109/tip.2009.2017151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Remote visualization of volumetric images has gained importance over the past few years in medical and industrial applications. Volume visualization is a computationally intensive process, often requiring hardware acceleration to achieve a real time viewing experience. One remote visualization model that can accomplish this would transmit rendered images from a server, based on viewpoint requests from a client. For constrained server-client bandwidth, an efficient compression scheme is vital for transmitting high quality rendered images. In this paper, we present a new view compensation scheme that utilizes the geometric relationship between viewpoints to exploit the correlation between successive rendered images. The proposed method obviates motion estimation between rendered images, enabling significant reduction to the complexity of a compressor. Additionally, the view compensation scheme, in conjunction with JPEG2000 performs better than AVC, the state of the art video compression standard.
Collapse
Affiliation(s)
- Hariharan G Lalgudi
- Department of Computer and Electrical Engineering, University of Arizona, tucson, AZ 85721, USA.
| | | | | | | | | |
Collapse
|
7
|
Very Low-Memory Wavelet Compression Architecture Using Strip-Based Processing for Implementation in Wireless Sensor Networks. ACTA ACUST UNITED AC 2009. [DOI: 10.1155/2009/479281] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
8
|
Singh R, Ortega A. Reduced-complexity delayed-decision algorithm for context-based image processing systems. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2007; 16:1937-45. [PMID: 17688199 DOI: 10.1109/tip.2007.901246] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
It is well known that the performance of context-based image processing systems can be improved by allowing the processor (e.g., an encoder or a denoiser) a delay of several samples before making a processing decision. Often, however, for such systems, traditional delayed-decision algorithms can become computationally prohibitive due to the growth in the size of the space of possible solutions. In this paper, we propose a reduced-complexity, one-pass, delayed-decision algorithm that systematically reduces the size of the search space, while also preserving its structure. In particular, we apply the proposed algorithm to two examples of adaptive context-based image processing systems, an image coding system that employs a context-based entropy coder, and a spatially adaptive image-denoising system. For these two types of widely used systems, we show that the proposed delayed-decision search algorithm outperforms instantaneous-decision algorithms with only a small increase in complexity. We also show that the performance of the proposed algorithm is better than that of other, higher complexity, delayed-decision algorithms.
Collapse
|
9
|
Xiong C, Tian J, Liu J. Efficient architectures for two-dimensional discrete wavelet transform using lifting scheme. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2007; 16:607-14. [PMID: 17357722 DOI: 10.1109/tip.2007.891069] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
Novel architectures for 1-D and 2-D discrete wavelet transform (DWT) by using lifting schemes are presented in this paper. An embedded decimation technique is exploited to optimize the architecture for 1-D DWT, which is designed to receive an input and generate an output with the low- and high-frequency components of original data being available alternately. Based on this 1-D DWT architecture, an efficient line-based architecture for 2-D DWT is further proposed by employing parallel and pipeline techniques, which is mainly composed of two horizontal filter modules and one vertical filter module, working in parallel and pipeline fashion with 100% hardware utilization. This 2-D architecture is called fast architecture (FA) that can perform J levels of decomposition for N * N image in approximately 2N2(1 - 4(-J))/3 internal clock cycles. Moreover, another efficient generic line-based 2-D architecture is proposed by exploiting the parallelism among four subband transforms in lifting-based 2-D DWT, which can perform J levels of decomposition for N * N image in approximately N2(1 - 4(-J))/3 internal clock cycles; hence, it is called high-speed architecture. The throughput rate of the latter is increased by two times when comparing with the former 2-D architecture, but only less additional hardware cost is added. Compared with the works reported in previous literature, the proposed architectures for 2-D DWT are efficient alternatives in tradeoff among hardware cost, throughput rate, output latency and control complexity, etc.
Collapse
Affiliation(s)
- Chengyi Xiong
- College of Electronic Information Engineering, South-Center University for Nationalities, Wuhan 430074, China.
| | | | | |
Collapse
|
10
|
Acharya T, Chakrabarti C. A Survey on Lifting-based Discrete Wavelet Transform Architectures. ACTA ACUST UNITED AC 2006. [DOI: 10.1007/s11266-006-4191-3] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
11
|
Bernabé G, García JM, González J. Reducing 3D Fast Wavelet Transform Execution Time Using Blocking and the Streaming SIMD Extensions. ACTA ACUST UNITED AC 2005. [DOI: 10.1007/s11265-005-6651-6] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
12
|
Oliver J, Malumbres MP. A Fast Run-Length Algorithm for Wavelet Image Coding with Reduced Memory Usage. PATTERN RECOGNITION AND IMAGE ANALYSIS 2005. [DOI: 10.1007/11492429_53] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
13
|
Skodras A, Christopoulos C, Ebrahimi T. JPEG2000: The upcoming still image compression standard. Pattern Recognit Lett 2001. [DOI: 10.1016/s0167-8655(01)00079-4] [Citation(s) in RCA: 63] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|