1
|
Wei C, Li Z, Hu T, Zhao M, Sun Z, Jia K, Feng J, Pogue BW, Paulsen KD, Jiang S. Model-Based Convolution Neural Network for 3D Near-Infrared Spectral Tomography. IEEE TRANSACTIONS ON MEDICAL IMAGING 2025; 44:2330-2340. [PMID: 40031020 DOI: 10.1109/tmi.2025.3529621] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/05/2025]
Abstract
Near-infrared spectral tomography (NIRST) is a non-invasive imaging technique that provides functional information about biological tissues. Due to diffuse light propagation in tissue and limited boundary measurements, NIRST image reconstruction presents an ill-posed and ill-conditioned computational problem that is difficult to solve. To address this challenge, we developed a reconstruction algorithm (Model-CNN) that integrates a diffusion equation model with a convolutional neural network (CNN). The CNN learns a regularization prior to restrict solutions to the space of desirable chromophore concentration images. Efficacy of Model-CNN was evaluated by training on numerical simulation data, and then applying the network to physical phantom and clinical patient NIRST data. Results demonstrated the superiority of Model-CNN over the conventional Tikhonov regularization approach and a deep learning algorithm (FC-CNN) in terms of absolute bias error (ABE) and peak signal-to-noise ratio (PSNR). Specifically, in comparison to Tikhonov regularization, Model-CNN reduced average ABE by 55% for total hemoglobin (HbT) and 70% water (H $_{\mathbf {{2}}}$ O) concentration, while improved PSNR by an average of 5.3 dB both for HbT and H $_{\mathbf {{2}}}$ O images. Meanwhile, image processing time was reduced by 82%, relative to the Tikhonov regularization. As compared to FC-CNN, the Model-CNN achieved a 91% reduction in ABE for HbT and 75% for H $_{\mathbf {{2}}}$ O images, with increases in PSNR by 7.3 dB and 4.7 dB, respectively. Notably, this Model-CNN approach was not trained on patient data; but instead, was trained on simulated phantom data with simpler geometrical shapes and optical source-detector configurations; yet, achieved superior image recovery when faced with real-world data.
Collapse
|
2
|
Yue Z, Yong H, Zhao Q, Zhang L, Meng D, Wong KYK. Deep Variational Network Toward Blind Image Restoration. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2024; 46:7011-7026. [PMID: 38349822 DOI: 10.1109/tpami.2024.3365745] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/15/2024]
Abstract
Blind image restoration (IR) is a common yet challenging problem in computer vision. Classical model-based methods and recent deep learning (DL)-based methods represent two different methodologies for this problem, each with their own merits and drawbacks. In this paper, we propose a novel blind image restoration method, aiming to integrate both the advantages of them. Specifically, we construct a general Bayesian generative model for the blind IR, which explicitly depicts the degradation process. In this proposed model, a pixel-wise non-i.i.d. Gaussian distribution is employed to fit the image noise. It is with more flexibility than the simple i.i.d. Gaussian or Laplacian distributions as adopted in most of conventional methods, so as to handle more complicated noise types contained in the image degradation. To solve the model, we design a variational inference algorithm where all the expected posteriori distributions are parameterized as deep neural networks to increase their model capability. Notably, such an inference algorithm induces a unified framework to jointly deal with the tasks of degradation estimation and image restoration. Further, the degradation information estimated in the former task is utilized to guide the latter IR process. Experiments on two typical blind IR tasks, namely image denoising and super-resolution, demonstrate that the proposed method achieves superior performance over current state-of-the-arts.
Collapse
|
3
|
Ning W, Sun D, Gao Q, Lu Y, Zhu D. Natural image restoration based on multi-scale group sparsity residual constraints. Front Neurosci 2023; 17:1293161. [PMID: 38027495 PMCID: PMC10657837 DOI: 10.3389/fnins.2023.1293161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Accepted: 10/09/2023] [Indexed: 12/01/2023] Open
Abstract
The Group Sparse Representation (GSR) model shows excellent potential in various image restoration tasks. In this study, we propose a novel Multi-Scale Group Sparse Residual Constraint Model (MS-GSRC) which can be applied to various inverse problems, including denoising, inpainting, and compressed sensing (CS). Our new method involves the following three steps: (1) finding similar patches with an overlapping scheme for the input degraded image using a multi-scale strategy, (2) performing a group sparse coding on these patches with low-rank constraints to get an initial representation vector, and (3) under the Bayesian maximum a posteriori (MAP) restoration framework, we adopt an alternating minimization scheme to solve the corresponding equation and reconstruct the target image finally. Simulation experiments demonstrate that our proposed model outperforms in terms of both objective image quality and subjective visual quality compared to several state-of-the-art methods.
Collapse
Affiliation(s)
| | - Dong Sun
- Anhui Engineering Laboratory of Human-Robot Integration System and Intelligent Equipment, Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, School of Electrical Engineering and Automation, Anhui University, Hefei, China
| | | | | | | |
Collapse
|
4
|
Zha Z, Wen B, Yuan X, Zhou J, Zhu C, Kot AC. Low-Rankness Guided Group Sparse Representation for Image Restoration. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:7593-7607. [PMID: 35130172 DOI: 10.1109/tnnls.2022.3144630] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
As a spotlighted nonlocal image representation model, group sparse representation (GSR) has demonstrated a great potential in diverse image restoration tasks. Most of the existing GSR-based image restoration approaches exploit the nonlocal self-similarity (NSS) prior by clustering similar patches into groups and imposing sparsity to each group coefficient, which can effectively preserve image texture information. However, these methods have imposed only plain sparsity over each individual patch of the group, while neglecting other beneficial image properties, e.g., low-rankness (LR), leads to degraded image restoration results. In this article, we propose a novel low-rankness guided group sparse representation (LGSR) model for highly effective image restoration applications. The proposed LGSR jointly utilizes the sparsity and LR priors of each group of similar patches under a unified framework. The two priors serve as the complementary priors in LGSR for effectively preserving the texture and structure information of natural images. Moreover, we apply an alternating minimization algorithm with an adaptively adjusted parameter scheme to solve the proposed LGSR-based image restoration problem. Extensive experiments are conducted to demonstrate that the proposed LGSR achieves superior results compared with many popular or state-of-the-art algorithms in various image restoration tasks, including denoising, inpainting, and compressive sensing (CS).
Collapse
|
5
|
Diao H, Zhang Y, Liu W, Ruan X, Lu H. Plug-and-Play Regulators for Image-Text Matching. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2023; 32:2322-2334. [PMID: 37071519 DOI: 10.1109/tip.2023.3266887] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Exploiting fine-grained correspondence and visual-semantic alignments has shown great potential in image-text matching. Generally, recent approaches first employ a cross-modal attention unit to capture latent region-word interactions, and then integrate all the alignments to obtain the final similarity. However, most of them adopt one-time forward association or aggregation strategies with complex architectures or additional information, while ignoring the regulation ability of network feedback. In this paper, we develop two simple but quite effective regulators which efficiently encode the message output to automatically contextualize and aggregate cross-modal representations. Specifically, we propose (i) a Recurrent Correspondence Regulator (RCR) which facilitates the cross-modal attention unit progressively with adaptive attention factors to capture more flexible correspondence, and (ii) a Recurrent Aggregation Regulator (RAR) which adjusts the aggregation weights repeatedly to increasingly emphasize important alignments and dilute unimportant ones. Besides, it is interesting that RCR and RAR are "plug-and-play": both of them can be incorporated into many frameworks based on cross-modal interaction to obtain significant benefits, and their cooperation achieves further improvements. Extensive experiments on MSCOCO and Flickr30K datasets validate that they can bring an impressive and consistent R@1 gain on multiple models, confirming the general effectiveness and generalization ability of the proposed methods.
Collapse
|
6
|
Zhong X, Liang N, Cai A, Yu X, Li L, Yan B. Super-resolution image reconstruction from sparsity regularization and deep residual-learned priors. JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY 2023; 31:319-336. [PMID: 36683486 DOI: 10.3233/xst-221299] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
BACKGROUND Computed tomography (CT) plays an important role in the field of non-destructive testing. However, conventional CT images often have blurred edge and unclear texture, which is not conducive to the follow-up medical diagnosis and industrial testing work. OBJECTIVE This study aims to generate high-resolution CT images using a new CT super-resolution reconstruction method combining with the sparsity regularization and deep learning prior. METHODS The new method reconstructs CT images through a reconstruction model incorporating image gradient L0-norm minimization and deep image priors using a plug-and-play super-resolution framework. The deep learning priors are learned from a deep residual network and then plugged into the proposed new framework, and alternating direction method of multipliers is utilized to optimize the iterative solution of the model. RESULTS The simulation data analysis results show that the new method improves the signal-to-noise ratio (PSNR) by 7% and the modulation transfer function (MTF) curves show that the value of MTF50 increases by 0.02 factors compared with the result of deep plug-and-play super-resolution. Additionally, the real CT image data analysis results show that the new method improves the PSNR by 5.1% and MTF50 by 0.11 factors. CONCLUSION Both simulation and real data experiments prove that the proposed new CT super-resolution method using deep learning priors can reconstruct CT images with lower noise and better detail recovery. This method is flexible, effective and extensive for low-resolution CT image super-resolution.
Collapse
Affiliation(s)
- Xinyi Zhong
- Henan Key Laboratory of Imaging and Intelligent Processing, PLA Strategy Support Force Information Engineering University, Zhengzhou, Henan, China
| | - Ningning Liang
- Henan Key Laboratory of Imaging and Intelligent Processing, PLA Strategy Support Force Information Engineering University, Zhengzhou, Henan, China
| | - Ailong Cai
- Henan Key Laboratory of Imaging and Intelligent Processing, PLA Strategy Support Force Information Engineering University, Zhengzhou, Henan, China
| | - Xiaohuan Yu
- Henan Key Laboratory of Imaging and Intelligent Processing, PLA Strategy Support Force Information Engineering University, Zhengzhou, Henan, China
| | - Lei Li
- Henan Key Laboratory of Imaging and Intelligent Processing, PLA Strategy Support Force Information Engineering University, Zhengzhou, Henan, China
| | - Bin Yan
- Henan Key Laboratory of Imaging and Intelligent Processing, PLA Strategy Support Force Information Engineering University, Zhengzhou, Henan, China
| |
Collapse
|
7
|
Lv T, Pan Z, Wei W, Yang G, Song J, Wang X, Sun L, Li Q, Sun X. Iterative deep neural networks based on proximal gradient descent for image restoration. PLoS One 2022; 17:e0276373. [PMID: 36331931 PMCID: PMC9635693 DOI: 10.1371/journal.pone.0276373] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 10/06/2022] [Indexed: 11/06/2022] Open
Abstract
The algorithm unfolding networks with explainability of algorithms and higher efficiency of Deep Neural Networks (DNN) have received considerable attention in solving ill-posed inverse problems. Under the algorithm unfolding network framework, we propose a novel end-to-end iterative deep neural network and its fast network for image restoration. The first one is designed making use of proximal gradient descent algorithm of variational models, which consists of denoiser and reconstruction sub-networks. The second one is its accelerated version with momentum factors. For sub-network of denoiser, we embed the Convolutional Block Attention Module (CBAM) in previous U-Net for adaptive feature refinement. Experiments on image denoising and deblurring demonstrate that competitive performances in quality and efficiency are gained by compared with several state-of-the-art networks for image restoration. Proposed unfolding DNN can be easily extended to solve other similar image restoration tasks, such as image super-resolution, image demosaicking, etc.
Collapse
Affiliation(s)
- Ting Lv
- College of Computer Science and Technology, Qingdao University, Qingdao, Shandong Province, China
| | - Zhenkuan Pan
- College of Computer Science and Technology, Qingdao University, Qingdao, Shandong Province, China
- * E-mail: (ZP); (WW)
| | - Weibo Wei
- College of Computer Science and Technology, Qingdao University, Qingdao, Shandong Province, China
- * E-mail: (ZP); (WW)
| | - Guangyu Yang
- College of Computer Science and Technology, Qingdao University, Qingdao, Shandong Province, China
| | - Jintao Song
- College of Computer Science and Technology, Qingdao University, Qingdao, Shandong Province, China
| | - Xuqing Wang
- College of Computer Science and Technology, Qingdao University, Qingdao, Shandong Province, China
| | - Lu Sun
- College of Computer Science and Technology, Qingdao University, Qingdao, Shandong Province, China
| | - Qian Li
- College of Computer Science and Technology, Qingdao University, Qingdao, Shandong Province, China
| | - Xiatao Sun
- School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| |
Collapse
|
8
|
Nair P, Chaudhury KN. Plug-and-Play Regularization Using Linear Solvers. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2022; 31:6344-6355. [PMID: 36215363 DOI: 10.1109/tip.2022.3211473] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
There has been tremendous research on the design of image regularizers over the years, from simple Tikhonov and Laplacian to sophisticated sparsity and CNN-based regularizers. Coupled with a model-based loss function, these are typically used for image reconstruction within an optimization framework. The technical challenge is to develop a regularizer that can accurately model realistic images and be optimized efficiently along with the loss function. Motivated by the recent plug-and-play paradigm for image regularization, we construct a quadratic regularizer whose reconstruction capability is competitive with state-of-the-art regularizers. The novelty of the regularizer is that, unlike classical regularizers, the quadratic objective function is derived from the observed data. Since the regularizer is quadratic, we can reduce the optimization to solving a linear system for applications such as superresolution, deblurring, inpainting, etc. In particular, we show that using iterative Krylov solvers, we can converge to the solution in few iterations, where each iteration requires an application of the forward operator and a linear denoiser. The surprising finding is that we can get close to deep learning methods in terms of reconstruction quality. To the best of our knowledge, the possibility of achieving near state-of-the-art performance using a linear solver is novel.
Collapse
|
9
|
Zhang K, Li Y, Zuo W, Zhang L, Van Gool L, Timofte R. Plug-and-Play Image Restoration With Deep Denoiser Prior. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2022; 44:6360-6376. [PMID: 34125670 DOI: 10.1109/tpami.2021.3088914] [Citation(s) in RCA: 84] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Recent works on plug-and-play image restoration have shown that a denoiser can implicitly serve as the image prior for model-based methods to solve many inverse problems. Such a property induces considerable advantages for plug-and-play image restoration (e.g., integrating the flexibility of model-based method and effectiveness of learning-based methods) when the denoiser is discriminatively learned via deep convolutional neural network (CNN) with large modeling capacity. However, while deeper and larger CNN models are rapidly gaining popularity, existing plug-and-play image restoration hinders its performance due to the lack of suitable denoiser prior. In order to push the limits of plug-and-play image restoration, we set up a benchmark deep denoiser prior by training a highly flexible and effective CNN denoiser. We then plug the deep denoiser prior as a modular part into a half quadratic splitting based iterative algorithm to solve various image restoration problems. We, meanwhile, provide a thorough analysis of parameter setting, intermediate results and empirical convergence to better understand the working mechanism. Experimental results on three representative image restoration tasks, including deblurring, super-resolution and demosaicing, demonstrate that the proposed plug-and-play image restoration with deep denoiser prior not only significantly outperforms other state-of-the-art model-based methods but also achieves competitive or even superior performance against state-of-the-art learning-based methods. The source code is available at https://github.com/cszn/DPIR.
Collapse
|
10
|
Zha Z, Wen B, Yuan X, Zhou J, Zhu C, Kot AC. A Hybrid Structural Sparsification Error Model for Image Restoration. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; 33:4451-4465. [PMID: 33625989 DOI: 10.1109/tnnls.2021.3057439] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Recent works on structural sparse representation (SSR), which exploit image nonlocal self-similarity (NSS) prior by grouping similar patches for processing, have demonstrated promising performance in various image restoration applications. However, conventional SSR-based image restoration methods directly fit the dictionaries or transforms to the internal (corrupted) image data. The trained internal models inevitably suffer from overfitting to data corruption, thus generating the degraded restoration results. In this article, we propose a novel hybrid structural sparsification error (HSSE) model for image restoration, which jointly exploits image NSS prior using both the internal and external image data that provide complementary information. Furthermore, we propose a general image restoration scheme based on the HSSE model, and an alternating minimization algorithm for a range of image restoration applications, including image inpainting, image compressive sensing and image deblocking. Extensive experiments are conducted to demonstrate that the proposed HSSE-based scheme outperforms many popular or state-of-the-art image restoration methods in terms of both objective metrics and visual perception.
Collapse
|
11
|
Wang M, Zheng S, Shi Y, Lou Y. Hybrid method for improving Tikhonov-based reconstruction quality in electrical impedance tomography. J Med Imaging (Bellingham) 2022; 9:054503. [PMID: 36267548 PMCID: PMC9574320 DOI: 10.1117/1.jmi.9.5.054503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Accepted: 09/23/2022] [Indexed: 11/14/2022] Open
Abstract
Purpose Electrical impedance tomography (EIT) has shown its potential in the field of medical imaging. Physiological or pathological variation would cause the change of conductivity. EIT is favorable in reconstructing conductivity distribution inside the detected area. However, due to its ill-posed and nonlinear characteristics, reconstructed images suffer from low spatial resolution. Approach Tikhonov regularization method is a popular and effective approach for image reconstruction in EIT. Nevertheless, excessive smoothness is observed when reconstruction is conducted based on Tikhonov method. To improve Tikhonov-based reconstruction quality in EIT, an innovative hybrid iterative optimization method is proposed. An efficient alternating minimization algorithm is introduced to solve the optimization problem. Results To verify image reconstruction performance and anti-noise robustness of the proposed method, a series of simulation work and phantom experiments is carried out. Meanwhile, comparison is made with reconstruction results based on Landweber, Newton-Raphson, and Tikhonov methods. The reconstruction performance is also verified by quantitative comparison of blur radius and structural similarity values which further demonstrates the excellent performance of the proposed method. Conclusions In contrast to Landweber, Newton-Raphson, and Tikhonov methods, it is found that images reconstructed by the proposed method are more accurate. Even under the impact of noise, the proposed method outperforms comparison methods.
Collapse
Affiliation(s)
- Meng Wang
- Henan Normal University, College of Electronic and Electrical Engineering, Henan Key Laboratory of Optoelectronic Sensing Integrated Application, Xinxiang, China
| | - Shuo Zheng
- Henan Normal University, College of Electronic and Electrical Engineering, Henan Key Laboratory of Optoelectronic Sensing Integrated Application, Xinxiang, China
| | - Yanyan Shi
- Henan Normal University, College of Electronic and Electrical Engineering, Henan Key Laboratory of Optoelectronic Sensing Integrated Application, Xinxiang, China
- Fourth Military Medical University, School of Biomedical Engineering, Xian, China
| | - Yajun Lou
- Henan Normal University, College of Electronic and Electrical Engineering, Henan Key Laboratory of Optoelectronic Sensing Integrated Application, Xinxiang, China
| |
Collapse
|
12
|
Multi-Color Channels Based Group Sparse Model for Image Restoration. ALGORITHMS 2022. [DOI: 10.3390/a15060176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
The group sparse representation (GSR) model combines local sparsity and nonlocal similarity in image processing, and achieves excellent results. However, the traditional GSR model and all subsequent improved GSR models convert the RGB space of the image to YCbCr space, and only extract the Y (luminance) channel of YCbCr space to change the color image to a gray image for processing. As a result, the image processing process cannot be loyal to each color channel, so the repair effect is not ideal. A new group sparse representation model based on multi-color channels is proposed in this paper. The model processes R, G and B color channels simultaneously when processing color images rather than processing a single color channel and then combining the results of different channels. The proposed multi-color-channels-based GSR model is compared with state-of-the-art methods. The experimental contrast results show that the proposed model is an effective method and can obtain good results in terms of objective quantitative metrics and subjective visual effects.
Collapse
|
13
|
Zhang M, Young GS, Tie Y, Gu X, Xu X. A New Framework of Designing Iterative Techniques for Image Deblurring. PATTERN RECOGNITION 2022; 124:108463. [PMID: 34949896 PMCID: PMC8691531 DOI: 10.1016/j.patcog.2021.108463] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
In this work we present a framework of designing iterative techniques for image deblurring in inverse problem. The new framework is based on two observations about existing methods. We used Landweber method as the basis to develop and present the new framework but note that the framework is applicable to other iterative techniques. First, we observed that the iterative steps of Landweber method consist of a constant term, which is a low-pass filtered version of the already blurry observation. We proposed a modification to use the observed image directly. Second, we observed that Landweber method uses an estimate of the true image as the starting point. This estimate, however, does not get updated over iterations. We proposed a modification that updates this estimate as the iterative process progresses. We integrated the two modifications into one framework of iteratively deblurring images. Finally, we tested the new method and compared its performance with several existing techniques, including Landweber method, Van Cittert method, GMRES (generalized minimal residual method), and LSQR (least square), to demonstrate its superior performance in image deblurring.
Collapse
Affiliation(s)
- Min Zhang
- Department of Radiology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Geoffrey S Young
- Department of Radiology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Yanmei Tie
- Department of Neurosurgery, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Xianfeng Gu
- Department of Computer Science, Stony Brook University, Stony Brook, NY
| | - Xiaoyin Xu
- Department of Radiology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
14
|
Efficient Iterative Regularization Method for Total Variation-Based Image Restoration. ELECTRONICS 2022. [DOI: 10.3390/electronics11020258] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Total variation (TV) regularization has received much attention in image restoration applications because of its advantages in denoising and preserving details. A common approach to address TV-based image restoration is to design a specific algorithm for solving typical cost function, which consists of conventional ℓ2 fidelity term and TV regularization. In this work, a novel objective function and an efficient algorithm are proposed. Firstly, a pseudoinverse transform-based fidelity term is imposed on TV regularization, and a closely-related optimization problem is established. Then, the split Bregman framework is used to decouple the complex inverse problem into subproblems to reduce computational complexity. Finally, numerical experiments show that the proposed method can obtain satisfactory restoration results with fewer iterations. Combined with the restoration effect and efficiency, this method is superior to the competitive algorithm. Significantly, the proposed method has the advantage of a simple solving structure, which can be easily extended to other image processing applications.
Collapse
|
15
|
Zhou C, Kong Y, Zhang C, Sun L, Wu D, Zhou C. A Hybrid Sparse Representation Model for Image Restoration. SENSORS (BASEL, SWITZERLAND) 2022; 22:537. [PMID: 35062497 PMCID: PMC8778763 DOI: 10.3390/s22020537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Revised: 01/03/2022] [Accepted: 01/05/2022] [Indexed: 06/14/2023]
Abstract
Group-based sparse representation (GSR) uses image nonlocal self-similarity (NSS) prior to grouping similar image patches, and then performs sparse representation. However, the traditional GSR model restores the image by training degraded images, which leads to the inevitable over-fitting of the data in the training model, resulting in poor image restoration results. In this paper, we propose a new hybrid sparse representation model (HSR) for image restoration. The proposed HSR model is improved in two aspects. On the one hand, the proposed HSR model exploits the NSS priors of both degraded images and external image datasets, making the model complementary in feature space and the plane. On the other hand, we introduce a joint sparse representation model to make better use of local sparsity and NSS characteristics of the images. This joint model integrates the patch-based sparse representation (PSR) model and GSR model, while retaining the advantages of the GSR model and the PSR model, so that the sparse representation model is unified. Extensive experimental results show that the proposed hybrid model outperforms several existing image recovery algorithms in both objective and subjective evaluations.
Collapse
Affiliation(s)
- Caiyue Zhou
- School of Cyber Science and Engineering, Qufu Normal University, Qufu 273165, China; (C.Z.); (L.S.); (D.W.)
| | - Yanfen Kong
- Department of Information Engineering, Weihai Ocean Vocational College, Rongcheng 264300, China; (Y.K.); (C.Z.)
| | - Chuanyong Zhang
- Department of Information Engineering, Weihai Ocean Vocational College, Rongcheng 264300, China; (Y.K.); (C.Z.)
| | - Lin Sun
- School of Cyber Science and Engineering, Qufu Normal University, Qufu 273165, China; (C.Z.); (L.S.); (D.W.)
| | - Dongmei Wu
- School of Cyber Science and Engineering, Qufu Normal University, Qufu 273165, China; (C.Z.); (L.S.); (D.W.)
| | - Chongbo Zhou
- School of Cyber Science and Engineering, Qufu Normal University, Qufu 273165, China; (C.Z.); (L.S.); (D.W.)
- Department of Information Engineering, Weihai Ocean Vocational College, Rongcheng 264300, China; (Y.K.); (C.Z.)
| |
Collapse
|
16
|
Kong S, Wang W, Feng X, Jia X. Deep RED Unfolding Network for Image Restoration. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2022; 31:852-867. [PMID: 34951845 DOI: 10.1109/tip.2021.3136623] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
The deep unfolding network (DUN) provides an efficient framework for image restoration. It consists of a regularization module and a data fitting module. In existing DUN models, it is common to directly use a deep convolution neural network (DCNN) as the regularization module, and perform data fitting before regularization in each iteration/stage. In this work, we present a DUN by incorporating a new regularization module, and putting the regularization module before the data fitting module. The proposed regularization model is deducted by using the regularization by denoing (RED) and plugging in it a newly designed DCNN. For the data fitting module, we use the closed-form solution with Faster Fourier Transform (FFT). The resulted DRED-DUN model has some major advantages. First, the regularization model inherits the flexibility of learned image-adaptive and interpretability of RED. Second, the DRED-DUN model is an end-to-end trainable DUN, which learns the regularization network and other parameters jointly, thus leads to better restoration performance than the plug-and-play framework. Third, extensive experiments show that, our proposed model significantly outperforms the-state-of-the-art model-based methods and learning based methods in terms of PSNR indexes as well as the visual effects. In particular, our method has much better capability in recovering salient image components such as edges and small scale textures.
Collapse
|
17
|
Fu Y, Zhang Y. Reinforcement Learning Based Plug-and-Play Method for Hyperspectral Image Reconstruction. ARTIF INTELL 2022. [DOI: 10.1007/978-3-031-20497-5_38] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
18
|
Iterative Reconstruction for Low-Dose CT using Deep Gradient Priors of Generative Model. IEEE TRANSACTIONS ON RADIATION AND PLASMA MEDICAL SCIENCES 2022. [DOI: 10.1109/trpms.2022.3148373] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
19
|
Liu R, Mu P, Zhang J. Investigating Customization Strategies and Convergence Behaviors of Task-Specific ADMM. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2021; 30:8278-8292. [PMID: 34559653 DOI: 10.1109/tip.2021.3113796] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Alternating Direction Method of Multiplier (ADMM) has been a popular algorithmic framework for separable optimization problems with linear constraints. For numerical ADMM fail to exploit the particular structure of the problem at hand nor the input data information, leveraging task-specific modules (e.g., neural networks and other data-driven architectures) to extend ADMM is a significant but challenging task. This work focuses on designing a flexible algorithmic framework to incorporate various task-specific modules (with no additional constraints) to improve the performance of ADMM in real-world applications. Specifically, we propose Guidance from Optimality (GO), a new customization strategy, to embed task-specific modules into ADMM (GO-ADMM). By introducing an optimality-based criterion to guide the propagation, GO-ADMM establishes an updating scheme agnostic to the choice of additional modules. The existing task-specific methods just plug their task-specific modules into the numerical iterations in a straightforward manner. Even with some restrictive constraints on the plug-in modules, they can only obtain some relatively weaker convergence properties for the resulted ADMM iterations. Fortunately, without any restrictions on the embedded modules, we prove the convergence of GO-ADMM regarding objective values and constraint violations, and derive the worst-case convergence rate measured by iteration complexity. Extensive experiments are conducted to verify the theoretical results and demonstrate the efficiency of GO-ADMM.
Collapse
|
20
|
Zhou H, Feng H, Xu W, Xu Z, Li Q, Chen Y. Deep denoiser prior based deep analytic network for lensless image restoration. OPTICS EXPRESS 2021; 29:27237-27253. [PMID: 34615144 DOI: 10.1364/oe.432544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 07/29/2021] [Indexed: 06/13/2023]
Abstract
Mask based lensless imagers have huge application prospects due to their ultra-thin body. However, the visual perception of the restored images is poor due to the ill conditioned nature of the system. In this work, we proposed a deep analytic network by imitating the traditional optimization process as an end-to-end network. Our network combines analytic updates with a deep denoiser prior to progressively improve lensless image quality over a few iterations. The convergence is proven mathematically and verified in the results. In addition, our method is universal in non-blind restoration. We detailed the solution for the general inverse problem and conducted five groups of deblurring experiments as examples. Both experimental results demonstrate that our method achieves superior performance against the existing state-of-the-art methods.
Collapse
|
21
|
He L, Wang Y, Liu J, Wang C, Gao S. Single image restoration through ℓ2-relaxed truncated ℓ0 analysis-based sparse optimization in tight frames. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2021.02.053] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
22
|
Zha Z, Wen B, Yuan X, Zhou JT, Zhou J, Zhu C. Triply Complementary Priors for Image Restoration. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2021; 30:5819-5834. [PMID: 34133279 DOI: 10.1109/tip.2021.3086049] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Recent works that utilized deep models have achieved superior results in various image restoration (IR) applications. Such approach is typically supervised, which requires a corpus of training images with distributions similar to the images to be recovered. On the other hand, the shallow methods, which are usually unsupervised remain promising performance in many inverse problems, e.g., image deblurring and image compressive sensing (CS), as they can effectively leverage nonlocal self-similarity priors of natural images. However, most of such methods are patch-based leading to the restored images with various artifacts due to naive patch aggregation in addition to the slow speed. Using either approach alone usually limits performance and generalizability in IR tasks. In this paper, we propose a joint low-rank and deep (LRD) image model, which contains a pair of triply complementary priors, namely, internal and external, shallow and deep, and non-local and local priors. We then propose a novel hybrid plug-and-play (H-PnP) framework based on the LRD model for IR. Following this, a simple yet effective algorithm is developed to solve the proposed H-PnP based IR problems. Extensive experimental results on several representative IR tasks, including image deblurring, image CS and image deblocking, demonstrate that the proposed H-PnP algorithm achieves favorable performance compared to many popular or state-of-the-art IR methods in terms of both objective and visual perception.
Collapse
|
23
|
Zha Z, Wen B, Yuan X, Zhou J, Zhu C. Image Restoration via Reconciliation of Group Sparsity and Low-Rank Models. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2021; 30:5223-5238. [PMID: 34010133 DOI: 10.1109/tip.2021.3078329] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Image nonlocal self-similarity (NSS) property has been widely exploited via various sparsity models such as joint sparsity (JS) and group sparse coding (GSC). However, the existing NSS-based sparsity models are either too restrictive, e.g., JS enforces the sparse codes to share the same support, or too general, e.g., GSC imposes only plain sparsity on the group coefficients, which limit their effectiveness for modeling real images. In this paper, we propose a novel NSS-based sparsity model, namely, low-rank regularized group sparse coding (LR-GSC), to bridge the gap between the popular GSC and JS. The proposed LR-GSC model simultaneously exploits the sparsity and low-rankness of the dictionary-domain coefficients for each group of similar patches. An alternating minimization with an adaptive adjusted parameter strategy is developed to solve the proposed optimization problem for different image restoration tasks, including image denoising, image deblocking, image inpainting, and image compressive sensing. Extensive experimental results demonstrate that the proposed LR-GSC algorithm outperforms many popular or state-of-the-art methods in terms of objective and perceptual metrics.
Collapse
|
24
|
Gavaskar RG, Athalye CD, Chaudhury KN. On Plug-and-Play Regularization Using Linear Denoisers. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2021; 30:4802-4813. [PMID: 33909564 DOI: 10.1109/tip.2021.3075092] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
In plug-and-play (PnP) regularization, the knowledge of the forward model is combined with a powerful denoiser to obtain state-of-the-art image reconstructions. This is typically done by taking a proximal algorithm such as FISTA or ADMM, and formally replacing the proximal map associated with a regularizer by nonlocal means, BM3D or a CNN denoiser. Each iterate of the resulting PnP algorithm involves some kind of inversion of the forward model followed by denoiser-induced regularization. A natural question in this regard is that of optimality, namely, do the PnP iterations minimize some f+g , where f is a loss function associated with the forward model and g is a regularizer? This has a straightforward solution if the denoiser can be expressed as a proximal map, as was shown to be the case for a class of linear symmetric denoisers. However, this result excludes kernel denoisers such as nonlocal means that are inherently non-symmetric. In this paper, we prove that a broader class of linear denoisers (including symmetric denoisers and kernel denoisers) can be expressed as a proximal map of some convex regularizer g . An algorithmic implication of this result for non-symmetric denoisers is that it necessitates appropriate modifications in the PnP updates to ensure convergence to a minimum of f+g . Apart from the convergence guarantee, the modified PnP algorithms are shown to produce good restorations.
Collapse
|
25
|
Zhang M, Ling Q. Bilateral Upsampling Network for Single Image Super-Resolution With Arbitrary Scaling Factors. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2021; 30:4395-4408. [PMID: 33848249 DOI: 10.1109/tip.2021.3071708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Single Image Super-Resolution (SISR) is essential for many computer vision tasks. In some real-world applications, such as object recognition and image classification, the captured image size can be arbitrary while the required image size is fixed, which necessitates SISR with arbitrary scaling factors. It is a challenging problem to take a single model to accomplish the SISR task under arbitrary scaling factors. To solve that problem, this paper proposes a bilateral upsampling network which consists of a bilateral upsampling filter and a depthwise feature upsampling convolutional layer. The bilateral upsampling filter is made up of two upsampling filters, including a spatial upsampling filter and a range upsampling filter. With the introduction of the range upsampling filter, the weights of the bilateral upsampling filter can be adaptively learned under different scaling factors and different pixel values. The output of the bilateral upsampling filter is then provided to the depthwise feature upsampling convolutional layer, which upsamples the low-resolution (LR) feature map to the high-resolution (HR) feature space depthwisely and well recovers the structural information of the HR feature map. The depthwise feature upsampling convolutional layer can not only efficiently reduce the computational cost of the weight prediction of the bilateral upsampling filter, but also accurately recover the textual details of the HR feature map. Experiments on benchmark datasets demonstrate that the proposed bilateral upsampling network can achieve better performance than some state-of-the-art SISR methods.
Collapse
|
26
|
Awasthi N, Kumar Kalva S, Pramanik M, Yalavarthy PK. Dimensionality reduced plug and play priors for improving photoacoustic tomographic imaging with limited noisy data. BIOMEDICAL OPTICS EXPRESS 2021; 12:1320-1338. [PMID: 33796356 PMCID: PMC7984800 DOI: 10.1364/boe.415182] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Revised: 01/23/2021] [Accepted: 01/23/2021] [Indexed: 05/03/2023]
Abstract
The reconstruction methods for solving the ill-posed inverse problem of photoacoustic tomography with limited noisy data are iterative in nature to provide accurate solutions. These methods performance is highly affected by the noise level in the photoacoustic data. A singular value decomposition (SVD) based plug and play priors method for solving photoacoustic inverse problem was proposed in this work to provide robustness to noise in the data. The method was shown to be superior as compared to total variation regularization, basis pursuit deconvolution and Lanczos Tikhonov based regularization and provided improved performance in case of noisy data. The numerical and experimental cases show that the improvement can be as high as 8.1 dB in signal to noise ratio of the reconstructed image and 67.98% in root mean square error in comparison to the state of the art methods.
Collapse
Affiliation(s)
- Navchetan Awasthi
- Department of Computational and Data Sciences, Indian Institute of Science, Bangalore 560012, India
| | - Sandeep Kumar Kalva
- School of Chemical and Biomedical Engineering, Nanyang Technological University, 637459, Singapore
| | - Manojit Pramanik
- School of Chemical and Biomedical Engineering, Nanyang Technological University, 637459, Singapore
| | - Phaneendra K. Yalavarthy
- Department of Computational and Data Sciences, Indian Institute of Science, Bangalore 560012, India
| |
Collapse
|
27
|
Nair P, Gavaskar RG, Chaudhury KN. Fixed-Point and Objective Convergence of Plug-and-Play Algorithms. IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING 2021; 7:337-348. [DOI: 10.1109/tci.2021.3066053] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/19/2023]
|
28
|
Zhang D, Liang Z, Shao J. Joint image deblurring and super-resolution with attention dual supervised network. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2020.05.069] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
29
|
Zha Z, Yuan X, Wen B, Zhou J, Zhu C. Group Sparsity Residual Constraint with Non-Local Priors for Image Restoration. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2020; PP:8960-8975. [PMID: 32903181 DOI: 10.1109/tip.2020.3021291] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Group sparse representation (GSR) has made great strides in image restoration producing superior performance, realized through employing a powerful mechanism to integrate the local sparsity and nonlocal self-similarity of images. However, due to some form of degradation (e.g., noise, down-sampling or pixels missing), traditional GSR models may fail to faithfully estimate sparsity of each group in an image, thus resulting in a distorted reconstruction of the original image. This motivates us to design a simple yet effective model that aims to address the above mentioned problem. Specifically, we propose group sparsity residual constraint with nonlocal priors (GSRC-NLP) for image restoration. Through introducing the group sparsity residual constraint, the problem of image restoration is further defined and simplified through attempts at reducing the group sparsity residual. Towards this end, we first obtain a good estimation of the group sparse coefficient of each original image group by exploiting the image nonlocal self-similarity (NSS) prior along with self-supervised learning scheme, and then the group sparse coefficient of the corresponding degraded image group is enforced to approximate the estimation. To make the proposed scheme tractable and robust, two algorithms, i.e., iterative shrinkage/thresholding (IST) and alternating direction method of multipliers (ADMM), are employed to solve the proposed optimization problems for different image restoration tasks. Experimental results on image denoising, image inpainting and image compressive sensing (CS) recovery, demonstrate that the proposed GSRC-NLP based image restoration algorithm is comparable to state-of-the-art denoising methods and outperforms several state-of-the-art image inpainting and image CS recovery methods in terms of both objective and perceptual quality metrics.
Collapse
|
30
|
Rastogi A, Yalavarthy PK. Comparison of iterative parametric and indirect deep learning‐based reconstruction methods in highly undersampled DCE‐MR Imaging of the breast. Med Phys 2020; 47:4838-4861. [DOI: 10.1002/mp.14447] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2020] [Revised: 07/24/2020] [Accepted: 08/03/2020] [Indexed: 12/23/2022] Open
Affiliation(s)
- Aditya Rastogi
- Department of Computational and Data Sciences Indian Institute of Science Bangalore560012 India
| | | |
Collapse
|
31
|
Jiao J, Tu WC, Liu D, He S, Lau RWH, Huang TS. FormNet: Formatted Learning for Image Restoration. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2020; 29:6302-6314. [PMID: 32365031 DOI: 10.1109/tip.2020.2990603] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
In this paper, we propose a deep CNN to tackle the image restoration problem by learning formatted information. Previous deep learning based methods directly learn the mapping from corrupted images to clean images, and may suffer from the gradient exploding/vanishing problems of deep neural networks. We propose to address the image restoration problem by learning the structured details and recovering the latent clean image together, from the shared information between the corrupted image and the latent image. In addition, instead of learning the pure difference (corruption), we propose to add a residual formatting layer and an adversarial block to format the information to structured one, which allows the network to converge faster and boosts the performance. Furthermore, we propose a cross-level loss net to ensure both pixel-level accuracy and semantic-level visual quality. Evaluations on public datasets show that the proposed method performs favorably against existing approaches quantitatively and qualitatively.
Collapse
|
32
|
Tirer T, Giryes R. Back-Projection based Fidelity Term for Ill-Posed Linear Inverse Problems. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2020; 29:6164-6179. [PMID: 32340949 DOI: 10.1109/tip.2020.2988779] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Ill-posed linear inverse problems appear in many image processing applications, such as deblurring, superresolution and compressed sensing. Many restoration strategies involve minimizing a cost function, which is composed of fidelity and prior terms, balanced by a regularization parameter. While a vast amount of research has been focused on different prior models, the fidelity term is almost always chosen to be the least squares (LS) objective, that encourages fitting the linearly transformed optimization variable to the observations. In this paper, we examine a different fidelity term, which has been implicitly used by the recently proposed iterative denoising and backward projections (IDBP) framework. This term encourages agreement between the projection of the optimization variable onto the row space of the linear operator and the pseudoinverse of the linear operator ("back-projection") applied on the observations. We analytically examine the difference between the two fidelity terms for Tikhonov regularization and identify cases (such as a badly conditioned linear operator) where the new term has an advantage over the standard LS one. Moreover, we demonstrate empirically that the behavior of the two induced cost functions for sophisticated convex and non-convex priors, such as total-variation, BM3D, and deep generative models, correlates with the obtained theoretical analysis.
Collapse
|
33
|
Zha Z, Yuan X, Wen B, Zhou J, Zhang J, Zhu C. A Benchmark for Sparse Coding: When Group Sparsity Meets Rank Minimization. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2020; 29:5094-5109. [PMID: 32167891 DOI: 10.1109/tip.2020.2972109] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Sparse coding has achieved a great success in various image processing tasks. However, a benchmark to measure the sparsity of image patch/group is missing since sparse coding is essentially an NP-hard problem. This work attempts to fill the gap from the perspective of rank minimization. We firstly design an adaptive dictionary to bridge the gap between group-based sparse coding (GSC) and rank minimization. Then, we show that under the designed dictionary, GSC and the rank minimization problems are equivalent, and therefore the sparse coefficients of each patch group can be measured by estimating the singular values of each patch group. We thus earn a benchmark to measure the sparsity of each patch group because the singular values of the original image patch groups can be easily computed by the singular value decomposition (SVD). This benchmark can be used to evaluate performance of any kind of norm minimization methods in sparse coding through analyzing their corresponding rank minimization counterparts. Towards this end, we exploit four well-known rank minimization methods to study the sparsity of each patch group and the weighted Schatten p-norm minimization (WSNM) is found to be the closest one to the real singular values of each patch group. Inspired by the aforementioned equivalence regime of rank minimization and GSC, WSNM can be translated into a non-convex weighted ℓp-norm minimization problem in GSC. By using the earned benchmark in sparse coding, the weighted ℓp-norm minimization is expected to obtain better performance than the three other norm minimization methods, i.e., ℓ1-norm, ℓp-norm and weighted ℓ1-norm. To verify the feasibility of the proposed benchmark, we compare the weighted ℓp-norm minimization against the three aforementioned norm minimization methods in sparse coding. Experimental results on image restoration applications, namely image inpainting and image compressive sensing recovery, demonstrate that the proposed scheme is feasible and outperforms many state-of-the-art methods.
Collapse
|
34
|
Gavaskar RG, Chaudhury KN. Plug-and-Play ISTA Converges With Kernel Denoisers. IEEE SIGNAL PROCESSING LETTERS 2020; 27:610-614. [DOI: 10.1109/lsp.2020.2986643] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/19/2023]
|
35
|
Gavaskar RG, Chaudhury KN. On the Proof of Fixed-Point Convergence for Plug-and-Play ADMM. IEEE SIGNAL PROCESSING LETTERS 2019; 26:1817-1821. [DOI: 10.1109/lsp.2019.2950611] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/19/2023]
|
36
|
Liu X, Chen Y, Peng Z, Wu J. Infrared Image Super-Resolution Reconstruction Based on Quaternion and High-Order Overlapping Group Sparse Total Variation. SENSORS (BASEL, SWITZERLAND) 2019; 19:E5139. [PMID: 31771234 PMCID: PMC6928699 DOI: 10.3390/s19235139] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/26/2019] [Revised: 11/20/2019] [Accepted: 11/21/2019] [Indexed: 12/18/2022]
Abstract
Owing to the limitations of imaging principles and system imaging characteristics, infrared images generally have some shortcomings, such as low resolution, insufficient details, and blurred edges. Therefore, it is of practical significance to improve the quality of infrared images. To make full use of the information on adjacent points, preserve the image structure, and avoid staircase artifacts, this paper proposes a super-resolution reconstruction method for infrared images based on quaternion total variation and high-order overlapping group sparse. The method uses a quaternion total variation method to utilize the correlation between adjacent points to improve image anti-noise ability and reconstruction effect. It uses the sparsity of a higher-order gradient to reconstruct a clear image structure and restore smooth changes. In addition, we performed regularization by using the denoising method, alternating direction method of multipliers, and fast Fourier transform theory to improve the efficiency and robustness of our method. Our experimental results show that this method has excellent performance in objective evaluation and subjective visual effects.
Collapse
Affiliation(s)
- Xingguo Liu
- School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China;
- Laboratory of Imaging Detection and Intelligent Perception, University of Electronic Science and Technology of China, Chengdu 610054, China
- Chongqing College of Electronic Engineering, Chongqing 401331, China;
| | - Yingpin Chen
- School of Physics and Information Engineering, Minnan Normal University, Zhangzhou 363000, China;
| | - Zhenming Peng
- School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China;
- Laboratory of Imaging Detection and Intelligent Perception, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Juan Wu
- Chongqing College of Electronic Engineering, Chongqing 401331, China;
| |
Collapse
|