1
|
Zhang Y, Zhang T, Zhu H, Chen Z, Mi S, Peng X, Geng X. Object Adaptive Self-Supervised Dense Visual Pre-Training. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2025; 34:2228-2240. [PMID: 40168204 DOI: 10.1109/tip.2025.3555073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/03/2025]
Abstract
Self-supervised visual pre-training models have achieved significant success without employing expensive annotations. Nevertheless, most of these models focus on iconic single-instance datasets (e.g. ImageNet), ignoring the insufficient discriminative representation for non-iconic multi-instance datasets (e.g. COCO). In this paper, we propose a novel Object Adaptive Dense Pre-training (OADP) method to learn the visual representation directly on the multi-instance datasets (e.g., PASCAL VOC and COCO) for dense prediction tasks (e.g., object detection and instance segmentation). We present a novel object-aware and learning-adaptive random view augmentation to focus the contrastive learning to enhance the discrimination of object presentations from large to small scale during different learning stages. Furthermore, the representations across different scale and resolutions are integrated so that the method can learn diverse representations. In the experiment, we evaluated OADP pre-trained on PASCAL VOC and COCO. Results show that our method has better performances than most existing state-of-the-art methods when transferring to various downstream tasks, including image classification, object detection, instance segmentation and semantic segmentation.
Collapse
|
2
|
Waida H, Yamazaki K, Tokuhisa A, Wada M, Wada Y. Investigating self-supervised image denoising with denaturation. Neural Netw 2025; 184:106966. [PMID: 39700824 DOI: 10.1016/j.neunet.2024.106966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2024] [Revised: 10/08/2024] [Accepted: 11/25/2024] [Indexed: 12/21/2024]
Abstract
Self-supervised learning for image denoising problems in the presence of denaturation for noisy data is a crucial approach in machine learning. However, theoretical understanding of the performance of the approach that uses denatured data is lacking. To provide better understanding of the approach, in this paper, we analyze a self-supervised denoising algorithm that uses denatured data in depth through theoretical analysis and numerical experiments. Through the theoretical analysis, we discuss that the algorithm finds desired solutions to the optimization problem with the population risk, while the guarantee for the empirical risk depends on the hardness of the denoising task in terms of denaturation levels. We also conduct several experiments to investigate the performance of an extended algorithm in practice. The results indicate that the algorithm training with denatured images works, and the empirical performance aligns with the theoretical results. These results suggest several insights for further improvement of self-supervised image denoising that uses denatured data in future directions.
Collapse
Affiliation(s)
- Hiroki Waida
- Department of Mathematical and Computing Science, Institute of Science Tokyo, 2-12-1 Ookayama, Meguro-ku, Tokyo, 152-8550, Japan
| | - Kimihiro Yamazaki
- Fujitsu Limited, 4-1-1 Kamikodanaka, Nakahara-ku, Kawasaki-shi, Kanagawa, 211-8588, Japan
| | - Atsushi Tokuhisa
- RIKEN Center for Computational Science, 7-1-26 Minatojima-minami-machi, Chuo-ku, Kobe, Hyogo, 650-0047, Japan
| | - Mutsuyo Wada
- Fujitsu Limited, 4-1-1 Kamikodanaka, Nakahara-ku, Kawasaki-shi, Kanagawa, 211-8588, Japan
| | - Yuichiro Wada
- Fujitsu Limited, 4-1-1 Kamikodanaka, Nakahara-ku, Kawasaki-shi, Kanagawa, 211-8588, Japan; RIKEN Center for Advanced Intelligence Project, Nihonbashi 1-chome Mitsui Building, 15th floor, 1-4-1 Nihonbashi, Chuo-ku, Tokyo, 103-0027, Japan.
| |
Collapse
|
3
|
Jiang L, Zhu B, Long W, Xu J, Wu Y, Li YW. A review of denoising methods in single-particle cryo-EM. Micron 2025; 194:103817. [PMID: 40164016 DOI: 10.1016/j.micron.2025.103817] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2024] [Revised: 02/08/2025] [Accepted: 03/11/2025] [Indexed: 04/02/2025]
Abstract
Cryo-EM has become a vital technique for resolving macromolecular structures at near-atomic resolution, enabling the visualization of proteins and large molecular complexes. However, the images are often accompanied by extremely low SNR, which poses significant challenges for subsequent processes such as particle picking and 3D reconstruction. Effective denoising methods can substantially improve SNR, making downstream analyzes more accurate and reliable. Thus, image denoising is an essential step in cryo-EM data processing. This paper comprehensively reviews recent advances in image denoising methods for single-particle analysis, covering approaches from traditional filtering methods to the latest deep learning-based strategies. By analyzing and comparing mainstream denoising methods, this review aims to advance the field of single-particle cryo-EM denoising, facilitate the acquisition of higher-quality images, and offer valuable insights for researchers new to the field.
Collapse
Affiliation(s)
- Linhua Jiang
- School of Information Engineering, Huzhou University, Huzhou, China; ISEP-Sorbonne Joint Research Lab, 10 Rue de Vanves, Paris 92130, France.
| | - Bo Zhu
- School of Information Engineering, Huzhou University, Huzhou, China.
| | - Wei Long
- School of Information Engineering, Huzhou University, Huzhou, China.
| | - Jiahao Xu
- School of Information Engineering, Huzhou University, Huzhou, China.
| | - Yi Wu
- School of Information Engineering, Huzhou University, Huzhou, China.
| | - Yao-Wang Li
- School of Life Sciences, Southern University of Science and Technology, Shenzhen, China.
| |
Collapse
|
4
|
Shafique M, Liu S, Schniter P, Ahmad R. MRI recovery with self-calibrated denoisers without fully-sampled data. MAGMA (NEW YORK, N.Y.) 2025; 38:53-66. [PMID: 39412614 PMCID: PMC11790797 DOI: 10.1007/s10334-024-01207-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2024] [Revised: 08/29/2024] [Accepted: 09/17/2024] [Indexed: 02/04/2025]
Abstract
OBJECTIVE Acquiring fully sampled training data is challenging for many MRI applications. We present a self-supervised image reconstruction method, termed ReSiDe, capable of recovering images solely from undersampled data. MATERIALS AND METHODS ReSiDe is inspired by plug-and-play (PnP) methods, but unlike traditional PnP approaches that utilize pre-trained denoisers, ReSiDe iteratively trains the denoiser on the image or images that are being reconstructed. We introduce two variations of our method: ReSiDe-S and ReSiDe-M. ReSiDe-S is scan-specific and works with a single set of undersampled measurements, while ReSiDe-M operates on multiple sets of undersampled measurements and provides faster inference. Studies I, II, and III compare ReSiDe-S and ReSiDe-M against other self-supervised or unsupervised methods using data from T1- and T2-weighted brain MRI, MRXCAT digital perfusion phantom, and first-pass cardiac perfusion, respectively. RESULTS ReSiDe-S and ReSiDe-M outperform other methods in terms of peak signal-to-noise ratio and structural similarity index measure for Studies I and II, and in terms of expert scoring for Study III. DISCUSSION We present a self-supervised image reconstruction method and validate it in both static and dynamic MRI applications. These developments can benefit MRI applications where the availability of fully sampled training data is limited.
Collapse
Affiliation(s)
- Muhammad Shafique
- Biomedical Engineering, Ohio State University, Columbus, OH, 43210, USA
- Electrical and Computer Engineering, COMSATS University, Islamabad, Pakistan
| | - Sizhuo Liu
- Biomedical Engineering, Ohio State University, Columbus, OH, 43210, USA
| | - Philip Schniter
- Electrical and Computer Engineering, Ohio State University, Columbus, OH, 43210, USA
| | - Rizwan Ahmad
- Biomedical Engineering, Ohio State University, Columbus, OH, 43210, USA.
| |
Collapse
|
5
|
Lin H, Seitz S, Tan Y, Lugagne JB, Wang L, Ding G, He H, Rauwolf TJ, Dunlop MJ, Connor JH, Porco JA, Tian L, Cheng JX. Label-free nanoscopy of cell metabolism by ultrasensitive reweighted visible stimulated Raman scattering. Nat Methods 2025:10.1038/s41592-024-02575-1. [PMID: 39820753 DOI: 10.1038/s41592-024-02575-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2024] [Accepted: 11/26/2024] [Indexed: 01/19/2025]
Abstract
Super-resolution imaging of cell metabolism is hindered by the incompatibility of small metabolites with fluorescent dyes and the limited resolution of imaging mass spectrometry. We present ultrasensitive reweighted visible stimulated Raman scattering (URV-SRS), a label-free vibrational imaging technique for multiplexed nanoscopy of intracellular metabolites. We developed a visible SRS microscope with extensive pulse chirping to improve the detection limit to ~4,000 molecules and introduced a self-supervised multi-agent denoiser to suppress non-independent noise in SRS by over 7.2 dB, resulting in a 50-fold sensitivity enhancement over near-infrared SRS. Leveraging the enhanced sensitivity, we employed Fourier reweighting to amplify sub-100-nm spatial frequencies that were previously overwhelmed by noise. Validated by Fourier ring correlation, we achieved a lateral resolution of 86 nm in cell imaging. We visualized the reprogramming of metabolic nanostructures associated with virus replication in host cells and subcellular fatty acid synthesis in engineered bacteria, demonstrating its capability towards nanoscopic spatial metabolomics.
Collapse
Affiliation(s)
- Haonan Lin
- Department of Electrical and Computer Engineering, Boston University, Boston, MA, USA
- Department of Biomedical Engineering, Boston University, Boston, MA, USA
- Photonics Center, Boston University, Boston, MA, USA
| | - Scott Seitz
- Department of Virology, Immunology, and Microbiology, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, USA
- National Emerging Infectious Diseases Laboratories, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, USA
| | - Yuying Tan
- Department of Biomedical Engineering, Boston University, Boston, MA, USA
- Photonics Center, Boston University, Boston, MA, USA
| | - Jean-Baptiste Lugagne
- Department of Biomedical Engineering, Boston University, Boston, MA, USA
- Biological Design Center, Boston University, Boston, MA, USA
| | - Le Wang
- Department of Electrical and Computer Engineering, Boston University, Boston, MA, USA
- Photonics Center, Boston University, Boston, MA, USA
| | - Guangrui Ding
- Department of Electrical and Computer Engineering, Boston University, Boston, MA, USA
- Photonics Center, Boston University, Boston, MA, USA
| | - Hongjian He
- Department of Electrical and Computer Engineering, Boston University, Boston, MA, USA
- Photonics Center, Boston University, Boston, MA, USA
| | - Tyler J Rauwolf
- Department of Chemistry, Boston University, Boston, MA, USA
- Center for Molecular Discovery (BU-CMD), Boston University, Boston, MA, USA
| | - Mary J Dunlop
- Department of Biomedical Engineering, Boston University, Boston, MA, USA
- Biological Design Center, Boston University, Boston, MA, USA
| | - John H Connor
- Department of Virology, Immunology, and Microbiology, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, USA
- National Emerging Infectious Diseases Laboratories, Boston University Chobanian & Avedisian School of Medicine, Boston, MA, USA
| | - John A Porco
- Department of Chemistry, Boston University, Boston, MA, USA
- Center for Molecular Discovery (BU-CMD), Boston University, Boston, MA, USA
| | - Lei Tian
- Department of Electrical and Computer Engineering, Boston University, Boston, MA, USA
- Department of Biomedical Engineering, Boston University, Boston, MA, USA
- Photonics Center, Boston University, Boston, MA, USA
| | - Ji-Xin Cheng
- Department of Electrical and Computer Engineering, Boston University, Boston, MA, USA.
- Department of Biomedical Engineering, Boston University, Boston, MA, USA.
- Photonics Center, Boston University, Boston, MA, USA.
- Department of Chemistry, Boston University, Boston, MA, USA.
| |
Collapse
|
6
|
Song Z, Xue L, Xu J, Zhang B, Jin C, Yang J, Zou C. Real-World Low-Dose CT Image Denoising by Patch Similarity Purification. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2024; PP:196-208. [PMID: 40030715 DOI: 10.1109/tip.2024.3515878] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/05/2025]
Abstract
Reducing the radiation dose in CT scanning is important to alleviate the damage to the human health in clinical scenes. A promising way is to replace the normal-dose CT (NDCT) imaging by low-dose CT (LDCT) imaging with lower tube voltage and tube current. This often brings severe noise to the LDCT images, which adversely affects the diagnosis accuracy. Most of existing LDCT image denoising networks are trained either with synthetic LDCT images or real-world LDCT and NDCT image pairs with huge spatial misalignment. However, the synthetic noise is very different from the complex noise in real-world LDCT images, while the huge spatial misalignment brings inaccurate predictions of tissue structures in the denoised LDCT images. To well utilize real-world LDCT and NDCT image pairs for LDCT image denoising, in this paper, we introduce a new Patch Similarity Purification (PSP) strategy to construct high-quality training dataset for network training. Specifically, our PSP strategy first perform binarization for each pair of image patches cropped from the corresponding LDCT and NDCT image pairs. For each pair of binary masks, it then computes their similarity ratio by common mask calculation, and the patch pair can be selected as a training sample if their mask similarity ratio is higher than a threshold. By using our PSP strategy, each training set of our Rabbit and Patient datasets contain hundreds of thousands of real-world LDCT and NDCT image patch pairs with negligible misalignment. Extensive experiments demonstrate the usefulness of our PSP strategy on purifying the training data and the effectiveness of training LDCT image denoising networks on our datasets. The code and dataset are provided at https://github.com/TuTusong/PSP.
Collapse
|
7
|
Arabi H, Zaidi H. Contrastive Learning vs. Self-Learning vs. Deformable Data Augmentation in Semantic Segmentation of Medical Images. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2024; 37:3217-3230. [PMID: 38858260 PMCID: PMC11612072 DOI: 10.1007/s10278-024-01159-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 05/23/2024] [Accepted: 05/24/2024] [Indexed: 06/12/2024]
Abstract
To develop a robust segmentation model, encoding the underlying features/structures of the input data is essential to discriminate the target structure from the background. To enrich the extracted feature maps, contrastive learning and self-learning techniques are employed, particularly when the size of the training dataset is limited. In this work, we set out to investigate the impact of contrastive learning and self-learning on the performance of the deep learning-based semantic segmentation. To this end, three different datasets were employed used for brain tumor and hippocampus delineation from MR images (BraTS and Decathlon datasets, respectively) and kidney segmentation from CT images (Decathlon dataset). Since data augmentation techniques are also aimed at enhancing the performance of deep learning methods, a deformable data augmentation technique was proposed and compared with contrastive learning and self-learning frameworks. The segmentation accuracy for the three datasets was assessed with and without applying data augmentation, contrastive learning, and self-learning to individually investigate the impact of these techniques. The self-learning and deformable data augmentation techniques exhibited comparable performance with Dice indices of 0.913 ± 0.030 and 0.920 ± 0.022 for kidney segmentation, 0.890 ± 0.035 and 0.898 ± 0.027 for hippocampus segmentation, and 0.891 ± 0.045 and 0.897 ± 0.040 for lesion segmentation, respectively. These two approaches significantly outperformed the contrastive learning and the original model with Dice indices of 0.871 ± 0.039 and 0.868 ± 0.042 for kidney segmentation, 0.872 ± 0.045 and 0.865 ± 0.048 for hippocampus segmentation, and 0.870 ± 0.049 and 0.860 ± 0.058 for lesion segmentation, respectively. The combination of self-learning with deformable data augmentation led to a robust segmentation model with no outliers in the outcomes. This work demonstrated the beneficial impact of self-learning and deformable data augmentation on organ and lesion segmentation, where no additional training datasets are needed.
Collapse
Affiliation(s)
- Hossein Arabi
- Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH-1211, Geneva 4, Switzerland
| | - Habib Zaidi
- Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH-1211, Geneva 4, Switzerland.
- Department of Nuclear Medicine and Molecular Imaging, University of Groningen, University Medical Center Groningen, 9700 RB, Groningen, Netherlands.
- Department of Nuclear Medicine, University of Southern Denmark, DK-500, Odense, Denmark.
- University Research and Innovation Center, Óbuda University, Budapest, Hungary.
| |
Collapse
|
8
|
Zhang H, Cao J, Cui H, Zhou C, Yao H, Hao Q, Wang Y. Computational ghost imaging enhanced by degradation models for under-sampling. OPTICS LETTERS 2024; 49:5296-5299. [PMID: 39270289 DOI: 10.1364/ol.532197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2024] [Accepted: 08/25/2024] [Indexed: 09/15/2024]
Abstract
Computational ghost imaging (CGI) allows two-dimensional (2D) imaging by using spatial light modulators and bucket detectors. However, most CGI methods attempt to obtain 2D images through measurements with a single sampling ratio. Here, we propose a CGI method enhanced by degradation models for under-sampling, which can be reflected by results from measurements with different sampling ratios. We utilize results from low-sampling-ratio measurements and normal-sampling-ratio measurements to train the neural network for the degradation model, which is fitted through self-supervised learning. We obtain final results by importing normal-sampling-ratio results into the neural network with optimal parameters. We experimentally demonstrate improved results from the CGI method using degradation models for under-sampling. Our proposed method would promote the development of CGI in many applications.
Collapse
|
9
|
Pan J, Chang D, Wu W, Chen Y, Wang S. Self-supervised tomographic image noise suppression via residual image prior network. Comput Biol Med 2024; 179:108837. [PMID: 38991317 DOI: 10.1016/j.compbiomed.2024.108837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Revised: 05/29/2024] [Accepted: 06/29/2024] [Indexed: 07/13/2024]
Abstract
Computed tomography (CT) denoising is a challenging task in medical imaging that has garnered considerable attention. Supervised networks require a lot of noisy-clean image pairs, which are always unavailable in clinical settings. Existing self-supervised algorithms for suppressing noise with paired noisy images have limitations, such as ignoring the residual between similar image pairs during training and insufficiently learning the spectrum information of images. In this study, we propose a Residual Image Prior Network (RIP-Net) to sufficiently model the residual between the paired similar noisy images. Our approach offers new insights into the field by addressing the limitations of existing methods. We first establish a mathematical theorem clarifying the non-equivalence between similar-image-based self-supervised learning and supervised learning. It helps us better understand the strengths and limitations of self-supervised learning. Secondly, we introduce a novel regularization term to model a low-frequency residual image prior. This can improve the accuracy and robustness of our model. Finally, we design a well-structured denoising network capable of exploring spectrum information while simultaneously sensing context messages. The network has dual paths for modeling high and low-frequency compositions in the raw noisy image. Additionally, context perception modules capture local and global interactions to produce high-quality images. The comprehensive experiments on preclinical photon-counting CT, clinical brain CT, and low-dose CT datasets, demonstrate that our RIP-Net is superior to other unsupervised denoising methods.
Collapse
Affiliation(s)
- Jiayi Pan
- School of Biomedical Engineering, Sun Yat-Sen University, Shenzhen, Guangdong, China.
| | - Dingyue Chang
- Institute of Materials, China Academy of Engineering Physics, Mianyang, Sichuan, China
| | - Weiwen Wu
- School of Biomedical Engineering, Sun Yat-Sen University, Shenzhen, Guangdong, China.
| | - Yang Chen
- School of Computer Science and Engineering, Southeast University, Nanjing, Jiangsu, China.
| | - Shaoyu Wang
- School of Information Engineering, Nanchang University, Nanchang, Jiangxi, China.
| |
Collapse
|
10
|
Oppliger J, Denner MM, Küspert J, Frison R, Wang Q, Morawietz A, Ivashko O, Dippel AC, Zimmermann MV, Biało I, Martinelli L, Fauqué B, Choi J, Garcia-Fernandez M, Zhou KJ, Christensen NB, Kurosawa T, Momono N, Oda M, Natterer FD, Fischer MH, Neupert T, Chang J. Weak signal extraction enabled by deep neural network denoising of diffraction data. NAT MACH INTELL 2024; 6:180-186. [PMID: 38404481 PMCID: PMC10883886 DOI: 10.1038/s42256-024-00790-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2022] [Accepted: 01/08/2024] [Indexed: 02/27/2024]
Abstract
The removal or cancellation of noise has wide-spread applications in imaging and acoustics. In applications in everyday life, such as image restoration, denoising may even include generative aspects, which are unfaithful to the ground truth. For scientific use, however, denoising must reproduce the ground truth accurately. Denoising scientific data is further challenged by unknown noise profiles. In fact, such data will often include noise from multiple distinct sources, which substantially reduces the applicability of simulation-based approaches. Here we show how scientific data can be denoised by using a deep convolutional neural network such that weak signals appear with quantitative accuracy. In particular, we study X-ray diffraction and resonant X-ray scattering data recorded on crystalline materials. We demonstrate that weak signals stemming from charge ordering, insignificant in the noisy data, become visible and accurate in the denoised data. This success is enabled by supervised training of a deep neural network with pairs of measured low- and high-noise data. We additionally show that using artificial noise does not yield such quantitatively accurate results. Our approach thus illustrates a practical strategy for noise filtering that can be applied to challenging acquisition problems.
Collapse
Affiliation(s)
- Jens Oppliger
- Physik-Institut, Universität Zürich, Zurich, Switzerland
| | | | - Julia Küspert
- Physik-Institut, Universität Zürich, Zurich, Switzerland
| | - Ruggero Frison
- Physik-Institut, Universität Zürich, Zurich, Switzerland
| | - Qisi Wang
- Physik-Institut, Universität Zürich, Zurich, Switzerland
- Department of Physics, The Chinese University of Hong Kong, Hong Kong, China
| | | | - Oleh Ivashko
- Deutsches Elektronen-Synchrotron DESY, Hamburg, Germany
| | | | | | - Izabela Biało
- Physik-Institut, Universität Zürich, Zurich, Switzerland
- Faculty of Physics and Applied Computer Science, AGH University of Krakow, Krakow, Poland
| | | | - Benoît Fauqué
- JEIP, USR 3573 CNRS, Collège de France, PSL University, Paris, France
| | | | | | | | | | - Tohru Kurosawa
- Department of Physics, Hokkaido University, Sapporo, Japan
| | - Naoki Momono
- Department of Physics, Hokkaido University, Sapporo, Japan
- Department of Applied Sciences, Muroran Institute of Technology, Muroran, Japan
| | - Migaku Oda
- Department of Physics, Hokkaido University, Sapporo, Japan
| | | | | | - Titus Neupert
- Physik-Institut, Universität Zürich, Zurich, Switzerland
| | - Johan Chang
- Physik-Institut, Universität Zürich, Zurich, Switzerland
| |
Collapse
|
11
|
Kim W, Lee J, Choi JH. An unsupervised two-step training framework for low-dose computed tomography denoising. Med Phys 2024; 51:1127-1144. [PMID: 37432026 DOI: 10.1002/mp.16628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Revised: 06/25/2023] [Accepted: 06/25/2023] [Indexed: 07/12/2023] Open
Abstract
BACKGROUND Although low-dose computed tomography (CT) imaging has been more widely adopted in clinical practice to reduce radiation exposure to patients, the reconstructed CT images tend to have more noise, which impedes accurate diagnosis. Recently, deep neural networks using convolutional neural networks to reduce noise in the reconstructed low-dose CT images have shown considerable improvement. However, they need a large number of paired normal- and low-dose CT images to fully train the network via supervised learning methods. PURPOSE To propose an unsupervised two-step training framework for image denoising that uses low-dose CT images of one dataset and unpaired high-dose CT images from another dataset. METHODS Our proposed framework trains the denoising network in two steps. In the first training step, we train the network using 3D volumes of CT images and predict the center CT slice from them. This pre-trained network is used in the second training step to train the denoising network and is combined with the memory-efficient denoising generative adversarial network (DenoisingGAN), which further enhances both objective and perceptual quality. RESULTS The experimental results on phantom and clinical datasets show superior performance over the existing traditional machine learning and self-supervised deep learning methods, and the results are comparable to the fully supervised learning methods. CONCLUSIONS We proposed a new unsupervised learning framework for low-dose CT denoising, convincingly improving noisy CT images from both objective and perceptual quality perspectives. Because our denoising framework does not require physics-based noise models or system-dependent assumptions, our proposed method can be easily reproduced; consequently, it can also be generally applicable to various CT scanners or dose levels.
Collapse
Affiliation(s)
- Wonjin Kim
- Division of Mechanical and Biomedical Engineering, Graduate Program in System Health Science and Engineering, Ewha Womans University, Seoul, Republic of Korea
| | - Jaayeon Lee
- Division of Mechanical and Biomedical Engineering, Graduate Program in System Health Science and Engineering, Ewha Womans University, Seoul, Republic of Korea
| | - Jang-Hwan Choi
- Division of Mechanical and Biomedical Engineering, Graduate Program in System Health Science and Engineering, Ewha Womans University, Seoul, Republic of Korea
| |
Collapse
|
12
|
Rafiei MH, Gauthier LV, Adeli H, Takabi D. Self-Supervised Learning for Electroencephalography. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:1457-1471. [PMID: 35867362 DOI: 10.1109/tnnls.2022.3190448] [Citation(s) in RCA: 48] [Impact Index Per Article: 48.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Decades of research have shown machine learning superiority in discovering highly nonlinear patterns embedded in electroencephalography (EEG) records compared with conventional statistical techniques. However, even the most advanced machine learning techniques require relatively large, labeled EEG repositories. EEG data collection and labeling are costly. Moreover, combining available datasets to achieve a large data volume is usually infeasible due to inconsistent experimental paradigms across trials. Self-supervised learning (SSL) solves these challenges because it enables learning from EEG records across trials with variable experimental paradigms, even when the trials explore different phenomena. It aggregates multiple EEG repositories to increase accuracy, reduce bias, and mitigate overfitting in machine learning training. In addition, SSL could be employed in situations where there is limited labeled training data, and manual labeling is costly. This article: 1) provides a brief introduction to SSL; 2) describes some SSL techniques employed in recent studies, including EEG; 3) proposes current and potential SSL techniques for future investigations in EEG studies; 4) discusses the cons and pros of different SSL techniques; and 5) proposes holistic implementation tips and potential future directions for EEG SSL practices.
Collapse
|
13
|
Yang Z, Zang D, Li H, Zhang Z, Zhang F, Han R. Self-supervised noise modeling and sparsity guided electron tomography volumetric image denoising. Ultramicroscopy 2024; 255:113860. [PMID: 37844382 DOI: 10.1016/j.ultramic.2023.113860] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Revised: 08/07/2023] [Accepted: 09/27/2023] [Indexed: 10/18/2023]
Abstract
Cryo-Electron Tomography (cryo-ET) is a revolutionary technique for visualizing macromolecular structures in near-native states. However, the physical limitations of imaging instruments lead to cryo-ET volumetric images with very low Signal-to-Noise Ratio (SNR) with complex noise, which has a side effect on the downstream analysis of the characteristics of observed macromolecules. Additionally, existing methods for image denoising are difficult to be well generalized to the complex noise in cryo-ET volumes. In this work, we propose a self-supervised deep learning model for cryo-ET volumetric image denoising based on noise modeling and sparsity guidance (NMSG), achieved by learning the noise distribution in noisy cryo-ET volumes and introducing sparsity guidance to ensure smoothness. Firstly, a Generative Adversarial Network (GAN) is utilized to learn noise distribution in cryo-ET volumes and generate noisy volumes pair from single volume. Then, a new loss function is devised to both ensure the recovery of ultrastructure and local smoothness. Experiments are done on five real cryo-ET datasets and three simulated cryo-ET datasets. The comprehensive experimental results demonstrate that our method can perform reliable denoising by training on single noisy volume, achieving better results than state-of-the-art single volume-based methods and competitive with methods trained on large-scale datasets.
Collapse
Affiliation(s)
- Zhidong Yang
- High Performance Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; School of Medical Technology, Beijing Institute of Technology, Beijing, 100081, China; University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Dawei Zang
- High Performance Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
| | - Hongjia Li
- High Performance Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Zhao Zhang
- Research Center for Mathematics and Interdisciplinary Sciences, Frontiers Science Center for Nonlinear Expectations (Ministry of Education), Shandong University, Qingdao 266237, China
| | - Fa Zhang
- School of Medical Technology, Beijing Institute of Technology, Beijing, 100081, China
| | - Renmin Han
- Research Center for Mathematics and Interdisciplinary Sciences, Frontiers Science Center for Nonlinear Expectations (Ministry of Education), Shandong University, Qingdao 266237, China
| |
Collapse
|
14
|
Zhou J, Dong X, Liu Q. Context-aware dynamic filtering network for confocal laser endomicroscopy image denoising. Phys Med Biol 2023; 68:195014. [PMID: 37647912 DOI: 10.1088/1361-6560/acf558] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 08/30/2023] [Indexed: 09/01/2023]
Abstract
Objective.As an emerging diagnosis technology for gastrointestinal diseases, confocal laser endomicroscopy (CLE) is limited by the physical structure of the fiber bundle, leading to the inevitable production of various forms of noise during the imaging process. However, existing denoising methods based on hand-crafted features inefficiently deal with realistic noise in CLE images. To alleviate this challenge, we proposed context-aware kernel estimation and multi-scale dynamic fusion modules to remove realistic noise in CLE images, including multiplicative and additive white noise.Approach.Specifically, a realistic noise statistics model with random noise specific to CLE data is constructed and further used to develop a self-supervised denoised model without the participation of clean images. Secondly, context-aware kernel estimation, which improves the representation of features by similar learnable region weights, addresses the problem of the non-uniform distribution of noises in CLE images and proposes a lightweight denoised model (CLENet). Thirdly, we have developed a multi-scale dynamic fusion module that decouples and recalibrates features, providing a precise and contextually enriched representation of features. Finally, we integrated two developed modules into a U-shaped backbone to build an efficient denoising network named U-CLENet.Main Results.Both proposed methods achieve comparable or better performance with low computational complexity on two gastrointestinal disease CLE image datasets using the same training benchmark.Significance.The proposed approaches improve the visual quality of unclear CLE images for various stages of tumor development, helping to reduce the rate of misdiagnosis in clinical decision-making and achieve computer graphics-assisted diagnosis.
Collapse
Affiliation(s)
- Jingjun Zhou
- School of Biomedical Engineering, Hainan University, 570228 Haikou, People's Republic of China
| | - Xiangjiang Dong
- Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, 430074 Wuhan, People's Republic of China
| | - Qian Liu
- School of Biomedical Engineering, Hainan University, 570228 Haikou, People's Republic of China
- Key Laboratory of Biomedical Engineering of Hainan Province, School of Biomedical Engineering, Hainan University, 570228 Haikou, People's Republic of China
| |
Collapse
|
15
|
Chong X, Cheng M, Fan W, Li Q, Leung H. M-Denoiser: Unsupervised image denoising for real-world optical and electron microscopy data. Comput Biol Med 2023; 164:107308. [PMID: 37562326 DOI: 10.1016/j.compbiomed.2023.107308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Revised: 07/04/2023] [Accepted: 07/28/2023] [Indexed: 08/12/2023]
Abstract
Real-world microscopy data have a large amount of noise due to the limited light/electron that can be used to capture images. The noise of microscopy data is composed of signal-dependent shot noise and signal-independent read noise, and the Poisson-Gaussian noise model is usually used to describe the noise distribution. Meanwhile, the noise is spatially correlated because of the data acquisition process. Due to the lack of clean ground truth, unsupervised and self-supervised denoising algorithms in computer vision shed new light on tackling such tasks by utilizing paired noisy images or one single noisy image. However, they usually make the assumption that the noise is signal-independent or pixel-wise independent, which contradicts with the actual case. Hence, we propose M-Denoiser for denoising real-world microscopy data in an unsupervised manner. Firstly, the shatter module is used to break the dependency and correlation before denoising. Secondly, a novelly designed unsupervised training loss based on a pair of noisy images is proposed for real-world microscopy data. For evaluation, we train our model on optical and electron microscopy datasets. The experimental results show that M-Denoiser achieves the best performance both quantitatively and qualitatively compared with all the baselines.
Collapse
Affiliation(s)
- Xiaoya Chong
- Department of Computer Science, City University of Hong Kong, Hong Kong, China.
| | - Min Cheng
- Noah's Ark Lab, Huawei Technologies, Hong Kong, China.
| | - Wenqi Fan
- Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China.
| | - Qing Li
- Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China.
| | - Howard Leung
- Department of Computer Science, City University of Hong Kong, Hong Kong, China.
| |
Collapse
|
16
|
Zhou Z, Inoue A, McCollough CH, Yu L. Self-trained deep convolutional neural network for noise reduction in CT. J Med Imaging (Bellingham) 2023; 10:044008. [PMID: 37636895 PMCID: PMC10449263 DOI: 10.1117/1.jmi.10.4.044008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2023] [Revised: 08/04/2023] [Accepted: 08/08/2023] [Indexed: 08/29/2023] Open
Abstract
Purpose Supervised deep convolutional neural network (CNN)-based methods have been actively used in clinical CT to reduce image noise. The networks of these methods are typically trained using paired high- and low-quality data from a massive number of patients and/or phantom images. This training process is tedious, and the network trained under a given condition may not be generalizable to patient images acquired and reconstructed under different conditions. We propose a self-trained deep CNN (ST_CNN) method for noise reduction in CT that does not rely on pre-existing training datasets. Approach The ST_CNN training was accomplished using extensive data augmentation in the projection domain, and the inference was applied to the data itself. Specifically, multiple independent noise insertions were applied to the original patient projection data to generate multiple realizations of low-quality projection data. Then, rotation augmentation was adopted for both the original and low-quality projection data by applying the rotation angle directly on the projection data so that images were rotated at arbitrary angles without introducing additional bias. A large number of paired low- and high-quality images from the same patient were reconstructed and paired for training the ST_CNN model. Results No significant difference was found between the ST_CNN and conventional CNN models in terms of the peak signal-to-noise ratio and structural similarity index measure. The ST_CNN model outperformed the conventional CNN model in terms of noise texture and homogeneity in liver parenchyma as well as better subjective visualization of liver lesions. The ST_CNN may sacrifice the sharpness of vessels slightly compared to the conventional CNN model but without affecting the visibility of peripheral vessels or diagnosis of vascular pathology. Conclusions The proposed ST_CNN method trained from the data itself may achieve similar image quality in comparison with conventional deep CNN denoising methods pre-trained on external datasets.
Collapse
Affiliation(s)
- Zhongxing Zhou
- Mayo Clinic, Department of Radiology, Rochester, Minnesota, United States
| | - Akitoshi Inoue
- Mayo Clinic, Department of Radiology, Rochester, Minnesota, United States
| | | | - Lifeng Yu
- Mayo Clinic, Department of Radiology, Rochester, Minnesota, United States
| |
Collapse
|
17
|
Feuerriegel GC, Weiss K, Kronthaler S, Leonhardt Y, Neumann J, Wurm M, Lenhart NS, Makowski MR, Schwaiger BJ, Woertler K, Karampinos DC, Gersing AS. Evaluation of a deep learning-based reconstruction method for denoising and image enhancement of shoulder MRI in patients with shoulder pain. Eur Radiol 2023; 33:4875-4884. [PMID: 36806569 PMCID: PMC10289918 DOI: 10.1007/s00330-023-09472-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Revised: 01/07/2023] [Accepted: 01/22/2023] [Indexed: 02/21/2023]
Abstract
OBJECTIVES To evaluate the diagnostic performance of an automated reconstruction algorithm combining MR imaging acquired using compressed SENSE (CS) with deep learning (DL) in order to reconstruct denoised high-quality images from undersampled MR images in patients with shoulder pain. METHODS Prospectively, thirty-eight patients (14 women, mean age 40.0 ± 15.2 years) with shoulder pain underwent morphological MRI using a pseudo-random, density-weighted k-space scheme with an acceleration factor of 2.5 using CS only. An automated DL-based algorithm (CS DL) was used to create reconstructions of the same k-space data as used for CS reconstructions. Images were analyzed by two radiologists and assessed for pathologies, image quality, and visibility of anatomical landmarks using a 4-point Likert scale. RESULTS Overall agreement for the detection of pathologies between the CS DL reconstructions and CS images was substantial to almost perfect (κ 0.95 (95% confidence interval 0.82-1.00)). Image quality and the visibility of the rotator cuff, articular cartilage, and axillary recess were overall rated significantly higher for CS DL images compared to CS (p < 0.03). Contrast-to-noise ratios were significantly higher for cartilage/fluid (CS DL 198 ± 24.3, CS 130 ± 32.2, p = 0.02) and ligament/fluid (CS DL 184 ± 17.3, CS 141 ± 23.5, p = 0.03) and SNR values were significantly higher for ligaments and muscle of the CS DL reconstructions (p < 0.04). CONCLUSION Evaluation of shoulder pathologies was feasible using a DL-based algorithm for MRI reconstruction and denoising. In clinical routine, CS DL may be beneficial in particular for reducing image noise and may be useful for the detection and better discrimination of discrete pathologies. Assessment of shoulder pathologies was feasible with improved image quality as well as higher SNR using a compressed sensing deep learning-based framework for image reconstructions and denoising. KEY POINTS • Automated deep learning-based reconstructions showed a significant increase in signal-to-noise ratio and contrast-to-noise ratio (p < 0.04) with only a slight increase of reconstruction time of 40 s compared to CS. • All pathologies were accurately detected with no loss of diagnostic information or prolongation of the scan time. • Significant improvements of the image quality as well as the visibility of the rotator cuff, articular cartilage, and axillary recess were detected.
Collapse
Affiliation(s)
- Georg C Feuerriegel
- Department of Radiology, Klinikum Rechts Der Isar, School of Medicine, Technical University of Munich, Ismaninger Strasse 22, 81675, Munich, Germany.
| | | | - Sophia Kronthaler
- Department of Radiology, Klinikum Rechts Der Isar, School of Medicine, Technical University of Munich, Ismaninger Strasse 22, 81675, Munich, Germany
| | - Yannik Leonhardt
- Department of Radiology, Klinikum Rechts Der Isar, School of Medicine, Technical University of Munich, Ismaninger Strasse 22, 81675, Munich, Germany
| | - Jan Neumann
- Department of Radiology, Klinikum Rechts Der Isar, School of Medicine, Technical University of Munich, Ismaninger Strasse 22, 81675, Munich, Germany
- Musculoskeletal Radiology Section, Klinikum Rechts Der Isar, School of Medicine, Technical University of Munich, Munich, Germany
| | - Markus Wurm
- Department of Trauma Surgery, Klinikum Rechts Der Isar, School of Medicine, Technical University of Munich, Munich, Germany
| | - Nicolas S Lenhart
- Department of Radiology, Klinikum Rechts Der Isar, School of Medicine, Technical University of Munich, Ismaninger Strasse 22, 81675, Munich, Germany
| | - Marcus R Makowski
- Department of Radiology, Klinikum Rechts Der Isar, School of Medicine, Technical University of Munich, Ismaninger Strasse 22, 81675, Munich, Germany
| | - Benedikt J Schwaiger
- Department of Neuroradiology, Klinikum Rechts Der Isar, School of Medicine, Technical University of Munich, Munich, Germany
| | - Klaus Woertler
- Department of Radiology, Klinikum Rechts Der Isar, School of Medicine, Technical University of Munich, Ismaninger Strasse 22, 81675, Munich, Germany
- Musculoskeletal Radiology Section, Klinikum Rechts Der Isar, School of Medicine, Technical University of Munich, Munich, Germany
| | - Dimitrios C Karampinos
- Department of Radiology, Klinikum Rechts Der Isar, School of Medicine, Technical University of Munich, Ismaninger Strasse 22, 81675, Munich, Germany
| | - Alexandra S Gersing
- Department of Radiology, Klinikum Rechts Der Isar, School of Medicine, Technical University of Munich, Ismaninger Strasse 22, 81675, Munich, Germany
- Department of Neuroradiology, University Hospital of Munich, LMU Munich, Munich, Germany
| |
Collapse
|
18
|
Jóźwik-Wabik P, Bernacki K, Popowicz A. Comparison of Training Strategies for Autoencoder-Based Monochromatic Image Denoising. SENSORS (BASEL, SWITZERLAND) 2023; 23:5538. [PMID: 37420705 PMCID: PMC10305082 DOI: 10.3390/s23125538] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 06/07/2023] [Accepted: 06/09/2023] [Indexed: 07/09/2023]
Abstract
Monochromatic images are used mainly in cases where the intensity of the received signal is examined. The identification of the observed objects as well as the estimation of intensity emitted by them depends largely on the precision of light measurement in image pixels. Unfortunately, this type of imaging is often affected by noise, which significantly degrades the quality of the results. In order to reduce it, numerous deterministic algorithms are used, with Non-Local-Means and Block-Matching-3D being the most widespread and treated as the reference point of the current state-of-the-art. Our article focuses on the utilization of machine learning (ML) for the denoising of monochromatic images in multiple data availability scenarios, including those with no access to noise-free data. For this purpose, a simple autoencoder architecture was chosen and checked for various training approaches on two large and widely used image datasets: MNIST and CIFAR-10. The results show that the method of training as well as architecture and the similarity of images within the image dataset significantly affect the ML-based denoising. However, even without access to any clear data, the performance of such algorithms is frequently well above the current state-of-the-art; therefore, they should be considered for monochromatic image denoising.
Collapse
Affiliation(s)
| | | | - Adam Popowicz
- Faculty of Automatic Control, Electronics and Computer Science, Silesian University of Technology, Akademicka 16, 44-100 Gliwice, Poland; (P.J.-W.); (K.B.)
| |
Collapse
|
19
|
Ravi KS, Nandakumar G, Thomas N, Lim M, Qian E, Jimeno MM, Poojar P, Jin Z, Quarterman P, Srinivasan G, Fung M, Vaughan JT, Geethanath S. Accelerated MRI using intelligent protocolling and subject-specific denoising applied to Alzheimer's disease imaging. FRONTIERS IN NEUROIMAGING 2023; 2:1072759. [PMID: 37554641 PMCID: PMC10406274 DOI: 10.3389/fnimg.2023.1072759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Accepted: 03/15/2023] [Indexed: 08/10/2023]
Abstract
Magnetic Resonance Imaging (MR Imaging) is routinely employed in diagnosing Alzheimer's Disease (AD), which accounts for up to 60-80% of dementia cases. However, it is time-consuming, and protocol optimization to accelerate MR Imaging requires local expertise since each pulse sequence involves multiple configurable parameters that need optimization for contrast, acquisition time, and signal-to-noise ratio (SNR). The lack of this expertise contributes to the highly inefficient utilization of MRI services diminishing their clinical value. In this work, we extend our previous effort and demonstrate accelerated MRI via intelligent protocolling of the modified brain screen protocol, referred to as the Gold Standard (GS) protocol. We leverage deep learning-based contrast-specific image-denoising to improve the image quality of data acquired using the accelerated protocol. Since the SNR of MR acquisitions depends on the volume of the object being imaged, we demonstrate subject-specific (SS) image-denoising. The accelerated protocol resulted in a 1.94 × gain in imaging throughput. This translated to a 72.51% increase in MR Value-defined in this work as the ratio of the sum of median object-masked local SNR values across all contrasts to the protocol's acquisition duration. We also computed PSNR, local SNR, MS-SSIM, and variance of the Laplacian values for image quality evaluation on 25 retrospective datasets. The minimum/maximum PSNR gains (measured in dB) were 1.18/11.68 and 1.04/13.15, from the baseline and SS image-denoising models, respectively. MS-SSIM gains were: 0.003/0.065 and 0.01/0.066; variance of the Laplacian (lower is better): 0.104/-0.135 and 0.13/-0.143. The GS protocol constitutes 44.44% of the comprehensive AD imaging protocol defined by the European Prevention of Alzheimer's Disease project. Therefore, we also demonstrate the potential for AD-imaging via automated volumetry of relevant brain anatomies. We performed statistical analysis on these volumetric measurements of the hippocampus and amygdala from the GS and accelerated protocols, and found that 27 locations were in excellent agreement. In conclusion, accelerated brain imaging with the potential for AD imaging was demonstrated, and image quality was recovered post-acquisition using DL-based image denoising models.
Collapse
Affiliation(s)
- Keerthi Sravan Ravi
- Department of Biomedical Engineering, Columbia University in the City of New York, New York, NY, United States
- Columbia University Magnetic Resonance Research Center, Columbia University in the City of New York, New York, NY, United States
| | | | | | | | - Enlin Qian
- Department of Biomedical Engineering, Columbia University in the City of New York, New York, NY, United States
- Columbia University Magnetic Resonance Research Center, Columbia University in the City of New York, New York, NY, United States
| | - Marina Manso Jimeno
- Department of Biomedical Engineering, Columbia University in the City of New York, New York, NY, United States
- Columbia University Magnetic Resonance Research Center, Columbia University in the City of New York, New York, NY, United States
| | - Pavan Poojar
- Department of Diagnostic, Molecular and Interventional Radiology, Accessible MRI Laboratory, Biomedical Engineering and Imaging Institute, Icahn School of Medicine at Mt. Sinai, New York, NY, United States
| | - Zhezhen Jin
- Mailman School of Public Health, Columbia University in the City of New York, New York, NY, United States
| | | | | | - Maggie Fung
- MR Clinical Solutions, GE Healthcare, New York, NY, United States
| | - John Thomas Vaughan
- Department of Biomedical Engineering, Columbia University in the City of New York, New York, NY, United States
- Columbia University Magnetic Resonance Research Center, Columbia University in the City of New York, New York, NY, United States
| | - Sairam Geethanath
- Columbia University Magnetic Resonance Research Center, Columbia University in the City of New York, New York, NY, United States
- Department of Diagnostic, Molecular and Interventional Radiology, Accessible MRI Laboratory, Biomedical Engineering and Imaging Institute, Icahn School of Medicine at Mt. Sinai, New York, NY, United States
| |
Collapse
|
20
|
Cheng H, Li H, Qiu H, Wu Q, Zhang X, Meng F, Ngan KN. Unsupervised Visual Representation Learning via Multi-Dimensional Relationship Alignment. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2023; 32:1613-1626. [PMID: 37027594 DOI: 10.1109/tip.2023.3246801] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Recently, contrastive learning based on augmentation invariance and instance discrimination has made great achievements, owing to its excellent ability to learn beneficial representations without any manual annotations. However, the natural similarity among instances conflicts with instance discrimination which treats each instance as a unique individual. In order to explore the natural relationship among instances and integrate it into contrastive learning, we propose a novel approach in this paper, Relationship Alignment (RA for abbreviation), which forces different augmented views of current batch instances to main a consistent relationship with other instances. In order to perform RA effectively in existing contrastive learning framework, we design an alternating optimization algorithm where the relationship exploration step and alignment step are optimized respectively. In addition, we add an equilibrium constraint for RA to avoid the degenerate solution, and introduce the expansion handler to make it approximately satisfied in practice. In order to better capture the complex relationship among instances, we additionally propose Multi-Dimensional Relationship Alignment (MDRA for abbreviation), which aims to explore the relationship from multiple dimensions. In practice, we decompose the final high-dimensional feature space into a cartesian product of several low-dimensional subspaces and perform RA in each subspace respectively. We validate the effectiveness of our approach on multiple self-supervised learning benchmarks and get consistent improvements compared with current popular contrastive learning methods. On the most commonly used ImageNet linear evaluation protocol, our RA obtains significant improvements over other methods, our MDRA gets further improvements based on RA to achieve the best performance. The source code of our approach will be released soon.
Collapse
|
21
|
Wang W, Wen F, Yan Z, Liu P. Optimal Transport for Unsupervised Denoising Learning. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2023; 45:2104-2118. [PMID: 35471875 DOI: 10.1109/tpami.2022.3170155] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Recently, much progress has been made in unsupervised denoising learning. However, existing methods more or less rely on some assumptions on the signal and/or degradation model, which limits their practical performance. How to construct an optimal criterion for unsupervised denoising learning without any prior knowledge on the degradation model is still an open question. Toward answering this question, this work proposes a criterion for unsupervised denoising learning based on the optimal transport theory. This criterion has favorable properties, e.g., approximately maximal preservation of the information of the signal, whilst achieving perceptual reconstruction. Furthermore, though a relaxed unconstrained formulation is used in practical implementation, we prove that the relaxed formulation in theory has the same solution as the original constrained formulation. Experiments on synthetic and real-world data, including realistic photographic, microscopy, depth, and raw depth images, demonstrate that the proposed method even compares favorably with supervised methods, e.g., approaching the PSNR of supervised methods while having better perceptual quality. Particularly, for spatially correlated noise and realistic microscopy images, the proposed method not only achieves better perceptual quality but also has higher PSNR than supervised methods. Besides, it shows remarkable superiority in harsh practical conditions with complex noise, e.g., raw depth images. Code is available at https://github.com/wangweiSJTU/OTUR.
Collapse
|
22
|
Yang JS, Jeon SY, Choi JH. Acquisition of a single grid-based phase-contrast X-ray image using instantaneous frequency and noise filtering. Biomed Eng Online 2022; 21:92. [PMID: 36575491 PMCID: PMC9793636 DOI: 10.1186/s12938-022-01061-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Accepted: 12/20/2022] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND To obtain phase-contrast X-ray images, single-grid imaging systems are effective, but Moire artifacts remain a significant issue. The solution for removing Moire artifacts from an image is grid rotation, which can distinguish between these artifacts and sample information within the Fourier space. However, the mechanical movement of grid rotation is slower than the real-time change in Moire artifacts. Thus, Moire artifacts generated during real-time imaging cannot be removed using grid rotation. To overcome this problem, we propose an effective method to obtain phase-contrast X-ray images using instantaneous frequency and noise filtering. RESULT The proposed phase-contrast X-ray image using instantaneous frequency and noise filtering effectively suppressed noise with Moire patterns. The proposed method also preserved the clear edge of the inner and outer boundaries and internal anatomical information from the biological sample, outperforming conventional Fourier analysis-based methods, including absorption, scattering, and phase-contrast X-ray images. In particular, when comparing the phase information for the proposed method with the x-axis gradient image from the absorption image, the proposed method correctly distinguished two different types of soft tissue and the detailed information, while the latter method did not. CONCLUSION This study successfully achieved a significant improvement in image quality for phase-contrast X-ray images using instantaneous frequency and noise filtering. This study can provide a foundation for real-time bio-imaging research using three-dimensional computed tomography.
Collapse
Affiliation(s)
- Jae-Suk Yang
- grid.255649.90000 0001 2171 7754Division of Mechanical and Biomedical Engineering, Graduate Program in System Health Science and Engineering, Ewha Womans University, Seoul, 03760 Republic of Korea
| | - Sun-Young Jeon
- grid.255649.90000 0001 2171 7754Division of Mechanical and Biomedical Engineering, Graduate Program in System Health Science and Engineering, Ewha Womans University, Seoul, 03760 Republic of Korea
| | - Jang-Hwan Choi
- grid.255649.90000 0001 2171 7754Division of Mechanical and Biomedical Engineering, Graduate Program in System Health Science and Engineering, Ewha Womans University, Seoul, 03760 Republic of Korea
| |
Collapse
|
23
|
Wang J, Liu P, Wen F. Self-Supervised Learning for RGB-Guided Depth Enhancement by Exploiting the Dependency Between RGB and Depth. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2022; 32:159-174. [PMID: 37015523 DOI: 10.1109/tip.2022.3226419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Due to the imaging mechanism of time-of-flight (ToF) sensors, the captured depth images usually suffer from severe noise and degradation. Though many RGB-guided methods have been proposed for depth image enhancement in the past few years, yet the enhancement performance on real-world depth images is still largely unsatisfactory. Two main reasons are the complexity of realistic noise and degradation in depth images, and the difficulty in collecting noise-clean pairs for supervised enhancement learning. This work aims to develop a self-supervised learning method for RGB-guided depth image enhancement, which does not require any noisy-clean pairs but can significantly boost the enhancement performance on real-world noisy depth images. To this end, we exploit the dependency between RGB and depth images to self-supervise the learning of the enhancement model. It is achieved by maximizing the cross-modal dependency between RGB and depth to promote the enhanced depth having dependency with the RGB of the same scene as much as possible. Furthermore, we augment the cross-modal dependency maximization formulation based on the optimal transport theory to achieve further performance improvement. Experimental results on both synthetic and real-world data demonstrate that our method can significantly outperform existing state-of-the-art methods on depth denoising, multi-path interference suppression, and hole filling. Particularly, our method shows remarkable superiority over existing ones on real-world data in handling various realistic complex degradation. Code is available at https://github.com/wjcyt/SRDE.
Collapse
|
24
|
Image denoising in the deep learning era. Artif Intell Rev 2022. [DOI: 10.1007/s10462-022-10305-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
25
|
Isotropic reconstruction for electron tomography with deep learning. Nat Commun 2022; 13:6482. [PMID: 36309499 PMCID: PMC9617606 DOI: 10.1038/s41467-022-33957-8] [Citation(s) in RCA: 128] [Impact Index Per Article: 42.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Accepted: 10/05/2022] [Indexed: 12/25/2022] Open
Abstract
Cryogenic electron tomography (cryoET) allows visualization of cellular structures in situ. However, anisotropic resolution arising from the intrinsic "missing-wedge" problem has presented major challenges in visualization and interpretation of tomograms. Here, we have developed IsoNet, a deep learning-based software package that iteratively reconstructs the missing-wedge information and increases signal-to-noise ratio, using the knowledge learned from raw tomograms. Without the need for sub-tomogram averaging, IsoNet generates tomograms with significantly reduced resolution anisotropy. Applications of IsoNet to three representative types of cryoET data demonstrate greatly improved structural interpretability: resolving lattice defects in immature HIV particles, establishing architecture of the paraflagellar rod in Eukaryotic flagella, and identifying heptagon-containing clathrin cages inside a neuronal synapse of cultured cells. Therefore, by overcoming two fundamental limitations of cryoET, IsoNet enables functional interpretation of cellular tomograms without sub-tomogram averaging. Its application to high-resolution cellular tomograms should also help identify differently oriented complexes of the same kind for sub-tomogram averaging.
Collapse
|
26
|
Özdemir Ö, Sönmez EB. Attention mechanism and mixup data augmentation for classification of COVID-19 Computed Tomography images. JOURNAL OF KING SAUD UNIVERSITY. COMPUTER AND INFORMATION SCIENCES 2022; 34:6199-6207. [PMID: 38620953 PMCID: PMC8280602 DOI: 10.1016/j.jksuci.2021.07.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/22/2021] [Revised: 07/01/2021] [Accepted: 07/07/2021] [Indexed: 12/21/2022]
Abstract
The Coronavirus disease is quickly spreading all over the world and the emergency situation is still out of control. Latest achievements of deep learning algorithms suggest the use of deep Convolutional Neural Network to implement a computer-aided diagnostic system for automatic classification of COVID-19 CT images. In this paper, we propose to employ a feature-wise attention layer in order to enhance the discriminative features obtained by convolutional networks. Moreover, the original performance of the network has been improved using the mixup data augmentation technique. This work compares the proposed attention-based model against the stacked attention networks, and traditional versus mixup data augmentation approaches. We deduced that feature-wise attention extension, while outperforming the stacked attention variants, achieves remarkable improvements over the baseline convolutional neural networks. That is, ResNet50 architecture extended with a feature-wise attention layer obtained 95.57% accuracy score, which, to best of our knowledge, fixes the state-of-the-art in the challenging COVID-CT dataset.
Collapse
Affiliation(s)
- Özgür Özdemir
- Computer Engineering Department, Istanbul Bilgi University, Turkey
| | | |
Collapse
|
27
|
Chen Z, Jiang Y, Liu D, Wang Z. CERL: A Unified Optimization Framework for Light Enhancement With Realistic Noise. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2022; PP:4162-4172. [PMID: 35700251 DOI: 10.1109/tip.2022.3180213] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Low-light images captured in the real world are inevitably corrupted by sensor noise. Such noise is spatially variant and highly dependent on the underlying pixel intensity, deviating from the oversimplified assumptions in conventional denoising. Existing light enhancement methods either overlook the important impact of real-world noise during enhancement, or treat noise removal as a separate pre- or post-processing step. We present Coordinated Enhancement for Real-world Low-light Noisy Images (CERL), that seamlessly integrates light enhancement and noise suppression parts into a unified and physics-grounded optimization framework. For the real low-light noise removal part, we customize a self-supervised denoising model that can easily be adapted without referring to clean ground-truth images. For the light enhancement part, we also improve the design of a state-of-the-art backbone. The two parts are then joint formulated into one principled plug-and-play optimization. Our approach is compared against state-of-the-art low-light enhancement methods both qualitatively and quantitatively. Besides standard benchmarks, we further collect and test on a new realistic low-light mobile photography dataset (RLMP), whose mobile-captured photos display heavier realistic noise than those taken by high-quality cameras. CERL consistently produces the most visually pleasing and artifact-free results across all experiments. Our RLMP dataset and codes are available at: https://github.com/VITA-Group/CERL.
Collapse
|
28
|
Huang T, Li S, Jia X, Lu H, Liu J. Neighbor2Neighbor: A Self-Supervised Framework for Deep Image Denoising. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2022; 31:4023-4038. [PMID: 35679376 DOI: 10.1109/tip.2022.3176533] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
In recent years, image denoising has benefited a lot from deep neural networks. However, these models need large amounts of noisy-clean image pairs for supervision. Although there have been attempts in training denoising networks with only noisy images, existing self-supervised algorithms suffer from inefficient network training, heavy computational burden, or dependence on noise modeling. In this paper, we proposed a self-supervised framework named Neighbor2Neighbor for deep image denoising. We develop a theoretical motivation and prove that by designing specific samplers for training image pairs generation from only noisy images, we can train a self-supervised denoising network similar to the network trained with clean images supervision. Besides, we propose a regularizer in the perspective of optimization to narrow the optimization gap between the self-supervised denoiser and the supervised denoiser. We present a very simple yet effective self-supervised training scheme based on the theoretical understandings: training image pairs are generated by random neighbor sub-samplers, and denoising networks are trained with a regularized loss. Moreover, we propose a training strategy named BayerEnsemble to adapt the Neighbor2Neighbor framework in raw image denoising. The proposed Neighbor2Neighbor framework can enjoy the progress of state-of-the-art supervised denoising networks in network architecture design. It also avoids heavy dependence on the assumption of the noise distribution. We evaluate the Neighbor2Neighbor framework through extensive experiments, including synthetic experiments with different noise distributions and real-world experiments under various scenarios. The code is available online: https://github.com/TaoHuang2018/Neighbor2Neighbor.
Collapse
|
29
|
Lee K, Jeong WK. Noise2Kernel: Adaptive Self-Supervised Blind Denoising Using a Dilated Convolutional Kernel Architecture. SENSORS 2022; 22:s22114255. [PMID: 35684882 PMCID: PMC9185435 DOI: 10.3390/s22114255] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Revised: 05/27/2022] [Accepted: 05/29/2022] [Indexed: 02/04/2023]
Abstract
With the advent of unsupervised learning, efficient training of a deep network for image denoising without pairs of noisy and clean images has become feasible. Most current unsupervised denoising methods are built on self-supervised loss with the assumption of zero-mean noise under the signal-independent condition, which causes brightness-shifting artifacts on unconventional noise statistics (i.e., different from commonly used noise models). Moreover, most blind denoising methods require a random masking scheme for training to ensure the invariance of the denoising process. In this study, we propose a dilated convolutional network that satisfies an invariant property, allowing efficient kernel-based training without random masking. We also propose an adaptive self-supervision loss to increase the tolerance for unconventional noise, which is specifically effective in removing salt-and-pepper or hybrid noise where prior knowledge of noise statistics is not readily available. We demonstrate the efficacy of the proposed method by comparing it with state-of-the-art denoising methods using various examples.
Collapse
|
30
|
Liu S, Schniter P, Ahmad R. MRI RECOVERY WITH A SELF-CALIBRATED DENOISER. PROCEEDINGS OF THE ... IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING. ICASSP (CONFERENCE) 2022; 2022:1351-1355. [PMID: 35645618 PMCID: PMC9134859 DOI: 10.1109/icassp43922.2022.9746785] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Plug-and-play (PnP) methods that employ application-specific denoisers have been proposed to solve inverse problems, including MRI reconstruction. However, training application-specific denoisers is not feasible for many applications due to the lack of training data. In this work, we propose a PnP-inspired recovery method that does not require data beyond the single, incomplete set of measurements. The proposed self-supervised method, called recovery with a self-calibrated denoiser (ReSiDe), trains the denoiser from the patches of the image being recovered. The denoiser training and a call to the denoising subroutine are performed in each iteration of a PnP algorithm, leading to a progressive refinement of the reconstructed image. For validation, we compare ReSiDe with a compressed sensing-based method and a PnP method with BM3D denoising using single-coil MRI brain data.
Collapse
Affiliation(s)
- Sizhuo Liu
- Department of Biomedical Engineering, Ohio State University, Columbus OH, 43210, USA
| | - Philip Schniter
- Department of Electrical and Computer Engineering, Ohio State University, Columbus OH, 43210, USA
| | - Rizwan Ahmad
- Department of Biomedical Engineering, Ohio State University, Columbus OH, 43210, USA
| |
Collapse
|
31
|
Cao W. Image Semantic Analysis in Visual Media Art Using Machine Learning and Neural Machine Translation. ACM T ASIAN LOW-RESO 2022. [DOI: 10.1145/3522576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
The current archaeological work in China has the problems of high cost, large material consumption, more attention on human detection and long time-consuming. It is urgent to use modern high-precision detection technology for auxiliary work. This exploration will also use the semantic recognition system based on deep learning and neural network for the recognition of oracle bone inscriptions in archaeology. Therefore, combined with the concept of multimedia semantic recognition and analysis, a unified real-time target detection semantic analysis model named You Only Look Once (YOLOv2) is established based on the deep convolutional neural network under deep learning in the field of machine learning, to test the semantic analysis of oracle bone inscriptions. Moreover, Faster Region-Convolutional Neural Network (Faster R-CNN) and traditional YOLO models are selected to conduct the controlled experiments. A YOLOv2 recognition system is established based on Diffusion-Convolutional Neural Networks (DCNN) under deep learning. First, the concept and performance of DCNN are studied. Next, the basic information of oracle bone inscriptions is analyzed. A recognition system based on DCNN is established. On the premise that the three models can identify different directions of the same oracle bone inscriptions sample, the detection accuracy and detection loss value of YOLOv2 are better than those of the other two models, the detection accuracy is as high as 0.90, and the loss value is less than 0.10. Therefore, it is considered that this YOLOv2 semantic analysis model can be applied in oracle bone inscriptions and other archaeological work, which improves the work efficiency and simplifies the human work items for the domestic archaeological work. This semantic analysis model is of great help to the pattern recognition of cultural relics in archaeological work, and can help professionals analyze the meaning of patterns faster when there are massive similar oracle bone inscriptions.
Collapse
Affiliation(s)
- Weiran Cao
- Zhejiang University City College, Hangzhou, 310000, China
| |
Collapse
|
32
|
Self-Supervised Denoising Image Filter Based on Recursive Deep Neural Network Structure. SENSORS 2021; 21:s21237827. [PMID: 34883829 PMCID: PMC8659654 DOI: 10.3390/s21237827] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/07/2021] [Revised: 11/11/2021] [Accepted: 11/20/2021] [Indexed: 12/02/2022]
Abstract
The purpose of this paper is to propose a novel noise removal method based on deep neural networks that can remove various types of noise without paired noisy and clean data. Because this type of filter generally has relatively poor performance, the proposed noise-to-blur-estimated clean (N2BeC) model introduces a stage-dependent loss function and a recursive learning stage for improved denoised image quality. The proposed loss function regularizes the existing loss function so that the proposed model can better learn image details. Moreover, the recursive learning stage provides the proposed model with an additional opportunity to learn image details. The overall deep neural network consists of three learning stages and three corresponding loss functions. We determine the essential hyperparameters via several simulations. Consequently, the proposed model showed more than 1 dB superior performance compared with the existing noise-to-blur model.
Collapse
|
33
|
Moreno López M, Frederick JM, Ventura J. Evaluation of MRI Denoising Methods Using Unsupervised Learning. Front Artif Intell 2021; 4:642731. [PMID: 34151253 PMCID: PMC8212039 DOI: 10.3389/frai.2021.642731] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Accepted: 05/17/2021] [Indexed: 11/13/2022] Open
Abstract
In this paper we evaluate two unsupervised approaches to denoise Magnetic Resonance Images (MRI) in the complex image space using the raw information that k-space holds. The first method is based on Stein’s Unbiased Risk Estimator, while the second approach is based on a blindspot network, which limits the network’s receptive field. Both methods are tested on two different datasets, one containing real knee MRI and the other consists of synthetic brain MRI. These datasets contain information about the complex image space which will be used for denoising purposes. Both networks are compared against a state-of-the-art algorithm, Non-Local Means (NLM) using quantitative and qualitative measures. For most given metrics and qualitative measures, both networks outperformed NLM, and they prove to be reliable denoising methods.
Collapse
Affiliation(s)
- Marc Moreno López
- Department of Computer Science, University of Colorado Colorado Springs, Colorado Springs, CO, United States
| | - Joshua M Frederick
- Department of Computer Science and Software Engineering, California Polytechnic State University, San Luis Obispo, CA, United States
| | - Jonathan Ventura
- Department of Computer Science and Software Engineering, California Polytechnic State University, San Luis Obispo, CA, United States
| |
Collapse
|
34
|
Weakly-supervised progressive denoising with unpaired CT images. Med Image Anal 2021; 71:102065. [PMID: 33915472 DOI: 10.1016/j.media.2021.102065] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 03/16/2021] [Accepted: 03/30/2021] [Indexed: 12/12/2022]
Abstract
Although low-dose CT imaging has attracted a great interest due to its reduced radiation risk to the patients, it suffers from severe and complex noise. Recent fully-supervised methods have shown impressive performances on CT denoising task. However, they require a huge amount of paired normal-dose and low-dose CT images, which is generally unavailable in real clinical practice. To address this problem, we propose a weakly-supervised denoising framework that generates paired original and noisier CT images from unpaired CT images using a physics-based noise model. Our denoising framework also includes a progressive denoising module that bypasses the challenges of mapping from low-dose to normal-dose CT images directly via progressively compensating the small noise gap. To quantitatively evaluate diagnostic image quality, we present the noise power spectrum and signal detection accuracy, which are well correlated with the visual inspection. The experimental results demonstrate that our method achieves remarkable performances, even superior to fully-supervised CT denoising with respect to the signal detectability. Moreover, our framework increases the flexibility in data collection, allowing us to utilize any unpaired data at any dose levels.
Collapse
|
35
|
Wu YH, Gao SH, Mei J, Xu J, Fan DP, Zhang RG, Cheng MM. JCS: An Explainable COVID-19 Diagnosis System by Joint Classification and Segmentation. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2021; 30:3113-3126. [PMID: 33600316 DOI: 10.1109/tip.2021.3058783] [Citation(s) in RCA: 162] [Impact Index Per Article: 40.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
Recently, the coronavirus disease 2019 (COVID-19) has caused a pandemic disease in over 200 countries, influencing billions of humans. To control the infection, identifying and separating the infected people is the most crucial step. The main diagnostic tool is the Reverse Transcription Polymerase Chain Reaction (RT-PCR) test. Still, the sensitivity of the RT-PCR test is not high enough to effectively prevent the pandemic. The chest CT scan test provides a valuable complementary tool to the RT-PCR test, and it can identify the patients in the early-stage with high sensitivity. However, the chest CT scan test is usually time-consuming, requiring about 21.5 minutes per case. This paper develops a novel Joint Classification and Segmentation (JCS) system to perform real-time and explainable COVID- 19 chest CT diagnosis. To train our JCS system, we construct a large scale COVID- 19 Classification and Segmentation (COVID-CS) dataset, with 144,167 chest CT images of 400 COVID- 19 patients and 350 uninfected cases. 3,855 chest CT images of 200 patients are annotated with fine-grained pixel-level labels of opacifications, which are increased attenuation of the lung parenchyma. We also have annotated lesion counts, opacification areas, and locations and thus benefit various diagnosis aspects. Extensive experiments demonstrate that the proposed JCS diagnosis system is very efficient for COVID-19 classification and segmentation. It obtains an average sensitivity of 95.0% and a specificity of 93.0% on the classification test set, and 78.5% Dice score on the segmentation test set of our COVID-CS dataset. The COVID-CS dataset and code are available at https://github.com/yuhuan-wu/JCS.
Collapse
|