1
|
Salimi Y, Mansouri Z, Sun C, Sanaat A, Yazdanpanah M, Shooli H, Nkoulou R, Boudabbous S, Zaidi H. Deep learning-based segmentation of ultra-low-dose CT images using an optimized nnU-Net model. LA RADIOLOGIA MEDICA 2025:10.1007/s11547-025-01989-x. [PMID: 40100539 DOI: 10.1007/s11547-025-01989-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2024] [Accepted: 02/25/2025] [Indexed: 03/20/2025]
Abstract
PURPOSE Low-dose CT protocols are widely used for emergency imaging, follow-ups, and attenuation correction in hybrid PET/CT and SPECT/CT imaging. However, low-dose CT images often suffer from reduced quality depending on acquisition and patient attenuation parameters. Deep learning (DL)-based organ segmentation models are typically trained on high-quality images, with limited dedicated models for noisy CT images. This study aimed to develop a DL pipeline for organ segmentation on ultra-low-dose CT images. MATERIALS AND METHODS 274 CT raw datasets were reconstructed using Siemens ReconCT software with ADMIRE iterative algorithm, generating full-dose (FD-CT) and simulated low-dose (LD-CT) images at 1%, 2%, 5%, and 10% of the original tube current. Existing FD-nnU-Net models segmented 22 organs on FD-CT images, serving as reference masks for training new LD-nnU-Net models using LD-CT images. Three models were trained for bony tissue (6 organs), soft-tissue (15 organs), and body contour segmentation. The segmented masks from LD-CT were compared to FD-CT as standard of reference. External datasets with actual LD-CT images were also segmented and compared. RESULTS FD-nnU-Net performance declined with reduced radiation dose, especially below 10% (5 mAs). LD-nnU-Net achieved average Dice scores of 0.937 ± 0.049 (bony tissues), 0.905 ± 0.117 (soft-tissues), and 0.984 ± 0.023 (body contour). LD models outperformed FD models on external datasets. CONCLUSION Conventional FD-nnU-Net models performed poorly on LD-CT images. Dedicated LD-nnU-Net models demonstrated superior performance across cross-validation and external evaluations, enabling accurate segmentation of ultra-low-dose CT images. The trained models are available on our GitHub page.
Collapse
Affiliation(s)
- Yazdan Salimi
- Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH-1211, Geneva, Switzerland
| | - Zahra Mansouri
- Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH-1211, Geneva, Switzerland
| | - Chang Sun
- Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH-1211, Geneva, Switzerland
- School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, 100876, China
| | - Amirhossein Sanaat
- Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH-1211, Geneva, Switzerland
| | | | - Hossein Shooli
- Department of Radiology, Bushehr University of Medical Sciences, Bushehr, Iran
| | - René Nkoulou
- Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH-1211, Geneva, Switzerland
| | - Sana Boudabbous
- Division of Radiology, Geneva University Hospital, CH-1211, Geneva, Switzerland
| | - Habib Zaidi
- Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH-1211, Geneva, Switzerland.
- Department of Nuclear Medicine and Molecular Imaging, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands.
- Department of Nuclear Medicine, University of Southern Denmark, Odense, Denmark.
- University Research and Innovation Center, Óbuda University, Budapest, Hungary.
| |
Collapse
|
2
|
Chu Y, Luo G, Zhou L, Cao S, Ma G, Meng X, Zhou J, Yang C, Xie D, Mu D, Henao R, Setti G, Xiao X, Wu L, Qiu Z, Gao X. Deep learning-driven pulmonary artery and vein segmentation reveals demography-associated vasculature anatomical differences. Nat Commun 2025; 16:2262. [PMID: 40050617 PMCID: PMC11885638 DOI: 10.1038/s41467-025-56505-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2024] [Accepted: 01/20/2025] [Indexed: 03/09/2025] Open
Abstract
Pulmonary artery-vein segmentation is critical for disease diagnosis and surgical planning. Traditional methods rely on Computed Tomography Pulmonary Angiography (CTPA), which requires contrast agents with potential health risks. Non-contrast CT, a safer and more widely available approach, however, has long been considered impossible for this task. Here we propose High-abundant Pulmonary Artery-vein Segmentation (HiPaS), enabling accurate segmentation across both non-contrast CT and CTPA at multiple resolutions. HiPaS integrates spatial normalization with an iterative segmentation strategy, leveraging lower-level vessel segmentations as priors for higher-level segmentations. Trained on a multi-center dataset comprising 1073 CT volumes with manual annotations, HiPaS achieves superior performance (dice score: 91.8%, sensitivity: 98.0%) and demonstrates non-inferiority on non-contrast CT compared to CTPA. Furthermore, HiPaS enables large-scale analysis of 11,784 participants, revealing associations between vessel abundance and sex, age, and diseases, under lung-volume control. HiPaS represents a promising, non-invasive approach for clinical diagnostics and anatomical research.
Collapse
Affiliation(s)
- Yuetan Chu
- Center of Excellence for Smart Health (KCSH), King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
- Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
- Center of Excellence on Generative AI, King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
| | - Gongning Luo
- Center of Excellence for Smart Health (KCSH), King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia.
- Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia.
- Center of Excellence on Generative AI, King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia.
| | - Longxi Zhou
- Center of Excellence for Smart Health (KCSH), King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
- Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
- Center of Excellence on Generative AI, King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
| | - Shaodong Cao
- Department of Radiology, The Fourth Hospital of Harbin Medical University, Harbin, China
| | - Guolin Ma
- Department of Radiology, China-Japan Friendship Hospital, Beijing, China
| | - Xianglin Meng
- Department of Critical Care Medicine, The First Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Juexiao Zhou
- Center of Excellence for Smart Health (KCSH), King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
- Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
- Center of Excellence on Generative AI, King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
| | - Changchun Yang
- Center of Excellence for Smart Health (KCSH), King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
- Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
- Center of Excellence on Generative AI, King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
| | - Dexuan Xie
- Department of Computer Tomography, The First Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Dan Mu
- Department of Radiology, Affiliated Nanjing Drum Tower Hospital of Nanjing University Medical School, Nanjing, China
| | - Ricardo Henao
- Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
| | - Gianluca Setti
- Electrical and Computer Engineering Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia.
| | - Xigang Xiao
- Department of Computer Tomography, The First Affiliated Hospital of Harbin Medical University, Harbin, China.
| | - Lianming Wu
- Department of Radiology, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China.
| | - Zhaowen Qiu
- College of Computer and Control Engineering, Northeast Forestry University, Harbin, China.
| | - Xin Gao
- Center of Excellence for Smart Health (KCSH), King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia.
- Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia.
- Center of Excellence on Generative AI, King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia.
| |
Collapse
|
3
|
Shu Z, Entezari A. A nonlocal prior in iterative CT reconstruction. Med Phys 2025; 52:1436-1453. [PMID: 39620497 DOI: 10.1002/mp.17533] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2024] [Revised: 10/19/2024] [Accepted: 10/30/2024] [Indexed: 03/06/2025] Open
Abstract
BACKGROUND Computed tomography (CT) reconstruction problems are always framed as inverse problems, where the attenuation map of an imaged object is reconstructed from the sinogram measurement. In practice, these inverse problems are often ill-posed, especially under few-view and limited-angle conditions, which makes accurate reconstruction challenging. Existing solutions use regularizations such as total variation to steer reconstruction algorithms to the most plausible result. However, most prevalent regularizations rely on the same priors, such as piecewise constant prior, hindering their ability to collaborate effectively and further boost reconstruction precision. PURPOSE This study aims to overcome the aforementioned challenge a prior previously limited to discrete tomography. This enables more accurate reconstructions when the proposed method is used in conjunction with most existing regularizations as they utilize different priors. The improvements will be demonstrated through experiments conducted under various conditions. METHODS Inspired by the discrete algebraic reconstruction technique (DART) algorithm for discrete tomography, we find out that pixel grayscale values in CT images are not uniformly distributed and are actually highly clustered. Such discovery can be utilized as a powerful prior for CT reconstruction. In this paper, we leverage the collaborative filtering technique to enable the collaboration of the proposed prior and most existing regularizations, significantly enhancing the reconstruction accuracy. RESULTS Our experiments show that the proposed method can work with most existing regularizations and significantly improve the reconstruction quality. Such improvement is most pronounced under limited-angle and few-view conditions. Furthermore, the proposed regularization also has the potential for further improvement and can be utilized in other image reconstruction areas. CONCLUSIONS We propose improving the performance of iterative CT reconstruction algorithms by applying the collaborative filtering technique along with a prior based on the densely clustered distribution of pixel grayscale values in CT images. Our experimental results indicate that the proposed methodology consistently enhances reconstruction accuracy when used in conjunction with most existing regularizations, particularly under few-view and limited-angle conditions.
Collapse
Affiliation(s)
- Ziyu Shu
- Department of Radiation Oncology, Stony Brook University, New York, USA
| | | |
Collapse
|
4
|
Braure T, Lazaro D, Hateau D, Brandon V, Ginsburger K. Conditioning generative latent optimization for sparse-view computed tomography image reconstruction. J Med Imaging (Bellingham) 2025; 12:024004. [PMID: 40177097 PMCID: PMC11961077 DOI: 10.1117/1.jmi.12.2.024004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2024] [Revised: 02/03/2025] [Accepted: 03/09/2025] [Indexed: 04/05/2025] Open
Abstract
Purpose The issue of delivered doses during computed tomography (CT) scans encouraged sparser sets of X-ray projection, severely degrading reconstructions from conventional methods. Although most deep learning approaches benefit from large supervised datasets, they cannot generalize to new acquisition protocols (geometry, source/detector specifications). To address this issue, we developed a method working without training data and independently of experimental setups. In addition, our model may be initialized on small unsupervised datasets to enhance reconstructions. Approach We propose a conditioned generative latent optimization (cGLO) in which a decoder reconstructs multiple slices simultaneously with a shared objective. It is tested on full-dose sparse-view CT for varying projection sets: (a) without training data against Deep Image Prior (DIP) and (b) with training datasets of multiple sizes against state-of-the-art score-based generative models (SGMs). Peak signal-to-noise ratio (PSNR) and structural SIMilarity (SSIM) metrics are used to quantify reconstruction quality. Results cGLO demonstrates better SSIM than SGMs (between + 0.034 and + 0.139 ) and has an increasing advantage for smaller datasets reaching a + 6.06 dB PSNR gain. Our strategy also outperforms DIP with at least a + 1.52 dB PSNR advantage and peaks at + 3.15 dB with fewer angles. Moreover, cGLO does not create artifacts or structural deformations contrary to DIP and SGMs. Conclusions We propose a parsimonious and robust reconstruction technique offering similar to better performances when compared with state-of-the-art methods regarding full-dose sparse-view CT. Our strategy could be readily applied to any imaging reconstruction task without any assumption about the acquisition protocol or the quantity of available data.
Collapse
|
5
|
Zhang R, Szczykutowicz TP, Toia GV. Artificial Intelligence in Computed Tomography Image Reconstruction: A Review of Recent Advances. J Comput Assist Tomogr 2025:00004728-990000000-00429. [PMID: 40008975 DOI: 10.1097/rct.0000000000001734] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2024] [Accepted: 01/07/2025] [Indexed: 02/27/2025]
Abstract
The development of novel image reconstruction algorithms has been pivotal in enhancing image quality and reducing radiation dose in computed tomography (CT) imaging. Traditional techniques like filtered back projection perform well under ideal conditions but fail to generate high-quality images under low-dose, sparse-view, and limited-angle conditions. Iterative reconstruction methods improve upon filtered back projection by incorporating system models and assumptions about the patient, yet they can suffer from patchy image textures. The emergence of artificial intelligence (AI), particularly deep learning, has further advanced CT reconstruction. AI techniques have demonstrated great potential in reducing radiation dose while preserving image quality and noise texture. Moreover, AI has exhibited unprecedented performance in addressing challenging CT reconstruction problems, including low-dose CT, sparse-view CT, limited-angle CT, and interior tomography. This review focuses on the latest advances in AI-based CT reconstruction under these challenging conditions.
Collapse
Affiliation(s)
- Ran Zhang
- Departments of Radiology and Medical Physics, University of Wisconsin, Madison, WI
| | | | | |
Collapse
|
6
|
Du W, Cui H, He L, Chen H, Zhang Y, Yang H. Structure-aware diffusion for low-dose CT imaging. Phys Med Biol 2024; 69:155008. [PMID: 38942004 DOI: 10.1088/1361-6560/ad5d47] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Accepted: 06/28/2024] [Indexed: 06/30/2024]
Abstract
Reducing the radiation dose leads to the x-ray computed tomography (CT) images suffering from heavy noise and artifacts, which inevitably interferes with the subsequent clinic diagnostic and analysis. Leading works have explored diffusion models for low-dose CT imaging to avoid the structure degeneration and blurring effects of previous deep denoising models. However, most of them always begin their generative processes with Gaussian noise, which has little or no structure priors of the clean data distribution, thereby leading to long-time inference and unpleasant reconstruction quality. To alleviate these problems, this paper presents a Structure-Aware Diffusion model (SAD), an end-to-end self-guided learning framework for high-fidelity CT image reconstruction. First, SAD builds a nonlinear diffusion bridge between clean and degraded data distributions, which could directly learn the implicit physical degradation prior from observed measurements. Second, SAD integrates the prompt learning mechanism and implicit neural representation into the diffusion process, where rich and diverse structure representations extracted by degraded inputs are exploited as prompts, which provides global and local structure priors, to guide CT image reconstruction. Finally, we devise an efficient self-guided diffusion architecture using an iterative updated strategy, which further refines structural prompts during each generative step to drive finer image reconstruction. Extensive experiments on AAPM-Mayo and LoDoPaB-CT datasets demonstrate that our SAD could achieve superior performance in terms of noise removal, structure preservation, and blind-dose generalization, with few generative steps, even one step only.
Collapse
Affiliation(s)
- Wenchao Du
- College of Computer Science, Sichuan University, Chengdu 610065, People's Republic of China
| | - HuanHuan Cui
- West China Hospital of Sichuan University, Chengdu 610041, People's Republic of China
| | - LinChao He
- College of Computer Science, Sichuan University, Chengdu 610065, People's Republic of China
| | - Hu Chen
- College of Computer Science, Sichuan University, Chengdu 610065, People's Republic of China
| | - Yi Zhang
- College of Computer Science, Sichuan University, Chengdu 610065, People's Republic of China
| | - Hongyu Yang
- College of Computer Science, Sichuan University, Chengdu 610065, People's Republic of China
| |
Collapse
|
7
|
Wang F, Wang R, Qiu H. Low-dose CT reconstruction using dataset-free learning. PLoS One 2024; 19:e0304738. [PMID: 38875181 PMCID: PMC11178168 DOI: 10.1371/journal.pone.0304738] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Accepted: 05/16/2024] [Indexed: 06/16/2024] Open
Abstract
Low-Dose computer tomography (LDCT) is an ideal alternative to reduce radiation risk in clinical applications. Although supervised-deep-learning-based reconstruction methods have demonstrated superior performance compared to conventional model-driven reconstruction algorithms, they require collecting massive pairs of low-dose and norm-dose CT images for neural network training, which limits their practical application in LDCT imaging. In this paper, we propose an unsupervised and training data-free learning reconstruction method for LDCT imaging that avoids the requirement for training data. The proposed method is a post-processing technique that aims to enhance the initial low-quality reconstruction results, and it reconstructs the high-quality images by neural work training that minimizes the ℓ1-norm distance between the CT measurements and their corresponding simulated sinogram data, as well as the total variation (TV) value of the reconstructed image. Moreover, the proposed method does not require to set the weights for both the data fidelity term and the plenty term. Experimental results on the AAPM challenge data and LoDoPab-CT data demonstrate that the proposed method is able to effectively suppress the noise and preserve the tiny structures. Also, these results demonstrate the rapid convergence and low computational cost of the proposed method. The source code is available at https://github.com/linfengyu77/IRLDCT.
Collapse
Affiliation(s)
- Feng Wang
- College of Big Data and Software Engineering, Zhejiang Wanli University, Ningbo, Zhejiang, China
| | - Renfang Wang
- College of Big Data and Software Engineering, Zhejiang Wanli University, Ningbo, Zhejiang, China
| | - Hong Qiu
- College of Big Data and Software Engineering, Zhejiang Wanli University, Ningbo, Zhejiang, China
| |
Collapse
|
8
|
Meng M, Wang Y, Zhu M, Tao X, Mao Z, Liao J, Bian Z, Zeng D, Ma J. DDT-Net: Dose-Agnostic Dual-Task Transfer Network for Simultaneous Low-Dose CT Denoising and Simulation. IEEE J Biomed Health Inform 2024; 28:3613-3625. [PMID: 38478459 DOI: 10.1109/jbhi.2024.3376628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/20/2024]
Abstract
Deep learning (DL) algorithms have achieved unprecedented success in low-dose CT (LDCT) imaging and are expected to be a new generation of CT reconstruction technology. However, most DL-based denoising models often lack the ability to generalize to unseen dose data. Moreover, most simulation tools for LDCT typically operate on proprietary projection data, which is generally not accessible without an established collaboration with CT manufacturers. To alleviate these issues, in this work, we propose a dose-agnostic dual-task transfer network, termed DDT-Net, for simultaneous LDCT denoising and simulation. Concretely, the dual-task learning module is constructed to integrate the LDCT denoising and simulation tasks into a unified optimization framework by learning the joint distribution of LDCT and NDCT data. We approximate the joint distribution of continuous dose level data by training DDT-Net with discrete dose data, which can be generalized to denoising and simulation of unseen dose data. In particular, the mixed-dose training strategy adopted by DDT-Net can promote the denoising performance of lower-dose data. The paired dataset simulated by DDT-Net can be used for data augmentation to further restore the tissue texture of LDCT images. Experimental results on synthetic data and clinical data show that the proposed DDT-Net outperforms competing methods in terms of denoising and generalization performance at unseen dose data, and it also provides a simulation tool that can quickly simulate realistic LDCT images at arbitrary dose levels.
Collapse
|
9
|
Li X, Jing K, Yang Y, Wang Y, Ma J, Zheng H, Xu Z. Noise-Generating and Imaging Mechanism Inspired Implicit Regularization Learning Network for Low Dose CT Reconstrution. IEEE TRANSACTIONS ON MEDICAL IMAGING 2024; 43:1677-1689. [PMID: 38145543 DOI: 10.1109/tmi.2023.3347258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2023]
Abstract
Low-dose computed tomography (LDCT) helps to reduce radiation risks in CT scanning while maintaining image quality, which involves a consistent pursuit of lower incident rays and higher reconstruction performance. Although deep learning approaches have achieved encouraging success in LDCT reconstruction, most of them treat the task as a general inverse problem in either the image domain or the dual (sinogram and image) domains. Such frameworks have not considered the original noise generation of the projection data and suffer from limited performance improvement for the LDCT task. In this paper, we propose a novel reconstruction model based on noise-generating and imaging mechanism in full-domain, which fully considers the statistical properties of intrinsic noises in LDCT and prior information in sinogram and image domains. To solve the model, we propose an optimization algorithm based on the proximal gradient technique. Specifically, we derive the approximate solutions of the integer programming problem on the projection data theoretically. Instead of hand-crafting the sinogram and image regularizers, we propose to unroll the optimization algorithm to be a deep network. The network implicitly learns the proximal operators of sinogram and image regularizers with two deep neural networks, providing a more interpretable and effective reconstruction procedure. Numerical results demonstrate our proposed method improvements of > 2.9 dB in peak signal to noise ratio, > 1.4% promotion in structural similarity metric, and > 9 HU decrements in root mean square error over current state-of-the-art LDCT methods.
Collapse
|
10
|
Han Y. Hierarchical decomposed dual-domain deep learning for sparse-view CT reconstruction. Phys Med Biol 2024; 69:085019. [PMID: 38457843 DOI: 10.1088/1361-6560/ad31c7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2023] [Accepted: 03/08/2024] [Indexed: 03/10/2024]
Abstract
Objective. X-ray computed tomography employing sparse projection views has emerged as a contemporary technique to mitigate radiation dose. However, due to the inadequate number of projection views, an analytic reconstruction method utilizing filtered backprojection results in severe streaking artifacts. Recently, deep learning (DL) strategies employing image-domain networks have demonstrated remarkable performance in eliminating the streaking artifact caused by analytic reconstruction methods with sparse projection views. Nevertheless, it is difficult to clarify the theoretical justification for applying DL to sparse view computed tomography (CT) reconstruction, and it has been understood as restoration by removing image artifacts, not reconstruction.Approach. By leveraging the theory of deep convolutional framelets (DCF) and the hierarchical decomposition of measurement, this research reveals the constraints of conventional image and projection-domain DL methodologies, subsequently, the research proposes a novel dual-domain DL framework utilizing hierarchical decomposed measurements. Specifically, the research elucidates how the performance of the projection-domain network can be enhanced through a low-rank property of DCF and a bowtie support of hierarchical decomposed measurement in the Fourier domain.Main results. This study demonstrated performance improvement of the proposed framework based on the low-rank property, resulting in superior reconstruction performance compared to conventional analytic and DL methods.Significance. By providing a theoretically justified DL approach for sparse-view CT reconstruction, this study not only offers a superior alternative to existing methods but also opens new avenues for research in medical imaging. It highlights the potential of dual-domain DL frameworks to achieve high-quality reconstructions with lower radiation doses, thereby advancing the field towards safer and more efficient diagnostic techniques. The code is available athttps://github.com/hanyoseob/HDD-DL-for-SVCT.
Collapse
Affiliation(s)
- Yoseob Han
- Department of Electronic Engineering, Soongsil University, Republic of Korea
- Department of Intelligent Semiconductors, Soongsil University, Republic of Korea
| |
Collapse
|
11
|
An R, Chen K, Li H. Self-supervised dual-domain balanced dropblock-network for low-dose CT denoising. Phys Med Biol 2024; 69:075026. [PMID: 38359449 DOI: 10.1088/1361-6560/ad29ba] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Accepted: 02/15/2024] [Indexed: 02/17/2024]
Abstract
Objective.Self-supervised learning methods have been successfully applied for low-dose computed tomography (LDCT) denoising, with the advantage of not requiring labeled data. Conventional self-supervised methods operate only in the image domain, ignoring valuable priors in the sinogram domain. Recently proposed dual-domain methods address this limitation but encounter issues with blurring artifacts in the reconstructed image due to the inhomogeneous distribution of noise levels in low-dose sinograms.Approach.To tackle this challenge, this paper proposes SDBDNet, an end-to-end dual-domain self-supervised method for LDCT denoising. With the network designed based on the properties of inhomogeneous noise in low-dose sinograms and the principle of moderate sinogram-domain denoising, SDBDNet achieves effective denoising in dual domains without introducing blurring artifacts. Specifically, we split the sinogram into two subsets based on the positions of detector cells to generate paired training data with high similarity and independent noise. These sub-sinograms are then restored to their original size using 1D interpolation and learning-based correction. To achieve adaptive and moderate smoothing in the sinogram domain, we integrate Dropblock, a type of convolution layer with regularization, into SDBDNet, and set a weighted average between the denoised sinograms and their noisy counterparts, leading to a well-balanced dual-domain approach.Main results.Numerical experiments show that our method outperforms popular non-learning and self-supervised learning methods, demonstrating its effectiveness and superior performance.Significance.While introducing a novel high-performance dual-domain self-supervised LDCT denoising method, this paper also emphasizes and verifies the importance of appropriate sinogram-domain denoising in dual-domain methods, which might inspire future work.
Collapse
Affiliation(s)
- Ran An
- School of Mathematical Sciences, Capital Normal University, Beijing, 100048, People's Republic of China
- Centre for Mathematical Imaging Techniques, University of Liverpool, Liverpool, L69 7ZL, United Kingdom
| | - Ke Chen
- Department of Mathematics and Statistics, University of Strathclyde, Glasgow, G1 1XQ, United Kingdom
| | - Hongwei Li
- School of Mathematical Sciences, Capital Normal University, Beijing, 100048, People's Republic of China
| |
Collapse
|
12
|
Song Q, Li X, Zhang M, Zhang X, Thanh DNH. APNet: Adaptive projection network for medical image denoising. JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY 2024; 32:1-15. [PMID: 37927293 DOI: 10.3233/xst-230181] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2023]
Abstract
BACKGROUND In clinical medicine, low-dose radiographic image noise reduces the quality of the detected image features and may have a negative impact on disease diagnosis. OBJECTIVE In this study, Adaptive Projection Network (APNet) is proposed to reduce noise from low-dose medical images. METHODS APNet is developed based on an architecture of the U-shaped network to capture multi-scale data and achieve end-to-end image denoising. To adaptively calibrate important features during information transmission, a residual block of the dual attention method throughout the encoding and decoding phases is integrated. A non-local attention module to separate the noise and texture of the image details by using image adaptive projection during the feature fusion. RESULTS To verify the effectiveness of APNet, experiments on lung CT images with synthetic noise are performed, and the results demonstrate that the proposed approach outperforms recent methods in both quantitative index and visual quality. In addition, the denoising experiment on the dental CT image is also carried out and it verifies that the network has a certain generalization. CONCLUSIONS The proposed APNet is an effective method that can reduce image noise and preserve the required image details in low-dose radiographic images.
Collapse
Affiliation(s)
- Qiyi Song
- Department of Endodontics and Periodontics, College of Stomatology, Dalian Medical University, Dalian, China
| | - Xiang Li
- Dalian Neusoft University of Information, Dalian, China
| | - Mingbao Zhang
- Dalian Neusoft University of Information, Dalian, China
| | - Xiangyi Zhang
- Dalian Neusoft University of Information, Dalian, China
| | - Dang N H Thanh
- College of Technology and Design, University of Economics Ho Chi Minh City, Ho Chi Minh City, Vietnam
| |
Collapse
|
13
|
Ikuta M, Zhang J. A Deep Convolutional Gated Recurrent Unit for CT Image Reconstruction. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:10612-10625. [PMID: 35522637 DOI: 10.1109/tnnls.2022.3169569] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Computed tomography (CT) is one of the most important medical imaging technologies in use today. Most commercial CT products use a technique known as the filtered backprojection (FBP) that is fast and can produce decent image quality when an X-ray dose is high. However, the FBP is not good enough on low-dose X-ray CT imaging because the CT image reconstruction problem becomes more stochastic. A more effective reconstruction technique proposed recently and implemented in a limited number of CT commercial products is an iterative reconstruction (IR). The IR technique is based on a Bayesian formulation of the CT image reconstruction problem with an explicit model of the CT scanning, including its stochastic nature, and a prior model that incorporates our knowledge about what a good CT image should look like. However, constructing such prior knowledge is more complicated than it seems. In this article, we propose a novel neural network for CT image reconstruction. The network is based on the IR formulation and constructed with a recurrent neural network (RNN). Specifically, we transform the gated recurrent unit (GRU) into a neural network performing CT image reconstruction. We call it "GRU reconstruction." This neural network conducts concurrent dual-domain learning. Many deep learning (DL)-based methods in medical imaging are single-domain learning, but dual-domain learning performs better because it learns from both the sinogram and the image domain. In addition, we propose backpropagation through stage (BPTS) as a new RNN backpropagation algorithm. It is similar to the backpropagation through time (BPTT) of an RNN; however, it is tailored for iterative optimization. Results from extensive experiments indicate that our proposed method outperforms conventional model-based methods, single-domain DL methods, and state-of-the-art DL techniques in terms of the root mean squared error (RMSE), the peak signal-to-noise ratio (PSNR), and the structure similarity (SSIM) and in terms of visual appearance.
Collapse
|
14
|
Yu J, Zhang H, Zhang P, Zhu Y. Unsupervised learning-based dual-domain method for low-dose CT denoising. Phys Med Biol 2023; 68:185010. [PMID: 37567225 DOI: 10.1088/1361-6560/acefa2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 08/10/2023] [Indexed: 08/13/2023]
Abstract
Objective. Low-dose CT (LDCT) is an important research topic in the field of CT imaging because of its ability to reduce radiation damage in clinical diagnosis. In recent years, deep learning techniques have been widely applied in LDCT imaging and a large number of denoising methods have been proposed. However, One major challenge of supervised deep learning-based methods is the exactly geometric pairing of datasets with different doses. Therefore, the aim of this study is to develop an unsupervised learning-based LDCT imaging method to address the aforementioned challenges.Approach. In this paper, we propose an unsupervised learning-based dual-domain method for LDCT denoising, which consists of two stages: the first stage is projection domain denoising, in which the unsupervised learning method Noise2Self is applied to denoise the projection data with statistically independent and zero-mean noise. The second stage is an iterative enhancement approach, which combines the prior information obtained from the generative model with an iterative reconstruction algorithm to enhance the details of the reconstructed image.Main results. Experimental results show that our proposed method outperforms the comparison method in terms of denoising effect. Particularly, in terms of SSIM, the denoised results obtained using our method achieve the highest SSIM.Significance. In conclusion, our unsupervised learning-based method can be a promising alternative to the traditional supervised methods for LDCT imaging, especially when the availability of the labeled datasets is limited.
Collapse
Affiliation(s)
- Jie Yu
- School of Mathematical Sciences, Capital Normal University, Beijing, 100048, People's Republic of China
| | - Huitao Zhang
- School of Mathematical Sciences, Capital Normal University, Beijing, 100048, People's Republic of China
- Shenzhen National Applied Mathematics Center, Southern University of Science and Technology, Shenzhen, 518055, People's Republic of China
| | - Peng Zhang
- School of Mathematical Sciences, Capital Normal University, Beijing, 100048, People's Republic of China
| | - Yining Zhu
- School of Mathematical Sciences, Capital Normal University, Beijing, 100048, People's Republic of China
- Shenzhen National Applied Mathematics Center, Southern University of Science and Technology, Shenzhen, 518055, People's Republic of China
| |
Collapse
|
15
|
Kiss MB, Coban SB, Batenburg KJ, van Leeuwen T, Lucka F. 2DeteCT - A large 2D expandable, trainable, experimental Computed Tomography dataset for machine learning. Sci Data 2023; 10:576. [PMID: 37666897 PMCID: PMC10477177 DOI: 10.1038/s41597-023-02484-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Accepted: 08/16/2023] [Indexed: 09/06/2023] Open
Abstract
Recent research in computational imaging largely focuses on developing machine learning (ML) techniques for image reconstruction, which requires large-scale training datasets consisting of measurement data and ground-truth images. However, suitable experimental datasets for X-ray Computed Tomography (CT) are scarce, and methods are often developed and evaluated only on simulated data. We fill this gap by providing the community with a versatile, open 2D fan-beam CT dataset suitable for developing ML techniques for a range of image reconstruction tasks. To acquire it, we designed a sophisticated, semi-automatic scan procedure that utilizes a highly-flexible laboratory X-ray CT setup. A diverse mix of samples with high natural variability in shape and density was scanned slice-by-slice (5,000 slices in total) with high angular and spatial resolution and three different beam characteristics: A high-fidelity, a low-dose and a beam-hardening-inflicted mode. In addition, 750 out-of-distribution slices were scanned with sample and beam variations to accommodate robustness and segmentation tasks. We provide raw projection data, reference reconstructions and segmentations based on an open-source data processing pipeline.
Collapse
Affiliation(s)
- Maximilian B Kiss
- Centrum Wiskunde & Informatica, Computational Imaging group, Amsterdam, 1098 XG, The Netherlands.
| | - Sophia B Coban
- Centrum Wiskunde & Informatica, Computational Imaging group, Amsterdam, 1098 XG, The Netherlands
- Department of Mathematics, University of Manchester, Oxford Road, Manchester, M13 9PL, United Kingdom
| | - K Joost Batenburg
- Centrum Wiskunde & Informatica, Computational Imaging group, Amsterdam, 1098 XG, The Netherlands
- Leiden University, LIACS, Leiden, 2300 RA, The Netherlands
| | - Tristan van Leeuwen
- Centrum Wiskunde & Informatica, Computational Imaging group, Amsterdam, 1098 XG, The Netherlands
- Utrecht University, Mathematical Institute, Utrecht, 3584 CD, The Netherlands
| | - Felix Lucka
- Centrum Wiskunde & Informatica, Computational Imaging group, Amsterdam, 1098 XG, The Netherlands.
| |
Collapse
|
16
|
Huang J, Chen K, Ren Y, Sun J, Wang Y, Tao T, Pu X. CDDnet: Cross-domain denoising network for low-dose CT image via local and global information alignment. Comput Biol Med 2023; 163:107219. [PMID: 37422942 DOI: 10.1016/j.compbiomed.2023.107219] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2023] [Revised: 05/21/2023] [Accepted: 06/25/2023] [Indexed: 07/11/2023]
Abstract
The domain shift problem has emerged as a challenge in cross-domain low-dose CT (LDCT) image denoising task, where the acquisition of a sufficient number of medical images from multiple sources may be constrained by privacy concerns. In this study, we propose a novel cross-domain denoising network (CDDnet) that incorporates both local and global information of CT images. To address the local component, a local information alignment module has been proposed to regularize the similarity between extracted target and source features from selected patches. To align the general information of the semantic structure from a global perspective, an autoencoder is adopted to learn the latent correlation between the source label and the estimated target label generated by the pre-trained denoiser. Experimental results demonstrate that our proposed CDDnet effectively alleviates the domain shift problem, outperforming other deep learning-based and domain adaptation-based methods under cross-domain scenarios.
Collapse
Affiliation(s)
- Jiaxin Huang
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China
| | - Kecheng Chen
- Department of Electrical Engineering, City University of Hong Kong, 999077, Hong Kong Special Administrative Region of China
| | - Yazhou Ren
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China; Shenzhen Institute for Advanced Study, University of Electronic Science and Technology of China, Shenzhen, 518110, China
| | - Jiayu Sun
- West China Hospital, Sichuan University, Chengdu, 610044, China
| | - Yanmei Wang
- Institute of Traditional Chinese Medicine, Sichuan College of Traditional Chinese Medicine (Sichuan Second Hospital of TCM), Chengdu, 610075, China
| | - Tao Tao
- Institute of Traditional Chinese Medicine, Sichuan College of Traditional Chinese Medicine (Sichuan Second Hospital of TCM), Chengdu, 610075, China
| | - Xiaorong Pu
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China; Shenzhen Institute for Advanced Study, University of Electronic Science and Technology of China, Shenzhen, 518110, China; NHC Key Laboratory of Nuclear Technology Medical Transformation, Mianyang Central Hospital, Mianyang, 621000, China.
| |
Collapse
|
17
|
Li X, Huang S, Pan Z, Qin P, Wu W, Qi M, Ma J, Kang S, Chen J, Zhou L, Xu Y, Qin G. Deep learning based de-overlapping correction of projections from a flat-panel micro array X-ray source: Simulation study. Phys Med 2023; 111:102607. [PMID: 37210964 DOI: 10.1016/j.ejmp.2023.102607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Revised: 04/26/2023] [Accepted: 05/09/2023] [Indexed: 05/23/2023] Open
Abstract
PURPOSE Flat-panel X-ray source is an experimental X-ray emitter with target application of static computer tomography (CT), which can save imaging space and time. However, the X-ray cone beams emitted by the densely arranged micro-ray sources are overlapped, causing serious structural overlapping and visual blur in the projection results. Traditional deoverlapping methods can hardly solve this problem well. METHOD We converted the overlapping cone beam projections to parallel beam projections through a U-like neural network and selected structural similarity (SSIM) loss as the loss function. In this study, we converted three kinds of overlapping cone beam projections of the Shepp-Logan, line-pairs, and abdominal data with two overlapping levels to corresponding parallel beam projections. Training completed, we tested the model using the test set data that was not used at the training phase, and evaluated the difference between the test set conversion results and their corresponding parallel beams through three indicators: mean squared error (MSE), peak signal-to-noise ratio (PSNR) and SSIM. In addition, projections from head phantoms were applied for generalization test. RESULT In the Shepp-Logan low-overlapping task, we obtained a MSE of 1.624×10-5, a PSNR of 47.892 dB, and a SSIM of 0.998 which are the best results of the six experiments. For the most challenging abdominal task, the MSE, PSNR, and SSIM are 1.563×10-3, 28.0586 dB, and 0.983, respectively. In more generalized data, the model also achieved good results. CONCLUSION This study proves the feasibility of utilizing the end-to-end U-net for deblurring and deoverlapping in the flat-panel X-ray source domain.
Collapse
Affiliation(s)
- Xu Li
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China
| | - Shuang Huang
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China
| | - Zengxiang Pan
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China
| | - Peishan Qin
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China
| | - Wangjiang Wu
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China
| | - Mengke Qi
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China
| | - Jianhui Ma
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China
| | - Song Kang
- State Key Laboratory of Optoelectronic Materials and Technologies, Guangdong Province Key Laboratory of Display Material and Technology, School of Electronics and Information Technology, Sun Yat-sen University, Guangzhou 510515, China
| | - Jun Chen
- State Key Laboratory of Optoelectronic Materials and Technologies, Guangdong Province Key Laboratory of Display Material and Technology, School of Electronics and Information Technology, Sun Yat-sen University, Guangzhou 510515, China
| | - Linghong Zhou
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China.
| | - Yuan Xu
- School of Biomedical Engineering, Southern Medical University, Guangzhou 510515, China.
| | - Genggeng Qin
- Department of Radiology, Nanfang Hospital, Southern Medical University, Guangzhou 510515, China.
| |
Collapse
|
18
|
Li J, Chen J, Tang Y, Wang C, Landman BA, Zhou SK. Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives. Med Image Anal 2023; 85:102762. [PMID: 36738650 PMCID: PMC10010286 DOI: 10.1016/j.media.2023.102762] [Citation(s) in RCA: 54] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Revised: 01/18/2023] [Accepted: 01/27/2023] [Indexed: 02/01/2023]
Abstract
Transformer, one of the latest technological advances of deep learning, has gained prevalence in natural language processing or computer vision. Since medical imaging bear some resemblance to computer vision, it is natural to inquire about the status quo of Transformers in medical imaging and ask the question: can the Transformer models transform medical imaging? In this paper, we attempt to make a response to the inquiry. After a brief introduction of the fundamentals of Transformers, especially in comparison with convolutional neural networks (CNNs), and highlighting key defining properties that characterize the Transformers, we offer a comprehensive review of the state-of-the-art Transformer-based approaches for medical imaging and exhibit current research progresses made in the areas of medical image segmentation, recognition, detection, registration, reconstruction, enhancement, etc. In particular, what distinguishes our review lies in its organization based on the Transformer's key defining properties, which are mostly derived from comparing the Transformer and CNN, and its type of architecture, which specifies the manner in which the Transformer and CNN are combined, all helping the readers to best understand the rationale behind the reviewed approaches. We conclude with discussions of future perspectives.
Collapse
Affiliation(s)
- Jun Li
- Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing 100190, China
| | - Junyu Chen
- Russell H. Morgan Department of Radiology and Radiological Science, Johns Hopkins Medical Institutes, Baltimore, MD, USA
| | - Yucheng Tang
- Department of Electrical and Computer Engineering, Vanderbilt University, Nashville, TN, USA
| | - Ce Wang
- Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing 100190, China
| | - Bennett A Landman
- Department of Electrical and Computer Engineering, Vanderbilt University, Nashville, TN, USA
| | - S Kevin Zhou
- Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing 100190, China; School of Biomedical Engineering & Suzhou Institute for Advanced Research, Center for Medical Imaging, Robotics, and Analytic Computing & Learning (MIRACLE), University of Science and Technology of China, Suzhou 215123, China.
| |
Collapse
|
19
|
Mesh-free technique for enhancement of the lung CT image. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]
|
20
|
Yi Z, Wang J, Li M. Deep image and feature prior algorithm based on U-ConformerNet structure. Phys Med 2023; 107:102535. [PMID: 36764130 DOI: 10.1016/j.ejmp.2023.102535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Revised: 01/04/2023] [Accepted: 01/25/2023] [Indexed: 02/10/2023] Open
Abstract
PURPOSE The reconstruction performance of the deep image prior (DIP) approach is limited by the conventional convolutional layer structure and it is difficult to enhance its potential. In order to improve the quality of image reconstruction and suppress artifacts, we propose a DIP algorithm with better performance, and verify its superiority in the latest case. METHODS We construct a new U-ConformerNet structure as the DIP algorithm's network, replacing the traditional convolutional layer-based U-net structure, and introduce the 'lpips' deep network based feature distance regularization method. Our algorithm can switch between supervised and unsupervised modes at will to meet different needs. RESULTS The reconstruction was performed on the low dose CT dataset (LoDoPaB). Our algorithm attained a PSNR of more than 35 dB under unsupervised conditions, and the PSNR under the supervised condition is greater than 36 dB. Both of which are better than the performance of the DIP-TV. Furthermore, the accuracy of this method is positively connected with the quality of the a priori image with the help of deep networks. In terms of noise eradication and artifact suppression, the DIP algorithm with U-ConformerNet structure outperforms the standard DIP method based on convolutional structure. CONCLUSIONS It is known by experimental verification that, in unsupervised mode, the algorithm improves the output PSNR by at least 2-3 dB when compared to the DIP-TV algorithm (proposed in 2020). In supervised mode, our algorithm approaches that of the state-of-the-art end-to-end deep learning algorithms.
Collapse
Affiliation(s)
- Zhengming Yi
- The State Key Laboratory of Refractories and Metallurgy, Wuhan University of Science and Technology, Wuhan 430081, Hubei, China; Key Laboratory for Ferrous Metallurgy and Resources Utilization of Metallurgy and Resources Utilization of Education, Wuhan University of Science and Technology, Wuhan 430081, Hubei, China
| | - Junjie Wang
- The State Key Laboratory of Refractories and Metallurgy, Wuhan University of Science and Technology, Wuhan 430081, Hubei, China; Key Laboratory for Ferrous Metallurgy and Resources Utilization of Metallurgy and Resources Utilization of Education, Wuhan University of Science and Technology, Wuhan 430081, Hubei, China
| | - Mingjie Li
- The State Key Laboratory of Refractories and Metallurgy, Wuhan University of Science and Technology, Wuhan 430081, Hubei, China; Key Laboratory for Ferrous Metallurgy and Resources Utilization of Metallurgy and Resources Utilization of Education, Wuhan University of Science and Technology, Wuhan 430081, Hubei, China.
| |
Collapse
|
21
|
Alkabbany I, Ali AM, Mohamed M, Elshazly SM, Farag A. An AI-Based Colonic Polyp Classifier for Colorectal Cancer Screening Using Low-Dose Abdominal CT. SENSORS (BASEL, SWITZERLAND) 2022; 22:9761. [PMID: 36560132 PMCID: PMC9782078 DOI: 10.3390/s22249761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 11/30/2022] [Accepted: 12/02/2022] [Indexed: 06/17/2023]
Abstract
Among the non-invasive Colorectal cancer (CRC) screening approaches, Computed Tomography Colonography (CTC) and Virtual Colonoscopy (VC), are much more accurate. This work proposes an AI-based polyp detection framework for virtual colonoscopy (VC). Two main steps are addressed in this work: automatic segmentation to isolate the colon region from its background, and automatic polyp detection. Moreover, we evaluate the performance of the proposed framework on low-dose Computed Tomography (CT) scans. We build on our visualization approach, Fly-In (FI), which provides "filet"-like projections of the internal surface of the colon. The performance of the Fly-In approach confirms its ability with helping gastroenterologists, and it holds a great promise for combating CRC. In this work, these 2D projections of FI are fused with the 3D colon representation to generate new synthetic images. The synthetic images are used to train a RetinaNet model to detect polyps. The trained model has a 94% f1-score and 97% sensitivity. Furthermore, we study the effect of dose variation in CT scans on the performance of the the FI approach in polyp visualization. A simulation platform is developed for CTC visualization using FI, for regular CTC and low-dose CTC. This is accomplished using a novel AI restoration algorithm that enhances the Low-Dose CT images so that a 3D colon can be successfully reconstructed and visualized using the FI approach. Three senior board-certified radiologists evaluated the framework for the peak voltages of 30 KV, and the average relative sensitivities of the platform were 92%, whereas the 60 KV peak voltage produced average relative sensitivities of 99.5%.
Collapse
Affiliation(s)
- Islam Alkabbany
- Computer Vision and Image Processing Laboratory, University of Louisville, Louisville, KY 40292, USA
| | - Asem M. Ali
- Computer Vision and Image Processing Laboratory, University of Louisville, Louisville, KY 40292, USA
| | - Mostafa Mohamed
- Computer Vision and Image Processing Laboratory, University of Louisville, Louisville, KY 40292, USA
| | | | - Aly Farag
- Computer Vision and Image Processing Laboratory, University of Louisville, Louisville, KY 40292, USA
| |
Collapse
|
22
|
Friot-Giroux L, Peyrin F, Maxim V. Iterative tomographic reconstruction with TV prior for low-dose CBCT dental imaging. Phys Med Biol 2022; 67. [PMID: 36162406 DOI: 10.1088/1361-6560/ac950c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Accepted: 09/26/2022] [Indexed: 12/24/2022]
Abstract
Objective.Cone-beam computed tomography is becoming more and more popular in applications such as 3D dental imaging. Iterative methods compared to the standard Feldkamp algorithm have shown improvements in image quality of reconstruction of low-dose acquired data despite their long computing time. An interesting aspect of iterative methods is their ability to include prior information such as sparsity-constraint. While a large panel of optimization algorithms along with their adaptation to tomographic problems are available, they are mainly studied on 2D parallel or fan-beam data. The issues raised by 3D CBCT and moreover by truncated projections are still poorly understood.Approach.We compare different carefully designed optimization schemes in the context of realistic 3D dental imaging. Besides some known algorithms, SIRT-TV and MLEM, we investigate the primal-dual hybrid gradient (PDHG) approach and a newly proposed MLEM-TV optimizer. The last one is alternating EM steps and TV-denoising, combination not yet investigated for CBCT. Experiments are performed on both simulated data from a 3D jaw phantom and data acquired with a dental clinical scanner.Main results.With some adaptations to the specificities of CBCT operators, PDHG and MLEM-TV algorithms provide the best reconstruction quality. These results were obtained by comparing the full-dose image with a low-dose image and an ultra low-dose image.Significance.The convergence speed of the original iterative methods is hampered by the conical geometry and significantly reduced compared to parallel geometries. We promote the pre-conditioned version of PDHG and we propose a pre-conditioned version of the MLEM-TV algorithm. To the best of our knowledge, this is the first time PDHG and convergent MLEM-TV algorithms are evaluated on experimental dental CBCT data, where constraints such as projection truncation and presence of metal have to be jointly overcome.
Collapse
Affiliation(s)
- Louise Friot-Giroux
- Univ Lyon, INSA-Lyon, Université Claude Bernard Lyon 1, CNRS, Inserm, CREATIS UMR 5220, U1294, F-69620, LYON, France
| | - Françoise Peyrin
- Univ Lyon, INSA-Lyon, Université Claude Bernard Lyon 1, CNRS, Inserm, CREATIS UMR 5220, U1294, F-69620, LYON, France
| | - Voichita Maxim
- Univ Lyon, INSA-Lyon, Université Claude Bernard Lyon 1, CNRS, Inserm, CREATIS UMR 5220, U1294, F-69620, LYON, France
| |
Collapse
|
23
|
Barbano R, Kereta Ž, Hauptmann A, Arridge SR, Jin B. Unsupervised knowledge-transfer for learned image reconstruction. INVERSE PROBLEMS 2022; 38:104004. [PMID: 37745782 PMCID: PMC10515400 DOI: 10.1088/1361-6420/ac8a91] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Revised: 07/12/2022] [Accepted: 08/18/2022] [Indexed: 09/26/2023]
Abstract
Deep learning-based image reconstruction approaches have demonstrated impressive empirical performance in many imaging modalities. These approaches usually require a large amount of high-quality paired training data, which is often not available in medical imaging. To circumvent this issue we develop a novel unsupervised knowledge-transfer paradigm for learned reconstruction within a Bayesian framework. The proposed approach learns a reconstruction network in two phases. The first phase trains a reconstruction network with a set of ordered pairs comprising of ground truth images of ellipses and the corresponding simulated measurement data. The second phase fine-tunes the pretrained network to more realistic measurement data without supervision. By construction, the framework is capable of delivering predictive uncertainty information over the reconstructed image. We present extensive experimental results on low-dose and sparse-view computed tomography showing that the approach is competitive with several state-of-the-art supervised and unsupervised reconstruction techniques. Moreover, for test data distributed differently from the training data, the proposed framework can significantly improve reconstruction quality not only visually, but also quantitatively in terms of PSNR and SSIM, when compared with learned methods trained on the synthetic dataset only.
Collapse
Affiliation(s)
- Riccardo Barbano
- Department of Computer Science, University College London, Gower Street, London WC1E 6BT, United Kingdom
| | - Željko Kereta
- Department of Computer Science, University College London, Gower Street, London WC1E 6BT, United Kingdom
| | - Andreas Hauptmann
- Department of Computer Science, University College London, Gower Street, London WC1E 6BT, United Kingdom
- Research Unit of Mathematical Sciences; University of Oulu, Oulu, Finland
| | - Simon R Arridge
- Department of Computer Science, University College London, Gower Street, London WC1E 6BT, United Kingdom
| | - Bangti Jin
- Department of Mathematics, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, People’s Republic of China
| |
Collapse
|
24
|
Feature decomposition and enhancement for unsupervised medical ultrasound image denoising and instance segmentation. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03857-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
25
|
Xia X, Zhang R, Yao X, Huang G, Tang T. A novel lung nodule accurate detection of computerized tomography images based on convolutional neural network and probability graph model. Comput Intell 2022. [DOI: 10.1111/coin.12531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/09/2022]
Affiliation(s)
- Xunpeng Xia
- School of Optical‐Electrical and Computer Engineering University of Shanghai for Science and Technology Shanghai China
| | - Rongfu Zhang
- School of Optical‐Electrical and Computer Engineering University of Shanghai for Science and Technology Shanghai China
| | - Xufeng Yao
- College of Medical Imaging Shanghai University of Medicine and Health Sciences Shanghai China
| | - Gang Huang
- College of Medical Imaging Shanghai University of Medicine and Health Sciences Shanghai China
- Shanghai Key Laboratory of Molecular Imaging Zhoupu Hospital, Shanghai University of Medicine and Health Sciences Shanghai China
- Shanghai Key Laboratory of Molecular Imaging Jiading District Central Hospital Affiliated Shanghai University of Medicine and Health Sciences Shanghai China
| | - Tiequn Tang
- School of Optical‐Electrical and Computer Engineering University of Shanghai for Science and Technology Shanghai China
| |
Collapse
|
26
|
Fu B, Zhang X, Wang L, Ren Y, Thanh DNH. A blind medical image denoising method with noise generation network. JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY 2022; 30:531-547. [PMID: 35253724 DOI: 10.3233/xst-211098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
BACKGROUND In the process of medical images acquisition, the unknown mixed noise will affect image quality. However, the existing denoising methods usually focus on the known noise distribution. OBJECTIVE In order to remove the unknown real noise in low-dose CT images (LDCT), a two-step deep learning framework is proposed in this study, which is called Noisy Generation-Removal Network (NGRNet). METHODS Firstly, the output results of L0 Gradient Minimization are used as the labels of a dental CT image dataset to form a pseudo-image pair with the real dental CT images, which are used to train the noise generation network to estimate real noise distribution. Then, for the lung CT images of the LIDC/IDRI database, we migrate the real noise to the noise-free lung CT images, to construct a new almost-real noisy images dataset. Since dental images and lung images are all CT images, this migration can be achieved. The denoising network is trained to realize the denoising of real LDCT for dental images by using this dataset but can extend for any low-dose CT images. RESULTS To prove the effectiveness of our NGRNet, we conduct experiments on lung CT images with synthetic noise and tooth CT images with real noise. For synthetic noise image datasets, experimental results show that NGRNet is superior to existing denoising methods in terms of visual effect and exceeds 0.13dB in the peak signal-to-noise ratio (PSNR). For real noisy image datasets, the proposed method can achieve the best visual denoising effect. CONCLUSIONS The proposed method can retain more details and achieve impressive denoising performance.
Collapse
Affiliation(s)
- Bo Fu
- School of Computer and Information Technology, Liaoning Normal University, Dalian, Liaoning, China
| | - Xiangyi Zhang
- School of Computer and Information Technology, Liaoning Normal University, Dalian, Liaoning, China
| | - Liyan Wang
- School of Computer and Information Technology, Liaoning Normal University, Dalian, Liaoning, China
| | - Yonggong Ren
- School of Computer and Information Technology, Liaoning Normal University, Dalian, Liaoning, China
| | - Dang N H Thanh
- Department of Information Technology, School of Business Information Technology, University of Economics Ho Chi Minh City, Ho Chi Minh City, Vietnam
| |
Collapse
|
27
|
Denker A, Schmidt M, Leuschner J, Maass P. Conditional Invertible Neural Networks for Medical Imaging. J Imaging 2021; 7:243. [PMID: 34821874 PMCID: PMC8624162 DOI: 10.3390/jimaging7110243] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Revised: 11/11/2021] [Accepted: 11/13/2021] [Indexed: 12/02/2022] Open
Abstract
Over recent years, deep learning methods have become an increasingly popular choice for solving tasks from the field of inverse problems. Many of these new data-driven methods have produced impressive results, although most only give point estimates for the reconstruction. However, especially in the analysis of ill-posed inverse problems, the study of uncertainties is essential. In our work, we apply generative flow-based models based on invertible neural networks to two challenging medical imaging tasks, i.e., low-dose computed tomography and accelerated medical resonance imaging. We test different architectures of invertible neural networks and provide extensive ablation studies. In most applications, a standard Gaussian is used as the base distribution for a flow-based model. Our results show that the choice of a radial distribution can improve the quality of reconstructions.
Collapse
Affiliation(s)
- Alexander Denker
- Center for Industrial Mathematics, University of Bremen, Bibliothekstr. 5, 28359 Bremen, Germany; (M.S.); (J.L.); (P.M.)
| | | | | | | |
Collapse
|