1
|
Ren M, Xue P, Ji H, Zhang Z, Dong E. Pulmonary CT Registration Network Based on Deformable Cross Attention. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2024:10.1007/s10278-024-01324-2. [PMID: 39528889 DOI: 10.1007/s10278-024-01324-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/20/2024] [Revised: 10/26/2024] [Accepted: 10/29/2024] [Indexed: 11/16/2024]
Abstract
Current Transformer structure utilizes the self-attention mechanism to model global contextual relevance within image, which makes an impact on medical image registration. However, the use of Transformer in handling large deformation lung CT registration is relatively straightforwardly. These models only focus on single image feature representation neglecting to employ attention mechanism to capture the across image correspondence. This hinders further improvement in registration performance. To address the above limitations, we propose a novel registration method in a cascaded manner, Cascaded Swin Deformable Cross Attention Transformer based U-shape structure (SD-CATU), to address the challenge of large deformations in lung CT registration. In SD-CATU, we introduce a Cross Attention-based Transformer (CAT) block that incorporates the Shifted Regions Multihead Cross-attention (SR-MCA) mechanism to flexibly exchange feature information and thus reduce the computational complexity. Besides, a consistency constraint in the loss function is used to ensure the preservation of topology and inverse consistency of the transformations. Experiments with public lung datasets demonstrate that the Cascaded SD-CATU outperforms current state-of-the-art registration methods (Dice Similarity Coefficient of 93.19% and Target registration error of 0.98 mm). The results further highlight the potential for obtaining excellent registration accuracy while assuring desirable smoothness and consistency in the deformed images.
Collapse
Affiliation(s)
- Meirong Ren
- Shool of Mechanical, Electrical & Information Engineering, Shandong University, Weihai, 264,209, China
| | - Peng Xue
- Shool of Mechanical, Electrical & Information Engineering, Shandong University, Weihai, 264,209, China
| | - Huizhong Ji
- Shool of Mechanical, Electrical & Information Engineering, Shandong University, Weihai, 264,209, China
| | - Zhili Zhang
- Shool of Mechanical, Electrical & Information Engineering, Shandong University, Weihai, 264,209, China
| | - Enqing Dong
- Shool of Mechanical, Electrical & Information Engineering, Shandong University, Weihai, 264,209, China.
| |
Collapse
|
2
|
Gou F, Liu J, Xiao C, Wu J. Research on Artificial-Intelligence-Assisted Medicine: A Survey on Medical Artificial Intelligence. Diagnostics (Basel) 2024; 14:1472. [PMID: 39061610 PMCID: PMC11275417 DOI: 10.3390/diagnostics14141472] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2024] [Revised: 07/04/2024] [Accepted: 07/05/2024] [Indexed: 07/28/2024] Open
Abstract
With the improvement of economic conditions and the increase in living standards, people's attention in regard to health is also continuously increasing. They are beginning to place their hopes on machines, expecting artificial intelligence (AI) to provide a more humanized medical environment and personalized services, thus greatly expanding the supply and bridging the gap between resource supply and demand. With the development of IoT technology, the arrival of the 5G and 6G communication era, and the enhancement of computing capabilities in particular, the development and application of AI-assisted healthcare have been further promoted. Currently, research on and the application of artificial intelligence in the field of medical assistance are continuously deepening and expanding. AI holds immense economic value and has many potential applications in regard to medical institutions, patients, and healthcare professionals. It has the ability to enhance medical efficiency, reduce healthcare costs, improve the quality of healthcare services, and provide a more intelligent and humanized service experience for healthcare professionals and patients. This study elaborates on AI development history and development timelines in the medical field, types of AI technologies in healthcare informatics, the application of AI in the medical field, and opportunities and challenges of AI in the field of medicine. The combination of healthcare and artificial intelligence has a profound impact on human life, improving human health levels and quality of life and changing human lifestyles.
Collapse
Affiliation(s)
- Fangfang Gou
- State Key Laboratory of Public Big Data, College of Computer Science and Technology, Guizhou University, Guiyang 550025, China
| | - Jun Liu
- The Second People's Hospital of Huaihua, Huaihua 418000, China
| | - Chunwen Xiao
- The Second People's Hospital of Huaihua, Huaihua 418000, China
| | - Jia Wu
- State Key Laboratory of Public Big Data, College of Computer Science and Technology, Guizhou University, Guiyang 550025, China
- Research Center for Artificial Intelligence, Monash University, Melbourne, Clayton, VIC 3800, Australia
| |
Collapse
|
3
|
Xue P, Fu Y, Zhang J, Ma L, Ren M, Zhang Z, Dong E. Effective lung ventilation estimation based on 4D CT image registration and supervoxels. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2022]
|
4
|
Chen P, Guo Y, Wang D, Chen C. Dlung: Unsupervised Few-Shot Diffeomorphic Respiratory Motion Modeling. JOURNAL OF SHANGHAI JIAOTONG UNIVERSITY (SCIENCE) 2022; 28:1-10. [PMID: 36406811 PMCID: PMC9660014 DOI: 10.1007/s12204-022-2525-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Accepted: 09/17/2021] [Indexed: 11/15/2022]
Abstract
Lung image registration plays an important role in lung analysis applications, such as respiratory motion modeling. Unsupervised learning-based image registration methods that can compute the deformation without the requirement of supervision attract much attention. However, it is noteworthy that they have two drawbacks: they do not handle the problem of limited data and do not guarantee diffeomorphic (topology-preserving) properties, especially when large deformation exists in lung scans. In this paper, we present an unsupervised few-shot learning-based diffeomorphic lung image registration, namely Dlung. We employ fine-tuning techniques to solve the problem of limited data and apply the scaling and squaring method to accomplish the diffeomorphic registration. Furthermore, atlas-based registration on spatio-temporal (4D) images is performed and thoroughly compared with baseline methods. Dlung achieves the highest accuracy with diffeomorphic properties. It constructs accurate and fast respiratory motion models with limited data. This research extends our knowledge of respiratory motion modeling.
Collapse
Affiliation(s)
- Peizhi Chen
- College of Computer and Information Engineering, Xiamen University of Technology, Xiamen, Fujian, 361024 China
- Fujian Key Laboratory of Pattern Recognition and Image Understanding, Xiamen, Fujian, 361024 China
| | - Yifan Guo
- College of Computer and Information Engineering, Xiamen University of Technology, Xiamen, Fujian, 361024 China
| | - Dahan Wang
- College of Computer and Information Engineering, Xiamen University of Technology, Xiamen, Fujian, 361024 China
- Fujian Key Laboratory of Pattern Recognition and Image Understanding, Xiamen, Fujian, 361024 China
| | - Chinling Chen
- College of Computer and Information Engineering, Xiamen University of Technology, Xiamen, Fujian, 361024 China
- School of Information Engineering, Changchun Sci-Tech University, Changchun, 130600 China
- Department of Computer Science and Information Engineering, Chaoyang University of Technology, Taichung, Taiwan, 41349 China
| |
Collapse
|
5
|
Dong B, Fu X, Kang X. SSGNet: semi-supervised multi-path grid network for diagnosing melanoma. Pattern Anal Appl 2022. [DOI: 10.1007/s10044-022-01100-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
|
6
|
GraformerDIR: Graph convolution transformer for deformable image registration. Comput Biol Med 2022; 147:105799. [DOI: 10.1016/j.compbiomed.2022.105799] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Revised: 05/06/2022] [Accepted: 06/26/2022] [Indexed: 01/02/2023]
|
7
|
Intensity-based nonrigid endomicroscopic image mosaicking incorporating texture relevance for compensation of tissue deformation. Comput Biol Med 2021; 142:105169. [PMID: 34974384 DOI: 10.1016/j.compbiomed.2021.105169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2021] [Revised: 12/12/2021] [Accepted: 12/20/2021] [Indexed: 12/09/2022]
Abstract
Image mosaicking has emerged as a universal technique to broaden the field-of-view of the probe-based confocal laser endomicroscopy (pCLE) imaging system. However, due to the influence of probe-tissue contact forces and optical components on imaging quality, existing mosaicking methods remain insufficient to deal with practical challenges. In this paper, we present the texture encoded sum of conditional variance (TESCV) as a novel similarity metric, and effectively incorporate it into a sequential mosaicking scheme to simultaneously correct rigid probe shift and nonrigid tissue deformation. TESCV combines both intensity dependency and texture relevance to quantify the differences between pCLE image frames, where a discriminative binary descriptor named fully cross-detected local derivative pattern (FCLDP) is designed to extract more detailed structural textures. Furthermore, we also analytically derive the closed-form gradient of TESCV with respect to the transformation variables. Experiments on the circular dataset highlighted the advantage of the TESCV metric in improving mosaicking performance compared with the other four recently published metrics. The comparison with the other four state-of-the-art mosaicking methods on the spiral and manual datasets indicated that the proposed TESCV-based method not only worked stably at different contact forces, but was also suitable for both low- and high-resolution imaging systems. With more accurate and delicate mosaics, the proposed method holds promises to meet clinical demands for intraoperative optical biopsy.
Collapse
|
8
|
Fu Y, Xue P, Li N, Zhao P, Xu Z, Ji H, Zhang Z, Cui W, Dong E. Fusion of 3D lung CT and serum biomarkers for diagnosis of multiple pathological types on pulmonary nodules. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2021; 210:106381. [PMID: 34496322 DOI: 10.1016/j.cmpb.2021.106381] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Accepted: 08/24/2021] [Indexed: 06/13/2023]
Abstract
BACKGROUND AND OBJECTIVE Current researches on pulmonary nodules mainly focused on the binary-classification of benign and malignant pulmonary nodules. However, in clinical applications, it is not enough to judge whether pulmonary nodules are benign or malignant. In this paper, we proposed a fusion model based on the Lung Information Dataset Containing 3D CT Images and Serum Biomarkers (LIDCCISB) we constructed to accurately diagnose the types of pulmonary nodules in squamous cell carcinoma, adenocarcinoma, inflammation and other benign diseases. METHODS Using single modal information of lung 3D CT images and single modal information of Lung Tumor Biomarkers (LTBs) in LIDCCISB, a Multi-resolution 3D Multi-classification deep learning model (Mr-Mc) and a Multi-Layer Perceptron machine learning model (MLP) were constructed for diagnosing multiple pathological types of pulmonary nodules, respectively. To comprehensively use the double modal information of CT images and LTBs, we used transfer learning to fuse Mr-Mc and MLP, and constructed a multimodal information fusion model that could classify multiple pathological types of benign and malignant pulmonary nodules. RESULTS Experiments showed that the constructed Mr-Mc model can achieve an average accuracy of 0.805 and MLP model can achieve an average accuracy of 0.887. The fusion model was verified on a dataset containing 64 samples, and achieved an average accuracy of 0.906. CONCLUSIONS This is the first study to simultaneously use CT images and LTBs to diagnose multiple pathological types of benign and malignant pulmonary nodules, and experiments showed that our research was more advanced and more suitable for practical clinical applications.
Collapse
Affiliation(s)
- Yu Fu
- School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai 264209, China
| | - Peng Xue
- School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai 264209, China
| | - Ning Li
- Department of Radiology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan 250021, China
| | - Peng Zhao
- Department of Radiology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan 250021, China
| | - Zhuodong Xu
- Department of Radiology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan 250021, China
| | - Huizhong Ji
- School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai 264209, China
| | - Zhili Zhang
- School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai 264209, China
| | - Wentao Cui
- School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai 264209, China.
| | - Enqing Dong
- School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai 264209, China.
| |
Collapse
|
9
|
Densely connected attention network for diagnosing COVID-19 based on chest CT. Comput Biol Med 2021; 137:104857. [PMID: 34520988 PMCID: PMC8427919 DOI: 10.1016/j.compbiomed.2021.104857] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 09/05/2021] [Accepted: 09/06/2021] [Indexed: 12/31/2022]
Abstract
BACKGROUND To fully enhance the feature extraction capabilities of deep learning models, so as to accurately diagnose coronavirus disease 2019 (COVID-19) based on chest CT images, a densely connected attention network (DenseANet) was constructed by utilizing the self-attention mechanism in deep learning. METHODS During the construction of the DenseANet, we not only densely connected attention features within and between the feature extraction blocks with the same scale, but also densely connected attention features with different scales at the end of the deep model, thereby further enhancing the high-order features. In this way, as the depth of the deep model increases, the spatial attention features generated by different layers can be densely connected and gradually transferred to deeper layers. The DenseANet takes CT images of the lung fields segmented by an improved U-Net as inputs and outputs the probability of the patients suffering from COVID-19. RESULTS Compared with exiting attention networks, DenseANet can maximize the utilization of self-attention features at different depths in the model. A five-fold cross-validation experiment was performed on a dataset containing 2993 CT scans of 2121 patients, and experiments showed that the DenseANet can effectively locate the lung lesions of patients infected with SARS-CoV-2, and distinguish COVID-19, common pneumonia and normal controls with an average of 96.06% Acc and 0.989 AUC. CONCLUSIONS The DenseANet we proposed can generate strong attention features and achieve the best diagnosis results. In addition, the proposed method of densely connecting attention features can be easily extended to other advanced deep learning methods to improve their performance in related tasks.
Collapse
|
10
|
Shi Q, Yin S, Wang K, Teng L, Li H. Multichannel convolutional neural network-based fuzzy active contour model for medical image segmentation. EVOLVING SYSTEMS 2021. [DOI: 10.1007/s12530-021-09392-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
|
11
|
Xue P, Fu Y, Ji H, Cui W, Dong E. Lung Respiratory Motion Estimation Based on Fast Kalman Filtering and 4D CT Image Registration. IEEE J Biomed Health Inform 2021; 25:2007-2017. [PMID: 33044936 DOI: 10.1109/jbhi.2020.3030071] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Respiratory motion estimation is an important part in image-guided radiation therapy and clinical diagnosis. However, most of the respiratory motion estimation methods rely on indirect measurements of external breathing indicators, which will not only introduce great estimation errors, but also bring invasive injury for patients. In this paper, we propose a method of lung respiratory motion estimation based on fast Kalman filtering and 4D CT image registration (LRME-4DCT). In order to perform dynamic motion estimation for continuous phases, a motion estimation model is constructed by combining two kinds of GPU-accelerated 4D CT image registration methods with fast Kalman filtering method. To address the high computational requirements of 4D CT image sequences, a multi-level processing strategy is adopted in the 4D CT image registration methods, and respiratory motion states are predicted from three independent directions. In the DIR-lab dataset and POPI dataset with 4D CT images, the average target registration error (TRE) of the LRME-4DCT method can reach 0.91 mm and 0.85 mm respectively. Compared with traditional estimation methods based on pair-wise image registration, the proposed LRME-4DCT method can estimate the physiological respiratory motion more accurately and quickly. Our proposed LRME-4DCT method fully meets the practical clinical requirements for rapid dynamic estimation of lung respiratory motion.
Collapse
|