1
|
Oulmalme C, Nakouri H, Jaafar F. A systematic review of generative AI approaches for medical image enhancement: Comparing GANs, transformers, and diffusion models. Int J Med Inform 2025; 199:105903. [PMID: 40179622 DOI: 10.1016/j.ijmedinf.2025.105903] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2025] [Revised: 03/23/2025] [Accepted: 03/28/2025] [Indexed: 04/05/2025]
Abstract
BACKGROUND Medical imaging is a vital diagnostic tool that provides detailed insights into human anatomy but faces challenges affecting its accuracy and efficiency. Advanced generative AI models offer promising solutions. Unlike previous reviews with a narrow focus, a comprehensive evaluation across techniques and modalities is necessary. OBJECTIVE This systematic review integrates the three state-of-the-art leading approaches, GANs, Diffusion Models, and Transformers, examining their applicability, methodologies, and clinical implications in improving medical image quality. METHODS Using the PRISMA framework, 63 studies from 989 were selected via Google Scholar and PubMed, focusing on GANs, Transformers, and Diffusion Models. Articles from ACM, IEEE Xplore, and Springer were analyzed. RESULTS Generative AI techniques show promise in improving image resolution, reducing noise, and enhancing fidelity. GANs generate high-quality images, Transformers utilize global context, and Diffusion Models are effective in denoising and reconstruction. Challenges include high computational costs, limited dataset diversity, and issues with generalizability, with a focus on quantitative metrics over clinical applicability. CONCLUSION This review highlights the transformative impact of GANs, Transformers, and Diffusion Models in advancing medical imaging. Future research must address computational and generalization challenges, emphasize open science, and validate these techniques in diverse clinical settings to unlock their full potential. These efforts could enhance diagnostic accuracy, lower costs, and improve patient outcome.
Collapse
Affiliation(s)
| | - Haïfa Nakouri
- University of Quebec at Chicoutimi, Quebec, Canada; Université de Tunis, LARODEC, ISG Tunis, Tunisia.
| | - Fehmi Jaafar
- University of Quebec at Chicoutimi, Quebec, Canada.
| |
Collapse
|
2
|
Chen B, Lv X, Zhao Y, Yu L. TPDC: Point Cloud Completion by Triangular Pyramid Features and Divide-and-Conquer in Complex Environments. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:6029-6040. [PMID: 38758619 DOI: 10.1109/tnnls.2024.3397988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/19/2024]
Abstract
Point cloud completion recovers the complete point clouds from partial ones, providing numerous point cloud information for downstream tasks such as 3-D reconstruction and target detection. However, previous methods usually suffer from unstructured prediction of points in local regions and the discrete nature of the point cloud. To resolve these problems, we propose a point cloud completion network called TPDC. Representing the point cloud as a set of unordered features of points with local geometric information, we devise a Triangular Pyramid Extractor (TPE), using the simplest 3-D structure-a triangular pyramid-to convert the point cloud to a sequence of local geometric information. Our insight of revealing local geometric information in a complex environment is to design a Divide-and-Conquer Splitting Module in a Divide-and-Conquer Splitting Decoder (DCSD) to learn point-splitting patterns that can fit local regions the best. This module employs the Divide-and-Conquer approach to parallelly handle tasks related to fitting ground-truth values to base points and predicting the displacement of split points. This approach aims to make the base points align more closely with the ground-truth values while also forecasting the displacement of split points relative to the base points. Furthermore, we propose a more realistic and challenging benchmark, ShapeNetMask, with more random point cloud input, more complex random item occlusion, and more realistic random environmental perturbations. The results show that our method outperforms both widely used benchmarks as well as the new benchmark.
Collapse
|
3
|
Czobit C, Samavi R. Generative Adversarial Networks for Neuroimage Translation. J Comput Biol 2024. [PMID: 39729343 DOI: 10.1089/cmb.2024.0635] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2024] Open
Abstract
Image-to-image translation has gained popularity in the medical field to transform images from one domain to another. Medical image synthesis via domain transformation is advantageous in its ability to augment an image dataset where images for a given class are limited. From the learning perspective, this process contributes to the data-oriented robustness of the model by inherently broadening the model's exposure to more diverse visual data and enabling it to learn more generalized features. In the case of generating additional neuroimages, it is advantageous to obtain unidentifiable medical data and augment smaller annotated datasets. This study proposes the development of a cycle-consistent generative adversarial network (CycleGAN) model for translating neuroimages from one field strength to another (e.g., 3 Tesla [T] to 1.5 T). This model was compared with a model based on a deep convolutional GAN model architecture. CycleGAN was able to generate the synthetic and reconstructed images with reasonable accuracy. The mapping function from the source (3 T) to the target domain (1.5 T) performed optimally with an average peak signal-to-noise ratio value of 25.69 ± 2.49 dB and a mean absolute error value of 2106.27 ± 1218.37. The codes for this study have been made publicly available in the following GitHub repository.a.
Collapse
Affiliation(s)
- Cassandra Czobit
- Electrical, Computer and Biomedical Engineering, Toronto Metropolitan University, Toronto, Canada
| | - Reza Samavi
- Electrical, Computer and Biomedical Engineering, Toronto Metropolitan University, Toronto, Canada
- Vector Institute for Artificial Intelligence, Toronto, Canada
| |
Collapse
|
4
|
Wang C, Xu R, Xu S, Meng W, Xiao J, Zhang X. Accurate Lung Nodule Segmentation With Detailed Representation Transfer and Soft Mask Supervision. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:18381-18393. [PMID: 37824321 DOI: 10.1109/tnnls.2023.3315271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/14/2023]
Abstract
Accurate lung lesion segmentation from computed tomography (CT) images is crucial to the analysis and diagnosis of lung diseases, such as COVID-19 and lung cancer. However, the smallness and variety of lung nodules and the lack of high-quality labeling make the accurate lung nodule segmentation difficult. To address these issues, we first introduce a novel segmentation mask named " soft mask," which has richer and more accurate edge details description and better visualization, and develop a universal automatic soft mask annotation pipeline to deal with different datasets correspondingly. Then, a novel network with detailed representation transfer and soft mask supervision (DSNet) is proposed to process the input low-resolution images of lung nodules into high-quality segmentation results. Our DSNet contains a special detailed representation transfer module (DRTM) for reconstructing the detailed representation to alleviate the small size of lung nodules images and an adversarial training framework with soft mask for further improving the accuracy of segmentation. Extensive experiments validate that our DSNet outperforms other state-of-the-art methods for accurate lung nodule segmentation, and has strong generalization ability in other accurate medical segmentation tasks with competitive results. Besides, we provide a new challenging lung nodules segmentation dataset for further studies (https://drive.google.com/file/d/15NNkvDTb_0Ku0IoPsNMHezJRTH1Oi1wm/view?usp=sharing).
Collapse
|
5
|
Zuo Q, Shi Z, Liu B, Ping N, Wang J, Cheng X, Zhang K, Guo J, Wu Y, Hong J. Multi-resolution visual Mamba with multi-directional selective mechanism for retinal disease detection. Front Cell Dev Biol 2024; 12:1484880. [PMID: 39463765 PMCID: PMC11512455 DOI: 10.3389/fcell.2024.1484880] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2024] [Accepted: 09/20/2024] [Indexed: 10/29/2024] Open
Abstract
Introduction Retinal diseases significantly impact patients' quality of life and increase social medical costs. Optical coherence tomography (OCT) offers high-resolution imaging for precise detection and monitoring of these conditions. While deep learning techniques have been employed to extract features from OCT images for classification, convolutional neural networks (CNNs) often fail to capture global context due to their focus on local receptive fields. Transformer-based methods, on the other hand, suffer from quadratic complexity when handling long-range dependencies. Methods To overcome these limitations, we introduce the Multi-Resolution Visual Mamba (MRVM) model, which addresses long-range dependencies with linear computational complexity for OCT image classification. The MRVM model initially employs convolution to extract local features and subsequently utilizes the retinal Mamba to capture global dependencies. By integrating multi-scale global features, the MRVM enhances classification accuracy and overall performance. Additionally, the multi-directional selection mechanism (MSM) within the retinal Mamba improves feature extraction by concentrating on various directions, thereby better capturing complex, orientation-specific retinal patterns. Results Experimental results demonstrate that the MRVM model excels in differentiating retinal images with various lesions, achieving superior detection accuracy compared to traditional methods, with overall accuracies of 98.98\% and 96.21\% on two public datasets, respectively. Discussion This approach offers a novel perspective for accurately identifying retinal diseases and could contribute to the development of more robust artificial intelligence algorithms and recognition systems for medical image-assisted diagnosis.
Collapse
Affiliation(s)
- Qiankun Zuo
- Hubei Key Laboratory of Digital Finance Innovation, Hubei University of Economics, Wuhan, China
- School of Information Engineering, Hubei University of Economics, Wuhan, China
- Hubei Internet Finance Information Engineering Technology Research Center, Hubei University of Economics, Wuhan, China
| | - Zhengkun Shi
- School of Information Engineering, Hubei University of Economics, Wuhan, China
| | - Bo Liu
- School of Mathematics and Computer Science, Nanchang University, Nanchang, China
| | - Na Ping
- School of Information Engineering, Hubei University of Economics, Wuhan, China
| | - Jiangtao Wang
- School of Information Engineering, Hubei University of Economics, Wuhan, China
| | - Xi Cheng
- School of Information Engineering, Hubei University of Economics, Wuhan, China
| | - Kexin Zhang
- School of Information Engineering, Hubei University of Economics, Wuhan, China
| | - Jia Guo
- Hubei Key Laboratory of Digital Finance Innovation, Hubei University of Economics, Wuhan, China
- School of Information Engineering, Hubei University of Economics, Wuhan, China
- Hubei Internet Finance Information Engineering Technology Research Center, Hubei University of Economics, Wuhan, China
| | - Yixian Wu
- School of Mechanical Engineering, Beijing Institute of Petrochemical Technology, Beijing, China
| | - Jin Hong
- School of Information Engineering, Nanchang University, Nanchang, China
| |
Collapse
|
6
|
Hu B, Zhan C, Tang B, Wang B, Lei B, Wang SQ. 3-D Brain Reconstruction by Hierarchical Shape-Perception Network From a Single Incomplete Image. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:13271-13283. [PMID: 37167053 DOI: 10.1109/tnnls.2023.3266819] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
3-D shape reconstruction is essential in the navigation of minimally invasive and auto robot-guided surgeries whose operating environments are indirect and narrow, and there have been some works that focused on reconstructing the 3-D shape of the surgical organ through limited 2-D information available. However, the lack and incompleteness of such information caused by intraoperative emergencies (such as bleeding) and risk control conditions have not been considered. In this article, a novel hierarchical shape-perception network (HSPN) is proposed to reconstruct the 3-D point clouds (PCs) of specific brains from one single incomplete image with low latency. A branching predictor and several hierarchical attention pipelines are constructed to generate PCs that accurately describe the incomplete images and then complete these PCs with high quality. Meanwhile, attention gate blocks (AGBs) are designed to efficiently aggregate geometric local features of incomplete PCs transmitted by hierarchical attention pipelines and internal features of reconstructing PCs. With the proposed HSPN, 3-D shape perception and completion can be achieved spontaneously. Comprehensive results measured by Chamfer distance (CD) and PC-to-PC error demonstrate that the performance of the proposed HSPN outperforms other competitive methods in terms of qualitative displays, quantitative experiment, and classification evaluation.
Collapse
|
7
|
Lyu J, Li G, Wang C, Cai Q, Dou Q, Zhang D, Qin J. Multicontrast MRI Super-Resolution via Transformer-Empowered Multiscale Contextual Matching and Aggregation. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:12004-12014. [PMID: 37028326 DOI: 10.1109/tnnls.2023.3250491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Magnetic resonance imaging (MRI) possesses the unique versatility to acquire images under a diverse array of distinct tissue contrasts, which makes multicontrast super-resolution (SR) techniques possible and needful. Compared with single-contrast MRI SR, multicontrast SR is expected to produce higher quality images by exploiting a variety of complementary information embedded in different imaging contrasts. However, existing approaches still have two shortcomings: 1) most of them are convolution-based methods and, hence, weak in capturing long-range dependencies, which are essential for MR images with complicated anatomical patterns and 2) they ignore to make full use of the multicontrast features at different scales and lack effective modules to match and aggregate these features for faithful SR. To address these issues, we develop a novel multicontrast MRI SR network via transformer-empowered multiscale feature matching and aggregation, dubbed McMRSR ++ . First, we tame transformers to model long-range dependencies in both reference and target images at different scales. Then, a novel multiscale feature matching and aggregation method is proposed to transfer corresponding contexts from reference features at different scales to the target features and interactively aggregate them. Furthermore, a texture-preserving branch and a contrastive constraint are incorporated into our framework for enhancing the textural details in the SR images. Experimental results on both public and clinical in vivo datasets show that McMRSR ++ outperforms state-of-the-art methods under peak signal to noise ratio (PSNR), structure similarity index measure (SSIM), and root mean square error (RMSE) metrics significantly. Visual results demonstrate the superiority of our method in restoring structures, demonstrating its great potential to improve scan efficiency in clinical practice.
Collapse
|
8
|
Chaudhary MFA, Gerard SE, Christensen GE, Cooper CB, Schroeder JD, Hoffman EA, Reinhardt JM. LungViT: Ensembling Cascade of Texture Sensitive Hierarchical Vision Transformers for Cross-Volume Chest CT Image-to-Image Translation. IEEE TRANSACTIONS ON MEDICAL IMAGING 2024; 43:2448-2465. [PMID: 38373126 PMCID: PMC11227912 DOI: 10.1109/tmi.2024.3367321] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/21/2024]
Abstract
Chest computed tomography (CT) at inspiration is often complemented by an expiratory CT to identify peripheral airways disease. Additionally, co-registered inspiratory-expiratory volumes can be used to derive various markers of lung function. Expiratory CT scans, however, may not be acquired due to dose or scan time considerations or may be inadequate due to motion or insufficient exhale; leading to a missed opportunity to evaluate underlying small airways disease. Here, we propose LungViT- a generative adversarial learning approach using hierarchical vision transformers for translating inspiratory CT intensities to corresponding expiratory CT intensities. LungViT addresses several limitations of the traditional generative models including slicewise discontinuities, limited size of generated volumes, and their inability to model texture transfer at volumetric level. We propose a shifted-window hierarchical vision transformer architecture with squeeze-and-excitation decoder blocks for modeling dependencies between features. We also propose a multiview texture similarity distance metric for texture and style transfer in 3D. To incorporate global information into the training process and refine the output of our model, we use ensemble cascading. LungViT is able to generate large 3D volumes of size 320×320×320 . We train and validate our model using a diverse cohort of 1500 subjects with varying disease severity. To assess model generalizability beyond the development set biases, we evaluate our model on an out-of-distribution external validation set of 200 subjects. Clinical validation on internal and external testing sets shows that synthetic volumes could be reliably adopted for deriving clinical endpoints of chronic obstructive pulmonary disease.
Collapse
|
9
|
Zuo Q, Wu H, Chen CLP, Lei B, Wang S. Prior-Guided Adversarial Learning With Hypergraph for Predicting Abnormal Connections in Alzheimer's Disease. IEEE TRANSACTIONS ON CYBERNETICS 2024; 54:3652-3665. [PMID: 38236677 DOI: 10.1109/tcyb.2023.3344641] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2024]
Abstract
Alzheimer's disease (AD) is characterized by alterations of the brain's structural and functional connectivity during its progressive degenerative processes. Existing auxiliary diagnostic methods have accomplished the classification task, but few of them can accurately evaluate the changing characteristics of brain connectivity. In this work, a prior-guided adversarial learning with hypergraph (PALH) model is proposed to predict abnormal brain connections using triple-modality medical images. Concretely, a prior distribution from anatomical knowledge is estimated to guide multimodal representation learning using an adversarial strategy. Also, the pairwise collaborative discriminator structure is further utilized to narrow the difference in representation distribution. Moreover, the hypergraph perceptual network is developed to effectively fuse the learned representations while establishing high-order relations within and between multimodal images. Experimental results demonstrate that the proposed model outperforms other related methods in analyzing and predicting AD progression. More importantly, the identified abnormal connections are partly consistent with previous neuroscience discoveries. The proposed model can evaluate the characteristics of abnormal brain connections at different stages of AD, which is helpful for cognitive disease study and early treatment.
Collapse
|
10
|
Zuo Q, Li R, Shi B, Hong J, Zhu Y, Chen X, Wu Y, Guo J. U-shaped convolutional transformer GAN with multi-resolution consistency loss for restoring brain functional time-series and dementia diagnosis. Front Comput Neurosci 2024; 18:1387004. [PMID: 38694950 PMCID: PMC11061376 DOI: 10.3389/fncom.2024.1387004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Accepted: 04/02/2024] [Indexed: 05/04/2024] Open
Abstract
Introduction The blood oxygen level-dependent (BOLD) signal derived from functional neuroimaging is commonly used in brain network analysis and dementia diagnosis. Missing the BOLD signal may lead to bad performance and misinterpretation of findings when analyzing neurological disease. Few studies have focused on the restoration of brain functional time-series data. Methods In this paper, a novel U-shaped convolutional transformer GAN (UCT-GAN) model is proposed to restore the missing brain functional time-series data. The proposed model leverages the power of generative adversarial networks (GANs) while incorporating a U-shaped architecture to effectively capture hierarchical features in the restoration process. Besides, the multi-level temporal-correlated attention and the convolutional sampling in the transformer-based generator are devised to capture the global and local temporal features for the missing time series and associate their long-range relationship with the other brain regions. Furthermore, by introducing multi-resolution consistency loss, the proposed model can promote the learning of diverse temporal patterns and maintain consistency across different temporal resolutions, thus effectively restoring complex brain functional dynamics. Results We theoretically tested our model on the public Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset, and our experiments demonstrate that the proposed model outperforms existing methods in terms of both quantitative metrics and qualitative assessments. The model's ability to preserve the underlying topological structure of the brain functional networks during restoration is a particularly notable achievement. Conclusion Overall, the proposed model offers a promising solution for restoring brain functional time-series and contributes to the advancement of neuroscience research by providing enhanced tools for disease analysis and interpretation.
Collapse
Affiliation(s)
- Qiankun Zuo
- Hubei Key Laboratory of Digital Finance Innovation, Hubei University of Economics, Wuhan, Hubei, China
- School of Information Engineering, Hubei University of Economics, Wuhan, Hubei, China
- Hubei Internet Finance Information Engineering Technology Research Center, Hubei University of Economics, Wuhan, Hubei, China
| | - Ruiheng Li
- Hubei Key Laboratory of Digital Finance Innovation, Hubei University of Economics, Wuhan, Hubei, China
- School of Information Engineering, Hubei University of Economics, Wuhan, Hubei, China
| | - Binghua Shi
- Hubei Key Laboratory of Digital Finance Innovation, Hubei University of Economics, Wuhan, Hubei, China
- School of Information Engineering, Hubei University of Economics, Wuhan, Hubei, China
| | - Jin Hong
- Medical Research Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Yanfei Zhu
- School of Foreign Languages, Sun Yat-sen University, Guangzhou, China
| | - Xuhang Chen
- Faculty of Science and Technology, University of Macau, Taipa, Macao SAR, China
| | - Yixian Wu
- School of Mechanical Engineering, Beijing Institute of Petrochemical Technology, Beijing, China
| | - Jia Guo
- Hubei Key Laboratory of Digital Finance Innovation, Hubei University of Economics, Wuhan, Hubei, China
- School of Information Engineering, Hubei University of Economics, Wuhan, Hubei, China
- Hubei Internet Finance Information Engineering Technology Research Center, Hubei University of Economics, Wuhan, Hubei, China
| |
Collapse
|
11
|
Jomeiri A, Navin AH, Shamsi M. Longitudinal MRI analysis using a hybrid DenseNet-BiLSTM method for Alzheimer's disease prediction. Behav Brain Res 2024; 463:114900. [PMID: 38341100 DOI: 10.1016/j.bbr.2024.114900] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2023] [Revised: 12/16/2023] [Accepted: 02/06/2024] [Indexed: 02/12/2024]
Abstract
Alzheimer's disease is a progressive neurological disorder characterized by brain atrophy and cell death, leading to cognitive decline and impaired functioning. Previous research has primarily focused on using cross-sectional data for Alzheimer's disease identification, but analyzing longitudinal sequential MR images is crucial for improved diagnostic accuracy and understanding disease progression. However, existing deep learning models face challenges in learning spatial and temporal features from such data. To address these challenges, this study presents a novel hybrid DenseNet-BiLSTM method for Alzheimer's disease prediction using longitudinal MRI analysis. The proposed framework combines Convolutional DenseNet for spatial information extraction and joined BiLSTM layers for capturing temporal characteristics and relationships between longitudinal images at different time points. This approach overcomes issues like overfitting, vanishing gradients, and incomplete patient data. We evaluated the model on 684 longitudinal MRI images from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database, including normal controls, individuals with mild cognitive impairment, and Alzheimer's disease patients. The results demonstrate high classification accuracy, with 95.28% for AD/CN, 88.19% for NC/MCI, 83.51% for sMCI/pMCI, and 92.14% for MCI/AD. These findings highlight the substantial improvement in Alzheimer's disease diagnosis achieved through the utilization of longitudinal MRI images. The contributions of this study lie in both the deep learning and medical domains. In the deep learning domain, our hybrid framework effectively learns spatial and temporal features from longitudinal data, addressing the challenges associated with multi-dimensional and sequential time series data. In the medical domain, our study emphasizes the importance of analyzing baseline and longitudinal MR images for accurate diagnosis and understanding disease progression.
Collapse
Affiliation(s)
- Alireza Jomeiri
- Department of Computer Engineering, Qom Branch, Islamic Azad University, Qom, Iran
| | - Ahmad Habibizad Navin
- Department of Computer Engineering, Tabriz Branch, Islamic Azad University, Tabriz, Iran.
| | - Mahboubeh Shamsi
- Department of Engineering, Qom University of Technology, Qom, Iran
| |
Collapse
|
12
|
Jing C, Kuai H, Matsumoto H, Yamaguchi T, Liao IY, Wang S. Addiction-related brain networks identification via Graph Diffusion Reconstruction Network. Brain Inform 2024; 11:1. [PMID: 38190053 PMCID: PMC10774517 DOI: 10.1186/s40708-023-00216-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Accepted: 12/13/2023] [Indexed: 01/09/2024] Open
Abstract
Functional magnetic resonance imaging (fMRI) provides insights into complex patterns of brain functional changes, making it a valuable tool for exploring addiction-related brain connectivity. However, effectively extracting addiction-related brain connectivity from fMRI data remains challenging due to the intricate and non-linear nature of brain connections. Therefore, this paper proposed the Graph Diffusion Reconstruction Network (GDRN), a novel framework designed to capture addiction-related brain connectivity from fMRI data acquired from addicted rats. The proposed GDRN incorporates a diffusion reconstruction module that effectively maintains the unity of data distribution by reconstructing the training samples, thereby enhancing the model's ability to reconstruct nicotine addiction-related brain networks. Experimental evaluations conducted on a nicotine addiction rat dataset demonstrate that the proposed GDRN effectively explores nicotine addiction-related brain connectivity. The findings suggest that the GDRN holds promise for uncovering and understanding the complex neural mechanisms underlying addiction using fMRI data.
Collapse
Affiliation(s)
- Changhong Jing
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Hongzhi Kuai
- Faculty of Engineering, Maebashi Institute of Technology, Maebashi, 371-0816, Japan
| | - Hiroki Matsumoto
- Faculty of Engineering, Maebashi Institute of Technology, Maebashi, 371-0816, Japan
| | | | - Iman Yi Liao
- University of Nottingham Malaysia Campus, Semenyih, Malaysia
| | - Shuqiang Wang
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.
| |
Collapse
|
13
|
Umirzakova S, Mardieva S, Muksimova S, Ahmad S, Whangbo T. Enhancing the Super-Resolution of Medical Images: Introducing the Deep Residual Feature Distillation Channel Attention Network for Optimized Performance and Efficiency. Bioengineering (Basel) 2023; 10:1332. [PMID: 38002456 PMCID: PMC10669552 DOI: 10.3390/bioengineering10111332] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2023] [Revised: 11/09/2023] [Accepted: 11/16/2023] [Indexed: 11/26/2023] Open
Abstract
In the advancement of medical image super-resolution (SR), the Deep Residual Feature Distillation Channel Attention Network (DRFDCAN) marks a significant step forward. This work presents DRFDCAN, a model that innovates traditional SR approaches by introducing a channel attention block that is tailored for high-frequency features-crucial for the nuanced details in medical diagnostics-while streamlining the network structure for enhanced computational efficiency. DRFDCAN's architecture adopts a residual-within-residual design to facilitate faster inference and reduce memory demands without compromising the integrity of the image reconstruction. This design strategy, combined with an innovative feature extraction method that emphasizes the utility of the initial layer features, allows for improved image clarity and is particularly effective in optimizing the peak signal-to-noise ratio (PSNR). The proposed work redefines efficiency in SR models, outperforming established frameworks like RFDN by improving model compactness and accelerating inference. The meticulous crafting of a feature extractor that effectively captures edge and texture information exemplifies the model's capacity to render detailed images, necessary for accurate medical analysis. The implications of this study are two-fold: it presents a viable solution for deploying SR technology in real-time medical applications, and it sets a precedent for future models that address the delicate balance between computational efficiency and high-fidelity image reconstruction. This balance is paramount in medical applications where the clarity of images can significantly influence diagnostic outcomes. The DRFDCAN model thus stands as a transformative contribution to the field of medical image super-resolution.
Collapse
Affiliation(s)
- Sabina Umirzakova
- Department of Computer Engineering, Gachon University, Sujeong-gu, Seongnam-si 113-120, Gyonggi-do, Republic of Korea; (S.M.)
| | | | | | | | - Taegkeun Whangbo
- Department of Computer Engineering, Gachon University, Sujeong-gu, Seongnam-si 113-120, Gyonggi-do, Republic of Korea; (S.M.)
| |
Collapse
|
14
|
Liu Y, Zhang F, Gao X, Liu T, Dong J. Lesion-aware attention network for diabetic nephropathy diagnosis with optical coherence tomography images. Front Med (Lausanne) 2023; 10:1259478. [PMID: 37964881 PMCID: PMC10641799 DOI: 10.3389/fmed.2023.1259478] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2023] [Accepted: 10/11/2023] [Indexed: 11/16/2023] Open
Abstract
Purpose For early screening of diabetic nephropathy patients, we propose a deep learning algorithm to screen high-risk patients with diabetic nephropathy from retinal images of diabetic patients. Methods We propose the use of attentional mechanisms to improve the model's focus on lesion-prone regions of retinal OCT images. First, the data is trained using the base network and the Grad-CAM algorithm locates image regions that have a large impact on the model output and generates a rough mask localization map. The mask is used as a auxiliary region to realize the auxiliary attention module. We then inserted the region-guided attention module into the baseline model and trained the CNN model to guide the model to better focus on relevant lesion features. The proposed model improves the recognition of the lesion region. Results To evaluate the lesion-aware attention network, we trained and tested it using OCT volumetric data collected from 66 patients with diabetic retinal microangiopathy (89 eyes, male = 43, female = 23). There were 45 patients (60 eyes, male=27, female = 18) in DR group and 21 patients (29 eyes, male = 16, female = 5) in DN group. Our proposed model performs even better in disease classification, specifically, the accuracy of the proposed model was 91.68%, the sensitivity was 89.99%, and the specificity was 92.18%. Conclusion The proposed lesion-aware attention model can provide reliable screening of high-risk patients with diabetic nephropathy.
Collapse
Affiliation(s)
- Yuliang Liu
- School of Information Science and Engineering, University of Jinan, Jinan, China
- Shandong Provincial Key Laboratory of Network-Based Intelligent Computing, Jinan, China
| | - Fenghang Zhang
- School of Information Science and Engineering, University of Jinan, Jinan, China
- Shandong Provincial Key Laboratory of Network-Based Intelligent Computing, Jinan, China
| | - Xizhan Gao
- School of Information Science and Engineering, University of Jinan, Jinan, China
- Shandong Provincial Key Laboratory of Network-Based Intelligent Computing, Jinan, China
| | - Tingting Liu
- Shandong Eye Institute, Shandong First Medical University and Shandong Academy of Medical Sciences, Jinan, China
| | - Jiwen Dong
- School of Information Science and Engineering, University of Jinan, Jinan, China
- Shandong Provincial Key Laboratory of Network-Based Intelligent Computing, Jinan, China
| |
Collapse
|
15
|
Zuo Q, Zhong N, Pan Y, Wu H, Lei B, Wang S. Brain Structure-Function Fusing Representation Learning Using Adversarial Decomposed-VAE for Analyzing MCI. IEEE Trans Neural Syst Rehabil Eng 2023; 31:4017-4028. [PMID: 37815971 DOI: 10.1109/tnsre.2023.3323432] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/12/2023]
Abstract
Integrating the brain structural and functional connectivity features is of great significance in both exploring brain science and analyzing cognitive impairment clinically. However, it remains a challenge to effectively fuse structural and functional features in exploring the complex brain network. In this paper, a novel brain structure-function fusing-representation learning (BSFL) model is proposed to effectively learn fused representation from diffusion tensor imaging (DTI) and resting-state functional magnetic resonance imaging (fMRI) for mild cognitive impairment (MCI) analysis. Specifically, the decomposition-fusion framework is developed to first decompose the feature space into the union of the uniform and unique spaces for each modality, and then adaptively fuse the decomposed features to learn MCI-related representation. Moreover, a knowledge-aware transformer module is designed to automatically capture local and global connectivity features throughout the brain. Also, a uniform-unique contrastive loss is further devised to make the decomposition more effective and enhance the complementarity of structural and functional features. The extensive experiments demonstrate that the proposed model achieves better performance than other competitive methods in predicting and analyzing MCI. More importantly, the proposed model could be a potential tool for reconstructing unified brain networks and predicting abnormal connections during the degenerative processes in MCI.
Collapse
|
16
|
Li W, Yang D, Ma C, Liu L. Identifying novel disease categories through divergence optimization: An approach to prevent misdiagnosis in medical imaging. Comput Biol Med 2023; 165:107403. [PMID: 37688992 DOI: 10.1016/j.compbiomed.2023.107403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2023] [Revised: 08/09/2023] [Accepted: 08/26/2023] [Indexed: 09/11/2023]
Abstract
Given the significant changes in human lifestyle, the incidence of colon cancer has rapidly increased. The diagnostic process can often be complicated due to symptom similarities between colon cancer and other colon-related diseases. In an effort to minimize misdiagnosis, deep learning-based approaches for colon cancer diagnosis have notably progressed within the field of clinical medicine, offering more precise detection and improved patient outcomes. Despite these advancements, practical application of these techniques continues to encounter two major challenges: 1) due to the need for expert annotation, only a limited number of labels are utilized for diagnosis; and 2) the existence of diverse disease types can lead to misdiagnosis when the model encounters unfamiliar disease categories. To overcome these hurdles, we present a method incorporating Universal Domain Adaptation (UniDA). By optimizing the divergence of samples in the source domain, our method detects noise. Furthermore, to identify categories that are not present in the source domain, we optimize the divergence of unlabeled samples in the target domain. Experimental validation on two gastrointestinal datasets demonstrates that our method surpasses current state-of-the-art domain adaptation techniques in identifying unknown disease classes. It is worth noting that our proposed method is the first work of medical image diagnosis aimed at the identification of unknown categories of diseases.
Collapse
Affiliation(s)
- Wencai Li
- Department of General Surgery, The Second Affiliated Hospital of Shanghai University (Wenzhou Central Hospital), Wenzhou, Zhejiang, 325000, China.
| | - Daqing Yang
- Department of General Surgery, The Second Affiliated Hospital of Shanghai University (Wenzhou Central Hospital), Wenzhou, Zhejiang, 325000, China.
| | - Chao Ma
- School of Digital Media, Shenzhen Institute of Information Technology, Shenzhen, 518172, China.
| | - Lei Liu
- College of Computer Science, Sichuan University, Chengdu, Sichuan, 610065, China.
| |
Collapse
|
17
|
Liu Y, Dwivedi G, Boussaid F, Sanfilippo F, Yamada M, Bennamoun M. Inflating 2D convolution weights for efficient generation of 3D medical images. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2023; 240:107685. [PMID: 37429247 DOI: 10.1016/j.cmpb.2023.107685] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Revised: 06/13/2023] [Accepted: 06/19/2023] [Indexed: 07/12/2023]
Abstract
BACKGROUND AND OBJECTIVE The generation of three-dimensional (3D) medical images has great application potential since it takes into account the 3D anatomical structure. Two problems prevent effective training of a 3D medical generative model: (1) 3D medical images are expensive to acquire and annotate, resulting in an insufficient number of training images, and (2) a large number of parameters are involved in 3D convolution. METHODS We propose a novel GAN model called 3D Split&Shuffle-GAN. To address the 3D data scarcity issue, we first pre-train a two-dimensional (2D) GAN model using abundant image slices and inflate the 2D convolution weights to improve the initialization of the 3D GAN. Novel 3D network architectures are proposed for both the generator and discriminator of the GAN model to significantly reduce the number of parameters while maintaining the quality of image generation. Several weight inflation strategies and parameter-efficient 3D architectures are investigated. RESULTS Experiments on both heart (Stanford AIMI Coronary Calcium) and brain (Alzheimer's Disease Neuroimaging Initiative) datasets show that our method leads to improved 3D image generation quality (14.7 improvements on Frchet inception distance) with significantly fewer parameters (only 48.5% of the baseline method). CONCLUSIONS We built a parameter-efficient 3D medical image generation model. Due to the efficiency and effectiveness, it has the potential to generate high-quality 3D brain and heart images for real use cases.
Collapse
Affiliation(s)
- Yanbin Liu
- School of Computing, Australian National University, Canberra, ACT, AU
| | - Girish Dwivedi
- Harry Perkins Institute of Medical Research, The University of Western Australia, Perth, WA, AU; Cardiology Department, Fiona Stanley Hospital, Perth, WA, AU
| | - Farid Boussaid
- Department of Electrical, Electronic and Computer Engineering, The University of Western Australia, Perth, WA, AU
| | - Frank Sanfilippo
- School of Population and Global Health, The University of Western Australia, Perth, WA, AU
| | - Makoto Yamada
- Okinawa Institute of Science and Technology, Okinawa, JP
| | - Mohammed Bennamoun
- Department of Computer Science and Software Engineering, The University of Western Australia, Perth, WA, AU.
| |
Collapse
|
18
|
Yu Y, She K, Liu J, Cai X, Shi K, Kwon OM. A super-resolution network for medical imaging via transformation analysis of wavelet multi-resolution. Neural Netw 2023; 166:162-173. [PMID: 37487412 DOI: 10.1016/j.neunet.2023.07.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 05/15/2023] [Accepted: 07/04/2023] [Indexed: 07/26/2023]
Abstract
In recent years, deep learning super-resolution models for progressive reconstruction have achieved great success. However, these models which refer to multi-resolution analysis basically ignore the information contained in the lower subspaces and do not explore the correlation between features in the wavelet and spatial domain, resulting in not fully utilizing the auxiliary information brought by multi-resolution analysis with multiple domains. Therefore, we propose a super-resolution network based on the wavelet multi-resolution framework (WMRSR) to capture the auxiliary information contained in multiple subspaces and to be aware of the interdependencies between spatial domain and wavelet domain features. Initially, the wavelet multi-resolution input (WMRI) is generated by combining wavelet sub-bands obtained from each subspace through wavelet multi-resolution analysis and the corresponding spatial domain image content, which serves as input to the network. Then, the WMRSR captures the corresponding features from the WMRI in the wavelet domain and spatial domain, respectively, and fuses them adaptively, thus learning fully explored features in multi-resolution and multi-domain. Finally, the high-resolution images are gradually reconstructed in the wavelet multi-resolution framework by our convolution-based wavelet transform module which is suitable for deep neural networks. Extensive experiments conducted on two public datasets demonstrate that our method outperforms other state-of-the-art methods in terms of objective and visual qualities.
Collapse
Affiliation(s)
- Yue Yu
- School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu, 610054, Sichuan, China.
| | - Kun She
- School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu, 610054, Sichuan, China.
| | - Jinhua Liu
- School of Mathematical and Computer Sciences, Shangrao Normal University, Shangrao 334001, Jiangxi, China.
| | - Xiao Cai
- School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu, 610054, Sichuan, China.
| | - Kaibo Shi
- School of Electronic Information and Electrical Engineering, Chengdu University, Chengdu, 610106, Sichuan, China.
| | - O M Kwon
- School of Electrical Engineering, Chungbuk National University, Chungdae-ro, Seowon-Gu, 28644, Cheongju, South Korea.
| |
Collapse
|
19
|
Zuo Q, Hu J, Zhang Y, Pan J, Jing C, Chen X, Meng X, Hong J. Brain Functional Network Generation Using Distribution-Regularized Adversarial Graph Autoencoder with Transformer for Dementia Diagnosis. COMPUTER MODELING IN ENGINEERING & SCIENCES : CMES 2023; 137:2129-2147. [PMID: 38566839 PMCID: PMC7615791 DOI: 10.32604/cmes.2023.028732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
The topological connectivity information derived from the brain functional network can bring new insights for diagnosing and analyzing dementia disorders. The brain functional network is suitable to bridge the correlation between abnormal connectivities and dementia disorders. However, it is challenging to access considerable amounts of brain functional network data, which hinders the widespread application of data-driven models in dementia diagnosis. In this study, a novel distribution-regularized adversarial graph auto-Encoder (DAGAE) with transformer is proposed to generate new fake brain functional networks to augment the brain functional network dataset, improving the dementia diagnosis accuracy of data-driven models. Specifically, the label distribution is estimated to regularize the latent space learned by the graph encoder, which can make the learning process stable and the learned representation robust. Also, the transformer generator is devised to map the node representations into node-to-node connections by exploring the long-term dependence of highly-correlated distant brain regions. The typical topological properties and discriminative features can be preserved entirely. Furthermore, the generated brain functional networks improve the prediction performance using different classifiers, which can be applied to analyze other cognitive diseases. Attempts on the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset demonstrate that the proposed model can generate good brain functional networks. The classification results show adding generated data can achieve the best accuracy value of 85.33%, sensitivity value of 84.00%, specificity value of 86.67%. The proposed model also achieves superior performance compared with other related augmented models. Overall, the proposed model effectively improves cognitive disease diagnosis by generating diverse brain functional networks.
Collapse
Affiliation(s)
- Qiankun Zuo
- School of Information Engineering, Hubei University of Economics, Wuhan, 430205, China
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055, China
| | - Junhua Hu
- State Key Laboratory of Simulation and Regulation of Water Cycle in River Basin, China Institute of Water Resources and Hydropower Research, Beijing, 100038, China
| | - Yudong Zhang
- School of Computing and Mathematic Sciences, University of Leicester, Leicester, LE1 7RH, UK
| | - Junren Pan
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055, China
| | - Changhong Jing
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055, China
| | - Xuhang Chen
- Faculty of Science and Technology, University of Macau, Macau, 999078, China
| | - Xiaobo Meng
- School of Geophysics, Chengdu University of Technology, Chengdu, 610059, China
| | - Jin Hong
- Laboratory of Artificial Intelligence and 3D Technologies for Cardiovascular Diseases, Guangdong Provincial Key Laboratory of South China Structural Heart Disease, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 519041, China
- Medical Research Institute, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 519041, China
| |
Collapse
|
20
|
Bhatt P, Sethi A, Tasgaonkar V, Shroff J, Pendharkar I, Desai A, Sinha P, Deshpande A, Joshi G, Rahate A, Jain P, Walambe R, Kotecha K, Jain NK. Machine learning for cognitive behavioral analysis: datasets, methods, paradigms, and research directions. Brain Inform 2023; 10:18. [PMID: 37524933 PMCID: PMC10390406 DOI: 10.1186/s40708-023-00196-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 06/06/2023] [Indexed: 08/02/2023] Open
Abstract
Human behaviour reflects cognitive abilities. Human cognition is fundamentally linked to the different experiences or characteristics of consciousness/emotions, such as joy, grief, anger, etc., which assists in effective communication with others. Detection and differentiation between thoughts, feelings, and behaviours are paramount in learning to control our emotions and respond more effectively in stressful circumstances. The ability to perceive, analyse, process, interpret, remember, and retrieve information while making judgments to respond correctly is referred to as Cognitive Behavior. After making a significant mark in emotion analysis, deception detection is one of the key areas to connect human behaviour, mainly in the forensic domain. Detection of lies, deception, malicious intent, abnormal behaviour, emotions, stress, etc., have significant roles in advanced stages of behavioral science. Artificial Intelligence and Machine learning (AI/ML) has helped a great deal in pattern recognition, data extraction and analysis, and interpretations. The goal of using AI and ML in behavioral sciences is to infer human behaviour, mainly for mental health or forensic investigations. The presented work provides an extensive review of the research on cognitive behaviour analysis. A parametric study is presented based on different physical characteristics, emotional behaviours, data collection sensing mechanisms, unimodal and multimodal datasets, modelling AI/ML methods, challenges, and future research directions.
Collapse
Affiliation(s)
- Priya Bhatt
- Symbiosis Institute of Technology, Symbiosis International Deemed University, Pune, India
| | - Amanrose Sethi
- Symbiosis Institute of Technology, Symbiosis International Deemed University, Pune, India
| | - Vaibhav Tasgaonkar
- Symbiosis Institute of Technology, Symbiosis International Deemed University, Pune, India
| | - Jugal Shroff
- Symbiosis Institute of Technology, Symbiosis International Deemed University, Pune, India
| | - Isha Pendharkar
- Symbiosis Institute of Technology, Symbiosis International Deemed University, Pune, India
| | - Aditya Desai
- Symbiosis Institute of Technology, Symbiosis International Deemed University, Pune, India
| | - Pratyush Sinha
- Symbiosis Institute of Technology, Symbiosis International Deemed University, Pune, India
| | - Aditya Deshpande
- Symbiosis Institute of Technology, Symbiosis International Deemed University, Pune, India
| | - Gargi Joshi
- Symbiosis Institute of Technology, Symbiosis International Deemed University, Pune, India
| | - Anil Rahate
- Symbiosis Institute of Technology, Symbiosis International Deemed University, Pune, India
| | - Priyanka Jain
- Centre for Development of Advanced Computing (C-DAC), Delhi, India
| | - Rahee Walambe
- Symbiosis Institute of Technology, Symbiosis International Deemed University, Pune, India.
- Symbiosis Centre for Applied Artificial Intelligence, Symbiosis International Deemed University, Pune, India.
| | - Ketan Kotecha
- Symbiosis Institute of Technology, Symbiosis International Deemed University, Pune, India.
- Symbiosis Centre for Applied Artificial Intelligence, Symbiosis International Deemed University, Pune, India.
- UCSI University, Kuala Lumpur, Malaysia.
| | - N K Jain
- Centre for Development of Advanced Computing (C-DAC), Delhi, India
| |
Collapse
|
21
|
Gong C, Jing C, Chen X, Pun CM, Huang G, Saha A, Nieuwoudt M, Li HX, Hu Y, Wang S. Generative AI for brain image computing and brain network computing: a review. Front Neurosci 2023; 17:1203104. [PMID: 37383107 PMCID: PMC10293625 DOI: 10.3389/fnins.2023.1203104] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Accepted: 05/22/2023] [Indexed: 06/30/2023] Open
Abstract
Recent years have witnessed a significant advancement in brain imaging techniques that offer a non-invasive approach to mapping the structure and function of the brain. Concurrently, generative artificial intelligence (AI) has experienced substantial growth, involving using existing data to create new content with a similar underlying pattern to real-world data. The integration of these two domains, generative AI in neuroimaging, presents a promising avenue for exploring various fields of brain imaging and brain network computing, particularly in the areas of extracting spatiotemporal brain features and reconstructing the topological connectivity of brain networks. Therefore, this study reviewed the advanced models, tasks, challenges, and prospects of brain imaging and brain network computing techniques and intends to provide a comprehensive picture of current generative AI techniques in brain imaging. This review is focused on novel methodological approaches and applications of related new methods. It discussed fundamental theories and algorithms of four classic generative models and provided a systematic survey and categorization of tasks, including co-registration, super-resolution, enhancement, classification, segmentation, cross-modality, brain network analysis, and brain decoding. This paper also highlighted the challenges and future directions of the latest work with the expectation that future research can be beneficial.
Collapse
Affiliation(s)
- Changwei Gong
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- Department of Computer Science, University of Chinese Academy of Sciences, Beijing, China
| | - Changhong Jing
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- Department of Computer Science, University of Chinese Academy of Sciences, Beijing, China
| | - Xuhang Chen
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- Department of Computer and Information Science, University of Macau, Macau, China
| | - Chi Man Pun
- Department of Computer and Information Science, University of Macau, Macau, China
| | - Guoli Huang
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Ashirbani Saha
- Department of Oncology and School of Biomedical Engineering, McMaster University, Hamilton, ON, Canada
| | - Martin Nieuwoudt
- Institute for Biomedical Engineering, Stellenbosch University, Stellenbosch, South Africa
| | - Han-Xiong Li
- Department of Systems Engineering, City University of Hong Kong, Hong Kong, China
| | - Yong Hu
- Department of Orthopaedics and Traumatology, The University of Hong Kong, Hong Kong, China
| | - Shuqiang Wang
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- Department of Computer Science, University of Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
22
|
Jin R, Cai Y, Zhang S, Yang T, Feng H, Jiang H, Zhang X, Hu Y, Liu J. Computational approaches for the reconstruction of optic nerve fibers along the visual pathway from medical images: a comprehensive review. Front Neurosci 2023; 17:1191999. [PMID: 37304011 PMCID: PMC10250625 DOI: 10.3389/fnins.2023.1191999] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Accepted: 05/09/2023] [Indexed: 06/13/2023] Open
Abstract
Optic never fibers in the visual pathway play significant roles in vision formation. Damages of optic nerve fibers are biomarkers for the diagnosis of various ophthalmological and neurological diseases; also, there is a need to prevent the optic nerve fibers from getting damaged in neurosurgery and radiation therapy. Reconstruction of optic nerve fibers from medical images can facilitate all these clinical applications. Although many computational methods are developed for the reconstruction of optic nerve fibers, a comprehensive review of these methods is still lacking. This paper described both the two strategies for optic nerve fiber reconstruction applied in existing studies, i.e., image segmentation and fiber tracking. In comparison to image segmentation, fiber tracking can delineate more detailed structures of optic nerve fibers. For each strategy, both conventional and AI-based approaches were introduced, and the latter usually demonstrates better performance than the former. From the review, we concluded that AI-based methods are the trend for optic nerve fiber reconstruction and some new techniques like generative AI can help address the current challenges in optic nerve fiber reconstruction.
Collapse
Affiliation(s)
- Richu Jin
- Research Institute of Trustworthy Autonomous Systems, Southern University of Science and Technology, Shenzhen, China
- Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, China
| | - Yongning Cai
- Research Institute of Trustworthy Autonomous Systems, Southern University of Science and Technology, Shenzhen, China
| | - Shiyang Zhang
- Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, China
| | - Ting Yang
- Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, China
| | - Haibo Feng
- Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, China
| | - Hongyang Jiang
- Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, China
| | - Xiaoqing Zhang
- Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, China
| | - Yan Hu
- Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, China
| | - Jiang Liu
- Research Institute of Trustworthy Autonomous Systems, Southern University of Science and Technology, Shenzhen, China
- Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, China
- Guangdong Provincial Key Laboratory of Brain-inspired Intelligent Computation, Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, China
| |
Collapse
|
23
|
Xiong X, Feng J, Zhang Y, Wu D, Yi S, Wang C, Liu R, He J. Improved HHT-microstate analysis of EEG in nicotine addicts. Front Neurosci 2023; 17:1174399. [PMID: 37292161 PMCID: PMC10244792 DOI: 10.3389/fnins.2023.1174399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2023] [Accepted: 05/08/2023] [Indexed: 06/10/2023] Open
Abstract
Background Substance addiction is a chronic disease which causes great harm to modern society and individuals. At present, many studies have applied EEG analysis methods to the substance addiction detection and treatment. As a tool to describe the spatio-temporal dynamic characteristics of large-scale electrophysiological data, EEG microstate analysis has been widely used, which is an effective method to study the relationship between EEG electrodynamics and cognition or disease. Methods To study the difference of EEG microstate parameters of nicotine addicts at each frequency band, we combine an improved Hilbert Huang Transformation (HHT) decomposition with microstate analysis, which is applied to the EEG of nicotine addicts. Results After using improved HHT-Microstate method, we notice that there is significant difference in EEG microstates of nicotine addicts between viewing smoke pictures group (smoke) and viewing neutral pictures group (neutral). Firstly, there is a significant difference in EEG microstates at full-frequency band between smoke and neutral group. Compared with the FIR-Microstate method, the similarity index of microstate topographic maps at alpha and beta bands had significant differences between smoke and neutral group. Secondly, we find significant class × group interactions for microstate parameters at delta, alpha and beta bands. Finally, the microstate parameters at delta, alpha and beta bands obtained by the improved HHT-microstate analysis method are selected as features for classification and detection under the Gaussian kernel support vector machine. The highest accuracy is 92% sensitivity is 94% and specificity is 91%, which can more effectively detect and identify addiction diseases than FIR-Microstate and FIR-Riemann methods. Conclusion Thus, the improved HHT-Microstate analysis method can effectively identify substance addiction diseases and provide new ideas and insights for the brain research of nicotine addiction.
Collapse
Affiliation(s)
- Xin Xiong
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, China
| | - Jiannan Feng
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, China
| | - Yaru Zhang
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, China
| | - Di Wu
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, China
| | - Sanli Yi
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, China
| | - Chunwu Wang
- College of Physics and Electronic Engineering, Hanshan Normal University, Chaozhou, China
| | - Ruixiang Liu
- Department of Clinical Psychology, Second People's Hospital of Yunnan, Kunming, China
| | - Jianfeng He
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, China
| |
Collapse
|
24
|
Xu M, Ouyang Y, Yuan Z. Deep Learning Aided Neuroimaging and Brain Regulation. SENSORS (BASEL, SWITZERLAND) 2023; 23:s23114993. [PMID: 37299724 DOI: 10.3390/s23114993] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Revised: 05/15/2023] [Accepted: 05/22/2023] [Indexed: 06/12/2023]
Abstract
Currently, deep learning aided medical imaging is becoming the hot spot of AI frontier application and the future development trend of precision neuroscience. This review aimed to render comprehensive and informative insights into the recent progress of deep learning and its applications in medical imaging for brain monitoring and regulation. The article starts by providing an overview of the current methods for brain imaging, highlighting their limitations and introducing the potential benefits of using deep learning techniques to overcome these limitations. Then, we further delve into the details of deep learning, explaining the basic concepts and providing examples of how it can be used in medical imaging. One of the key strengths is its thorough discussion of the different types of deep learning models that can be used in medical imaging including convolutional neural networks (CNNs), recurrent neural networks (RNNs), and generative adversarial network (GAN) assisted magnetic resonance imaging (MRI), positron emission tomography (PET)/computed tomography (CT), electroencephalography (EEG)/magnetoencephalography (MEG), optical imaging, and other imaging modalities. Overall, our review on deep learning aided medical imaging for brain monitoring and regulation provides a referrable glance for the intersection of deep learning aided neuroimaging and brain regulation.
Collapse
Affiliation(s)
- Mengze Xu
- Center for Cognition and Neuroergonomics, State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Zhuhai 519087, China
- Centre for Cognitive and Brain Sciences, Institute of Collaborative Innovation, University of Macau, Macau SAR 999078, China
| | - Yuanyuan Ouyang
- Nanomicro Sino-Europe Technology Company Limited, Zhuhai 519031, China
- Jiangfeng China-Portugal Technology Co., Ltd., Macau SAR 999078, China
| | - Zhen Yuan
- Centre for Cognitive and Brain Sciences, Institute of Collaborative Innovation, University of Macau, Macau SAR 999078, China
| |
Collapse
|
25
|
Wiem T, Ali D. Deep second generation wavelet autoencoders based on curvelet pooling for brain pathology classification. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2023.104675] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]
|
26
|
A novel medical text classification model with Kalman filter for clinical decision making. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
|
27
|
Aetesam H, Maji SK. Perceptually Motivated Generative Model for Magnetic Resonance Image Denoising. J Digit Imaging 2023; 36:725-738. [PMID: 36474088 PMCID: PMC10039195 DOI: 10.1007/s10278-022-00744-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Revised: 11/01/2022] [Accepted: 11/17/2022] [Indexed: 12/12/2022] Open
Abstract
Image denoising is an important preprocessing step in low-level vision problems involving biomedical images. Noise removal techniques can greatly benefit raw corrupted magnetic resonance images (MRI). It has been discovered that the MR data is corrupted by a mixture of Gaussian-impulse noise caused by detector flaws and transmission errors. This paper proposes a deep generative model (GenMRIDenoiser) for dealing with this mixed noise scenario. This work makes four contributions. To begin, Wasserstein generative adversarial network (WGAN) is used in model training to mitigate the problem of vanishing gradient, mode collapse, and convergence issues encountered while training a vanilla GAN. Second, a perceptually motivated loss function is used to guide the training process in order to preserve the low-level details in the form of high-frequency components in the image. Third, batch renormalization is used between the convolutional and activation layers to prevent performance degradation under the assumption of non-independent and identically distributed (non-iid) data. Fourth, global feature attention module (GFAM) is appended at the beginning and end of the parallel ensemble blocks to capture the long-range dependencies that are often lost due to the small receptive field of convolutional filters. The experimental results over synthetic data and MRI stack obtained from real MR scanners indicate the potential utility of the proposed technique across a wide range of degradation scenarios.
Collapse
Affiliation(s)
- Hazique Aetesam
- Department of Computer Science and Engineering, Indian Institute of Technology Patna, Patna, 801106 India
| | - Suman Kumar Maji
- Department of Computer Science and Engineering, Indian Institute of Technology Patna, Patna, 801106 India
| |
Collapse
|
28
|
Zhao X, Liu L, Heidari AA, Chen Y, Ma BJ, Chen H, Quan S. An enhanced ant colony optimizer with Cauchy-Gaussian fusion and novel movement strategy for multi-threshold COVID-19 X-ray image segmentation. Front Neuroinform 2023; 17:1126783. [PMID: 37006638 PMCID: PMC10064065 DOI: 10.3389/fninf.2023.1126783] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2022] [Accepted: 02/16/2023] [Indexed: 03/19/2023] Open
Abstract
The novel coronavirus pneumonia (COVID-19) is a respiratory disease of great concern in terms of its dissemination and severity, for which X-ray imaging-based diagnosis is one of the effective complementary diagnostic methods. It is essential to be able to separate and identify lesions from their pathology images regardless of the computer-aided diagnosis techniques. Therefore, image segmentation in the pre-processing stage of COVID-19 pathology images would be more helpful for effective analysis. In this paper, to achieve highly effective pre-processing of COVID-19 pathological images by using multi-threshold image segmentation (MIS), an enhanced version of ant colony optimization for continuous domains (MGACO) is first proposed. In MGACO, not only a new move strategy is introduced, but also the Cauchy-Gaussian fusion strategy is incorporated. It has been accelerated in terms of convergence speed and has significantly enhanced its ability to jump out of the local optimum. Furthermore, an MIS method (MGACO-MIS) based on MGACO is developed, where it applies the non-local means, 2D histogram as the basis, and employs 2D Kapur’s entropy as the fitness function. To demonstrate the performance of MGACO, we qualitatively analyze it in detail and compare it with other peers on 30 benchmark functions from IEEE CEC2014, which proves that it has a stronger capability of solving problems over the original ant colony optimization for continuous domains. To verify the segmentation effect of MGACO-MIS, we conducted a comparison experiment with eight other similar segmentation methods based on real pathology images of COVID-19 at different threshold levels. The final evaluation and analysis results fully demonstrate that the developed MGACO-MIS is sufficient to obtain high-quality segmentation results in the COVID-19 image segmentation and has stronger adaptability to different threshold levels than other methods. Therefore, it has been well-proven that MGACO is an excellent swarm intelligence optimization algorithm, and MGACO-MIS is also an excellent segmentation method.
Collapse
Affiliation(s)
- Xiuzhi Zhao
- College of Artificial Intelligence, Zhejiang Industry & Trade Vocational College, Wenzhou, Zhejiang, China
| | - Lei Liu
- College of Computer Science, Sichuan University, Chengdu, Sichuan, China
- *Correspondence: Lei Liu,
| | - Ali Asghar Heidari
- School of Surveying and Geospatial Engineering, College of Engineering, University of Tehran, Tehran, Iran
| | - Yi Chen
- Institute of Big Data and Information Technology, Wenzhou University, Wenzhou, China
| | - Benedict Jun Ma
- Department of Industrial and Manufacturing Systems Engineering, The University of Hong Kong, Hong Kong, Hong Kong SAR, China
| | - Huiling Chen
- Institute of Big Data and Information Technology, Wenzhou University, Wenzhou, China
- Huiling Chen,
| | - Shichao Quan
- Department of Big Data in Health Science, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
- Key Laboratory of Intelligent Treatment and Life Support for Critical Diseases of Zhejiang Province, Wenzhou, China
- Zhejiang Engineering Research Center for Hospital Emergency and Process Digitization, Wenzhou, China
- Shichao Quan,
| |
Collapse
|
29
|
Feature similarity rank-based information distillation network for lightweight image superresolution. Knowl Based Syst 2023. [DOI: 10.1016/j.knosys.2023.110437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/05/2023]
|
30
|
Wang C, Yu J, Zhong J, Han S, Qi Y, Fang B, Li X. Prior knowledge-based precise diagnosis of blend sign from head computed tomography. Front Neurosci 2023; 17:1112355. [PMID: 36845414 PMCID: PMC9950259 DOI: 10.3389/fnins.2023.1112355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Accepted: 01/17/2023] [Indexed: 02/12/2023] Open
Abstract
Introduction Automated diagnosis of intracranial hemorrhage on head computed tomography (CT) plays a decisive role in clinical management. This paper presents a prior knowledge-based precise diagnosis of blend sign network from head CT scans. Method We employ the object detection task as an auxiliary task in addition to the classification task, which could incorporate the hemorrhage location as prior knowledge into the detection framework. The auxiliary task could help the model pay more attention to the regions with hemorrhage, which is beneficial for distinguishing the blend sign. Furthermore, we propose a self-knowledge distillation strategy to deal with inaccuracy annotations. Results In the experiment, we retrospectively collected 1749 anonymous non-contrast head CT scans from the First Affiliated Hospital of China Medical University. The dataset contains three categories: no intracranial hemorrhage (non-ICH), normal intracranial hemorrhage (normal ICH), and blend sign. The experimental results demonstrate that our method performs better than other methods. Discussion Our method has the potential to assist less-experienced head CT interpreters, reduce radiologists' workload, and improve efficiency in natural clinical settings.
Collapse
Affiliation(s)
- Chen Wang
- College of Computer Science, Chongqing University, Chongqing, China
| | - Jiefu Yu
- Department of Neurosurgery, The First Hospital of China Medical University, Shenyang, China
| | - Jiang Zhong
- College of Computer Science, Chongqing University, Chongqing, China,*Correspondence: Jiang Zhong ✉
| | - Shuai Han
- Department of Neurosurgery, Shengjing Hospital of China Medical University, Shenyang, China,Shuai Han ✉
| | - Yafei Qi
- College of Computer Science and Engineering, Central South University, Changsha, China
| | - Bin Fang
- College of Computer Science, Chongqing University, Chongqing, China
| | - Xue Li
- School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, QLD, Australia
| |
Collapse
|
31
|
Data-Driven Phenotyping of Alzheimer's Disease under Epigenetic Conditions Using Partial Volume Correction of PET Studies and Manifold Learning. Biomedicines 2023; 11:biomedicines11020273. [PMID: 36830810 PMCID: PMC9953610 DOI: 10.3390/biomedicines11020273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Revised: 01/10/2023] [Accepted: 01/16/2023] [Indexed: 01/20/2023] Open
Abstract
Alzheimer's disease (AD) is the most common form of dementia. An increasing number of studies have confirmed epigenetic changes in AD. Consequently, a robust phenotyping mechanism must take into consideration the environmental effects on the patient in the generation of phenotypes. Positron Emission Tomography (PET) is employed for the quantification of pathological amyloid deposition in brain tissues. The objective is to develop a new methodology for the hyperparametric analysis of changes in cognitive scores and PET features to test for there being multiple AD phenotypes. We used a computational method to identify phenotypes in a retrospective cohort study (532 subjects), using PET and Magnetic Resonance Imaging (MRI) images and neuropsychological assessments, to develop a novel computational phenotyping method that uses Partial Volume Correction (PVC) and subsets of neuropsychological assessments in a non-biased fashion. Our pipeline is based on a Regional Spread Function (RSF) method for PVC and a t-distributed Stochastic Neighbor Embedding (t-SNE) manifold. The results presented demonstrate that (1) the approach to data-driven phenotyping is valid, (2) the different techniques involved in the pipelines produce different results, and (3) they permit us to identify the best phenotyping pipeline. The method identifies three phenotypes and permits us to analyze them under epigenetic conditions.
Collapse
|
32
|
Gong C, Chen X, Mughal B, Wang S. Addictive brain-network identification by spatial attention recurrent network with feature selection. Brain Inform 2023; 10:2. [PMID: 36625937 PMCID: PMC9832209 DOI: 10.1186/s40708-022-00182-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Accepted: 12/01/2022] [Indexed: 01/11/2023] Open
Abstract
Addiction in the brain is associated with adaptive changes that reshape addiction-related brain regions and lead to functional abnormalities that cause a range of behavioral changes, and functional magnetic resonance imaging (fMRI) studies can reveal complex dynamic patterns of brain functional change. However, it is still a challenge to identify functional brain networks and discover region-level biomarkers between nicotine addiction (NA) and healthy control (HC) groups. To tackle it, we transform the fMRI of the rat brain into a network with biological attributes and propose a novel feature-selected framework to extract and select the features of addictive brain regions and identify these graph-level networks. In this framework, spatial attention recurrent network (SARN) is designed to capture the features with spatial and time-sequential information. And the Bayesian feature selection(BFS) strategy is adopted to optimize the model and improve classification tasks by restricting features. Our experiments on the addiction brain imaging dataset obtain superior identification performance and interpretable biomarkers associated with addiction-relevant brain regions.
Collapse
Affiliation(s)
- Changwei Gong
- grid.9227.e0000000119573309Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055 China ,grid.410726.60000 0004 1797 8419University of Chinese Academy of Sciences, Beijing, 100049 China
| | - Xinyi Chen
- grid.9227.e0000000119573309Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055 China ,grid.263817.90000 0004 1773 1790Southern University of Science and Technology, Shenzhen, 518055 China
| | - Bushra Mughal
- grid.5808.50000 0001 1503 7226Faculty of Engineering, University of Porto, Porto, 0035122 Portugal
| | - Shuqiang Wang
- grid.9227.e0000000119573309Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055 China
| |
Collapse
|
33
|
TA-GAN: transformer-driven addiction-perception generative adversarial network. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-08187-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|
34
|
Zuo Q, Lu L, Wang L, Zuo J, Ouyang T. Constructing brain functional network by Adversarial Temporal-Spatial Aligned Transformer for early AD analysis. Front Neurosci 2022; 16:1087176. [PMID: 36518529 PMCID: PMC9742604 DOI: 10.3389/fnins.2022.1087176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Accepted: 11/10/2022] [Indexed: 09/19/2023] Open
Abstract
Introduction The brain functional network can describe the spontaneous activity of nerve cells and reveal the subtle abnormal changes associated with brain disease. It has been widely used for analyzing early Alzheimer's disease (AD) and exploring pathological mechanisms. However, the current methods of constructing functional connectivity networks from functional magnetic resonance imaging (fMRI) heavily depend on the software toolboxes, which may lead to errors in connection strength estimation and bad performance in disease analysis because of many subjective settings. Methods To solve this problem, in this paper, a novel Adversarial Temporal-Spatial Aligned Transformer (ATAT) model is proposed to automatically map 4D fMRI into functional connectivity network for early AD analysis. By incorporating the volume and location of anatomical brain regions, the region-guided feature learning network can roughly focus on local features for each brain region. Also, the spatial-temporal aligned transformer network is developed to adaptively adjust boundary features of adjacent regions and capture global functional connectivity patterns of distant regions. Furthermore, a multi-channel temporal discriminator is devised to distinguish the joint distributions of the multi-region time series from the generator and the real sample. Results Experimental results on the Alzheimer's Disease Neuroimaging Initiative (ADNI) proved the effectiveness and superior performance of the proposed model in early AD prediction and progression analysis. Discussion To verify the reliability of the proposed model, the detected important ROIs are compared with clinical studies and show partial consistency. Furthermore, the most significant altered connectivity reflects the main characteristics associated with AD. Conclusion Generally, the proposed ATAT provides a new perspective in constructing functional connectivity networks and is able to evaluate the disease-related changing characteristics at different stages for neuroscience exploration and clinical disease analysis.
Collapse
Affiliation(s)
- Qiankun Zuo
- School of Information Engineering, Hubei University of Economics, Wuhan, China
- CAS Key Laboratory of Human-Machine Intelligence-Synergy Systems, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, and the SIAT Branch, Shenzhen Institute of Artificial Intelligence and Robotics for Society, Shenzhen, China
| | - Libin Lu
- School of Mathematics and Computer Science, Wuhan Polytechnic University, Wuhan, China
| | - Lin Wang
- CAS Key Laboratory of Human-Machine Intelligence-Synergy Systems, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, and the SIAT Branch, Shenzhen Institute of Artificial Intelligence and Robotics for Society, Shenzhen, China
- Guangdong-Hong Kong-Macau Joint Laboratory of Human-Machine Intelligence-Synergy Systems, Shenzhen, China
| | - Jiahui Zuo
- State Key Laboratory of Petroleum Resource and Prospecting, and Unconventional Petroleum Research Institute, China University of Petroleum, Beijing, China
| | - Tao Ouyang
- State Key Laboratory of Geomechanics and Geotechnical Engineering, Institute of Rock and Soil Mechanics, Chinese Academy of Sciences, Wuhan, China
| |
Collapse
|
35
|
A Systematic Literature Review on Applications of GAN-Synthesized Images for Brain MRI. FUTURE INTERNET 2022. [DOI: 10.3390/fi14120351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
With the advances in brain imaging, magnetic resonance imaging (MRI) is evolving as a popular radiological tool in clinical diagnosis. Deep learning (DL) methods can detect abnormalities in brain images without an extensive manual feature extraction process. Generative adversarial network (GAN)-synthesized images have many applications in this field besides augmentation, such as image translation, registration, super-resolution, denoising, motion correction, segmentation, reconstruction, and contrast enhancement. The existing literature was reviewed systematically to understand the role of GAN-synthesized dummy images in brain disease diagnosis. Web of Science and Scopus databases were extensively searched to find relevant studies from the last 6 years to write this systematic literature review (SLR). Predefined inclusion and exclusion criteria helped in filtering the search results. Data extraction is based on related research questions (RQ). This SLR identifies various loss functions used in the above applications and software to process brain MRIs. A comparative study of existing evaluation metrics for GAN-synthesized images helps choose the proper metric for an application. GAN-synthesized images will have a crucial role in the clinical sector in the coming years, and this paper gives a baseline for other researchers in the field.
Collapse
|
36
|
Yang X, Ye X, Zhao D, Heidari AA, Xu Z, Chen H, Li Y. Multi-threshold image segmentation for melanoma based on Kapur’s entropy using enhanced ant colony optimization. Front Neuroinform 2022; 16:1041799. [PMID: 36387585 PMCID: PMC9663822 DOI: 10.3389/fninf.2022.1041799] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2022] [Accepted: 10/10/2022] [Indexed: 11/07/2022] Open
Abstract
Melanoma is a malignant tumor formed by the cancerous transformation of melanocytes, and its medical images contain much information. However, the percentage of the critical information in the image is small, and the noise is non-uniformly distributed. We propose a new multi-threshold image segmentation model based on the two-dimensional histogram approach to the above problem. We present an enhanced ant colony optimization for continuous domains (EACOR) in the proposed model based on the soft besiege and chase strategies. Further, EACOR is combined with two-dimensional Kapur’s entropy to search for the optimal thresholds. An experiment on the IEEE CEC2014 benchmark function was conducted to measure the reliable global search capability of the EACOR algorithm in the proposed model. Moreover, we have also conducted several sets of experiments to test the validity of the image segmentation model proposed in this paper. The experimental results show that the segmented images from the proposed model outperform the comparison method in several evaluation metrics. Ultimately, the model proposed in this paper can provide high-quality samples for subsequent analysis of melanoma pathology images.
Collapse
Affiliation(s)
- Xiao Yang
- School of Computer Science and Technology, Changchun University of Science and Technology, Changchun, China
| | - Xiaojia Ye
- School of Statistics and Mathematics, Shanghai Lixin University of Accounting and Finance, Shanghai, China
- *Correspondence: Xiaojia Ye,
| | - Dong Zhao
- College of Computer Science and Technology, Changchun Normal University, Changchun, China
- Dong Zhao,
| | - Ali Asghar Heidari
- School of Surveying and Geospatial Engineering, College of Engineering, University of Tehran, Tehran, Iran
| | - Zhangze Xu
- College of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, China
| | - Huiling Chen
- College of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, China
| | - Yangyang Li
- Department of Pathology, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
- Yangyang Li,
| |
Collapse
|
37
|
Khairuddin MZF, Lu Hui P, Hasikin K, Abd Razak NA, Lai KW, Mohd Saudi AS, Ibrahim SS. Occupational Injury Risk Mitigation: Machine Learning Approach and Feature Optimization for Smart Workplace Surveillance. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:13962. [PMID: 36360843 PMCID: PMC9653932 DOI: 10.3390/ijerph192113962] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 10/09/2022] [Accepted: 10/25/2022] [Indexed: 06/16/2023]
Abstract
Forecasting the severity of occupational injuries shall be all industries' top priority. The use of machine learning is theoretically valuable to assist the predictive analysis, thus, this study attempts to propose a feature-optimized predictive model for anticipating occupational injury severity. A public database of 66,405 occupational injury records from OSHA is analyzed using five sets of machine learning models: Support Vector Machine, K-Nearest Neighbors, Naïve Bayes, Decision Tree, and Random Forest. For model comparison, Random Forest outperformed other models with higher accuracy and F1-score. Therefore, it highlighted the potential of ensemble learning as a more accurate prediction model in the field of occupational injury. In constructing the model, this study also proposed the feature optimization technique that revealed the three most important features; 'nature of injury', 'type of event', and 'affected body part' in developing model. The accuracy of the Random Forest model was improved by 0.5% or 0.895 and 0.954 for the prediction of hospitalization and amputation, respectively by redeveloping and optimizing the model with hyperparameter tuning. The feature optimization is essential in providing insight knowledge to the Safety and Health Practitioners for future injury corrective and preventive strategies. This study has shown promising potential for smart workplace surveillance.
Collapse
Affiliation(s)
- Mohamed Zul Fadhli Khairuddin
- Department of Biomedical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur 50603, Malaysia
- Environmental Healthcare Section, Institute of Medical Science Technology, Universiti Kuala Lumpur, Kajang 40300, Selangor, Malaysia
| | - Puat Lu Hui
- Department of Biomedical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur 50603, Malaysia
| | - Khairunnisa Hasikin
- Department of Biomedical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur 50603, Malaysia
- Centre of Intelligent Systems for Emerging Technology (CISET), Faculty of Engineering, Universiti Malaya, Kuala Lumpur 50603, Malaysia
| | - Nasrul Anuar Abd Razak
- Department of Biomedical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur 50603, Malaysia
| | - Khin Wee Lai
- Department of Biomedical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur 50603, Malaysia
| | - Ahmad Shakir Mohd Saudi
- Centre of Water Engineering Technology, Water Energy Section, Malaysia France Institute, Universiti Kuala Lumpur, Bangi 43650, Selangor, Malaysia
| | - Siti Salwa Ibrahim
- Negeri Sembilan State Health Department, Seremban 70300, Negeri Sembilan, Malaysia
| |
Collapse
|
38
|
Gong C, Xue B, Jing C, He CH, Wu GC, Lei B, Wang S. Time-sequential graph adversarial learning for brain modularity community detection. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2022; 19:13276-13293. [PMID: 36654046 DOI: 10.3934/mbe.2022621] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Brain community detection is an efficient method to represent the communities of brain networks. However, time-variable functions of the brain and the intricate brain community structure impose a great challenge on it. In this paper, a time-sequential graph adversarial learning (TGAL) framework is proposed to detect brain communities and characterize the structure of communities from brain networks. In the framework, a novel time-sequential graph neural network is designed as an encoder to extract efficient graph representations by spatio-temporal attention mechanism. Since it is difficult to capture the community structure, the measurable modularity loss is used to optimize by maximizing the modularity of the community. In addition, the framework employs an adversarial scheme to guide the learning of representation. The effectiveness of our model is shown through experiments on the real-world brain network datasets, and the great performance of brain community detection demonstrates the advantage of the proposed framework.
Collapse
Affiliation(s)
- Changwei Gong
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518060, China
- Department of Computer Science, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Bing Xue
- Faculty of Computer science, University of Malaya, Malaya
| | - Changhong Jing
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518060, China
- Department of Computer Science, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Chun-Hui He
- School of Civil Engineering, Xi'an University of Architecture and Technology, Xi'an 710055, China
| | - Guo-Cheng Wu
- Data Recovery Key Laboratory of Sichuan Province, College of Mathematics and Information Science, Neijiang Normal University, Neijiang 641100, China
| | - Baiying Lei
- School of Biomedical Engineering, Shenzhen University, Shenzhen 518060, China
| | - Shuqiang Wang
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518060, China
| |
Collapse
|
39
|
Bi-MGAN: Bidirectional T1-to-T2 MRI images prediction using multi-generative multi-adversarial nets. Biomed Signal Process Control 2022. [DOI: 10.1016/j.bspc.2022.103994] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
40
|
Feature-Selected Graph Spatial Attention Network for Addictive Brain-Networks Identification. Brain Inform 2022. [DOI: 10.1007/978-3-031-15037-1_26] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2022] Open
|