2
|
Xu J, Hua Q, Jia X, Zheng Y, Hu Q, Bai B, Miao J, Zhu L, Zhang M, Tao R, Li Y, Luo T, Xie J, Zheng X, Gu P, Xing F, He C, Song Y, Dong Y, Xia S, Zhou J. Synthetic Breast Ultrasound Images: A Study to Overcome Medical Data Sharing Barriers. RESEARCH (WASHINGTON, D.C.) 2024; 7:0532. [PMID: 39628833 PMCID: PMC11612121 DOI: 10.34133/research.0532] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/11/2024] [Revised: 10/02/2024] [Accepted: 10/24/2024] [Indexed: 12/06/2024]
Abstract
The vast potential of medical big data to enhance healthcare outcomes remains underutilized due to privacy concerns, which restrict cross-center data sharing and the construction of diverse, large-scale datasets. To address this challenge, we developed a deep generative model aimed at synthesizing medical data to overcome data sharing barriers, with a focus on breast ultrasound (US) image synthesis. Specifically, we introduce CoLDiT, a conditional latent diffusion model with a transformer backbone, to generate US images of breast lesions across various Breast Imaging Reporting and Data System (BI-RADS) categories. Using a training dataset of 9,705 US images from 5,243 patients across 202 hospitals with diverse US systems, CoLDiT generated breast US images without duplicating private information, as confirmed through nearest-neighbor analysis. Blinded reader studies further validated the realism of these images, with area under the receiver operating characteristic curve (AUC) scores ranging from 0.53 to 0.77. Additionally, synthetic breast US images effectively augmented the training dataset for BI-RADS classification, achieving performance comparable to that using an equal-sized training set comprising solely real images (P = 0.81 for AUC). Our findings suggest that synthetic data, such as CoLDiT-generated images, offer a viable, privacy-preserving solution to facilitate secure medical data sharing and advance the utilization of medical big data.
Collapse
Affiliation(s)
- JiaLe Xu
- Department of Ultrasound, Ruijin Hospital,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
- College of Health Science and Technology,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
| | - Qing Hua
- Department of Ultrasound, Ruijin Hospital,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
- College of Health Science and Technology,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
| | - XiaoHong Jia
- Department of Ultrasound, Ruijin Hospital,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
- College of Health Science and Technology,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
| | - YuHang Zheng
- Department of Ultrasound, Ruijin Hospital,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
- College of Health Science and Technology,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
| | - Qiao Hu
- Department of Ultrasound,
The People’s Hospital of Guangxi Zhuang Autonomous Region, Nanning, 530021 Guangxi, China
| | - BaoYan Bai
- Department of Ultrasound,
Yan’an University Affiliated Hospital, Yan’an, 716000 Shaanxi, China
| | - Juan Miao
- Department of Ultrasound,
Zigong Fourth People’s Hospital, Zigong, 643000 Sichuan, China
| | - LiSha Zhu
- Department of Ultrasound,
Yichun City People’s Hospital, Yichun, 336000 Jiangxi, China
| | - MeiXiang Zhang
- Department of Ultrasound, Ruijin Hospital,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
- College of Health Science and Technology,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
| | - RuoLin Tao
- Department of Ultrasound, Ruijin Hospital,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
- College of Health Science and Technology,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
| | - YuHeng Li
- Department of Ultrasound, Ruijin Hospital,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
- College of Health Science and Technology,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
| | - Ting Luo
- Department of Ultrasound, Ruijin Hospital,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
- College of Health Science and Technology,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
| | - Jun Xie
- Shanghai Aitrox Technology Corporation Limited, 200050 Shanghai, China
| | - XueBin Zheng
- Shanghai Aitrox Technology Corporation Limited, 200050 Shanghai, China
| | - PengChen Gu
- Shanghai Aitrox Technology Corporation Limited, 200050 Shanghai, China
| | - FengYuan Xing
- Shanghai Aitrox Technology Corporation Limited, 200050 Shanghai, China
| | - Chuan He
- Shanghai Aitrox Technology Corporation Limited, 200050 Shanghai, China
| | - YanYan Song
- Department of Biostatistics, Institute of Medical Sciences,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
| | - YiJie Dong
- Department of Ultrasound, Ruijin Hospital,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
- College of Health Science and Technology,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
| | - ShuJun Xia
- Department of Ultrasound, Ruijin Hospital,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
- College of Health Science and Technology,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
| | - JianQiao Zhou
- Department of Ultrasound, Ruijin Hospital,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
- College of Health Science and Technology,
Shanghai Jiao Tong University School of Medicine, 200025 Shanghai, China
| |
Collapse
|
3
|
Rajaraman S, Liang Z, Xue Z, Antani S. Addressing Class Imbalance with Latent Diffusion-based Data Augmentation for Improving Disease Classification in Pediatric Chest X-rays. PROCEEDINGS. IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE 2024; 2024:5059-5066. [PMID: 40134830 PMCID: PMC11936509 DOI: 10.1109/bibm62325.2024.10822172] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/27/2025]
Abstract
Deep learning (DL) has transformed medical image classification; however, its efficacy is often limited by significant data imbalance due to far fewer cases (minority class) compared to controls (majority class). It has been shown that synthetic image augmentation techniques can simulate clinical variability, leading to enhanced model performance. We hypothesize that they could also mitigate the challenge of data imbalance, thereby addressing overfitting to the majority class and enhancing generalization. Recently, latent diffusion models (LDMs) have shown promise in synthesizing high-quality medical images. This study evaluates the effectiveness of a text-guided image-to-image LDM in synthesizing disease-positive chest X-rays (CXRs) and augmenting a pediatric CXR dataset to improve classification performance. We first establish baseline performance by fine-tuning an ImageNet-pretrained Inception-V3 model on class-imbalanced data for two tasks-normal vs. pneumonia and normal vs. bronchopneumonia. Next, we fine-tune individual text-guided image-to-image LDMs to generate CXRs showing signs of pneumonia and bronchopneumonia. The Inception-V3 model is retrained on an updated data set that includes these synthesized images as part of augmented training and validation sets. Classification performance is compared using balanced accuracy, sensitivity, specificity, F-score, Matthews correlation coefficient (MCC), Kappa, and Youden's index against the baseline performance. Results show that the augmentation significantly improves Youden's index (p<0.05) and markedly enhances other metrics, indicating that data augmentation using LDM-synthesized images is an effective strategy for addressing class imbalance in medical image classification.
Collapse
Affiliation(s)
- Sivaramakrishnan Rajaraman
- Division of Intramural Research, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Zhaohui Liang
- Division of Intramural Research, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Zhiyun Xue
- Division of Intramural Research, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Sameer Antani
- Division of Intramural Research, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
4
|
Hardy R, Klepich J, Mitchell R, Hall S, Villareal J, Ilin C. Improving nonalcoholic fatty liver disease classification performance with latent diffusion models. Sci Rep 2023; 13:21619. [PMID: 38062049 PMCID: PMC10703886 DOI: 10.1038/s41598-023-48062-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 11/22/2023] [Indexed: 12/18/2023] Open
Abstract
Integrating deep learning with clinical expertise holds great potential for addressing healthcare challenges and empowering medical professionals with improved diagnostic tools. However, the need for annotated medical images is often an obstacle to leveraging the full power of machine learning models. Our research demonstrates that by combining synthetic images, generated using diffusion models, with real images, we can enhance nonalcoholic fatty liver disease (NAFLD) classification performance even in low-data regime settings. We evaluate the quality of the synthetic images by comparing two metrics: Inception Score (IS) and Fréchet Inception Distance (FID), computed on diffusion- and generative adversarial network (GAN)-generated images. Our results show superior performance for the diffusion-generated images, with a maximum IS score of 1.90 compared to 1.67 for GANs, and a minimum FID score of 69.45 compared to 100.05 for GANs. Utilizing a partially frozen CNN backbone (EfficientNet v1), our synthetic augmentation method achieves a maximum image-level ROC AUC of 0.904 on a NAFLD prediction task.
Collapse
Affiliation(s)
- Romain Hardy
- School of Information, U.C. Berkeley, Berkeley, CA, USA
| | - Joe Klepich
- School of Information, U.C. Berkeley, Berkeley, CA, USA
| | - Ryan Mitchell
- School of Information, U.C. Berkeley, Berkeley, CA, USA
| | - Steve Hall
- School of Information, U.C. Berkeley, Berkeley, CA, USA
| | | | - Cornelia Ilin
- School of Information, U.C. Berkeley, Berkeley, CA, USA.
| |
Collapse
|