Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

19
(from Reference Citation Analysis)

Article PDFs (2)

Cited by > 0 (7)

Searched Name

domain generalization

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Li Y, Chen Q, Li H, Wang S, Chen N, Han T, Wang K, Yu Q, Cao Z, Tang J. MFNet: Meta-learning based on frequency-space mix for MRI segmentation in nasopharyngeal carcinoma. J Cell Mol Med 2024;28:e18355. [PMID: 38685683 PMCID: PMC11058331 DOI: 10.1111/jcmm.18355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 04/07/2024] [Accepted: 04/11/2024] [Indexed: 05/02/2024] Open

Loewinger G, Nunez RA, Mazumder R, Parmigiani G. Optimal ensemble construction for multistudy prediction with applications to mortality estimation. Stat Med 2024;43:1774-1789. [PMID: 38396313 DOI: 10.1002/sim.10006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Revised: 10/12/2023] [Accepted: 12/22/2023] [Indexed: 02/25/2024]

Papadakis A, Spyrou E. A Multi-Modal Egocentric Activity Recognition Approach towards Video Domain Generalization. Sensors (Basel) 2024;24:2491. [PMID: 38676108 PMCID: PMC11054491 DOI: 10.3390/s24082491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Revised: 04/08/2024] [Accepted: 04/10/2024] [Indexed: 04/28/2024]

Liu X, Vafay Eslahi S, Marin T, Tiss A, Chemli Y, Huang Y, Johnson KA, El Fakhri G, Ouyang J. Cross noise level PET denoising with continuous adversarial domain generalization. Phys Med Biol 2024;69:085001. [PMID: 38484401 DOI: 10.1088/1361-6560/ad341a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Accepted: 03/14/2024] [Indexed: 04/04/2024]

Abstract

Objective.Performing positron emission tomography (PET) denoising within the image space proves effective in reducing the variance in PET images. In recent years, deep learning has demonstrated superior denoising performance, but models trained on a specific noise level typically fail to generalize well on different noise levels, due to inherent distribution shifts between inputs. The distribution shift usually results in bias in the denoised images. Our goal is to tackle such a problem using a domain generalization technique.Approach.We propose to utilize the domain generalization technique with a novel feature space continuous discriminator (CD) for adversarial training, using the fraction of events as a continuous domain label. The core idea is to enforce the extraction of noise-level invariant features. Thus minimizing the distribution divergence of latent feature representation for different continuous noise levels, and making the model general for arbitrary noise levels. We created three sets of 10%, 13%-22% (uniformly randomly selected), or 25% fractions of events from 9718F-MK6240 tau PET studies of 60 subjects. For each set, we generated 20 noise realizations. Training, validation, and testing were implemented using 1400, 120, and 420 pairs of 3D image volumes from the same or different sets. We used 3D UNet as the baseline and implemented CD to the continuous noise level training data of 13%-22% set.Main results.The proposed CD improves the denoising performance of our model trained in a 13%-22% fraction set for testing in both 10% and 25% fraction sets, measured by bias and standard deviation using full-count images as references. In addition, our CD method can improve the SSIM and PSNR consistently for Alzheimer-related regions and the whole brain.Significance.To our knowledge, this is the first attempt to alleviate the performance degradation in cross-noise level denoising from the perspective of domain generalization. Our study is also a pioneer work of continuous domain generalization to utilize continuously changing source domains.

Collapse

Affiliation(s)

Xiaofeng Liu Gordon Center for Medical Imaging, Department of Radiology, Massachusetts General Hospital, Boston, MA 02114, United States of America Department of Radiology, Harvard Medical School, Boston, MA 02115, United States of America Department of Radiology and Biomedical Imaging, Yale University, New Haven, CT 06520, United States of America
Samira Vafay Eslahi Gordon Center for Medical Imaging, Department of Radiology, Massachusetts General Hospital, Boston, MA 02114, United States of America Department of Radiology, Harvard Medical School, Boston, MA 02115, United States of America
Thibault Marin Gordon Center for Medical Imaging, Department of Radiology, Massachusetts General Hospital, Boston, MA 02114, United States of America Department of Radiology, Harvard Medical School, Boston, MA 02115, United States of America Department of Radiology and Biomedical Imaging, Yale University, New Haven, CT 06520, United States of America
Amal Tiss Gordon Center for Medical Imaging, Department of Radiology, Massachusetts General Hospital, Boston, MA 02114, United States of America Department of Radiology, Harvard Medical School, Boston, MA 02115, United States of America
Yanis Chemli Gordon Center for Medical Imaging, Department of Radiology, Massachusetts General Hospital, Boston, MA 02114, United States of America Department of Radiology, Harvard Medical School, Boston, MA 02115, United States of America Department of Radiology and Biomedical Imaging, Yale University, New Haven, CT 06520, United States of America
Yongsong Huang Gordon Center for Medical Imaging, Department of Radiology, Massachusetts General Hospital, Boston, MA 02114, United States of America
Keith A Johnson Gordon Center for Medical Imaging, Department of Radiology, Massachusetts General Hospital, Boston, MA 02114, United States of America Department of Radiology, Harvard Medical School, Boston, MA 02115, United States of America
Georges El Fakhri Gordon Center for Medical Imaging, Department of Radiology, Massachusetts General Hospital, Boston, MA 02114, United States of America Department of Radiology, Harvard Medical School, Boston, MA 02115, United States of America Department of Radiology and Biomedical Imaging, Yale University, New Haven, CT 06520, United States of America
Jinsong Ouyang Gordon Center for Medical Imaging, Department of Radiology, Massachusetts General Hospital, Boston, MA 02114, United States of America Department of Radiology, Harvard Medical School, Boston, MA 02115, United States of America Department of Radiology and Biomedical Imaging, Yale University, New Haven, CT 06520, United States of America

Collapse

Sahay R, Thomas G, Jahan CS, Manjrekar M, Popp D, Savakis A. On the Importance of Attention and Augmentations for Hypothesis Transfer in Domain Adaptation and Generalization. Sensors (Basel) 2023;23:8409. [PMID: 37896503 PMCID: PMC10611075 DOI: 10.3390/s23208409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 09/27/2023] [Accepted: 10/10/2023] [Indexed: 10/29/2023]

Abstract

Unsupervised domain adaptation (UDA) aims to mitigate the performance drop due to the distribution shift between the training and testing datasets. UDA methods have achieved performance gains for models trained on a source domain with labeled data to a target domain with only unlabeled data. The standard feature extraction method in domain adaptation has been convolutional neural networks (CNNs). Recently, attention-based transformer models have emerged as effective alternatives for computer vision tasks. In this paper, we benchmark three attention-based architectures, specifically vision transformer (ViT), shifted window transformer (SWIN), and dual attention vision transformer (DAViT), against convolutional architectures ResNet, HRNet and attention-based ConvNext, to assess the performance of different backbones for domain generalization and adaptation. We incorporate these backbone architectures as feature extractors in the source hypothesis transfer (SHOT) framework for UDA. SHOT leverages the knowledge learned in the source domain to align the image features of unlabeled target data in the absence of source domain data, using self-supervised deep feature clustering and self-training. We analyze the generalization and adaptation performance of these models on standard UDA datasets and aerial UDA datasets. In addition, we modernize the training procedure commonly seen in UDA tasks by adding image augmentation techniques to help models generate richer features. Our results show that ConvNext and SWIN offer the best performance, indicating that the attention mechanism is very beneficial for domain generalization and adaptation with both transformer and convolutional architectures. Our ablation study shows that our modernized training recipe, within the SHOT framework, significantly boosts performance on aerial datasets.

Collapse

Gordon SM, McDaniel JR, King KW, Lawhern VJ, Touryan J. Decoding neural activity to assess individual latent state in ecologically valid contexts. J Neural Eng 2023;20:046033. [PMID: 37552980 DOI: 10.1088/1741-2552/acee20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Accepted: 08/08/2023] [Indexed: 08/10/2023]

Abstract

Objective.Currently, there exists very few ways to isolate cognitive processes, historically defined via highly controlled laboratory studies, in more ecologically valid contexts. Specifically, it remains unclear as to what extent patterns of neural activity observed under such constraints actually manifest outside the laboratory in a manner that can be used to make accurate inferences about latent states, associated cognitive processes, or proximal behavior. Improving our understanding of when and how specific patterns of neural activity manifest in ecologically valid scenarios would provide validation for laboratory-based approaches that study similar neural phenomena in isolation and meaningful insight into the latent states that occur during complex tasks.Approach.Domain generalization methods, borrowed from the work of the brain-computer interface community, have the potential to capture high-dimensional patterns of neural activity in a way that can be reliably applied across experimental datasets in order to address this specific challenge. We previously used such an approach to decode phasic neural responses associated with visual target discrimination. Here, we extend that work to more tonic phenomena such as internal latent states. We use data from two highly controlled laboratory paradigms to train two separate domain-generalized models. We apply the trained models to an ecologically valid paradigm in which participants performed multiple, concurrent driving-related tasks while perched atop a six-degrees-of-freedom ride-motion simulator.Main Results.Using the pretrained models, we estimate latent state and the associated patterns of neural activity. As the patterns of neural activity become more similar to those patterns observed in the training data, we find changes in behavior and task performance that are consistent with the observations from the original, laboratory-based paradigms.Significance.These results lend ecological validity to the original, highly controlled, experimental designs and provide a methodology for understanding the relationship between neural activity and behavior during complex tasks.

Collapse

Wozniak P, Ozog D. Cross-Domain Indoor Visual Place Recognition for Mobile Robot via Generalization Using Style Augmentation. Sensors (Basel) 2023;23:6134. [PMID: 37447982 PMCID: PMC10346347 DOI: 10.3390/s23136134] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 06/22/2023] [Accepted: 06/26/2023] [Indexed: 07/15/2023]

Lin N, Zhao W, Liang S, Zhong M. Real-Time Segmentation of Unstructured Environments by Combining Domain Generalization and Attention Mechanisms. Sensors (Basel) 2023;23:6008. [PMID: 37447855 DOI: 10.3390/s23136008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 06/23/2023] [Accepted: 06/27/2023] [Indexed: 07/15/2023]

Abstract

This paper presents a focused investigation into real-time segmentation in unstructured environments, a crucial aspect for enabling autonomous navigation in off-road robots. To address this challenge, an improved variant of the DDRNet23-slim model is proposed, which includes a lightweight network architecture and reclassifies ten different categories, including drivable roads, trees, high vegetation, obstacles, and buildings, based on the RUGD dataset. The model's design includes the integration of the semantic-aware normalization and semantic-aware whitening (SAN-SAW) module into the main network to improve generalization ability beyond the visible domain. The model's segmentation accuracy is improved through the fusion of channel attention and spatial attention mechanisms in the low-resolution branch to enhance its ability to capture fine details in complex scenes. Additionally, to tackle the issue of category imbalance in unstructured scene datasets, a rare class sampling strategy (RCS) is employed to mitigate the negative impact of low segmentation accuracy for rare classes on the overall performance of the model. Experimental results demonstrate that the improved model achieves a significant 14% increase mIoU in the invisible domain, indicating its strong generalization ability. With a parameter count of only 5.79M, the model achieves mAcc of 85.21% and mIoU of 77.75%. The model has been successfully deployed on a a Jetson Xavier NX ROS robot and tested in both real and simulated orchard environments. Speed optimization using TensorRT increased the segmentation speed to 30.17 FPS. The proposed model strikes a desirable balance between inference speed and accuracy and has good domain migration ability, making it applicable in various domains such as forestry rescue and intelligent agricultural orchard harvesting.

Collapse

Xiao L, Xu J, Zhao D, Shang E, Zhu Q, Dai B. Adversarial and Random Transformations for Robust Domain Adaptation and Generalization. Sensors (Basel) 2023;23:s23115273. [PMID: 37300000 DOI: 10.3390/s23115273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Revised: 05/18/2023] [Accepted: 05/29/2023] [Indexed: 06/12/2023]

Zhang S, Nie W. Multi-Domain Feature Alignment for Face Anti-Spoofing. Sensors (Basel) 2023;23:4077. [PMID: 37112418 PMCID: PMC10144369 DOI: 10.3390/s23084077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/26/2023] [Revised: 04/06/2023] [Accepted: 04/12/2023] [Indexed: 06/19/2023]

Luo X, Meratnia N. A Codeword-Independent Localization Technique for Reconfigurable Intelligent Surface Enhanced Environments Using Adversarial Learning. Sensors (Basel) 2023;23:984. [PMID: 36679782 PMCID: PMC9865069 DOI: 10.3390/s23020984] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Revised: 01/01/2023] [Accepted: 01/11/2023] [Indexed: 06/17/2023]

Bento N, Rebelo J, Barandas M, Carreiro AV, Campagner A, Cabitza F, Gamboa H. Comparing Handcrafted Features and Deep Neural Representations for Domain Generalization in Human Activity Recognition. Sensors (Basel) 2022;22:s22197324. [PMID: 36236427 PMCID: PMC9572241 DOI: 10.3390/s22197324] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 09/21/2022] [Accepted: 09/23/2022] [Indexed: 06/02/2023]

Zakia U, Menon C. Force Myography-Based Human Robot Interactions via Deep Domain Adaptation and Generalization. Sensors (Basel) 2021;22:s22010211. [PMID: 35009752 PMCID: PMC8749939 DOI: 10.3390/s22010211] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 12/25/2021] [Accepted: 12/27/2021] [Indexed: 05/20/2023]

Lee K, Dobbins NJ, McInnes B, Yetisgen M, Uzuner Ö. Transferability of neural network clinical deidentification systems. J Am Med Inform Assoc 2021;28:2661-2669. [PMID: 34586386 DOI: 10.1093/jamia/ocab207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Revised: 07/19/2021] [Accepted: 09/10/2021] [Indexed: 11/14/2022] Open

Bian W, Chen Y, Ye X, Zhang Q. An Optimization-Based Meta-Learning Model for MRI Reconstruction with Diverse Dataset. J Imaging 2021;7:231. [PMID: 34821862 PMCID: PMC8621471 DOI: 10.3390/jimaging7110231] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Revised: 10/26/2021] [Accepted: 10/28/2021] [Indexed: 11/16/2022] Open

Gideon J, McInnis MG, Provost EM. Improving Cross-Corpus Speech Emotion Recognition with Adversarial Discriminative Domain Generalization (ADDoG). IEEE Trans Affect Comput 2021;12:1055-1068. [PMID: 35695825 PMCID: PMC9173710 DOI: 10.1109/taffc.2019.2916092] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Hagad JL, Kimura T, Fukui KI, Numao M. Learning Subject-Generalized Topographical EEG Embeddings Using Deep Variational Autoencoders and Domain-Adversarial Regularization. Sensors (Basel) 2021;21:1792. [PMID: 33806712 PMCID: PMC7961341 DOI: 10.3390/s21051792] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Revised: 02/21/2021] [Accepted: 03/02/2021] [Indexed: 02/07/2023]

Ma J, Wang Y, An X, Ge C, Yu Z, Chen J, Zhu Q, Dong G, He J, He Z, Cao T, Zhu Y, Nie Z, Yang X. Toward data-efficient learning: A benchmark for COVID-19 CT lung and infection segmentation. Med Phys 2021;48:1197-1210. [PMID: 33354790 DOI: 10.1002/mp.14676] [Citation(s) in RCA: 81] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Revised: 12/01/2020] [Accepted: 12/01/2020] [Indexed: 12/11/2022] Open

Abstract

PURPOSE

Accurate segmentation of lung and infection in COVID-19 computed tomography (CT) scans plays an important role in the quantitative management of patients. Most of the existing studies are based on large and private annotated datasets that are impractical to obtain from a single institution, especially when radiologists are busy fighting the coronavirus disease. Furthermore, it is hard to compare current COVID-19 CT segmentation methods as they are developed on different datasets, trained in different settings, and evaluated with different metrics.

METHODS

To promote the development of data-efficient deep learning methods, in this paper, we built three benchmarks for lung and infection segmentation based on 70 annotated COVID-19 cases, which contain current active research areas, for example, few-shot learning, domain generalization, and knowledge transfer. For a fair comparison among different segmentation methods, we also provide standard training, validation and testing splits, evaluation metrics and, the corresponding code.

RESULTS

Based on the state-of-the-art network, we provide more than 40 pretrained baseline models, which not only serve as out-of-the-box segmentation tools but also save computational time for researchers who are interested in COVID-19 lung and infection segmentation. We achieve average dice similarity coefficient (DSC) scores of 97.3%, 97.7%, and 67.3% and average normalized surface dice (NSD) scores of 90.6%, 91.4%, and 70.0% for left lung, right lung, and infection, respectively.

CONCLUSIONS

To the best of our knowledge, this work presents the first data-efficient learning benchmark for medical image segmentation, and the largest number of pretrained models up to now. All these resources are publicly available, and our work lays the foundation for promoting the development of deep learning methods for efficient COVID-19 CT segmentation with limited data.

Collapse

Zhang L, Wang X, Yang D, Sanford T, Harmon S, Turkbey B, Wood BJ, Roth H, Myronenko A, Xu D, Xu Z. Generalizing Deep Learning for Medical Image Segmentation to Unseen Domains via Deep Stacked Transformation. IEEE Trans Med Imaging 2020;39:2531-2540. [PMID: 32070947 PMCID: PMC7393676 DOI: 10.1109/tmi.2020.2973595] [Citation(s) in RCA: 98] [Impact Index Per Article: 24.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]

Abstract

Recent advances in deep learning for medical image segmentation demonstrate expert-level accuracy. However, application of these models in clinically realistic environments can result in poor generalization and decreased accuracy, mainly due to the domain shift across different hospitals, scanner vendors, imaging protocols, and patient populations etc. Common transfer learning and domain adaptation techniques are proposed to address this bottleneck. However, these solutions require data (and annotations) from the target domain to retrain the model, and is therefore restrictive in practice for widespread model deployment. Ideally, we wish to have a trained (locked) model that can work uniformly well across unseen domains without further training. In this paper, we propose a deep stacked transformation approach for domain generalization. Specifically, a series of n stacked transformations are applied to each image during network training. The underlying assumption is that the "expected" domain shift for a specific medical imaging modality could be simulated by applying extensive data augmentation on a single source domain, and consequently, a deep model trained on the augmented "big" data (BigAug) could generalize well on unseen domains. We exploit four surprisingly effective, but previously understudied, image-based characteristics for data augmentation to overcome the domain generalization problem. We train and evaluate the BigAug model (with n=9 transformations) on three different 3D segmentation tasks (prostate gland, left atrial, left ventricle) covering two medical imaging modalities (MRI and ultrasound) involving eight publicly available challenge datasets. The results show that when training on relatively small dataset (n = 10~32 volumes, depending on the size of the available datasets) from a single source domain: (i) BigAug models degrade an average of 11%(Dice score change) from source to unseen domain, substantially better than conventional augmentation (degrading 39%) and CycleGAN-based domain adaptation method (degrading 25%), (ii) BigAug is better than "shallower" stacked transforms (i.e. those with fewer transforms) on unseen domains and demonstrates modest improvement to conventional augmentation on the source domain, (iii) after training with BigAug on one source domain, performance on an unseen domain is similar to training a model from scratch on that domain when using the same number of training samples. When training on large datasets (n = 465 volumes) with BigAug, (iv) application to unseen domains reaches the performance of state-of-the-art fully supervised models that are trained and tested on their source domains. These findings establish a strong benchmark for the study of domain generalization in medical imaging, and can be generalized to the design of highly robust deep segmentation models for clinical deployment.

Collapse