Real-world analysis of manual editing of deep learning contouring in the thorax region.
Phys Imaging Radiat Oncol 2022;
22:104-110. [PMID:
35602549 PMCID:
PMC9115320 DOI:
10.1016/j.phro.2022.04.008]
[Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Revised: 04/13/2022] [Accepted: 04/27/2022] [Indexed: 01/18/2023] Open
Abstract
Deep-learning contouring for radiotherapy is evaluated in the thorax region.
Specific regions-of-adjustment and variability in editing per organ were found.
The consistency and accuracy of training data remains crucial to model performance.
Separate models for specific indications or acquisition protocols may be necessary.
Subsampling and post-processing make the clinical workflow more efficient.
Background and purpose
User-adjustments after deep-learning (DL) contouring in radiotherapy were evaluated to get insight in real-world editing during clinical practice. This study assessed the amount, type and spatial regions of editing of auto-contouring for organs-at-risk (OARs) in routine clinical workflow for patients in the thorax region.
Materials and methods
A total of 350 lung cancer and 362 breast cancer patients, contoured between March 2020 and March 2021 using a commercial DL-contouring method followed by manual adjustments were retrospectively analyzed. Subsampling was performed for some OARs, using an inter-slice gap of 1–3 slices. Commonly-used whole-organ contouring assessment measures were calculated, and all cases were registered to a common reference shape per OAR to identify regions of manual adjustment. Results were expressed as the median, 10th-90th percentile of adjustment and visualized using 3D renderings.
Results
Per OAR, the median amount of editing was below 1 mm. However, large adjustments were found in some locations for most OARs. In general, enlarging of the auto-contours was needed. Subsampling DL-contours showed less adjustments were made in the interpolated slices compared to simulated no-subsampling for these OARs.
Conclusion
The real-world performance of automatic DL-contouring software was evaluated and proven useful in clinical practice. Specific regions-of-adjustment were identified per OAR in the thorax region, and separate models were found to be necessary for specific clinical indications different from training data. This analysis showed the need to perform routine clinical analysis especially when procedures or acquisition protocols change to have the best configuration of the workflow.
Collapse