1
|
van Herten RLM, Lagogiannis I, Wolterink JM, Bruns S, Meulendijks ER, Dey D, de Groot JR, Henriques JP, Planken RN, Saitta S, Išgum I. World of Forms: Deformable geometric templates for one-shot surface meshing in coronary CT angiography. Med Image Anal 2025; 103:103582. [PMID: 40318517 DOI: 10.1016/j.media.2025.103582] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2024] [Revised: 04/01/2025] [Accepted: 04/02/2025] [Indexed: 05/07/2025]
Abstract
Deep learning-based medical image segmentation and surface mesh generation typically involve a sequential pipeline from image to segmentation to meshes, often requiring large training datasets while making limited use of prior geometric knowledge. This may lead to topological inconsistencies and suboptimal performance in low-data regimes. To address these challenges, we propose a data-efficient deep learning method for direct 3D anatomical object surface meshing using geometric priors. Our approach employs a multi-resolution graph neural network that operates on a prior geometric template which is deformed to fit object boundaries of interest. We show how different templates may be used for the different surface meshing targets, and introduce a novel masked autoencoder pretraining strategy for 3D spherical data. The proposed method outperforms nnUNet in a one-shot setting for segmentation of the pericardium, left ventricle (LV) cavity and the LV myocardium. Similarly, the method outperforms other lumen segmentation operating on multi-planar reformatted images. Results further indicate that mesh quality is on par with or improves upon marching cubes post-processing of voxel mask predictions, while remaining flexible in the choice of mesh triangulation prior, thus paving the way for more accurate and topologically consistent 3D medical object surface meshing.
Collapse
Affiliation(s)
- Rudolf L M van Herten
- Department of Biomedical Engineering and Physics, Amsterdam UMC, Meibergdreef 9, Amsterdam, 1105 AZ, The Netherlands; Informatics Institute, University of Amsterdam, Amsterdam, The Netherlands; Amsterdam Cardiovascular Sciences, Amsterdam, The Netherlands.
| | - Ioannis Lagogiannis
- Department of Biomedical Engineering and Physics, Amsterdam UMC, Meibergdreef 9, Amsterdam, 1105 AZ, The Netherlands; Informatics Institute, University of Amsterdam, Amsterdam, The Netherlands; Amsterdam Cardiovascular Sciences, Amsterdam, The Netherlands
| | - Jelmer M Wolterink
- Department of Applied Mathematics, Technical Medical Centre, University of Twente, Drienerlolaan 5, Enschede, 7522 NB, The Netherlands
| | - Steffen Bruns
- Department of Biomedical Engineering and Physics, Amsterdam UMC, Meibergdreef 9, Amsterdam, 1105 AZ, The Netherlands
| | - Eva R Meulendijks
- Department of Radiology and Nuclear Medicine, Amsterdam UMC, Meibergdreef 9, Amsterdam, 1105 AZ, The Netherlands
| | - Damini Dey
- Biomedical Imaging Research Institute, Cedars-Sinai Medical Center, Los Angeles, USA
| | | | | | - R Nils Planken
- Department of Radiology and Nuclear Medicine, Amsterdam UMC, Meibergdreef 9, Amsterdam, 1105 AZ, The Netherlands; Department of Radiology, Mayo Clinic, Rochester, USA
| | - Simone Saitta
- Department of Biomedical Engineering and Physics, Amsterdam UMC, Meibergdreef 9, Amsterdam, 1105 AZ, The Netherlands; Informatics Institute, University of Amsterdam, Amsterdam, The Netherlands; Amsterdam Cardiovascular Sciences, Amsterdam, The Netherlands
| | - Ivana Išgum
- Department of Biomedical Engineering and Physics, Amsterdam UMC, Meibergdreef 9, Amsterdam, 1105 AZ, The Netherlands; Informatics Institute, University of Amsterdam, Amsterdam, The Netherlands; Amsterdam Cardiovascular Sciences, Amsterdam, The Netherlands; Department of Radiology and Nuclear Medicine, Amsterdam UMC, Meibergdreef 9, Amsterdam, 1105 AZ, The Netherlands
| |
Collapse
|
2
|
Gonca M, Bayrakdar İŞ, Çelik Ö. Does the FARNet neural network algorithm accurately identify Posteroanterior cephalometric landmarks? BMC Med Imaging 2024; 24:294. [PMID: 39478475 PMCID: PMC11526671 DOI: 10.1186/s12880-024-01478-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2024] [Accepted: 10/23/2024] [Indexed: 11/02/2024] Open
Abstract
BACKGROUND We explored whether the feature aggregation and refinement network (FARNet) algorithm accurately identified posteroanterior (PA) cephalometric landmarks. METHODS We identified 47 landmarks on 1,431 PA cephalograms of which 1,177 were used for training, 117 for validation, and 137 for testing. A FARNet-based artificial intelligence (AI) algorithm automatically detected the landmarks. Model effectiveness was calculated by deriving the mean radial error (MRE) and the successful detection rates (SDRs) within 2, 2.5, 3, and 4 mm. The Mann-Whitney U test was performed on the Euclidean differences between repeated manual identifications and AI trials. The direction in differences was analyzed, and whether differences moved in the same or opposite directions relative to ground truth on both the x and y-axis. RESULTS The AI system (web-based CranioCatch annotation software (Eskişehir, Turkey)) identified 47 anatomical landmarks in PA cephalograms. The right gonion SDRs were the highest, thus 96.4, 97.8, 100, and 100% within 2, 2.5, 3, and 4 mm, respectively. The right gonion MRE was 0.94 ± 0.53 mm. The right condylon SDRs were the lowest, thus 32.8, 45.3, 54.0, and 67.9% within the same thresholds. The right condylon MRE was 3.31 ± 2.25 mm. The AI model's reliability and accuracy were similar to a human expert's. AI was better at four skeleton points than the expert, whereas the expert was better at one skeletal and seven dental points (P < 0.05). Most of the points exhibited significant deviations along the y-axis. Compared to ground truth, most of the points in AI and the second trial showed opposite movement on the x-axis and the same on the y-axis. CONCLUSIONS The FARNet algorithm streamlined orthodontic diagnosis.
Collapse
Affiliation(s)
- Merve Gonca
- Department of Orthodontics, Faculty of Dentistry, Eskisehir Osmangazi University, Eskişehir, Turkey.
- Department of Orthodontics, Faculty of Dentistry, Recep Tayyip Erdoğan University, Rize, Turkey.
| | - İbrahim Şevki Bayrakdar
- Department of Oral and Maxillofacial Radiology, Faculty of Dentistry, Eskisehir Osmangazi University, Eskişehir, Turkey
- Eskisehir Osmangazi University Center of Research and Application for Computer Aided Diagnosis and Treatment in Health, Eskişehir, Turkey
| | - Özer Çelik
- Eskisehir Osmangazi University Center of Research and Application for Computer Aided Diagnosis and Treatment in Health, Eskişehir, Turkey
- Department of Mathematics-Computer, Eskisehir Osmangazi University Faculty of Science, Eskişehir, Turkey
| |
Collapse
|
3
|
Joham SJ, Hadzic A, Urschler M. Implicit Is Not Enough: Explicitly Enforcing Anatomical Priors inside Landmark Localization Models. Bioengineering (Basel) 2024; 11:932. [PMID: 39329674 PMCID: PMC11428392 DOI: 10.3390/bioengineering11090932] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2024] [Revised: 09/12/2024] [Accepted: 09/13/2024] [Indexed: 09/28/2024] Open
Abstract
The task of localizing distinct anatomical structures in medical image data is an essential prerequisite for several medical applications, such as treatment planning in orthodontics, bone-age estimation, or initialization of segmentation methods in automated image analysis tools. Currently, Anatomical Landmark Localization (ALL) is mainly solved by deep-learning methods, which cannot guarantee robust ALL predictions; there may always be outlier predictions that are far from their ground truth locations due to out-of-distribution inputs. However, these localization outliers are detrimental to the performance of subsequent medical applications that rely on ALL results. The current ALL literature relies heavily on implicit anatomical constraints built into the loss function and network architecture to reduce the risk of anatomically infeasible predictions. However, we argue that in medical imaging, where images are generally acquired in a controlled environment, we should use stronger explicit anatomical constraints to reduce the number of outliers as much as possible. Therefore, we propose the end-to-end trainable Global Anatomical Feasibility Filter and Analysis (GAFFA) method, which uses prior anatomical knowledge estimated from data to explicitly enforce anatomical constraints. GAFFA refines the initial localization results of a U-Net by approximately solving a Markov Random Field (MRF) with a single iteration of the sum-product algorithm in a differentiable manner. Our experiments demonstrate that GAFFA outperforms all other landmark refinement methods investigated in our framework. Moreover, we show that GAFFA is more robust to large outliers than state-of-the-art methods on the studied X-ray hand dataset. We further motivate this claim by visualizing the anatomical constraints used in GAFFA as spatial energy heatmaps, which allowed us to find an annotation error in the hand dataset not previously discussed in the literature.
Collapse
Affiliation(s)
- Simon Johannes Joham
- Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, 8036 Graz, Austria
- Institute of Computer Graphics and Vision, Graz University of Technology, 8010 Graz, Austria
| | - Arnela Hadzic
- Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, 8036 Graz, Austria
| | - Martin Urschler
- Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, 8036 Graz, Austria
- BioTechMed-Graz, 8010 Graz, Austria
| |
Collapse
|
4
|
Huang T, Shi J, Li J, Wang J, Du J, Shi J. Involution Transformer Based U-Net for Landmark Detection in Ultrasound Images for Diagnosis of Infantile DDH. IEEE J Biomed Health Inform 2024; 28:4797-4809. [PMID: 38630567 DOI: 10.1109/jbhi.2024.3390241] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/19/2024]
Abstract
The B-mode ultrasound based computer-aided diagnosis (CAD) has demonstrated its effectiveness for diagnosis of Developmental Dysplasia of the Hip (DDH) in infants, which can conduct the Graf's method by detecting landmarks in hip ultrasound images. However, it is still necessary to explore more valuable information around these landmarks to enhance feature representation for improving detection performance in the detection model. To this end, a novel Involution Transformer based U-Net (IT-UNet) network is proposed for hip landmark detection. The IT-UNet integrates the efficient involution operation into Transformer to develop an Involution Transformer module (ITM), which consists of an involution attention block and a squeeze-and-excitation involution block. The ITM can capture both the spatial-related information and long-range dependencies from hip ultrasound images to effectively improve feature representation. Moreover, an Involution Downsampling block (IDB) is developed to alleviate the issue of feature loss in the encoder modules, which combines involution and convolution for the purpose of downsampling. The experimental results on two DDH ultrasound datasets indicate that the proposed IT-UNet achieves the best landmark detection performance, indicating its potential applications.
Collapse
|
5
|
Tan Z, Feng J, Lu W, Yin Y, Yang G, Zhou J. Multi-task global optimization-based method for vascular landmark detection. Comput Med Imaging Graph 2024; 114:102364. [PMID: 38432060 DOI: 10.1016/j.compmedimag.2024.102364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2023] [Revised: 12/04/2023] [Accepted: 02/22/2024] [Indexed: 03/05/2024]
Abstract
Vascular landmark detection plays an important role in medical analysis and clinical treatment. However, due to the complex topology and similar local appearance around landmarks, the popular heatmap regression based methods always suffer from the landmark confusion problem. Vascular landmarks are connected by vascular segments and have special spatial correlations, which can be utilized for performance improvement. In this paper, we propose a multi-task global optimization-based framework for accurate and automatic vascular landmark detection. A multi-task deep learning network is exploited to accomplish landmark heatmap regression, vascular semantic segmentation, and orientation field regression simultaneously. The two auxiliary objectives are highly correlated with the heatmap regression task and help the network incorporate the structural prior knowledge. During inference, instead of performing a max-voting strategy, we propose a global optimization-based post-processing method for final landmark decision. The spatial relationships between neighboring landmarks are utilized explicitly to tackle the landmark confusion problem. We evaluated our method on a cerebral MRA dataset with 564 volumes, a cerebral CTA dataset with 510 volumes, and an aorta CTA dataset with 50 volumes. The experiments demonstrate that the proposed method is effective for vascular landmark localization and achieves state-of-the-art performance.
Collapse
Affiliation(s)
- Zimeng Tan
- Department of Automation, Tsinghua University, Beijing, China
| | - Jianjiang Feng
- Department of Automation, Tsinghua University, Beijing, China.
| | - Wangsheng Lu
- UnionStrong (Beijing) Technology Co.Ltd, Beijing, China
| | - Yin Yin
- UnionStrong (Beijing) Technology Co.Ltd, Beijing, China
| | | | - Jie Zhou
- Department of Automation, Tsinghua University, Beijing, China
| |
Collapse
|