1
|
Rabe M, Kurz C, Thummerer A, Landry G. Artificial intelligence for treatment delivery: image-guided radiotherapy. Strahlenther Onkol 2025; 201:283-297. [PMID: 39138806 DOI: 10.1007/s00066-024-02277-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Accepted: 07/07/2024] [Indexed: 08/15/2024]
Abstract
Radiation therapy (RT) is a highly digitized field relying heavily on computational methods and, as such, has a high affinity for the automation potential afforded by modern artificial intelligence (AI). This is particularly relevant where imaging is concerned and is especially so during image-guided RT (IGRT). With the advent of online adaptive RT (ART) workflows at magnetic resonance (MR) linear accelerators (linacs) and at cone-beam computed tomography (CBCT) linacs, the need for automation is further increased. AI as applied to modern IGRT is thus one area of RT where we can expect important developments in the near future. In this review article, after outlining modern IGRT and online ART workflows, we cover the role of AI in CBCT and MRI correction for dose calculation, auto-segmentation on IGRT imaging, motion management, and response assessment based on in-room imaging.
Collapse
Affiliation(s)
- Moritz Rabe
- Department of Radiation Oncology, LMU University Hospital, LMU Munich, Marchioninistraße 15, 81377, Munich, Bavaria, Germany
| | - Christopher Kurz
- Department of Radiation Oncology, LMU University Hospital, LMU Munich, Marchioninistraße 15, 81377, Munich, Bavaria, Germany
| | - Adrian Thummerer
- Department of Radiation Oncology, LMU University Hospital, LMU Munich, Marchioninistraße 15, 81377, Munich, Bavaria, Germany
| | - Guillaume Landry
- Department of Radiation Oncology, LMU University Hospital, LMU Munich, Marchioninistraße 15, 81377, Munich, Bavaria, Germany.
- German Cancer Consortium (DKTK), partner site Munich, a partnership between the DKFZ and the LMU University Hospital Munich, Marchioninistraße 15, 81377, Munich, Bavaria, Germany.
- Bavarian Cancer Research Center (BZKF), Marchioninistraße 15, 81377, Munich, Bavaria, Germany.
| |
Collapse
|
2
|
Brignol A, Cheriet F, Aubin-Fournier JF, Fortin C, Laporte C. Robust unsupervised texture segmentation for motion analysis in ultrasound images. Int J Comput Assist Radiol Surg 2025; 20:97-106. [PMID: 39289317 DOI: 10.1007/s11548-024-03249-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Accepted: 07/29/2024] [Indexed: 09/19/2024]
Abstract
PURPOSE Ultrasound imaging has emerged as a promising cost-effective and portable non-irradiant modality for the diagnosis and follow-up of diseases. Motion analysis can be performed by segmenting anatomical structures of interest before tracking them over time. However, doing so in a robust way is challenging as ultrasound images often display a low contrast and blurry boundaries. METHODS In this paper, a robust descriptor inspired from the fractal dimension is presented to locally characterize the gray-level variations of an image. This descriptor is an adaptive grid pattern whose scale locally varies as the gray-level variations of the image. Robust features are then located based on the gray-level variations, which are more likely to be consistently tracked over time despite the presence of noise. RESULTS The method was validated on three datasets: segmentation of the left ventricle on simulated echocardiography (Dice coefficient, DC), accuracy of diaphragm motion tracking for healthy subjects (mean sum of distances, MSD) and for a scoliosis patient (root mean square error, RMSE). Results show that the method segments the left ventricle accurately ( DC = 0.84 ) and robustly tracks the diaphragm motion for healthy subjects ( MSD = 1.10 mm) and for the scoliosis patient ( RMSE = 1.22 mm). CONCLUSIONS This method has the potential to segment structures of interest according to their texture in an unsupervised fashion, as well as to help analyze the deformation of tissues. Possible applications are not limited to US image. The same principle could also be applied to other medical imaging modalities such as MRI or CT scans.
Collapse
Affiliation(s)
- Arnaud Brignol
- Department of Electrical Engineering, École de technologie supérieure, 1100, Rue Notre-Dame Ouest, Montreal, QC, H3C 1K3, Canada.
| | - Farida Cheriet
- Department of Computer Engineering and Software Engineering, Polytechnique Montréal, 2900, boul. Édouard-Montpetit, Montreal, QC, H3T 1J4, Canada
| | - Jean-François Aubin-Fournier
- Centre de réadaptation Marie-Enfant du CHU Sainte-Justine, 5200, rue Bélanger Est, Montreal, QC, H1T 1C9, Canada
| | - Carole Fortin
- Faculté de médecine, École de réadaptation, 6128, succursale Centre-ville, Montreal, QC, H3C 3J7, Canada
| | - Catherine Laporte
- Department of Electrical Engineering, École de technologie supérieure, 1100, Rue Notre-Dame Ouest, Montreal, QC, H3C 1K3, Canada
| |
Collapse
|
3
|
Salari E, Wang J, Wynne JF, Chang C, Wu Y, Yang X. Artificial intelligence-based motion tracking in cancer radiotherapy: A review. J Appl Clin Med Phys 2024; 25:e14500. [PMID: 39194360 PMCID: PMC11540048 DOI: 10.1002/acm2.14500] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 07/13/2024] [Accepted: 07/27/2024] [Indexed: 08/29/2024] Open
Abstract
Radiotherapy aims to deliver a prescribed dose to the tumor while sparing neighboring organs at risk (OARs). Increasingly complex treatment techniques such as volumetric modulated arc therapy (VMAT), stereotactic radiosurgery (SRS), stereotactic body radiotherapy (SBRT), and proton therapy have been developed to deliver doses more precisely to the target. While such technologies have improved dose delivery, the implementation of intra-fraction motion management to verify tumor position at the time of treatment has become increasingly relevant. Artificial intelligence (AI) has recently demonstrated great potential for real-time tracking of tumors during treatment. However, AI-based motion management faces several challenges, including bias in training data, poor transparency, difficult data collection, complex workflows and quality assurance, and limited sample sizes. This review presents the AI algorithms used for chest, abdomen, and pelvic tumor motion management/tracking for radiotherapy and provides a literature summary on the topic. We will also discuss the limitations of these AI-based studies and propose potential improvements.
Collapse
Affiliation(s)
- Elahheh Salari
- Department of Radiation OncologyEmory UniversityAtlantaGeorgiaUSA
| | - Jing Wang
- Radiation OncologyIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | | | - Chih‐Wei Chang
- Department of Radiation OncologyEmory UniversityAtlantaGeorgiaUSA
| | - Yizhou Wu
- School of Electrical and Computer EngineeringGeorgia Institute of TechnologyAtlantaGeorgiaUSA
| | - Xiaofeng Yang
- Department of Radiation OncologyEmory UniversityAtlantaGeorgiaUSA
| |
Collapse
|
4
|
Ma L, Wang J, Gong S, Lan L, Geng L, Wang S, Feng X. Self-supervised context-aware correlation filter for robust landmark tracking in liver ultrasound sequences. BIOMED ENG-BIOMED TE 2024; 69:383-394. [PMID: 38353097 DOI: 10.1515/bmt-2022-0489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 01/05/2024] [Indexed: 08/03/2024]
Abstract
OBJECTIVES Respiratory motion-induced displacement of internal organs poses a significant challenge in image-guided radiation therapy, particularly affecting liver landmark tracking accuracy. METHODS Addressing this concern, we propose a self-supervised method for robust landmark tracking in long liver ultrasound sequences. Our approach leverages a Siamese-based context-aware correlation filter network, trained by using the consistency loss between forward tracking and back verification. By effectively utilizing both labeled and unlabeled liver ultrasound images, our model, Siam-CCF , mitigates the impact of speckle noise and artifacts on ultrasonic image tracking by a context-aware correlation filter. Additionally, a fusion strategy for template patch feature helps the tracker to obtain rich appearance information around the point-landmark. RESULTS Siam-CCF achieves a mean tracking error of 0.79 ± 0.83 mm at a frame rate of 118.6 fps, exhibiting a superior speed-accuracy trade-off on the public MICCAI 2015 Challenge on Liver Ultrasound Tracking (CLUST2015) 2D dataset. This performance won the 5th place on the CLUST2015 2D point-landmark tracking task. CONCLUSIONS Extensive experiments validate the effectiveness of our proposed approach, establishing it as one of the top-performing techniques on the CLUST2015 online leaderboard at the time of this submission.
Collapse
Affiliation(s)
- Lin Ma
- College of Computer Science and Engineering, Chongqing University of Technology, Chongqing, China
| | - Junjie Wang
- College of Computer Science and Engineering, Chongqing University of Technology, Chongqing, China
| | - Shu Gong
- Department of Gastroenterology, Children's Hospital of Chongqing Medical University, Chongqing, China
| | - Libin Lan
- College of Computer Science and Engineering, Chongqing University of Technology, Chongqing, China
| | - Li Geng
- City University of New York NYCCT, New York, USA
| | - Siping Wang
- College of Computer Science and Engineering, Chongqing University of Technology, Chongqing, China
| | - Xin Feng
- College of Computer Science and Engineering, Chongqing University of Technology, Chongqing, China
| |
Collapse
|
5
|
Liu Z, Yang B, Shen Y, Ni X, Tsaftaris SA, Zhou H. Long-short diffeomorphism memory network for weakly-supervised ultrasound landmark tracking. Med Image Anal 2024; 94:103138. [PMID: 38479152 DOI: 10.1016/j.media.2024.103138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 01/26/2024] [Accepted: 03/05/2024] [Indexed: 04/16/2024]
Abstract
Ultrasound is a promising medical imaging modality benefiting from low-cost and real-time acquisition. Accurate tracking of an anatomical landmark has been of high interest for various clinical workflows such as minimally invasive surgery and ultrasound-guided radiation therapy. However, tracking an anatomical landmark accurately in ultrasound video is very challenging, due to landmark deformation, visual ambiguity and partial observation. In this paper, we propose a long-short diffeomorphism memory network (LSDM), which is a multi-task framework with an auxiliary learnable deformation prior to supporting accurate landmark tracking. Specifically, we design a novel diffeomorphic representation, which contains both long and short temporal information stored in separate memory banks for delineating motion margins and reducing cumulative errors. We further propose an expectation maximization memory alignment (EMMA) algorithm to iteratively optimize both the long and short deformation memory, updating the memory queue for mitigating local anatomical ambiguity. The proposed multi-task system can be trained in a weakly-supervised manner, which only requires few landmark annotations for tracking and zero annotation for deformation learning. We conduct extensive experiments on both public and private ultrasound landmark tracking datasets. Experimental results show that LSDM can achieve better or competitive landmark tracking performance with a strong generalization capability across different scanner types and different ultrasound modalities, compared with other state-of-the-art methods.
Collapse
Affiliation(s)
- Zhihua Liu
- School of Computing and Mathematical Sciences, University of Leicester, Leicester, LE1 7RH, UK
| | - Bin Yang
- Department of Cardiovascular Sciences, University Hospitals of Leicester NHS Trust, Leicester, LE1 9HN, UK; Nantong-Leicester Joint Institute of Kidney Science, Department of Nephrology, Affiliated Hospital of Nantong University, Nantong, 226001, China
| | - Yan Shen
- Department of Emergency Medicine, Affiliated Hospital of Nantong University, Nantong, 226001, China
| | - Xuejun Ni
- Department of Emergency Medicine, Affiliated Hospital of Nantong University, Nantong, 226001, China
| | - Sotirios A Tsaftaris
- School of Engineering, The University of Edinburgh, Edinburgh EH9 3FG, UK; The Alan Turing Institute, London NW1 2DB, UK
| | - Huiyu Zhou
- School of Computing and Mathematical Sciences, University of Leicester, Leicester, LE1 7RH, UK.
| |
Collapse
|
6
|
Sun M, Huang W, Zhang H, Shi Y, Wang J, Gong Q, Wang X. Temporal contexts for motion tracking in ultrasound sequences with information bottleneck. Med Phys 2023; 50:5553-5567. [PMID: 36866782 DOI: 10.1002/mp.16339] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 02/13/2023] [Accepted: 02/18/2023] [Indexed: 03/04/2023] Open
Abstract
BACKGROUND Recently, deep convolutional neural networks (CNNs) have been widely adopted for ultrasound sequence tracking and shown to perform satisfactorily. However, existing trackers ignore the rich temporal contexts that exists between consecutive frames, making it difficult for these trackers to perceive information about the motion of the target. PURPOSE In this paper, we propose a sophisticated method to fully utilize temporal contexts for ultrasound sequences tracking with information bottleneck. This method determines the temporal contexts between consecutive frames to perform both feature extraction and similarity graph refinement, and information bottleneck is integrated into the feature refinement process. METHODS The proposed tracker combined three models. First, online temporal adaptive convolutional neural network (TAdaCNN) is proposed to focus on feature extraction and enhance spatial features using temporal information. Second, information bottleneck (IB) is incorporated to achieve more accurate target tracking by maximally limiting the amount of information in the network and discarding irrelevant information. Finally, we propose temporal adaptive transformer (TA-Trans) that efficiently encodes temporal knowledge by decoding it for similarity graph refinement. The tracker was trained on 2015 MICCAI Challenge on Liver Ultrasound Tracking (CLUST) dataset to evaluate the performance of the proposed method by calculating the tracking error (TE) between the predicted landmarks and the ground truth landmarks for each frame. The experimental results are compared with 13 state-of-the-art methods, and ablation studies are conducted. RESULTS On CLUST 2015 dataset, our proposed model achieves a mean TE of 0.81 ± 0.74 mm and a maximum TE of 1.93 mm for 85 point-landmarks across 39 ultrasound sequences in the 2D sequences. Tracking speed ranged from 41 to 63 frames per second (fps). CONCLUSIONS This study demonstrates a new integrated workflow for ultrasound sequences motion tracking. The results show that the model has excellent accuracy and robustness. Reliable and accurate motion estimation is provided for applications requiring real-time motion estimation in the context of ultrasound-guided radiation therapy.
Collapse
Affiliation(s)
- Mengxue Sun
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| | - Wenhui Huang
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| | - Huili Zhang
- Shandong Innovation and Development Research Institute, Jinan, China
| | - Yunfeng Shi
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| | - Jiale Wang
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| | - Qingtao Gong
- Ulsan Ship and Ocean College, Ludong University, Yantai, China
| | - Xiaoyan Wang
- Department of Urology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China
| |
Collapse
|
7
|
Huang Y, Jiao J, Yu J, Zheng Y, Wang Y. Si-MSPDNet: A multiscale Siamese network with parallel partial decoders for the 3-D measurement of spines in 3D ultrasonic images. Comput Med Imaging Graph 2023; 108:102262. [PMID: 37385048 DOI: 10.1016/j.compmedimag.2023.102262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Revised: 05/26/2023] [Accepted: 06/09/2023] [Indexed: 07/01/2023]
Abstract
Early screening and frequent monitoring effectively decrease the risk of severe scoliosis, but radiation exposure is a consequence of traditional radiograph examinations. Additionally, traditional X-ray images on the coronal or sagittal plane have difficulty providing three-dimensional (3-D) information on spinal deformities. The Scolioscan system provides an innovative 3-D spine imaging approach via ultrasonic scanning, and its feasibility has been demonstrated in numerous studies. In this paper, to further examine the potential of spinal ultrasonic data for describing 3-D spinal deformities, we propose a novel deep-learning tracker named Si-MSPDNet for extracting widely employed landmarks (spinous process (SP)) in ultrasonic images of spines and establish a 3-D spinal profile to measure 3-D spinal deformities. Si-MSPDNet has a Siamese architecture. First, we employ two efficient two-stage encoders to extract features from the uncropped ultrasonic image and the patch centered on the SP cut from the image. Then, a fusion block is designed to strengthen the communication between encoded features and further refine them from channel and spatial perspectives. The SP is a very small target in ultrasonic images, so its representation is weak in the highest-level feature maps. To overcome this, we ignore the highest-level feature maps and introduce parallel partial decoders to localize the SP. The correlation evaluation in the traditional Siamese network is also expanded to multiple scales to enhance cooperation. Furthermore, we propose a binary guided mask based on vertebral anatomical prior knowledge, which can further improve the performance of our tracker by highlighting the potential region with SP. The binary-guided mask is also utilized for fully automatic initialization in tracking. We collected spinal ultrasonic data and corresponding radiographs on the coronal and sagittal planes from 150 patients to evaluate the tracking precision of Si-MSPDNet and the performance of the generated 3-D spinal profile. Experimental results revealed that our tracker achieved a tracking success rate of 100% and a mean IoU of 0.882, outperforming some commonly used tracking and real-time detection models. Furthermore, a high correlation existed on both the coronal and sagittal planes between our projected spinal curve and that extracted from the spinal annotation in X-ray images. The correlation between the tracking results of the SP and their ground truths on other projected planes was also satisfactory. More importantly, the difference in mean curvatures was slight on all projected planes between tracking results and ground truths. Thus, this study effectively demonstrates the promising potential of our 3-D spinal profile extraction method for the 3-D measurement of spinal deformities using 3-D ultrasound data.
Collapse
Affiliation(s)
- Yi Huang
- Biomedical Engineering Center, Fudan University, Shanghai 200433, China
| | - Jing Jiao
- Biomedical Engineering Center, Fudan University, Shanghai 200433, China
| | - Jinhua Yu
- Biomedical Engineering Center, Fudan University, Shanghai 200433, China; Key Laboratory of Medical Imaging Computing and Computer Assisted Intervention of Shanghai, Fudan University, 200433, China
| | - Yongping Zheng
- Department of Biomedical Engineering, The Hong Kong Polytechnic University, Hong Kong Special Administrative Region of China; Research Institute for Smart Ageing, The Hong Kong Polytechnic University, Hong Kong Special Administrative Region of China.
| | - Yuanyuan Wang
- Biomedical Engineering Center, Fudan University, Shanghai 200433, China; Key Laboratory of Medical Imaging Computing and Computer Assisted Intervention of Shanghai, Fudan University, 200433, China.
| |
Collapse
|
8
|
Levin AA, Klimov DD, Nechunaev AA, Prokhorenko LS, Mishchenkov DS, Nosova AG, Astakhov DA, Poduraev YV, Panchenkov DN. Assessment of experimental OpenCV tracking algorithms for ultrasound videos. Sci Rep 2023; 13:6765. [PMID: 37185281 PMCID: PMC10130022 DOI: 10.1038/s41598-023-30930-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Accepted: 03/03/2023] [Indexed: 05/17/2023] Open
Abstract
This study aims to compare the tracking algorithms provided by the OpenCV library to use on ultrasound video. Despite the widespread application of this computer vision library, few works describe the attempts to use it to track the movement of liver tumors on ultrasound video. Movements of the neoplasms caused by the patient`s breath interfere with the positioning of the instruments during the process of biopsy and radio-frequency ablation. The main hypothesis of the experiment was that tracking neoplasms and correcting the position of the manipulator in case of using robotic-assisted surgery will allow positioning the instruments more precisely. Another goal of the experiment was to check if it is possible to ensure real-time tracking with at least 25 processed frames per second for standard definition video. OpenCV version 4.5.0 was used with 7 tracking algorithms from the extra modules package. They are: Boosting, CSRT, KCF, MedianFlow, MIL, MOSSE, TLD. More than 5600 frames of standard definition were processed during the experiment. Analysis of the results shows that two algorithms-CSRT and KCF-could solve the problem of tumor tracking. They lead the test with 70% and more of Intersection over Union and more than 85% successful searches. They could also be used in real-time processing with an average processing speed of up to frames per second in CSRT and 100 + frames per second for KCF. Tracking results reach the average deviation between centers of neoplasms to 2 mm and maximum deviation less than 5 mm. This experiment also shows that no frames made CSRT and KCF algorithms fail simultaneously. So, the hypothesis for future work is combining these algorithms to work together, with one of them-CSRT-as support for the KCF tracker on the rarely failed frames.
Collapse
Affiliation(s)
- A A Levin
- Moscow State University of Medicine and Dentistry Named After A.I. Evdokimov, 20/1 Delegatskaya ul., Moscow, Russian Federation, 127473.
| | - D D Klimov
- Moscow State University of Medicine and Dentistry Named After A.I. Evdokimov, 20/1 Delegatskaya ul., Moscow, Russian Federation, 127473
| | - A A Nechunaev
- Moscow State University of Medicine and Dentistry Named After A.I. Evdokimov, 20/1 Delegatskaya ul., Moscow, Russian Federation, 127473
| | - L S Prokhorenko
- Moscow State University of Medicine and Dentistry Named After A.I. Evdokimov, 20/1 Delegatskaya ul., Moscow, Russian Federation, 127473
| | - D S Mishchenkov
- Moscow State University of Medicine and Dentistry Named After A.I. Evdokimov, 20/1 Delegatskaya ul., Moscow, Russian Federation, 127473
| | - A G Nosova
- Moscow State University of Medicine and Dentistry Named After A.I. Evdokimov, 20/1 Delegatskaya ul., Moscow, Russian Federation, 127473
| | - D A Astakhov
- Moscow State University of Medicine and Dentistry Named After A.I. Evdokimov, 20/1 Delegatskaya ul., Moscow, Russian Federation, 127473
| | - Y V Poduraev
- Moscow State University of Technology "STANKIN", 1 Vadkovsky per., Moscow, Russian Federation, 127055
| | - D N Panchenkov
- Moscow State University of Medicine and Dentistry Named After A.I. Evdokimov, 20/1 Delegatskaya ul., Moscow, Russian Federation, 127473
| |
Collapse
|
9
|
Du Y, Xie F, Wu G, Chen P, Yang Y, Yang L, Yin L, Wang S. A classification model for detection of ductal carcinoma in situ by Fourier transform infrared spectroscopy based on deep structured semantic model. Anal Chim Acta 2023; 1251:340991. [PMID: 36925283 DOI: 10.1016/j.aca.2023.340991] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Revised: 01/26/2023] [Accepted: 02/16/2023] [Indexed: 02/19/2023]
Abstract
At present, deep learning is widely used in spectral data processing. Deep learning requires a large amount of data for training, while the collection of biological serum spectra is limited by sample numbers and labor costs, so it is impractical to obtain a large amount of serum spectral data for disease detection. In this study, we propose a spectral classification model based on the deep structured semantic model (DSSM) and successfully apply it to Fourier Transform Infrared (FT-IR) spectroscopy for ductal carcinoma in situ (DCIS) detection. Compared with the traditional deep learning model, we match the spectral data into positive and negative pairs according to whether the spectra are from the same category. The DSSM structure is constructed by extracting features according to the spectral similarity of spectra pairs. This new construction model increases the data amount used for model training and reduces the dimension of spectral data. Firstly, the FT-IR spectra are paired. The spectra pairs are labeled as positive pairs if they come from the same category, and the spectra pairs are labeled as negative pairs if they come from different categories. Secondly, two spectra in each spectra pair are put into two deep neural networks of the DSSM structure separately. Then the spectral similarity between the output feature maps of two deep neural networks is calculated. The DSSM structure is trained by maximizing the conditional likelihood of the spectra pairs from the same category. Thirdly, after the training of DSSM is done, the training set and testing set are input into two deep neural networks separately. The output feature maps of the training set are put into the reference library. Lastly, the k-nearest neighbor (KNN) model is used for classification according to Euclidean distances between the output feature map of each unknown sample to the reference library. The category of the unknown sample is judged according to the categories of k nearest samples. We also use principal component analysis (PCA) to reduce dimension for comparison. The accuracies of the KNN model, principal component analysis-k nearest neighbor (PCA-KNN) model, and deep structured semantic model-k nearest neighbor (DSSM-KNN) model are 78.8%, 72.7%, and 97.0%, which proves that our proposed model has higher accuracy.
Collapse
Affiliation(s)
- Yu Du
- School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing, 100876, China
| | - Fei Xie
- Department of Breast Center, Peking University People's Hospital, Beijing, 100044, China
| | - Guohua Wu
- School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing, 100876, China.
| | - Peng Chen
- School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing, 100876, China
| | - Yang Yang
- Department of Breast Center, Peking University People's Hospital, Beijing, 100044, China
| | - Liu Yang
- Department of Breast Center, Peking University People's Hospital, Beijing, 100044, China
| | - Longfei Yin
- School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing, 100876, China
| | - Shu Wang
- Department of Breast Center, Peking University People's Hospital, Beijing, 100044, China.
| |
Collapse
|
10
|
Seo J, Nguon LS, Park S. Vascular wall motion detection models based on long short-term memory in plane-wave-based ultrasound imaging. Phys Med Biol 2023; 68:075005. [PMID: 36881926 DOI: 10.1088/1361-6560/acc238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 03/07/2023] [Indexed: 03/09/2023]
Abstract
Objective.Vascular wall motion can be used to diagnose cardiovascular diseases. In this study, long short-term memory (LSTM) neural networks were used to track vascular wall motion in plane-wave-based ultrasound imaging.Approach.The proposed LSTM and convolutional LSTM (ConvLSTM) models were trained using ultrasound data from simulations and tested experimentally using a tissue-mimicking vascular phantom and anin vivostudy using a carotid artery. The performance of the models in the simulation was evaluated using the mean square error from axial and lateral motions and compared with the cross-correlation (XCorr) method. Statistical analysis was performed using the Bland-Altman plot, Pearson correlation coefficient, and linear regression in comparison with the manually annotated ground truth.Main results.For thein vivodata, the median error and 95% limit of agreement from the Bland-Altman analysis were (0.01, 0.13), (0.02, 0.19), and (0.03, 0.18), the Pearson correlation coefficients were 0.97, 0.94, and 0.94, respectively, and the linear equations were 0.89x+ 0.02, 0.84x+ 0.03, and 0.88x+ 0.03 from linear regression for the ConvLSTM model, LSTM model, and XCorr method, respectively. In the longitudinal and transverse views of the carotid artery, the LSTM-based models outperformed the XCorr method. Overall, the ConvLSTM model was superior to the LSTM model and XCorr method.Significance.This study demonstrated that vascular wall motion can be tracked accurately and precisely using plane-wave-based ultrasound imaging and the proposed LSTM-based models.
Collapse
Affiliation(s)
- Jeongwung Seo
- Department of Electronic and Electrical Engineering, Ewha Womans University, Seoul 03760, Republic of Korea
| | - Leang Sim Nguon
- Department of Electronic and Electrical Engineering, Ewha Womans University, Seoul 03760, Republic of Korea
| | - Suhyun Park
- Department of Electronic and Electrical Engineering, Ewha Womans University, Seoul 03760, Republic of Korea
| |
Collapse
|
11
|
Bechar MEA, Guyader JM, El Bouz M, Douet-Guilbert N, Al Falou A, Troadec MB. Highly Performing Automatic Detection of Structural Chromosomal Abnormalities Using Siamese Architecture. J Mol Biol 2023; 435:168045. [PMID: 36906061 DOI: 10.1016/j.jmb.2023.168045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Revised: 03/03/2023] [Accepted: 03/06/2023] [Indexed: 03/11/2023]
Abstract
The detection of structural chromosomal abnormalities (SCA) is crucial for diagnosis, prognosis and management of many genetic diseases and cancers. This detection, done by highly qualified medical experts, is tedious and time-consuming. We propose a highly performing and intelligent method to assist cytogeneticists to screen for SCA. Each chromosome is present in two copies that make up a pair of chromosomes. Usually, SCA are present in only one copy of the pair. Convolutional neural networks (CNN) with Siamese architecture are particularly relevant for evaluating similarities between two images, which is why we used this method to detect abnormalities between both chromosomes of a given pair. As a proof-of-concept, we first focused on a deletion occurring on chromosome 5 (del(5q)) observed in hematological malignancies. Using our dataset, we conducted several experiments without and with data augmentation on seven popular CNN models. Overall, performances obtained were very relevant for detecting deletions, particularly with Xception and InceptionResNetV2 models achieving 97.50% and 97.01% of F1-score, respectively. We additionally demonstrated that these models successfully recognized another SCA, inversion inv(3), which is one of the most difficult SCA to detect. The performance improved when the training was applied on inversion inv(3) dataset, achieving 94.82% of F1-score. The technique that we propose in this paper is the first highly performing method based on Siamese architecture that allows the detection of SCA. Our code is publicly available at: https://github.com/MEABECHAR/ChromosomeSiameseAD.
Collapse
Affiliation(s)
| | | | | | - Nathalie Douet-Guilbert
- University of Brest, Inserm, EFS, UMR 1078, GGB, 29200 Brest, France; CHRU Brest, Service de génétique, Laboratoire de génétique chromosomique, 29200 Brest, France; Centre de ressources biologiques, Site cytogénétique, CHRU Brest, 29200 Brest, France
| | | | - Marie-Bérengère Troadec
- University of Brest, Inserm, EFS, UMR 1078, GGB, 29200 Brest, France; CHRU Brest, Service de génétique, Laboratoire de génétique chromosomique, 29200 Brest, France; Centre de ressources biologiques, Site cytogénétique, CHRU Brest, 29200 Brest, France.
| |
Collapse
|
12
|
Heinrich MP, Siebert H, Graf L, Mischkewitz S, Hansen L. Robust and Realtime Large Deformation Ultrasound Registration Using End-to-End Differentiable Displacement Optimisation. SENSORS (BASEL, SWITZERLAND) 2023; 23:2876. [PMID: 36991588 PMCID: PMC10056872 DOI: 10.3390/s23062876] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 02/19/2023] [Accepted: 02/22/2023] [Indexed: 06/19/2023]
Abstract
Image registration for temporal ultrasound sequences can be very beneficial for image-guided diagnostics and interventions. Cooperative human-machine systems that enable seamless assistance for both inexperienced and expert users during ultrasound examinations rely on robust, realtime motion estimation. Yet rapid and irregular motion patterns, varying image contrast and domain shifts in imaging devices pose a severe challenge to conventional realtime registration approaches. While learning-based registration networks have the promise of abstracting relevant features and delivering very fast inference times, they come at the potential risk of limited generalisation and robustness for unseen data; in particular, when trained with limited supervision. In this work, we demonstrate that these issues can be overcome by using end-to-end differentiable displacement optimisation. Our method involves a trainable feature backbone, a correlation layer that evaluates a large range of displacement options simultaneously and a differentiable regularisation module that ensures smooth and plausible deformation. In extensive experiments on public and private ultrasound datasets with very sparse ground truth annotation the method showed better generalisation abilities and overall accuracy than a VoxelMorph network with the same feature backbone, while being two times faster at inference.
Collapse
Affiliation(s)
- Mattias P. Heinrich
- Institute of Medical Informatics, Universität zu Lübeck, 23562 Lübeck, Germany
| | - Hanna Siebert
- Institute of Medical Informatics, Universität zu Lübeck, 23562 Lübeck, Germany
| | - Laura Graf
- Institute of Medical Informatics, Universität zu Lübeck, 23562 Lübeck, Germany
| | | | | |
Collapse
|
13
|
Wang Y, Fu T, Wang Y, Xiao D, Lin Y, Fan J, Song H, Liu F, Yang J. Multi 3: multi-templates siamese network with multi-peaks detection and multi-features refinement for target tracking in ultrasound image sequences. Phys Med Biol 2022; 67. [DOI: 10.1088/1361-6560/ac9032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Accepted: 09/07/2022] [Indexed: 11/11/2022]
Abstract
Abstract
Objective. Radiation therapy requires a precise target location. However, respiratory motion increases the uncertainties of the target location. Accurate and robust tracking is significant for improving operation accuracy. Approach. In this work, we propose a tracking framework Multi3, including a multi-templates Siamese network, multi-peaks detection and multi-features refinement, for target tracking in ultrasound sequences. Specifically, we use two templates to provide the location and deformation of the target for robust tracking. Multi-peaks detection is applied to extend the set of potential target locations, and multi-features refinement is designed to select an appropriate location as the tracking result by quality assessment. Main results. The proposed Multi3 is evaluated on a public dataset, i.e. MICCAI 2015 challenge on liver ultrasound tracking (CLUST), and our clinical dataset provided by the Chinese People’s Liberation Army General Hospital. Experimental results show that Multi3 achieves accurate and robust tracking in ultrasound sequences (0.75 ± 0.62 mm and 0.51 ± 0.32 mm tracking errors in two datasets). Significance. The proposed Multi3 is the most robust method on the CLUST 2D benchmark set, exhibiting potential in clinical practice.
Collapse
|
14
|
Wasih M, Almekkawy M. Analysis of Advanced Siamese Neural Networks for Motion Tracking of Sonography of Carotid Arteries. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2022; 2022:2173-2176. [PMID: 36086192 DOI: 10.1109/embc48229.2022.9871782] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
The Siamese Tracker (ST) for tracking objects of interest in Ultrasound (US) images does not incorporate video specific cues and assumes a fixed template of the reference block. Recently, a more advanced version of ST, Correlation Filter Network (CFNet), which overcomes the problems of ST, has been used for tracking in US images. In this study, we demonstrate how the basic CFNet can be made computationally more efficient by reducing the number of layers in its feature extraction network. We further show that due to the unique architecture of the CFNet, this strategy does not affect the performance of the baseline CFNet considerably. Our methodology was evaluated on 10 random sequences from the publicly available carotid artery dataset. CFNet obtained a 35.7% improvement in the average localization error over the basic ST, thus demonstrating that it is a practical and robust tracking algorithm for tracking objects in US images.
Collapse
|
15
|
Wu C, Fu T, Wang Y, Lin Y, Wang Y, Ai D, Fan J, Song H, Yang J. Fusion Siamese network with drift correction for target tracking in ultrasound sequences. Phys Med Biol 2022; 67. [DOI: 10.1088/1361-6560/ac4fa1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Accepted: 01/27/2022] [Indexed: 12/25/2022]
Abstract
Abstract
Motion tracking techniques can revise the bias arising from respiration-caused motion in radiation therapy. Tracking key structures accurately and at a real-time speed is necessary for effective motion tracking. In this work, we propose a fusion Siamese network with drift correction for target tracking in ultrasound sequences. Specifically, the network fuses four response maps generated by the cross-correlation between convolution layers at different resolutions to reduce up-sampling error. A correction strategy combining local structural similarity and target trajectory is proposed to revise the target drift predicted by the network. Moreover, a coarse-to-fine strategy is proposed to train the network with a limited number of annotated images, in which an augmented dataset is generated by corner points to learn network features with high generalizability. The proposed method is evaluated on the basis of the public dataset of the MICCAI 2015 Challenge on Liver UltraSound Tracking (CLUST) and our ultrasound image dataset, which is provided by the Chinese People’s Liberation Army General Hospital (CPLAGH). A tracking error of 0.80 ± 1.16 mm is observed for 85 targets across 39 ultrasound sequences in the CLUST dataset. A tracking error of 0.61 ± 0.36 mm is observed for 20 targets across 10 ultrasound sequences in the CPLAGH dataset. The effectiveness of the proposed fusion and correction strategies is verified via two ablation experiments. Overall, the experimental results demonstrate the effectiveness of the proposed fusion Siamese network with drift correction and reveal its potential in clinical practice.
Collapse
|
16
|
Wasih M, Almekkawy M. A Novel Deep Learning Approach for Tracking Regions of Interest in Ultrasound Images . ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2021; 2021:4095-4098. [PMID: 34892128 DOI: 10.1109/embc46164.2021.9631026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Due to their great success in learning a universal object similarity metric, Siamese Trackers have been adopted for motion tracking a Region of Interest (ROI) in Ultrasound (US) image sequences. However, these Fully Convolutional Siamese networks (SiamFC) offer no online adaptation of the network and fail to take cues from the input sequence. The more recent Correlation Filter Networks (CFNet) solve this problem by learning the reference template online using a Correlation Filter layer. In this work, we use the CFNet as our backbone model and propose an advanced tracking algorithm (Seq-CFNet) for tracking an ROI in US sequences by constructing a sequential cascade of two identical CFNet. The cascade with CFNet is novel and offers practical benefits in tracking accuracy. Our method is evaluated on 10 different sequences of a Carotid Artery (CA) dataset to track the transverse section of the carotid artery. Results show that Seq-CFNet obtains better Root Mean Square Error (RMSE) values than the baseline CFNet as well as SiamFC, without significantly compromising the speed.
Collapse
|