1
|
Kumar V, Sharma NM, Mahapatra PK, Dogra N, Maurya L, Ahmad F, Dahiya N, Panda P. Enhancing Left Ventricular Segmentation in Echocardiograms Through GAN-Based Synthetic Data Augmentation and MultiResUNet Architecture. Diagnostics (Basel) 2025; 15:663. [PMID: 40150006 PMCID: PMC11940873 DOI: 10.3390/diagnostics15060663] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2025] [Revised: 02/23/2025] [Accepted: 02/26/2025] [Indexed: 03/29/2025] Open
Abstract
Background: Accurate segmentation of the left ventricle in echocardiograms is crucial for the diagnosis and monitoring of cardiovascular diseases. However, this process is hindered by the limited availability of high-quality annotated datasets and the inherent complexities of echocardiogram images. Traditional methods often struggle to generalize across varying image qualities and conditions, necessitating a more robust solution. Objectives: This study aims to enhance left ventricular segmentation in echocardiograms by developing a framework that integrates Generative Adversarial Networks (GANs) for synthetic data augmentation with a MultiResUNet architecture, providing a more accurate and reliable segmentation method. Methods: We propose a GAN-based framework that generates synthetic echocardiogram images and their corresponding segmentation masks, augmenting the available training data. The synthetic data, along with real echocardiograms from the EchoNet-Dynamic dataset, were used to train the MultiResUNet architecture. MultiResUNet incorporates multi-resolution blocks, residual connections, and attention mechanisms to effectively capture fine details at multiple scales. Additional enhancements include atrous spatial pyramid pooling (ASPP) and scaled exponential linear units (SELUs) to further improve segmentation accuracy. Results: The proposed approach significantly outperforms existing methods, achieving a Dice Similarity Coefficient of 95.68% and an Intersection over Union (IoU) of 91.62%. This represents improvements of 2.58% in Dice and 4.84% in IoU over previous segmentation techniques, demonstrating the effectiveness of GAN-based augmentation in overcoming data scarcity and improving segmentation performance. Conclusions: The integration of GAN-generated synthetic data and the MultiResUNet architecture provides a robust and accurate solution for left ventricular segmentation in echocardiograms. This approach has the potential to enhance clinical decision-making in cardiovascular medicine by improving the accuracy of automated diagnostic tools, even in the presence of limited and complex training data.
Collapse
Affiliation(s)
- Vikas Kumar
- CSIR-Central Scientific Instruments Organisation (CSIR-CSIO), Chandigarh 160030, India; (V.K.); (N.M.S.)
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| | - Nitin Mohan Sharma
- CSIR-Central Scientific Instruments Organisation (CSIR-CSIO), Chandigarh 160030, India; (V.K.); (N.M.S.)
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| | - Prasant K. Mahapatra
- CSIR-Central Scientific Instruments Organisation (CSIR-CSIO), Chandigarh 160030, India; (V.K.); (N.M.S.)
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| | - Neeti Dogra
- Anaesthesia and Intensive Care, Postgraduate Institute of Medical Education and Research, Chandigarh 160012, India; (N.D.); (N.D.); (P.P.)
| | - Lalit Maurya
- School of Computing, University of Portsmouth, Portsmouth PO1 3HE, UK;
- Portsmouth Artificial Intelligence and Data Science Centre (PAIDS), University of Portsmouth, Portsmouth PO1 3HE, UK
| | - Fahad Ahmad
- School of Computing, University of Portsmouth, Portsmouth PO1 3HE, UK;
- Portsmouth Artificial Intelligence and Data Science Centre (PAIDS), University of Portsmouth, Portsmouth PO1 3HE, UK
| | - Neelam Dahiya
- Anaesthesia and Intensive Care, Postgraduate Institute of Medical Education and Research, Chandigarh 160012, India; (N.D.); (N.D.); (P.P.)
| | - Prashant Panda
- Anaesthesia and Intensive Care, Postgraduate Institute of Medical Education and Research, Chandigarh 160012, India; (N.D.); (N.D.); (P.P.)
| |
Collapse
|
2
|
Germain P, Labani A, Vardazaryan A, Padoy N, Roy C, El Ghannudi S. Segmentation-Free Estimation of Left Ventricular Ejection Fraction Using 3D CNN Is Reliable and Improves as Multiple Cardiac MRI Cine Orientations Are Combined. Biomedicines 2024; 12:2324. [PMID: 39457634 PMCID: PMC11505352 DOI: 10.3390/biomedicines12102324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2024] [Revised: 09/24/2024] [Accepted: 10/10/2024] [Indexed: 10/28/2024] Open
Abstract
OBJECTIVES We aimed to study classical, publicly available convolutional neural networks (3D-CNNs) using a combination of several cine-MR orientation planes for the estimation of left ventricular ejection fraction (LVEF) without contour tracing. METHODS Cine-MR examinations carried out on 1082 patients from our institution were analysed by comparing the LVEF provided by the CVI42 software (V5.9.3) with the estimation resulting from different 3D-CNN models and various combinations of long- and short-axis orientation planes. RESULTS The 3D-Resnet18 architecture appeared to be the most favourable, and the results gradually and significantly improved as several long-axis and short-axis planes were combined. Simply pasting multiple orientation views into composite frames increased performance. Optimal results were obtained by pasting two long-axis views and six short-axis views. The best configuration provided an R2 = 0.83, a mean absolute error (MAE) = 4.97, and a root mean square error (RMSE) = 6.29; the area under the ROC curve (AUC) for the classification of LVEF < 40% was 0.99, and for the classification of LVEF > 60%, the AUC was 0.97. Internal validation performed on 149 additional patients after model training provided very similar results (MAE 4.98). External validation carried out on 62 patients from another institution showed an MAE of 6.59. Our results in this area are among the most promising obtained to date using CNNs with cardiac magnetic resonance. CONCLUSION (1) The use of traditional 3D-CNNs and a combination of multiple orientation planes is capable of estimating LVEF from cine-MRI data without segmenting ventricular contours, with a reliability similar to that of traditional methods. (2) Performance significantly improves as the number of orientation planes increases, providing a more complete view of the left ventricle.
Collapse
Affiliation(s)
- Philippe Germain
- Department of Radiology, Nouvel Hopital Civil, University Hospital, 67091 Strasbourg, France; (A.L.); (C.R.); (S.E.G.)
| | - Aissam Labani
- Department of Radiology, Nouvel Hopital Civil, University Hospital, 67091 Strasbourg, France; (A.L.); (C.R.); (S.E.G.)
| | - Armine Vardazaryan
- ICube, University of Strasbourg, CNRS, 67000 Strasbourg, France; (A.V.); (N.P.)
| | - Nicolas Padoy
- ICube, University of Strasbourg, CNRS, 67000 Strasbourg, France; (A.V.); (N.P.)
| | - Catherine Roy
- Department of Radiology, Nouvel Hopital Civil, University Hospital, 67091 Strasbourg, France; (A.L.); (C.R.); (S.E.G.)
| | - Soraya El Ghannudi
- Department of Radiology, Nouvel Hopital Civil, University Hospital, 67091 Strasbourg, France; (A.L.); (C.R.); (S.E.G.)
- Department of Nuclear Medicine, Nouvel Hopital Civil, University Hospital, 67091 Strasbourg, France
| |
Collapse
|
3
|
Wu H, Qu G, Xiao Z, Chunyu F. Enhancing left ventricular segmentation in echocardiography with a modified mixed attention mechanism in SegFormer architecture. Heliyon 2024; 10:e34845. [PMID: 39170227 PMCID: PMC11336270 DOI: 10.1016/j.heliyon.2024.e34845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Revised: 07/16/2024] [Accepted: 07/17/2024] [Indexed: 08/23/2024] Open
Abstract
Echocardiography is a key tool for the diagnosis of cardiac diseases, and accurate left ventricular (LV) segmentation in echocardiographic videos is crucial for the assessment of cardiac function. However, since semantic segmentation of video needs to take into account the temporal correlation between frames, this makes the task very challenging. This article introduces an innovative method that incorporates a modified mixed attention mechanism into the SegFormer architecture, enabling it to effectively grasp the temporal correlation present in video data. The proposed method processes each time series by encoding the image input into the encoder to obtain the current time feature map. This map, along with the historical time feature map, is then fed into a time-sensitive mixed attention mechanism type of convolution block attention module (TCBAM). Its output can serve as the historical time feature map for the subsequent sequence, and a combination of the current time feature map and historical time feature map for the current sequence. The processed feature map is then input into the Multilayer Perceptron (MLP) and subsequent networks to generate the final segmented image. Through extensive experiments conducted on two different datasets: Hamad Medical Corporation, Tampere University, and Qatar University (HMC-QU), Cardiac Acquisitions for Multi-structure Ultrasound Segmentation (CAMUS) and Sunnybrook Cardiac Data (SCD), achieving a Dice coefficient of 97.92 % on the SCD dataset and an F1 score of 0.9263 on the CAMUS dataset, outperforming all other models. This research provides a promising solution to the temporal modeling challenge in video semantic segmentation tasks using transformer-based models and points out a promising direction for future research in this field.
Collapse
Affiliation(s)
- Hanqiong Wu
- Internal Medicine, The First Hospital of Jinzhou Medical University, Jinzhou, 121001, China
| | - Gangrong Qu
- Cardiovascular Medicine, Chongqing General Hospital of the Armed Police Force, Chongqing, 400061, China
| | - Zhifeng Xiao
- China Nanhu Academy of Electronics and Information Technology, Jiaxing, 314050, China
| | - Fan Chunyu
- Department of Cardiovascular Medicine, The People's Hospital of Liaoning Province, Shengyang, 110067, China
| |
Collapse
|
4
|
G S, Gopalakrishnan U, Parthinarupothi RK, Madathil T. Deep learning supported echocardiogram analysis: A comprehensive review. Artif Intell Med 2024; 151:102866. [PMID: 38593684 DOI: 10.1016/j.artmed.2024.102866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2023] [Revised: 03/20/2024] [Accepted: 03/30/2024] [Indexed: 04/11/2024]
Abstract
An echocardiogram is a sophisticated ultrasound imaging technique employed to diagnose heart conditions. The transthoracic echocardiogram, one of the most prevalent types, is instrumental in evaluating significant cardiac diseases. However, interpreting its results heavily relies on the clinician's expertise. In this context, artificial intelligence has emerged as a vital tool for helping clinicians. This study critically analyzes key state-of-the-art research that uses deep learning techniques to automate transthoracic echocardiogram analysis and support clinical judgments. We have systematically organized and categorized articles that proffer solutions for view classification, enhancement of image quality and dataset, segmentation and identification of cardiac structures, detection of cardiac function abnormalities, and quantification of cardiac functions. We compared the performance of various deep learning approaches within each category, identifying the most promising methods. Additionally, we highlight limitations in current research and explore promising avenues for future exploration. These include addressing generalizability issues, incorporating novel AI approaches, and tackling the analysis of rare cardiac diseases.
Collapse
Affiliation(s)
- Sanjeevi G
- Center for Wireless Networks & Applications (WNA), Amrita Vishwa Vidyapeetham, Amritapuri, India
| | - Uma Gopalakrishnan
- Center for Wireless Networks & Applications (WNA), Amrita Vishwa Vidyapeetham, Amritapuri, India.
| | | | - Thushara Madathil
- Department of Cardiac Anesthesiology, Amrita Institute of Medical Sciences and Research Center, Kochi, India
| |
Collapse
|
5
|
Vafaeezadeh M, Behnam H, Gifani P. Ultrasound Image Analysis with Vision Transformers-Review. Diagnostics (Basel) 2024; 14:542. [PMID: 38473014 DOI: 10.3390/diagnostics14050542] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Revised: 02/22/2024] [Accepted: 02/29/2024] [Indexed: 03/14/2024] Open
Abstract
Ultrasound (US) has become a widely used imaging modality in clinical practice, characterized by its rapidly evolving technology, advantages, and unique challenges, such as a low imaging quality and high variability. There is a need to develop advanced automatic US image analysis methods to enhance its diagnostic accuracy and objectivity. Vision transformers, a recent innovation in machine learning, have demonstrated significant potential in various research fields, including general image analysis and computer vision, due to their capacity to process large datasets and learn complex patterns. Their suitability for automatic US image analysis tasks, such as classification, detection, and segmentation, has been recognized. This review provides an introduction to vision transformers and discusses their applications in specific US image analysis tasks, while also addressing the open challenges and potential future trends in their application in medical US image analysis. Vision transformers have shown promise in enhancing the accuracy and efficiency of ultrasound image analysis and are expected to play an increasingly important role in the diagnosis and treatment of medical conditions using ultrasound imaging as technology progresses.
Collapse
Affiliation(s)
- Majid Vafaeezadeh
- Biomedical Engineering Department, School of Electrical Engineering, Iran University of Science and Technology, Tehran 1311416846, Iran
| | - Hamid Behnam
- Biomedical Engineering Department, School of Electrical Engineering, Iran University of Science and Technology, Tehran 1311416846, Iran
| | - Parisa Gifani
- Medical Sciences and Technologies Department, Science and Research Branch, Islamic Azad University, Tehran 1477893855, Iran
| |
Collapse
|