1
|
Li J, Wang K, Jiang X. Robust Multi-Subtype Identification of Breast Cancer Pathological Images Based on a Dual-Branch Frequency Domain Fusion Network. SENSORS (BASEL, SWITZERLAND) 2025; 25:240. [PMID: 39797031 PMCID: PMC11723249 DOI: 10.3390/s25010240] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/12/2024] [Revised: 12/27/2024] [Accepted: 01/01/2025] [Indexed: 01/13/2025]
Abstract
Breast cancer (BC) is one of the most lethal cancers worldwide, and its early diagnosis is critical for improving patient survival rates. However, the extraction of key information from complex medical images and the attainment of high-precision classification present a significant challenge. In the field of signal processing, texture-rich images typically exhibit periodic patterns and structures, which are manifested as significant energy concentrations at specific frequencies in the frequency domain. Given the above considerations, this study is designed to explore the application of frequency domain analysis in BC histopathological classification. This study proposes the dual-branch adaptive frequency domain fusion network (AFFNet), designed to enable each branch to specialize in distinct frequency domain features of pathological images. Additionally, two different frequency domain approaches, namely Multi-Spectral Channel Attention (MSCA) and Fourier Filtering Enhancement Operator (FFEO), are employed to enhance the texture features of pathological images and minimize information loss. Moreover, the contributions of the two branches at different stages are dynamically adjusted by a frequency-domain-adaptive fusion strategy to accommodate the complexity and multi-scale features of pathological images. The experimental results, based on two public BC histopathological image datasets, corroborate the idea that AFFNet outperforms 10 state-of-the-art image classification methods, underscoring its effectiveness and superiority in this domain.
Collapse
Affiliation(s)
- Jianjun Li
- School of Mechanical and Electrical Engineering, China Jiliang University, Hangzhou 310018, China
| | | | | |
Collapse
|
2
|
Książek W. Explainable Thyroid Cancer Diagnosis Through Two-Level Machine Learning Optimization with an Improved Naked Mole-Rat Algorithm. Cancers (Basel) 2024; 16:4128. [PMID: 39766028 PMCID: PMC11674737 DOI: 10.3390/cancers16244128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2024] [Revised: 12/06/2024] [Accepted: 12/06/2024] [Indexed: 01/11/2025] Open
Abstract
Modern technologies, particularly artificial intelligence methods such as machine learning, hold immense potential for supporting doctors with cancer diagnostics. This study explores the enhancement of popular machine learning methods using a bio-inspired algorithm-the naked mole-rat algorithm (NMRA)-to assess the malignancy of thyroid tumors. The study utilized a novel dataset released in 2022, containing data collected at Shengjing Hospital of China Medical University. The dataset comprises 1232 records described by 19 features. In this research, 10 well-known classifiers, including XGBoost, LightGBM, and random forest, were employed to evaluate the malignancy of thyroid tumors. A key innovation of this study is the application of the naked mole-rat algorithm for parameter optimization and feature selection within the individual classifiers. Among the models tested, the LightGBM classifier demonstrated the highest performance, achieving a classification accuracy of 81.82% and an F1-score of 86.62%, following two-level parameter optimization and feature selection using the naked mole-rat algorithm. Additionally, explainability analysis of the LightGBM model was conducted using SHAP values, providing insights into the decision-making process of the model.
Collapse
Affiliation(s)
- Wojciech Książek
- Department of Computer Science, Faculty of Computer Science and Telecommunications, Cracow University of Technology, Warszawska 24, 31-155 Cracow, Poland
| |
Collapse
|
3
|
Shah AA, Daud A, Bukhari A, Alshemaimri B, Ahsan M, Younis R. DEL-Thyroid: deep ensemble learning framework for detection of thyroid cancer progression through genomic mutation. BMC Med Inform Decis Mak 2024; 24:198. [PMID: 39039464 PMCID: PMC11533268 DOI: 10.1186/s12911-024-02604-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2024] [Accepted: 07/10/2024] [Indexed: 07/24/2024] Open
Abstract
Genes, expressed as sequences of nucleotides, are susceptible to mutations, some of which can lead to cancer. Machine learning and deep learning methods have emerged as vital tools in identifying mutations associated with cancer. Thyroid cancer ranks as the 5th most prevalent cancer in the USA, with thousands diagnosed annually. This paper presents an ensemble learning model leveraging deep learning techniques such as Long Short-Term Memory (LSTM), Gated Recurrent Units (GRUs), and Bi-directional LSTM (Bi-LSTM) to detect thyroid cancer mutations early. The model is trained on a dataset sourced from asia.ensembl.org and IntOGen.org, consisting of 633 samples with 969 mutations across 41 genes, collected from individuals of various demographics. Feature extraction encompasses techniques including Hahn moments, central moments, raw moments, and various matrix-based methods. Evaluation employs three testing methods: self-consistency test (SCT), independent set test (IST), and 10-fold cross-validation test (10-FCVT). The proposed ensemble learning model demonstrates promising performance, achieving 96% accuracy in the independent set test (IST). Statistical measures such as training accuracy, testing accuracy, recall, sensitivity, specificity, Mathew's Correlation Coefficient (MCC), loss, training accuracy, F1 Score, and Cohen's kappa are utilized for comprehensive evaluation.
Collapse
Affiliation(s)
- Asghar Ali Shah
- Center of Excellence in Artificial Intelligence (CoE-AI), Department of Computer Science, Bahria University, Islamabad, 04408, Pakistan
| | - Ali Daud
- Faculty of Resilience, Rabdan Academy, Abu Dhabi, United Arab Emirates.
| | - Amal Bukhari
- Department of Information Systems and Technology, Collage of Computer Science and Engineering, University of Jeddah, Jeddah, Saudi Arabia
| | - Bader Alshemaimri
- Software Engineering Department, College of Computing and Information Sciences, King Saud University, Riyadh, Saudi Arabia
| | - Muhammad Ahsan
- Department of Computer Science, University of Alabama at Birmingham, 1402 10th Avenue S, Birmingham, AL, 35294, USA
| | - Rehmana Younis
- College of Letters and Sciences, Graduate Student of Robotics Engineering, Columbus State University, Columbus, USA
| |
Collapse
|
4
|
Sánchez-Morales ME, Guillen-Bonilla JT, Guillen-Bonilla H, Guillen-Bonilla A, Aguilar-Santiago J, Jiménez-Rodríguez M. Vectorial Image Representation for Image Classification. J Imaging 2024; 10:48. [PMID: 38392096 PMCID: PMC10889765 DOI: 10.3390/jimaging10020048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Revised: 01/30/2024] [Accepted: 02/06/2024] [Indexed: 02/24/2024] Open
Abstract
This paper proposes the transformation S→C→, where S is a digital gray-level image and C→ is a vector expressed through the textural space. The proposed transformation is denominated Vectorial Image Representation on the Texture Space (VIR-TS), given that the digital image S is represented by the textural vector C→. This vector C→ contains all of the local texture characteristics in the image of interest, and the texture unit T→ entertains a vectorial character, since it is defined through the resolution of a homogeneous equation system. For the application of this transformation, a new classifier for multiple classes is proposed in the texture space, where the vector C→ is employed as a characteristics vector. To verify its efficiency, it was experimentally deployed for the recognition of digital images of tree barks, obtaining an effective performance. In these experiments, the parametric value λ employed to solve the homogeneous equation system does not affect the results of the image classification. The VIR-TS transform possesses potential applications in specific tasks, such as locating missing persons, and the analysis and classification of diagnostic and medical images.
Collapse
Affiliation(s)
- Maria-Eugenia Sánchez-Morales
- Departamento de Ciencias Tecnológicas, Centro Universitario de la Ciénega, Universidad de Guadalajara, Av. Universidad No. 1115, Lindavista, Ocotlán 47810, Jalisco, Mexico
| | - José-Trinidad Guillen-Bonilla
- Departamento de Electro-Fotónica, Centro Universitario de Ciencias Exactas e Ingenierías, Universidad de Guadalajara, Blvd. M. García Barragán 1421, Guadalajara 44410, Jalisco, Mexico
| | - Héctor Guillen-Bonilla
- Departamento de Ingeniería de Proyectos, Centro Universitario de Ciencias Exactas e Ingenierías, Universidad de Guadalajara, Blvd-M. García Barragán 1421, Guadalajara 44410, Jalisco, Mexico
| | - Alex Guillen-Bonilla
- Departamento de Ciencias Computacionales e Ingenierías, Centro Universitario de los Valles, Universidad de Guadalajara, Carretera Guadalajara-Ameca Km. 45.5, Ameca 46600, Jalisco, Mexico
| | - Jorge Aguilar-Santiago
- Departamento de Ciencias Básicas, Centro Universitario de la Ciénega, Universidad de Guadalajara, Av. Universidad No. 1115, Lindavista, Ocotlán 47810, Jalisco, Mexico
| | - Maricela Jiménez-Rodríguez
- Departamento de Ciencias Básicas, Centro Universitario de la Ciénega, Universidad de Guadalajara, Av. Universidad No. 1115, Lindavista, Ocotlán 47810, Jalisco, Mexico
| |
Collapse
|
5
|
Sharma R, Mahanti GK, Panda G, Rath A, Dash S, Mallik S, Zhao Z. Comparative performance analysis of binary variants of FOX optimization algorithm with half-quadratic ensemble ranking method for thyroid cancer detection. Sci Rep 2023; 13:19598. [PMID: 37950041 PMCID: PMC10638362 DOI: 10.1038/s41598-023-46865-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Accepted: 11/06/2023] [Indexed: 11/12/2023] Open
Abstract
Thyroid cancer is a life-threatening condition that arises from the cells of the thyroid gland located in the neck's frontal region just below the adam's apple. While it is not as prevalent as other types of cancer, it ranks prominently among the commonly observed cancers affecting the endocrine system. Machine learning has emerged as a valuable medical diagnostics tool specifically for detecting thyroid abnormalities. Feature selection is of vital importance in the field of machine learning as it serves to decrease the data dimensionality and concentrate on the most pertinent features. This process improves model performance, reduces training time, and enhances interpretability. This study examined binary variants of FOX-optimization algorithms for feature selection. The study employed eight transfer functions (S and V shape) to convert the FOX-optimization algorithms into their binary versions. The vision transformer-based pre-trained models (DeiT and Swin Transformer) are used for feature extraction. The extracted features are transformed using locally linear embedding, and binary FOX-optimization algorithms are applied for feature selection in conjunction with the Naïve Bayes classifier. The study utilized two datasets (ultrasound and histopathological) related to thyroid cancer images. The benchmarking is performed using the half-quadratic theory-based ensemble ranking technique. Two TOPSIS-based methods (H-TOPSIS and A-TOPSIS) are employed for initial model ranking, followed by an ensemble technique for final ranking. The problem is treated as multi-objective optimization task with accuracy, F2-score, AUC-ROC and feature space size as optimization goals. The binary FOX-optimization algorithm based on the [Formula: see text] transfer function achieved superior performance compared to other variants using both datasets as well as feature extraction techniques. The proposed framework comprised a Swin transformer to extract features, a Fox optimization algorithm with a V1 transfer function for feature selection, and a Naïve Bayes classifier and obtained the best performance for both datasets. The best model achieved an accuracy of 94.75%, an AUC-ROC value of 0.9848, an F2-Score of 0.9365, an inference time of 0.0353 seconds, and selected 5 features for the ultrasound dataset. For the histopathological dataset, the diagnosis model achieved an overall accuracy of 89.71%, an AUC-ROC score of 0.9329, an F2-Score of 0.8760, an inference time of 0.05141 seconds, and selected 12 features. The proposed model achieved results comparable to existing research with small features space.
Collapse
Affiliation(s)
- Rohit Sharma
- Department of Electronics and Communication Engineering, NIT, Durgapur, 713209, India
| | - Gautam Kumar Mahanti
- Department of Electronics and Communication Engineering, NIT, Durgapur, 713209, India
| | - Ganapati Panda
- Department of Electronics and Communication Engineering, C.V. Raman Global University, Bhubaneswar, 752054, India
| | - Adyasha Rath
- Department of Computer Science and Engineering, C.V. Raman Global University, Bhubaneswar, 752054, India
| | - Sujata Dash
- Department of Information Technology, Nagaland University, Dimapur, India
| | - Saurav Mallik
- Department of Environmental Health, Harvard T. H. Chan School of Public Health, Boston, MA, USA.
| | - Zhongming Zhao
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA.
| |
Collapse
|