1
|
Lu H, Qi G, Wu D, Lin C, Ma S, Shi Y, Xue H. A novel feature extraction method based on dynamic handwriting for Parkinson's disease detection. PLoS One 2025; 20:e0318021. [PMID: 39854412 PMCID: PMC11760584 DOI: 10.1371/journal.pone.0318021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2024] [Accepted: 01/08/2025] [Indexed: 01/26/2025] Open
Abstract
Parkinson's disease (PD) is a common disease of the elderly. Given the easy accessibility of handwriting samples, many researchers have proposed handwriting-based detection methods for Parkinson's disease. Extracting more discriminative features from handwriting is an important step. Although many features have been proposed in previous researches, the insight analysis of the combination of handwriting's kinematic, pressure, and angle dynamic features is lacking. Moreover, most existing feature is incompletely represented, with feature information lost. Therefore, to solve the above problems, a new feature extraction approach for PD detection is proposed using handwriting. First, built on the kinematic, pressure, and angle dynamic features, we propose a moment feature by composed these three types of features, an overall representation of these three types of features information. Then, we proposed a feature extraction method to extract time-frequency-based statistical (TF-ST) features from dynamic handwriting features in terms of their temporal and frequency characteristics. Finally, we proposed an escape Coati Optimization Algorithm (eCOA) for global optimization to enhance classification performance. Self-constructed and public datasets are used to verify the proposed method's effectiveness respectively. The experimental results showed an accuracy of 97.95% and 98.67%, a sensitivity of 98.15% (average) and 97.78%, a specificity of 99.17% (average) and 100%, and an AUC of 98.66% (average) and 98.89%. The code is available at https://github.com/dreamhcy/MLforPD.
Collapse
Affiliation(s)
- Huimin Lu
- School of Computer Science and Engineering, Changchun University of Technology, Changchun, Jilin, China
- Jilin Provincial Smart Health Joint Innovation Laboratory for the New Generation of AI, Changchun Univerity of Technology, Changchun, Jilin, China
| | - Guolian Qi
- School of Computer Science and Engineering, Changchun University of Technology, Changchun, Jilin, China
- Jilin Provincial Smart Health Joint Innovation Laboratory for the New Generation of AI, Changchun Univerity of Technology, Changchun, Jilin, China
| | - Dalong Wu
- Affiliated Hospital to Changchun University of Chinese Medicine, Changchun, Jilin, China
| | - Chenglin Lin
- School of Computer Science and Engineering, Changchun University of Technology, Changchun, Jilin, China
- Jilin Provincial Smart Health Joint Innovation Laboratory for the New Generation of AI, Changchun Univerity of Technology, Changchun, Jilin, China
| | - Songzhe Ma
- School of Computer Science and Engineering, Changchun University of Technology, Changchun, Jilin, China
- Jilin Provincial Smart Health Joint Innovation Laboratory for the New Generation of AI, Changchun Univerity of Technology, Changchun, Jilin, China
| | - Yingqi Shi
- Changchun University of Chinese Medicine, Changchun, Jilin, China
| | - Han Xue
- School of Computer Science and Engineering, Changchun University of Technology, Changchun, Jilin, China
- Jilin Provincial Smart Health Joint Innovation Laboratory for the New Generation of AI, Changchun Univerity of Technology, Changchun, Jilin, China
| |
Collapse
|
2
|
Gürkan Kuntalp D, Özcan N, Düzyel O, Kababulut FY, Kuntalp M. A Comparative Study of Metaheuristic Feature Selection Algorithms for Respiratory Disease Classification. Diagnostics (Basel) 2024; 14:2244. [PMID: 39410648 PMCID: PMC11475976 DOI: 10.3390/diagnostics14192244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2024] [Revised: 10/02/2024] [Accepted: 10/07/2024] [Indexed: 10/20/2024] Open
Abstract
The correct diagnosis and early treatment of respiratory diseases can significantly improve the health status of patients, reduce healthcare expenses, and enhance quality of life. Therefore, there has been extensive interest in developing automatic respiratory disease detection systems. Most recent methods for detecting respiratory disease use machine and deep learning algorithms. The success of these machine learning methods depends heavily on the selection of proper features to be used in the classifier. Although metaheuristic-based feature selection methods have been successful in addressing difficulties presented by high-dimensional medical data in various biomedical classification tasks, there is not much research on the utilization of metaheuristic methods in respiratory disease classification. This paper aims to conduct a detailed and comparative analysis of six widely used metaheuristic optimization methods using eight different transfer functions in respiratory disease classification. For this purpose, two different classification cases were examined: binary and multi-class. The findings demonstrate that metaheuristic algorithms using correct transfer functions could effectively reduce data dimensionality while enhancing classification accuracy.
Collapse
Affiliation(s)
- Damla Gürkan Kuntalp
- Department of Electrical and Electronics Engineering, Dokuz Eylül University, İzmir 35160, Türkiye;
| | - Nermin Özcan
- Department of Biomedical Engineering, İskenderun Technical University, İskenderun 31200, Türkiye;
| | - Okan Düzyel
- Department of Electrical and Electronics Engineering, İzmir Institute of Technology, İzmir 35433, Türkiye;
| | | | - Mehmet Kuntalp
- Department of Electrical and Electronics Engineering, Dokuz Eylül University, İzmir 35160, Türkiye;
| |
Collapse
|
3
|
Zhang L, Chen X. Social coevolution and Sine chaotic opposition learning Chimp Optimization Algorithm for feature selection. Sci Rep 2024; 14:15413. [PMID: 38965341 PMCID: PMC11224333 DOI: 10.1038/s41598-024-66285-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Accepted: 07/01/2024] [Indexed: 07/06/2024] Open
Abstract
Feature selection is a hot problem in machine learning. Swarm intelligence algorithms play an essential role in feature selection due to their excellent optimisation ability. The Chimp Optimisation Algorithm (CHoA) is a new type of swarm intelligence algorithm. It has quickly won widespread attention in the academic community due to its fast convergence speed and easy implementation. However, CHoA has specific challenges in balancing local and global search, limiting its optimisation accuracy and leading to premature convergence, thus affecting the algorithm's performance on feature selection tasks. This study proposes Social coevolution and Sine chaotic opposition learning Chimp Optimization Algorithm (SOSCHoA). SOSCHoA enhances inter-population interaction through social coevolution, improving local search. Additionally, it introduces sine chaotic opposition learning to increase population diversity and prevent local optima. Extensive experiments on 12 high-dimensional classification datasets demonstrate that SOSCHoA outperforms existing algorithms in classification accuracy, convergence, and stability. Although SOSCHoA shows advantages in handling high-dimensional datasets, there is room for future research and optimization, particularly concerning feature dimensionality reduction.
Collapse
Affiliation(s)
- Li Zhang
- College of Computer Engineering, Jiangsu University of Technology, Changzhou, 213001, People's Republic of China.
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, 130012, People's Republic of China.
| | - XiaoBo Chen
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, 130012, People's Republic of China
- People's Bank of China Changzhou City Center Branch, Jiangsu, 213001, Changzhou, People's Republic of China
| |
Collapse
|
4
|
Li M, Luo Q, Zhou Y. BGOA-TVG: Binary Grasshopper Optimization Algorithm with Time-Varying Gaussian Transfer Functions for Feature Selection. Biomimetics (Basel) 2024; 9:187. [PMID: 38534872 DOI: 10.3390/biomimetics9030187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 03/09/2024] [Accepted: 03/11/2024] [Indexed: 03/28/2024] Open
Abstract
Feature selection aims to select crucial features to improve classification accuracy in machine learning and data mining. In this paper, a new binary grasshopper optimization algorithm using time-varying Gaussian transfer functions (BGOA-TVG) is proposed for feature selection. Compared with the traditional S-shaped and V-shaped transfer functions, the proposed Gaussian time-varying transfer functions have the characteristics of a fast convergence speed and a strong global search capability to convert a continuous search space to a binary one. The BGOA-TVG is tested and compared to S-shaped and V-shaped binary grasshopper optimization algorithms and five state-of-the-art swarm intelligence algorithms for feature selection. The experimental results show that the BGOA-TVG has better performance in UCI, DEAP, and EPILEPSY datasets for feature selection.
Collapse
Affiliation(s)
- Mengjun Li
- College of Artificial Intelligence, Guangxi Minzu University, Nanning 530006, China
| | - Qifang Luo
- College of Artificial Intelligence, Guangxi Minzu University, Nanning 530006, China
- Guangxi Key Laboratories of Hybrid Computation and IC Design Analysis, Nanning 530006, China
| | - Yongquan Zhou
- College of Artificial Intelligence, Guangxi Minzu University, Nanning 530006, China
- Guangxi Key Laboratories of Hybrid Computation and IC Design Analysis, Nanning 530006, China
- Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia
| |
Collapse
|
5
|
Lee H, Lee Y, Jo M, Nam S, Jo J, Lee C. Enhancing Diagnosis of Rotating Elements in Roll-to-Roll Manufacturing Systems through Feature Selection Approach Considering Overlapping Data Density and Distance Analysis. SENSORS (BASEL, SWITZERLAND) 2023; 23:7857. [PMID: 37765913 PMCID: PMC10534779 DOI: 10.3390/s23187857] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 09/01/2023] [Accepted: 09/11/2023] [Indexed: 09/29/2023]
Abstract
Roll-to-roll manufacturing systems have been widely adopted for their cost-effectiveness, eco-friendliness, and mass-production capabilities, utilizing thin and flexible substrates. However, in these systems, defects in the rotating components such as the rollers and bearings can result in severe defects in the functional layers. Therefore, the development of an intelligent diagnostic model is crucial for effectively identifying these rotating component defects. In this study, a quantitative feature-selection method, feature partial density, to develop high-efficiency diagnostic models was proposed. The feature combinations extracted from the measured signals were evaluated based on the partial density, which is the density of the remaining data excluding the highest class in overlapping regions and the Mahalanobis distance by class to assess the classification performance of the models. The validity of the proposed algorithm was verified through the construction of ranked model groups and comparison with existing feature-selection methods. The high-ranking group selected by the algorithm outperformed the other groups in terms of training time, accuracy, and positive predictive value. Moreover, the top feature combination demonstrated superior performance across all indicators compared to existing methods.
Collapse
Affiliation(s)
- Haemi Lee
- Department of Mechanical Design and Production Engineering, Konkuk University, 120 Neungdong-ro, Gwangjin-gu, Seoul 05030, Republic of Korea
| | - Yoonjae Lee
- Department of Mechanical Design and Production Engineering, Konkuk University, 120 Neungdong-ro, Gwangjin-gu, Seoul 05030, Republic of Korea
| | - Minho Jo
- Department of Mechanical Design and Production Engineering, Konkuk University, 120 Neungdong-ro, Gwangjin-gu, Seoul 05030, Republic of Korea
| | - Sanghoon Nam
- Department of Mechanical Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Jeongdai Jo
- Department of Printed Electronics, Korea Institute of Machinery and Materials, 156, Gajeongbuk-ro, Yuseong-gu, Daejeon 34103, Republic of Korea
| | - Changwoo Lee
- Department of Mechanical and Aerospace Engineering, Konkuk University, 120 Neungdong-ro, Gwangjin-gu, Seoul 05030, Republic of Korea
| |
Collapse
|
6
|
Seyyedabbasi A. Binary Sand Cat Swarm Optimization Algorithm for Wrapper Feature Selection on Biological Data. Biomimetics (Basel) 2023; 8:310. [PMID: 37504198 PMCID: PMC10807367 DOI: 10.3390/biomimetics8030310] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Revised: 06/02/2023] [Accepted: 06/04/2023] [Indexed: 07/29/2023] Open
Abstract
In large datasets, irrelevant, redundant, and noisy attributes are often present. These attributes can have a negative impact on the classification model accuracy. Therefore, feature selection is an effective pre-processing step intended to enhance the classification performance by choosing a small number of relevant or significant features. It is important to note that due to the NP-hard characteristics of feature selection, the search agent can become trapped in the local optima, which is extremely costly in terms of time and complexity. To solve these problems, an efficient and effective global search method is needed. Sand cat swarm optimization (SCSO) is a newly introduced metaheuristic algorithm that solves global optimization algorithms. Nevertheless, the SCSO algorithm is recommended for continuous problems. bSCSO is a binary version of the SCSO algorithm proposed here for the analysis and solution of discrete problems such as wrapper feature selection in biological data. It was evaluated on ten well-known biological datasets to determine the effectiveness of the bSCSO algorithm. Moreover, the proposed algorithm was compared to four recent binary optimization algorithms to determine which algorithm had better efficiency. A number of findings demonstrated the superiority of the proposed approach both in terms of high prediction accuracy and small feature sizes.
Collapse
Affiliation(s)
- Amir Seyyedabbasi
- Software Engineering Department, Faculty of Engineering and Natural Science, Istinye University, 34396 Istanbul, Turkey
| |
Collapse
|
7
|
Dokeroglu T. A new parallel multi-objective Harris hawk algorithm for predicting the mortality of COVID-19 patients. PeerJ Comput Sci 2023; 9:e1430. [PMID: 37346714 PMCID: PMC10280461 DOI: 10.7717/peerj-cs.1430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Accepted: 05/18/2023] [Indexed: 06/23/2023]
Abstract
Harris' Hawk Optimization (HHO) is a novel metaheuristic inspired by the collective hunting behaviors of hawks. This technique employs the flight patterns of hawks to produce (near)-optimal solutions, enhanced with feature selection, for challenging classification problems. In this study, we propose a new parallel multi-objective HHO algorithm for predicting the mortality risk of COVID-19 patients based on their symptoms. There are two objectives in this optimization problem: to reduce the number of features while increasing the accuracy of the predictions. We conduct comprehensive experiments on a recent real-world COVID-19 dataset from Kaggle. An augmented version of the COVID-19 dataset is also generated and experimentally shown to improve the quality of the solutions. Significant improvements are observed compared to existing state-of-the-art metaheuristic wrapper algorithms. We report better classification results with feature selection than when using the entire set of features. During experiments, a 98.15% prediction accuracy with a 45% reduction is achieved in the number of features. We successfully obtained new best solutions for this COVID-19 dataset.
Collapse
Affiliation(s)
- Tansel Dokeroglu
- Cankaya University, Software Engineering Department, Ankara, Turkey
| |
Collapse
|
8
|
Doumari SA, Berahmand K, Ebadi MJ. Early and High-Accuracy Diagnosis of Parkinson's Disease: Outcomes of a New Model. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2023; 2023:1493676. [PMID: 37304324 PMCID: PMC10256450 DOI: 10.1155/2023/1493676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Revised: 03/02/2023] [Accepted: 03/06/2023] [Indexed: 06/13/2023]
Abstract
Parkinson's disease (PD) is one of the significant common neurological disorders of the current age that causes uncontrollable movements like shaking, stiffness, and difficulty. The early clinical diagnosis of this disease is essential for preventing the progression of PD. Hence, an innovative method is proposed here based on combining the crow search algorithm and decision tree (CSADT) for the early PD diagnosis. This approach is used on four crucial Parkinson's datasets, including meander, spiral, voice, and speech-Sakar. Using the presented method, PD is effectively diagnosed by evaluating each dataset's critical features and extracting the primary practical outcomes. The used algorithm was compared with other machine learning algorithms of k-nearest neighbor (KNN), support vector machine (SVM), naive Baye (NB), multilayer perceptron (MLP), decision tree (DT), random tree, logistic regression, support vector machine of radial base functions (SVM of RBFs), and combined classifier in terms of accuracy, recall, and combination measure F1. The analytical results emphasize the used algorithm's superiority over the other selected ones. The proposed model yields nearly 100% accuracy through various trials on the datasets. Notably, a high detection speed achieved the lowest detection time of 2.6 seconds. The main novelty of this paper is attributed to the accuracy of the presented PD diagnosis method, which is much higher than its counterparts.
Collapse
Affiliation(s)
- Sajjad Amiri Doumari
- Department of Mathematics and Computer Science, Sirjan University of Technology, Sirjan, Iran
| | - Kamal Berahmand
- Department of Information Technology and Communications, Azarbaijan Shahid Madani University, Tabriz, Iran
| | - M. J. Ebadi
- Department of Mathematics, Chabahar Maritime University, Chabahar, Iran
| |
Collapse
|
9
|
Low-Carbon and Low-Energy-Consumption Gear Processing Route Optimization Based on Gray Wolf Algorithm. Processes (Basel) 2022. [DOI: 10.3390/pr10122585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
The process of gear machining consumes a large amount of energy and causes serious pollution to the environment. Developing a proper process route of gear machining is the key to conserving energy and reducing emissions. Nowadays, the proper process route of gear machining is based on experience and is difficult to keep up with the development of modern times. In this article, a calculation model of low-carbon and low-energy consumption in gear machining processes was established based on an analysis of the machining process. With processing parameters as independent variables, the grey wolf algorithm was used to solve the problem. The effectiveness of the method was proven by an example of the machining process of an automobile transmission shaft.
Collapse
|
10
|
Al-qaness MAA, Helmi AM, Dahou A, Elaziz MA. The Applications of Metaheuristics for Human Activity Recognition and Fall Detection Using Wearable Sensors: A Comprehensive Analysis. BIOSENSORS 2022; 12:821. [PMID: 36290958 PMCID: PMC9599938 DOI: 10.3390/bios12100821] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/31/2022] [Revised: 09/25/2022] [Accepted: 09/28/2022] [Indexed: 06/16/2023]
Abstract
In this paper, we study the applications of metaheuristics (MH) optimization algorithms in human activity recognition (HAR) and fall detection based on sensor data. It is known that MH algorithms have been utilized in complex engineering and optimization problems, including feature selection (FS). Thus, in this regard, this paper used nine MH algorithms as FS methods to boost the classification accuracy of the HAR and fall detection applications. The applied MH were the Aquila optimizer (AO), arithmetic optimization algorithm (AOA), marine predators algorithm (MPA), artificial bee colony (ABC) algorithm, genetic algorithm (GA), slime mold algorithm (SMA), grey wolf optimizer (GWO), whale optimization algorithm (WOA), and particle swarm optimization algorithm (PSO). First, we applied efficient prepossessing and segmentation methods to reveal the motion patterns and reduce the time complexities. Second, we developed a light feature extraction technique using advanced deep learning approaches. The developed model was ResRNN and was composed of several building blocks from deep learning networks including convolution neural networks (CNN), residual networks, and bidirectional recurrent neural networks (BiRNN). Third, we applied the mentioned MH algorithms to select the optimal features and boost classification accuracy. Finally, the support vector machine and random forest classifiers were employed to classify each activity in the case of multi-classification and to detect fall and non-fall actions in the case of binary classification. We used seven different and complex datasets for the multi-classification case: the PAMMP2, Sis-Fall, UniMiB SHAR, OPPORTUNITY, WISDM, UCI-HAR, and KU-HAR datasets. In addition, we used the Sis-Fall dataset for the binary classification (fall detection). We compared the results of the nine MH optimization methods using different performance indicators. We concluded that MH optimization algorithms had promising performance in HAR and fall detection applications.
Collapse
Affiliation(s)
- Mohammed A. A. Al-qaness
- College of Physics and Electronic Information Engineering, Zhejiang Normal University, Jinhua 321004, China
| | - Ahmed M. Helmi
- College of Engineering and Information Technology, Buraydah Private Colleges, Buraydah 51418, Saudi Arabia
- Computer and Systems Engineering Department, Faculty of Engineering, Zagazig University, Zagazig 44519, Egypt
| | - Abdelghani Dahou
- Mathematics and Computer Science Department, University of Ahmed DRAIA, Adrar 01000, Algeria
- LDDI Laboratory, Faculty of Science and Technology, University of Ahmed DRAIA, Adrar 01000, Algeria
| | - Mohamed Abd Elaziz
- Department of Mathematics, Faculty of Science, Zagazig University, Zagazig 44519, Egypt
- Artificial Intelligence Research Center (AIRC), College of Engineering and Information Technology, Ajman University, Ajman 346, United Arab Emirates
- Faculty of Computer Science and Engineering, Galala University, Suez 435611, Egypt
- Department of Electrical and Computer Engineering, Lebanese American University, Byblos 13-5053, Lebanon
| |
Collapse
|