1
|
Alweshah M, Aldabbas Y, Abu-Salih B, Oqeil S, Hasan HS, Alkhalaileh S, Kassaymeh S. Hybrid black widow optimization with iterated greedy algorithm for gene selection problems. Heliyon 2023; 9:e20133. [PMID: 37809602 PMCID: PMC10559925 DOI: 10.1016/j.heliyon.2023.e20133] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2023] [Revised: 09/03/2023] [Accepted: 09/12/2023] [Indexed: 10/10/2023] Open
Abstract
Gene Selection (GS) is a strategy method targeted at reducing redundancy, limited expressiveness, and low informativeness in gene expression datasets obtained by DNA Microarray technology. These datasets contain a plethora of diverse and high-dimensional samples and genes, with a significant discrepancy in the number of samples and genes present. The complexities of GS are especially noticeable in the context of microarray expression data analysis, owing to the inherent data imbalance. The main goal of this study is to offer a simplified and computationally effective approach to dealing with the conundrum of attribute selection in microarray gene expression data. We use the Black Widow Optimization algorithm (BWO) in the context of GS to achieve this, using two unique methodologies: the unaltered BWO variation and the hybridized BWO variant combined with the Iterated Greedy algorithm (BWO-IG). By improving the local search capabilities of BWO, this hybridization attempts to promote more efficient gene selection. A series of tests was carried out using nine benchmark datasets that were obtained from the gene expression data repository in the pursuit of empirical validation. The results of these tests conclusively show that the BWO-IG technique performs better than the traditional BWO algorithm. Notably, the hybridized BWO-IG technique excels in the efficiency of local searches, making it easier to identify relevant genes and producing findings with higher levels of reliability in terms of accuracy and the degree of gene pruning. Additionally, a comparison analysis is done against five modern wrapper Feature Selection (FS) methodologies, namely BIMFOHHO, BMFO, BHHO, BCS, and BBA, in order to put the suggested BWO-IG method's effectiveness into context. The comparison that follows highlights BWO-IG's obvious superiority in reducing the number of selected genes while also obtaining remarkably high classification accuracy. The key findings were an average classification accuracy of 94.426, average fitness values of 0.061, and an average number of selected genes of 2933.767.
Collapse
Affiliation(s)
- Mohammed Alweshah
- Prince Abdullah Bin Ghazi Faculty of Information and Communication Technology, Al-Balqa Applied University, Al-Salt, Jordan
| | - Yasmeen Aldabbas
- Prince Abdullah Bin Ghazi Faculty of Information and Communication Technology, Al-Balqa Applied University, Al-Salt, Jordan
| | - Bilal Abu-Salih
- Department of Computer Science, King Abdullah II School of Information Technology, The University of Jordan, Amman, Jordan
| | - Saleh Oqeil
- Prince Abdullah Bin Ghazi Faculty of Information and Communication Technology, Al-Balqa Applied University, Al-Salt, Jordan
| | - Hazem S. Hasan
- Department of Plant Production and Protection, Faculty of Agricultural Technology, Al-Balqa Applied University, Al-Salt, Jordan
| | - Saleh Alkhalaileh
- Prince Abdullah Bin Ghazi Faculty of Information and Communication Technology, Al-Balqa Applied University, Al-Salt, Jordan
| | - Sofian Kassaymeh
- Software Engineering Department, Faculty of Information Technology, Aqaba University of Technology, Aqaba, Jordan
| |
Collapse
|
2
|
Pan JS, Hu P, Snášel V, Chu SC. A survey on binary metaheuristic algorithms and their engineering applications. Artif Intell Rev 2022; 56:6101-6167. [PMID: 36466763 PMCID: PMC9684803 DOI: 10.1007/s10462-022-10328-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
This article presents a comprehensively state-of-the-art investigation of the engineering applications utilized by binary metaheuristic algorithms. Surveyed work is categorized based on application scenarios and solution encoding, and describes these algorithms in detail to help researchers choose appropriate methods to solve related applications. It is seen that transfer function is the main binary coding of metaheuristic algorithms, which usually adopts Sigmoid function. Among the contributions presented, there were different implementations and applications of metaheuristic algorithms, or the study of engineering applications by different objective functions such as the single- and multi-objective problems of feature selection, scheduling, layout and engineering structure optimization. The article identifies current troubles and challenges by the conducted review, and discusses that novel binary algorithm, transfer function, benchmark function, time-consuming problem and application integration are need to be resolved in future.
Collapse
Affiliation(s)
- Jeng-Shyang Pan
- College of Computer Science and Engineering, Shandong University of Science and Technology, Qingdao, 266590 Shandong China
- Department of Information Management, Chaoyang University of Technology, Taichung, 413310 Taiwan
| | - Pei Hu
- Department of Information Management, Chaoyang University of Technology, Taichung, 413310 Taiwan
- School of Computer Science and Software Engineering, Nanyang Institute of Technology, Nanyang, 473004 Henan China
| | - Václav Snášel
- Faculty of Electrical Engineering and Computer Science, VŠB—Technical University of Ostrava, Ostrava, 70032 Moravskoslezský kraj Czech Republic
| | - Shu-Chuan Chu
- College of Computer Science and Engineering, Shandong University of Science and Technology, Qingdao, 266590 Shandong China
| |
Collapse
|
3
|
Dokeroglu T, Deniz A, Kiziloz HE. A comprehensive survey on recent metaheuristics for feature selection. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.04.083] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
|
4
|
Liang S, Fang Z, Sun G, Qu G. Biogeography-based optimization with adaptive migration and adaptive mutation with its application in sidelobe reduction of antenna arrays. Appl Soft Comput 2022. [DOI: 10.1016/j.asoc.2022.108772] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
|
5
|
Constructing Features Using a Hybrid Genetic Algorithm. SIGNALS 2022. [DOI: 10.3390/signals3020012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
A hybrid procedure that incorporates grammatical evolution and a weight decaying technique is proposed here for various classification and regression problems. The proposed method has two main phases: the creation of features and the evaluation of these features. During the first phase, using grammatical evolution, new features are created as non-linear combinations of the original features of the datasets. In the second phase, based on the characteristics of the first phase, the original dataset is modified and a neural network trained with a genetic algorithm is applied to this dataset. The proposed method was applied to an extremely wide set of datasets from the relevant literature and the experimental results were compared with four other techniques.
Collapse
|
6
|
Na X, Han M, Ren W, Zhong K. Modified BBO-Based Multivariate Time-Series Prediction System With Feature Subset Selection and Model Parameter Optimization. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:2163-2173. [PMID: 32639932 DOI: 10.1109/tcyb.2020.2977375] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Multivariate time-series prediction is a challenging research topic in the field of time-series analysis and modeling, and is continually under research. The echo state network (ESN), a type of efficient recurrent neural network, has been widely used in time-series prediction, but when using ESN, two crucial problems have to be confronted: 1) how to select the optimal subset of input features and 2) how to set the suitable parameters of the model. To solve this problem, the modified biogeography-based optimization ESN (MBBO-ESN) system is proposed for system modeling and multivariate time-series prediction, which can simultaneously achieve feature subset selection and model parameter optimization. The proposed MBBO algorithm is an improved evolutionary algorithm based on biogeography-based optimization (BBO), which utilizes an S -type population migration rate model, a covariance matrix migration strategy, and a Lévy distribution mutation strategy to enhance the rotation invariance and exploration ability. Furthermore, the MBBO algorithm cannot only optimize the key parameters of the ESN model but also uses a hybrid-metric feature selection method to remove the redundancies and distinguish the importance of the input features. Compared with the traditional methods, the proposed MBBO-ESN system can discover the relationship between the input features and the model parameters automatically and make the prediction more accurate. The experimental results on the benchmark and real-world datasets demonstrate that MBBO outperforms the other traditional evolutionary algorithms, and the MBBO-ESN system is more competitive in multivariate time-series prediction than other classic machine-learning models.
Collapse
|
7
|
Pashaei E, Pashaei E. An efficient binary chimp optimization algorithm for feature selection in biomedical data classification. Neural Comput Appl 2022. [DOI: 10.1007/s00521-021-06775-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
|
8
|
Wang Y, Ma Z, Wong KC, Li X. Evolving Multiobjective Cancer Subtype Diagnosis From Cancer Gene Expression Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:2431-2444. [PMID: 32086219 DOI: 10.1109/tcbb.2020.2974953] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Detection and diagnosis of cancer are especially essential for early prevention and effective treatments. Many studies have been proposed to tackle the subtype diagnosis problems with those data, which often suffer from low diagnostic ability and bad generalization. This article studies a multiobjective PSO-based hybrid algorithm (MOPSOHA) to optimize four objectives including the number of features, the accuracy, and two entropy-based measures: the relevance and the redundancy simultaneously, diagnosing the cancer data with high classification power and robustness. First, we propose a novel binary encoding strategy to choose informative gene subsets to optimize those objective functions. Second, a mutation operator is designed to enhance the exploration capability of the swarm. Finally, a local search method based on the "best/1" mutation operator of differential evolutionary algorithm (DE) is employed to exploit the neighborhood area with sparse high-quality solutions since the base vector always approaches to some good promising areas. In order to demonstrate the effectiveness of MOPSOHA, it is tested on 41 cancer datasets including thirty-five cancer gene expression datasets and six independent disease datasets. Compared MOPSOHA with other state-of-the-art algorithms, the performance of MOPSOHA is superior to other algorithms in most of the benchmark datasets.
Collapse
|
9
|
Abstract
The problems of gene regulatory network (GRN) reconstruction and the creation of disease diagnostic effective systems based on genes expression data are some of the current directions of modern bioinformatics. In this manuscript, we present the results of the research focused on the evaluation of the effectiveness of the most used metrics to estimate the gene expression profiles’ proximity, which can be used to extract the groups of informative gene expression profiles while taking into account the states of the investigated samples. Symmetry is very important in the field of both genes’ and/or proteins’ interaction since it undergirds essentially all interactions between molecular components in the GRN and extraction of gene expression profiles, which allows us to identify how the investigated biological objects (disease, state of patients, etc.) contribute to the further reconstruction of GRN in terms of both the symmetry and understanding the mechanism of molecular element interaction in a biological organism. Within the framework of our research, we have investigated the following metrics: Mutual information maximization (MIM) using various methods of Shannon entropy calculation, Pearson’s χ2 test and correlation distance. The accuracy of the investigated samples classification was used as the main quality criterion to evaluate the appropriate metric effectiveness. The random forest classifier (RF) was used during the simulation process. The research results have shown that results of the use of various methods of Shannon entropy within the framework of the MIM metric disagree with each other. As a result, we have proposed the modified mutual information maximization (MMIM) proximity metric based on the joint use of various methods of Shannon entropy calculation and the Harrington desirability function. The results of the simulation have also shown that the correlation proximity metric is less effective in comparison to both the MMIM metric and Pearson’s χ2 test. Finally, we propose the hybrid proximity metric (HPM) that considers both the MMIM metric and Pearson’s χ2 test. The proposed metric was investigated within the framework of one-cluster structure effectiveness evaluation. To our mind, the main benefit of the proposed HPM is in increasing the objectivity of mutually similar gene expression profiles extraction due to the joint use of the various effective proximity metrics that can contradict with each other when they are used alone.
Collapse
|
10
|
Xiong G, Yuan X, Mohamed AW, Zhang J. Fault section diagnosis of power systems with logical operation binary gaining‐sharing knowledge‐based algorithm. INT J INTELL SYST 2021. [DOI: 10.1002/int.22659] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Affiliation(s)
- Guojiang Xiong
- Guizhou Key Laboratory of Intelligent Technology in Power System College of Electrical Engineering, Guizhou University Guiyang China
| | - Xufeng Yuan
- Guizhou Key Laboratory of Intelligent Technology in Power System College of Electrical Engineering, Guizhou University Guiyang China
| | - Ali Wagdy Mohamed
- Operations Research Department, Faculty of Graduate Studies for Statistical Research Cairo University Giza Egypt
- Department of Mathematics and Actuarial Science The American University in Cairo New Cairo Egypt
| | - Jing Zhang
- Guizhou Key Laboratory of Intelligent Technology in Power System College of Electrical Engineering, Guizhou University Guiyang China
| |
Collapse
|
11
|
A novel bio-inspired hybrid multi-filter wrapper gene selection method with ensemble classifier for microarray data. Neural Comput Appl 2021; 35:11531-11561. [PMID: 34539088 PMCID: PMC8435304 DOI: 10.1007/s00521-021-06459-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2020] [Accepted: 08/26/2021] [Indexed: 01/04/2023]
Abstract
Microarray technology is known as one of the most important tools for collecting DNA expression data. This technology allows researchers to investigate and examine types of diseases and their origins. However, microarray data are often associated with a small sample size, a significant number of genes, imbalanced data, etc., making classification models inefficient. Thus, a new hybrid solution based on a multi-filter and adaptive chaotic multi-objective forest optimization algorithm (AC-MOFOA) is presented to solve the gene selection problem and construct the Ensemble Classifier. In the proposed solution, a multi-filter model (i.e., ensemble filter) is proposed as preprocessing step to reduce the dataset's dimensions, using a combination of five filter methods to remove redundant and irrelevant genes. Accordingly, the results of the five filter methods are combined using a voting-based function. Additionally, the results of the proposed multi-filter indicate that it has good capability in reducing the gene subset size and selecting relevant genes. Then, an AC-MOFOA based on the concepts of non-dominated sorting, crowding distance, chaos theory, and adaptive operators is presented. AC-MOFOA as a wrapper method aimed at reducing dataset dimensions, optimizing KELM, and increasing the accuracy of the classification, simultaneously. Next, in this method, an ensemble classifier model is presented using AC-MOFOA results to classify microarray data. The performance of the proposed algorithm was evaluated on nine public microarray datasets, and its results were compared in terms of the number of selected genes, classification efficiency, execution time, time complexity, hypervolume indicator, and spacing metric with five hybrid multi-objective methods, and three hybrid single-objective methods. According to the results, the proposed hybrid method could increase the accuracy of the KELM in most datasets by reducing the dataset's dimensions and achieve similar or superior performance compared to other multi-objective methods. Furthermore, the proposed Ensemble Classifier model could provide better classification accuracy and generalizability in the seven of nine microarray datasets compared to conventional ensemble methods. Moreover, the comparison results of the Ensemble Classifier model with three state-of-the-art ensemble generation methods indicate its competitive performance in which the proposed ensemble model achieved better results in the five of nine datasets.
Collapse
|
12
|
Ma S, Wang Y, Zhang S. Modified chemical reaction optimization and its application in engineering problems. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2021; 18:7143-7160. [PMID: 34814243 DOI: 10.3934/mbe.2021354] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Chemical Reaction Optimization (CRO) is a simple and efficient evolutionary optimization algorithm by simulating chemical reactions. As far as the current research is concerned, the algorithm has been successfully used for solving a number of real-world optimization tasks. In our paper, a new real encoded chemical reaction optimization algorithm is proposed to boost the efficiency of the optimization operations in standard chemical reactions optimization algorithm. Inspired by the evolutionary operation of the differential evolution algorithm, an improved search operation mechanism is proposed based on the underlying operation. It is modeled to further explore the search space of the algorithm under the best individuals. Afterwards, to control the perturbation frequency of the search strategy, the modification rate is increased to balance between the exploration ability and mining ability of the algorithm. Meanwhile, we also propose a new population initialization method that incorporates several models to produce high-quality initialized populations. To validate the effectiveness of the algorithm, nine unconstrained optimization algorithms are used as benchmark functions. As observed from the experimental results, it is evident that the proposed algorithm is significantly better than the standard chemical reaction algorithm and other evolutionary optimization algorithms. Then, we also apply the proposed model to address the synthesis problem of two antenna array synthesis. The results also reveal that the proposed algorithm is superior to other approaches from different perspectives.
Collapse
Affiliation(s)
- Shijing Ma
- School of Physical Education, Northeast Normal University, Changchun 130022, China
| | - Yunhe Wang
- School of Artificial Intelligence, Hebei University of Technology, Tianjin 300401, China
| | - Shouwei Zhang
- School of Physical Education, Northeast Normal University, Changchun 130022, China
| |
Collapse
|
13
|
Zheng Y, Wu J, Wang B. CLGBO: An Algorithm for Constructing Highly Robust Coding Sets for DNA Storage. Front Genet 2021; 12:644945. [PMID: 34017354 PMCID: PMC8129200 DOI: 10.3389/fgene.2021.644945] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Accepted: 04/08/2021] [Indexed: 11/22/2022] Open
Abstract
In the era of big data, new storage media are urgently needed because the storage capacity for global data cannot meet the exponential growth of information. Deoxyribonucleic acid (DNA) storage, where primer and address sequences play a crucial role, is one of the most promising storage media because of its high density, large capacity and durability. In this study, we describe an enhanced gradient-based optimizer that includes the Cauchy and Levy mutation strategy (CLGBO) to construct DNA coding sets, which are used as primer and address libraries. Our experimental results show that the lower bounds of DNA storage coding sets obtained using the CLGBO algorithm are increased by 4.3–13.5% compared with previous work. The non-adjacent subsequence constraint was introduced to reduce the error rate in the storage process. This helps to resolve the problem that arises when consecutive repetitive subsequences in the sequence cause errors in DNA storage. We made use of the CLGBO algorithm and the non-adjacent subsequence constraint to construct larger and more highly robust coding sets.
Collapse
Affiliation(s)
- Yanfen Zheng
- The Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian, China
| | - Jieqiong Wu
- The Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian, China
| | - Bin Wang
- The Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian, China
| |
Collapse
|
14
|
Li X, Zhang S, Wong KC. Deep embedded clustering with multiple objectives on scRNA-seq data. Brief Bioinform 2021; 22:6209682. [PMID: 33822877 DOI: 10.1093/bib/bbab090] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Revised: 02/17/2021] [Accepted: 02/25/2021] [Indexed: 12/19/2022] Open
Abstract
In recent years, single-cell RNA sequencing (scRNA-seq) technologies have been widely adopted to interrogate gene expression of individual cells; it brings opportunities to understand the underlying processes in a high-throughput manner. Deep embedded clustering (DEC) was demonstrated successful in high-dimensional sparse scRNA-seq data by joint feature learning and cluster assignment for identifying cell types simultaneously. However, the deep network architecture for embedding clustering is not trivial to optimize. Therefore, we propose an evolutionary multiobjective DEC by synergizing the multiobjective evolutionary optimization to simultaneously evolve the hyperparameters and architectures of DEC in an automatic manner. Firstly, a denoising autoencoder is integrated into the DEC to project the high-dimensional sparse scRNA-seq data into a low-dimensional space. After that, to guide the evolution, three objective functions are formulated to balance the model's generality and clustering performance for robustness. Meanwhile, migration and mutation operators are proposed to optimize the objective functions to select the suitable hyperparameters and architectures of DEC in the multiobjective framework. Multiple comparison analyses are conducted on twenty synthetic data and eight real data from different representative single-cell sequencing platforms to validate the effectiveness. The experimental results reveal that the proposed algorithm outperforms other state-of-the-art clustering methods under different metrics. Meanwhile, marker genes identification, gene ontology enrichment and pathology analysis are conducted to reveal novel insights into the cell type identification and characterization mechanisms.
Collapse
Affiliation(s)
- Xiangtao Li
- School of Artificial Intelligence Jilin University, Jilin, China
| | - Shixiong Zhang
- Department of Computer science City University of Hong Kong, Hong Kong SAR
| | - Ka-Chun Wong
- Department of Computer science City University of Hong Kong, Hong Kong SAR
| |
Collapse
|
15
|
Gao XZ, Nalluri MSR, Kannan K, Sinharoy D. Multi-objective optimization of feature selection using hybrid cat swarm optimization. SCIENCE CHINA TECHNOLOGICAL SCIENCES 2021; 64:508-520. [DOI: 10.1007/s11431-019-1607-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2019] [Accepted: 04/17/2020] [Indexed: 01/04/2025]
|
16
|
Albashish D, Hammouri AI, Braik M, Atwan J, Sahran S. Binary biogeography-based optimization based SVM-RFE for feature selection. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2020.107026] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
|
17
|
Lai CM, Huang HP. A gene selection algorithm using simplified swarm optimization with multi-filter ensemble technique. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2020.106994] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
|
18
|
Multi-variant differential evolution algorithm for feature selection. Sci Rep 2020; 10:17261. [PMID: 33057120 PMCID: PMC7560894 DOI: 10.1038/s41598-020-74228-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Accepted: 09/28/2020] [Indexed: 11/29/2022] Open
Abstract
This work introduces a new population-based stochastic search technique, named multi-variant differential evolution (MVDE) algorithm for solving fifteen well-known real world problems from UCI repository and compared to four popular optimization methods. The MVDE proposes a new self-adaptive scaling factor based on cosine and logistic distributions as an almost factor-free optimization technique. For more updated chances, this factor is binary-mapped by incorporating an adaptive crossover operator. During the evolution, both greedy and less-greedy variants are managed by adjusting and incorporating the binary scaling factor and elite identification mechanism into a new multi-mutation crossover process through a number of sequentially evolutionary phases. Feature selection decreases the number of features by eliminating irrelevant or misleading, noisy and redundant data which can accelerate the process of classification. In this paper, a new feature selection algorithm based on the MVDE method and artificial neural network is presented which enabled MVDE to get a combination features’ set, accelerate the accuracy of the classification, and optimize both the structure and weights of Artificial Neural Network (ANN) simultaneously. The experimental results show the encouraging behavior of the proposed algorithm in terms of the classification accuracies and optimal number of feature selection.
Collapse
|
19
|
Al-Betar MA, Alomari OA, Abu-Romman SM. A TRIZ-inspired bat algorithm for gene selection in cancer classification. Genomics 2020; 112:114-126. [DOI: 10.1016/j.ygeno.2019.09.015] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2019] [Revised: 09/05/2019] [Accepted: 09/17/2019] [Indexed: 10/25/2022]
|
20
|
|
21
|
Brankovic A, Hosseini M, Piroddi L. A Distributed Feature Selection Algorithm Based on Distance Correlation with an Application to Microarrays. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019; 16:1802-1815. [PMID: 29993889 DOI: 10.1109/tcbb.2018.2833482] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
DNA microarray datasets are characterized by a large number of features with very few samples, which is a typical cause of overfitting and poor generalization in the classification task. Here, we introduce a novel feature selection (FS) approach which employs the distance correlation (dCor) as a criterion for evaluating the dependence of the class on a given feature subset. The dCor index provides a reliable dependence measure among random vectors of arbitrary dimension, without any assumption on their distribution. Moreover, it is sensitive to the presence of redundant terms. The proposed FS method is based on a probabilistic representation of the feature subset model, which is progressively refined by a repeated process of model extraction and evaluation. A key element of the approach is a distributed optimization scheme based on a vertical partitioning of the dataset, which alleviates the negative effects of its unbalanced dimensions. The proposed method has been tested on several microarray datasets, resulting in quite compact and accurate models obtained at a reasonable computational cost.
Collapse
|
22
|
Shukla AK, Singh P, Vardhan M. A New Hybrid Feature Subset Selection Framework Based on Binary Genetic Algorithm and Information Theory. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS 2019. [DOI: 10.1142/s1469026819500202] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The explosion of the high-dimensional dataset in the scientific repository has been encouraging interdisciplinary research on data mining, pattern recognition and bioinformatics. The fundamental problem of the individual Feature Selection (FS) method is extracting informative features for classification model and to seek for the malignant disease at low computational cost. In addition, existing FS approaches overlook the fact that for a given cardinality, there can be several subsets with similar information. This paper introduces a novel hybrid FS algorithm, called Filter-Wrapper Feature Selection (FWFS) for a classification problem and also addresses the limitations of existing methods. In the proposed model, the front-end filter ranking method as Conditional Mutual Information Maximization (CMIM) selects the high ranked feature subset while the succeeding method as Binary Genetic Algorithm (BGA) accelerates the search in identifying the significant feature subsets. One of the merits of the proposed method is that, unlike an exhaustive method, it speeds up the FS procedure without lancing of classification accuracy on reduced dataset when a learning model is applied to the selected subsets of features. The efficacy of the proposed (FWFS) method is examined by Naive Bayes (NB) classifier which works as a fitness function. The effectiveness of the selected feature subset is evaluated using numerous classifiers on five biological datasets and five UCI datasets of a varied dimensionality and number of instances. The experimental results emphasize that the proposed method provides additional support to the significant reduction of the features and outperforms the existing methods. For microarray data-sets, we found the lowest classification accuracy is 61.24% on SRBCT dataset and highest accuracy is 99.32% on Diffuse large B-cell lymphoma (DLBCL). In UCI datasets, the lowest classification accuracy is 40.04% on the Lymphography using k-nearest neighbor (k-NN) and highest classification accuracy is 99.05% on the ionosphere using support vector machine (SVM).
Collapse
Affiliation(s)
| | - Pradeep Singh
- Department of Computer Science & Engineering, Raipur, India
| | - Manu Vardhan
- Department of Computer Science & Engineering, Raipur, India
| |
Collapse
|
23
|
A novel filter–wrapper hybrid greedy ensemble approach optimized using the genetic algorithm to reduce the dimensionality of high-dimensional biomedical datasets. Appl Soft Comput 2019. [DOI: 10.1016/j.asoc.2019.105538] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
24
|
Ramya J, Somasundareswari D, Vijayalakshmi P. Gas chimney and hydrocarbon detection using combined BBO and artificial neural network with hybrid seismic attributes. Soft comput 2019. [DOI: 10.1007/s00500-019-04064-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
25
|
Classification of high dimensional biomedical data based on feature selection using redundant removal. PLoS One 2019; 14:e0214406. [PMID: 30964868 PMCID: PMC6456288 DOI: 10.1371/journal.pone.0214406] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2018] [Accepted: 03/12/2019] [Indexed: 11/26/2022] Open
Abstract
High dimensional biomedical data contain tens of thousands of features, accurate and effective identification of the core features in these data can be used to assist diagnose related diseases. However, there are often a large number of irrelevant or redundant features in biomedical data, which seriously affect subsequent classification accuracy and machine learning efficiency. To solve this problem, a novel filter feature selection algorithm based on redundant removal (FSBRR) is proposed to classify high dimensional biomedical data in this paper. First of all, two redundant criteria are determined by vertical relevance (the relationship between feature and class attribute) and horizontal relevance (the relationship between feature and feature). Secondly, to quantify redundant criteria, an approximate redundancy feature framework based on mutual information (MI) is defined to remove redundant and irrelevant features. To evaluate the effectiveness of our proposed algorithm, controlled trials based on typical feature selection algorithm are conducted using three different classifiers, and the experimental results indicate that the FSBRR algorithm can effectively reduce the feature dimension and improve the classification accuracy. In addition, an experiment of small sample dataset is designed and conducted in the section of discussion and analysis to clarify the specific implementation process of FSBRR algorithm more clearly.
Collapse
|
26
|
Wang Y, Liu B, Ma Z, Wong KC, Li X. Nature-Inspired Multiobjective Cancer Subtype Diagnosis. IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE-JTEHM 2019; 7:4300112. [PMID: 30915261 PMCID: PMC6433215 DOI: 10.1109/jtehm.2019.2891746] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/26/2018] [Revised: 12/03/2018] [Accepted: 01/03/2019] [Indexed: 11/18/2022]
Abstract
Cancer gene expression data is of great importance in cancer subtype diagnosis and drug discovery. Many computational methods have been proposed to classify subtypes using those data. However, most of the previous computational methods suffer from poor interpretability, experimental noises, and low diagnostic quality. To address those problems, multiobjective ensemble cuckoo search based on decomposition (MOECSA) is proposed to optimize those four objectives simultaneously including the number of features, the accuracy, and two entropy-based measures: the relevance and the redundancy, classifying the cancer gene expression data with high predictive power for different cardinality levels under multiple objectives. A novel binary encoding is proposed to choose gene subsets from the cancer gene expression data for calculating four objective functions. Furthermore, an effective ensemble mechanism blended in the cuckoo search algorithm framework is applied to balance the convergence speed and population diversity in MOECSA. To demonstrate the effectiveness and efficiency of the proposed algorithm, experiments on thirty-five benchmark cancer gene expression datasets, four independent disease datasets, and one sequencing-based dataset are carried out to compare MOECSA with the six state-of-the-art multiobjective evolutionary algorithms and seven traditional classification algorithms. The experimental results in different perspectives demonstrate that MOECSA has better diagnosis performance than others at multiple levels.
Collapse
Affiliation(s)
- Yunhe Wang
- School of Information Science and TechnologyNortheast Normal UniversityChangchun130117China
| | - Bo Liu
- School of Physical EducationNortheast Normal UniversityChangchun130117China
| | - Zhiqiang Ma
- School of Information Science and TechnologyNortheast Normal UniversityChangchun130117China
| | - Ka-Chun Wong
- Department of Computer ScienceCity University of Hong KongHong Kong
| | - Xiangtao Li
- School of Information Science and TechnologyNortheast Normal UniversityChangchun130117China
| |
Collapse
|
27
|
Shukla AK, Singh P, Vardhan M. A hybrid framework for optimal feature subset selection. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2019. [DOI: 10.3233/jifs-169936] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Alok Kumar Shukla
- Department of Computer Science & Engineering, NIT Raipur, Chhattisgarh (C.G), India
| | - Pradeep Singh
- Department of Computer Science & Engineering, NIT Raipur, Chhattisgarh (C.G), India
| | - Manu Vardhan
- Department of Computer Science & Engineering, NIT Raipur, Chhattisgarh (C.G), India
| |
Collapse
|
28
|
Alomari OA, Khader AT, Al-Betar MA, Awadallah MA. A novel gene selection method using modified MRMR and hybrid bat-inspired algorithm with β-hill climbing. APPL INTELL 2018. [DOI: 10.1007/s10489-018-1207-1] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
29
|
Lai CM. Multi-objective simplified swarm optimization with weighting scheme for gene selection. Appl Soft Comput 2018. [DOI: 10.1016/j.asoc.2017.12.049] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
30
|
Gene selection from large-scale gene expression data based on fuzzy interactive multi-objective binary optimization for medical diagnosis. Biocybern Biomed Eng 2018. [DOI: 10.1016/j.bbe.2018.02.002] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
31
|
Zhao XS, Bao LL, Ning Q, Ji JC, Zhao XW. An Improved Binary Differential Evolution Algorithm for Feature Selection in Molecular Signatures. Mol Inform 2017; 37:e1700081. [PMID: 29106044 DOI: 10.1002/minf.201700081] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Accepted: 10/18/2017] [Indexed: 11/08/2022]
Abstract
The discovery of biomarkers from high-dimensional data is a very challenging task in cancer diagnoses. On the one hand, biomarker discovery is the so-called high-dimensional small-sample problem. On the other hand, these data are redundant and noisy. In recent years, biomarker discovery from high-throughput biological data has become an increasingly important emerging topic in the field of bioinformatics. In this study, we propose a binary differential evolution algorithm for feature selection. Firstly, we suggest using a two-stage approach, where three filter methods including the Fisher score, T-statistics, and Information gain are used to generate the feature pool for input to differential evolution (DE). Secondly, in order to improve the performance of differential evolution algorithm for feature selection, a new variant of binary DE called BDE is proposed. Three optimization strategies are incorporated into the BDE. The first strategy is the heuristic method in initial stage, the second one is the self-adaptive parameter control, and the third one is the minimum change value to improve the exploration behaviour thus enhance the diversity. Finally, Support vector machine (SVM) is used as the classifier in 10 fold cross-validation method. The experimental results of our proposed algorithm on some benchmark datasets demonstrate the effectiveness of our algorithm. In addition, the BDE forged in this study will be of great potential in feature selection problems.
Collapse
Affiliation(s)
- X S Zhao
- School of Computer Science and Information Technology, Northeast Normal University, Changchun, 130000, P.R.China
| | - L L Bao
- School of Computer Science and Information Technology, Northeast Normal University, Changchun, 130000, P.R.China
| | - Q Ning
- School of Computer Science and Information Technology, Northeast Normal University, Changchun, 130000, P.R.China
| | - J C Ji
- School of Computer Science and Information Technology, Northeast Normal University, Changchun, 130000, P.R.China
| | - X W Zhao
- School of Computer Science and Information Technology, Northeast Normal University, Changchun, 130000, P.R.China.,Key Laboratory of symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, 130012, China
| |
Collapse
|
32
|
Ma H, Simon D, Siarry P, Yang Z, Fei M. Biogeography-Based Optimization: A 10-Year Review. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE 2017. [DOI: 10.1109/tetci.2017.2739124] [Citation(s) in RCA: 76] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
33
|
Wang H, Jing X, Niu B. A discrete bacterial algorithm for feature selection in classification of microarray gene expression cancer data. Knowl Based Syst 2017. [DOI: 10.1016/j.knosys.2017.04.004] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
34
|
Ahmadigorji M, Amjady N, Dehghan S. A novel two-stage evolutionary optimization method for multiyear expansion planning of distribution systems in presence of distributed generation. Appl Soft Comput 2017. [DOI: 10.1016/j.asoc.2016.09.020] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
35
|
Wang H, Niu B. A novel bacterial algorithm with randomness control for feature selection in classification. Neurocomputing 2017. [DOI: 10.1016/j.neucom.2016.09.078] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
36
|
Classification of Gene Expression Data Using Multiobjective Differential Evolution. ENERGIES 2016. [DOI: 10.3390/en9121061] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
37
|
Lai CM, Yeh WC, Chang CY. Gene selection using information gain and improved simplified swarm optimization. Neurocomputing 2016. [DOI: 10.1016/j.neucom.2016.08.089] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
38
|
Li R, Hu S, Zhang H, Yin M. An efficient local search framework for the minimum weighted vertex cover problem. Inf Sci (N Y) 2016. [DOI: 10.1016/j.ins.2016.08.053] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
39
|
Ang JC, Mirzal A, Haron H, Hamed HNA. Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2016; 13:971-989. [PMID: 26390495 DOI: 10.1109/tcbb.2015.2478454] [Citation(s) in RCA: 196] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Recently, feature selection and dimensionality reduction have become fundamental tools for many data mining tasks, especially for processing high-dimensional data such as gene expression microarray data. Gene expression microarray data comprises up to hundreds of thousands of features with relatively small sample size. Because learning algorithms usually do not work well with this kind of data, a challenge to reduce the data dimensionality arises. A huge number of gene selection are applied to select a subset of relevant features for model construction and to seek for better cancer classification performance. This paper presents the basic taxonomy of feature selection, and also reviews the state-of-the-art gene selection methods by grouping the literatures into three categories: supervised, unsupervised, and semi-supervised. The comparison of experimental results on top 5 representative gene expression datasets indicates that the classification accuracy of unsupervised and semi-supervised feature selection is competitive with supervised feature selection.
Collapse
|
40
|
Wang L, Wang Y, Chang Q. Feature selection methods for big data bioinformatics: A survey from the search perspective. Methods 2016; 111:21-31. [PMID: 27592382 DOI: 10.1016/j.ymeth.2016.08.014] [Citation(s) in RCA: 110] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2016] [Revised: 08/25/2016] [Accepted: 08/30/2016] [Indexed: 11/26/2022] Open
Abstract
This paper surveys main principles of feature selection and their recent applications in big data bioinformatics. Instead of the commonly used categorization into filter, wrapper, and embedded approaches to feature selection, we formulate feature selection as a combinatorial optimization or search problem and categorize feature selection methods into exhaustive search, heuristic search, and hybrid methods, where heuristic search methods may further be categorized into those with or without data-distilled feature ranking measures.
Collapse
Affiliation(s)
- Lipo Wang
- School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore.
| | - Yaoli Wang
- College of Information Engineering, Taiyuan University of Technology, Taiyuan, China.
| | - Qing Chang
- College of Information Engineering, Taiyuan University of Technology, Taiyuan, China.
| |
Collapse
|
41
|
Jiang W, Shi Y, Zhao W, Wang X. Parameters Identification of Fluxgate Magnetic Core Adopting the Biogeography-Based Optimization Algorithm. SENSORS 2016; 16:s16070979. [PMID: 27347974 PMCID: PMC4970030 DOI: 10.3390/s16070979] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/15/2016] [Revised: 06/14/2016] [Accepted: 06/17/2016] [Indexed: 11/16/2022]
Abstract
The main part of the magnetic fluxgate sensor is the magnetic core, the hysteresis characteristic of which affects the performance of the sensor. When the fluxgate sensors are modelled for design purposes, an accurate model of hysteresis characteristic of the cores is necessary to achieve good agreement between modelled and experimental data. The Jiles-Atherton model is simple and can reflect the hysteresis properties of the magnetic material precisely, which makes it widely used in hysteresis modelling and simulation of ferromagnetic materials. However, in practice, it is difficult to determine the parameters accurately owing to the sensitivity of the parameters. In this paper, the Biogeography-Based Optimization (BBO) algorithm is applied to identify the Jiles-Atherton model parameters. To enhance the performances of the BBO algorithm such as global search capability, search accuracy and convergence rate, an improved Biogeography-Based Optimization (IBBO) algorithm is put forward by using Arnold map and mutation strategy of Differential Evolution (DE) algorithm. Simulation results show that IBBO algorithm is superior to Genetic Algorithm (GA), Particle Swarm Optimization (PSO) algorithm, Differential Evolution algorithm and BBO algorithm in identification accuracy and convergence rate. The IBBO algorithm is applied to identify Jiles-Atherton model parameters of selected permalloy. The simulation hysteresis loop is in high agreement with experimental data. Using permalloy as core of fluxgate probe, the simulation output is consistent with experimental output. The IBBO algorithm can identify the parameters of Jiles-Atherton model accurately, which provides a basis for the precise analysis and design of instruments and equipment with magnetic core.
Collapse
Affiliation(s)
- Wenjuan Jiang
- The Higher Educational Key Laboratory for Measuring & Control Technology and Instrumentations of Heilongjiang Province, School of Measurement-Control Technology & Communications Engineering, Harbin University of Science and Technology, Harbin 150080, China.
- School of Automation Engineering, Northeast Dianli University, Jilin 132012, China.
| | - Yunbo Shi
- The Higher Educational Key Laboratory for Measuring & Control Technology and Instrumentations of Heilongjiang Province, School of Measurement-Control Technology & Communications Engineering, Harbin University of Science and Technology, Harbin 150080, China.
| | - Wenjie Zhao
- The Higher Educational Key Laboratory for Measuring & Control Technology and Instrumentations of Heilongjiang Province, School of Measurement-Control Technology & Communications Engineering, Harbin University of Science and Technology, Harbin 150080, China.
| | - Xiangxin Wang
- The Higher Educational Key Laboratory for Measuring & Control Technology and Instrumentations of Heilongjiang Province, School of Measurement-Control Technology & Communications Engineering, Harbin University of Science and Technology, Harbin 150080, China.
| |
Collapse
|
42
|
Wang GG, Gandomi AH, Alavi AH, Deb S. A Multi-Stage Krill Herd Algorithm for Global Numerical Optimization. INT J ARTIF INTELL T 2016. [DOI: 10.1142/s021821301550030x] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
A multi-stage krill herd (MSKH) algorithm is presented to fully exploit the global and local search abilities of the standard krill herd (KH) optimization method. The proposed method involves exploration and exploitation stages. The exploration stage uses the basic KH algorithm to select a good candidate solution set. This phase is followed by fine-tuning a good candidate solution in the exploitation stage with a focused local mutation and crossover (LMC) operator in order to enhance the reliability of the method for solving global numerical optimization problems. Moreover, the elitism scheme is introduced into the MSKH method to guarantee the best solution. The performance of MSKH is verified using twenty-five standard and rotated and shifted benchmark problems. The results show the superiority of the proposed algorithm to the standard KH and other well-known optimization methods.
Collapse
Affiliation(s)
- Gai-Ge Wang
- School of Computer Science and Technology, Jiangsu Normal University ,Xuzhou, Jiangsu, 221116, China
- Institute of Algorithm and Big Data Analysis, Northeast Normal University ,Changchun, 130117, China
- School of Computer Science and Information Technology, Northeast Normal University ,Changchun, 130117, China
| | - Amir H. Gandomi
- BEACON Center for the Study of Evolution in Action, Michigan State University, East Lansing, MI 48824, USA
| | - Amir H. Alavi
- Department of Civil and Environmental Engineering, Michigan State University, East Lansing, MI 48824, USA
| | - Suash Deb
- Department of Computer Science & Engineering Cambridge Institute of Technology, Ranchi 835103, Jharkhand, India
| |
Collapse
|
43
|
Wang GG, Deb S, Gandomi AH, Alavi AH. Opposition-based krill herd algorithm with Cauchy mutation and position clamping. Neurocomputing 2016. [DOI: 10.1016/j.neucom.2015.11.018] [Citation(s) in RCA: 123] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
44
|
A local search algorithm with tabu strategy and perturbation mechanism for generalized vertex cover problem. Neural Comput Appl 2016. [DOI: 10.1007/s00521-015-2172-9] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
45
|
Guo W, Wang L, Wu Q. Numerical comparisons of migration models for Multi-objective Biogeography-Based Optimization. Inf Sci (N Y) 2016. [DOI: 10.1016/j.ins.2015.07.059] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
46
|
Abstract
This study proposes a new firefly-inspired krill herd (FKH) optimization method based on integration of firefly and krill herd algorithms. FKH introduces an attractiveness and light intensity updating (ALIU) operator originally used in firefly algorithm into the krill herd method. This is basically done to improve local search technique and promote the diversity of the population to avoid a premature convergence. Moreover, an elitism strategy is adopted to maintain the optimal krill with the best fitness when updating the krill. The performance of the FKH method is verified using fifteen different benchmark functions. The results indicate that FKH performs more accurate and effective than the basic krill herd and other optimization algorithms.
Collapse
|
47
|
Feng Y, Wang GG, Gao XZ. A Novel Hybrid Cuckoo Search Algorithm with Global Harmony Search for 0–1 Knapsack Problems. INT J COMPUT INT SYS 2016. [DOI: 10.1080/18756891.2016.1256577] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open
|
48
|
A New Swarm Intelligence Approach for Clustering Based on Krill Herd with Elitism Strategy. ALGORITHMS 2015. [DOI: 10.3390/a8040951] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
49
|
|
50
|
|