Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Verikas A, Bacauskiene M. Feature selection with neural networks. Pattern Recognit Lett 2002. [DOI: 10.1016/s0167-8655(02)00081-8] [Citation(s) in RCA: 89] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Number

Cited by Other Article(s)

Seo B, Lin L, Li J. Mixture of Linear Models Co-supervised by Deep Neural Networks. J Comput Graph Stat 2022. [DOI: 10.1080/10618600.2022.2107533] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]

Neural network input feature selection using structured l2 − norm penalization. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03539-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]

Sildir H, Sarrafi S, Aydin E. Optimal artificial neural network architecture design for modeling an industrial ethylene oxide plant. Comput Chem Eng 2022. [DOI: 10.1016/j.compchemeng.2022.107850] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]

Badaoui M, Buigues PJ, Berta D, Mandana GM, Gu H, Földes T, Dickson CJ, Hornak V, Kato M, Molteni C, Parsons S, Rosta E. Combined Free-Energy Calculation and Machine Learning Methods for Understanding Ligand Unbinding Kinetics. J Chem Theory Comput 2022;18:2543-2555. [PMID: 35195418 PMCID: PMC9097281 DOI: 10.1021/acs.jctc.1c00924] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Abstract

The determination of drug residence times, which define the time an inhibitor is in complex with its target, is a fundamental part of the drug discovery process. Synthesis and experimental measurements of kinetic rate constants are, however, expensive and time consuming. In this work, we aimed to obtain drug residence times computationally. Furthermore, we propose a novel algorithm to identify molecular design objectives based on ligand unbinding kinetics. We designed an enhanced sampling technique to accurately predict the free-energy profiles of the ligand unbinding process, focusing on the free-energy barrier for unbinding. Our method first identifies unbinding paths determining a corresponding set of internal coordinates (ICs) that form contacts between the protein and the ligand; it then iteratively updates these interactions during a series of biased molecular dynamics (MD) simulations to reveal the ICs that are important for the whole of the unbinding process. Subsequently, we performed finite-temperature string simulations to obtain the free-energy barrier for unbinding using the set of ICs as a complex reaction coordinate. Importantly, we also aimed to enable the further design of drugs focusing on improved residence times. To this end, we developed a supervised machine learning (ML) approach with inputs from unbiased “downhill” trajectories initiated near the transition state (TS) ensemble of the string unbinding path. We demonstrate that our ML method can identify key ligand–protein interactions driving the system through the TS. Some of the most important drugs for cancer treatment are kinase inhibitors. One of these kinase targets is cyclin-dependent kinase 2 (CDK2), an appealing target for anticancer drug development. Here, we tested our method using two different CDK2 inhibitors for the potential further development of these compounds. We compared the free-energy barriers obtained from our calculations with those observed in available experimental data. We highlighted important interactions at the distal ends of the ligands that can be targeted for improved residence times. Our method provides a new tool to determine unbinding rates and to identify key structural features of the inhibitors that can be used as starting points for novel design strategies in drug discovery.

Collapse

Punitha S, Stephan T, Gandomi AH. A Novel Breast Cancer Diagnosis Scheme With Intelligent Feature and Parameter Selections. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022;214:106432. [PMID: 34844767 DOI: 10.1016/j.cmpb.2021.106432] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/08/2020] [Accepted: 09/15/2021] [Indexed: 06/13/2023]

Abstract

BACKGROUND AND OBJECTIVE

Breast cancer is the most commonly occurring cancer among women, which contributes to the global death rate. The key to increasing the survival rate of affected patients is early diagnosis along with appropriate treatments. Manual methods for breast cancer diagnosis fail due to human errors, inaccurate diagnoses, and are time-consuming when demands are high. Intelligent systems based on Artificial Neural Network (ANN) for automated breast cancer diagnosis are powerful due to their strong decision-making capabilities in complicated cases. Artificial Bee Colony, Artificial Immune System, and Bacterial Foraging Optimization are swarm intelligence algorithms that solve combinatorial optimization problems. This paper proposes two novel hybrid Artificial Bee Colony (ABC) optimization algorithms that overcome the demerits of standard ABC algorithms. First, this paper proposes a hybrid ABC approach called HABC, in which the standard ABC optimization is hybridized with a modified clonal selection algorithm of the Artificial Immune System that eliminates the poor exploration capabilities of standard ABC optimization. Further, this paper proposes a novel hybrid Artificial Bee Colony (Hybrid ABC) optimization where the strong explorative capabilities of the chemotaxis phase of the bacterial foraging optimization are integrated with a spiral model-based exploitative phase of the ABC by which the proposed Hybrid ABC overcomes the demerits of poor exploration and exploitation of the standard ABC algorithm.

METHODS

In this work, the two proposed hybrid approaches were used in concurrent feature selection and parameter optimization of an ANN model. The proposed algorithm is implemented using various back-propagation algorithms, including resilient back-propagation (HABC-RP and Hybrid ABC-RP), Levenberg Marquart (HABC-LM and Hybrid ABC-LM), and momentum-based gradient descent (HABC-MGD and Hybrid ABC-GD) for parameter tuning of ANN. The Wisconsin breast cancer dataset was used to evaluate the performance of the proposed algorithms in terms of accuracy, complexity, and computational time.

RESULTS

The mean accuracy of the proposed HABC-RP was 99.14% and 99.54% for Hybrid ABC which is better than the results found in the existing literature. HABC-RP attained a sensitivity of 98.32%, a specificity of 99.63%, and a precision of 99.38% whereas Hybrid ABC attained sensitivity of 99.08% and Specificity of 99.81%.

CONCLUSIONS

HABC-RP and Hybrid ABC-RP yielded high accuracy with a low complexity ANN structure compared to other variants. After evaluation, interestingly it is found that the Hybrid ABC-RP has achieved the highest mean accuracy of 99.54% with low complexity of 10.25 mean connections when compared to other variants proposed in this paper. It can be concluded that the concurrent selection of input features and tuning of parameters of ANN plays a vital role in increasing the accuracy of a breast cancer diagnosis. The proposed HABC-RP and Hybrid ABC-RP showed better results when compared to the existing breast cancer diagnosis systems taken for comparison. In the future, the proposed two-hybrid approaches can be used to generate optimal thresholds for the segmentation of tumors in abnormal images. HABC and Hybrid ABC can be used for tuning the parameters of various classifiers.

Collapse

Chia C, Sesia M, Ho CS, Jeffrey SS, Dionne J, Candes EJ, Howe RT. Interpretable Classification of Bacterial Raman Spectra with Knockoff Wavelets. IEEE J Biomed Health Inform 2021;26:740-748. [PMID: 34232897 DOI: 10.1109/jbhi.2021.3094873] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Stephan P, Stephan T, Kannan R, Abraham A. A hybrid artificial bee colony with whale optimization algorithm for improved breast cancer diagnosis. Neural Comput Appl 2021. [DOI: 10.1007/s00521-021-05997-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Wang MWH, Goodman JM, Allen TEH. Machine Learning in Predictive Toxicology: Recent Applications and Future Directions for Classification Models. Chem Res Toxicol 2020;34:217-239. [PMID: 33356168 DOI: 10.1021/acs.chemrestox.0c00316] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

An Effective Multi-Label Feature Selection Model Towards Eliminating Noisy Features. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10228093] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Binary Whale Optimization Algorithm for Dimensionality Reduction. MATHEMATICS 2020. [DOI: 10.3390/math8101821] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Abstract Feature selection (FS) was regarded as a global combinatorial optimization problem. FS is used to simplify and enhance the quality of high-dimensional datasets by selecting prominent features and removing irrelevant and redundant data to provide good classification results. FS aims to reduce the dimensionality and improve the classification accuracy that is generally utilized with great importance in different fields such as pattern classification, data analysis, and data mining applications. The main problem is to find the best subset that contains the representative information of all the data. In order to overcome this problem, two binary variants of the whale optimization algorithm (WOA) are proposed, called bWOA-S and bWOA-V. They are used to decrease the complexity and increase the performance of a system by selecting significant features for classification purposes. The first bWOA-S version uses the Sigmoid transfer function to convert WOA values to binary ones, whereas the second bWOA-V version uses a hyperbolic tangent transfer function. Furthermore, the two binary variants introduced here were compared with three famous and well-known optimization algorithms in this domain, such as Particle Swarm Optimizer (PSO), three variants of binary ant lion (bALO1, bALO2, and bALO3), binary Dragonfly Algorithm (bDA) as well as the original WOA, over 24 benchmark datasets from the UCI repository. Eventually, a non-parametric test called Wilcoxon’s rank-sum was carried out at 5% significance to prove the powerfulness and effectiveness of the two proposed algorithms when compared with other algorithms statistically. The qualitative and quantitative results showed that the two introduced variants in the FS domain are able to minimize the selected feature number as well as maximize the accuracy of the classification within an appropriate time. Collapse

Fogliatto FS, Anzanello MJ, Soares F, Brust-Renck PG. Decision Support for Breast Cancer Detection: Classification Improvement Through Feature Selection. Cancer Control 2020;26:1073274819876598. [PMID: 31538497 PMCID: PMC6755645 DOI: 10.1177/1073274819876598] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Cui S, Luo Y, Tseng HH, Ten Haken RK, El Naqa I. Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage. Med Phys 2019;46:2497-2511. [PMID: 30891794 DOI: 10.1002/mp.13497] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2018] [Revised: 02/18/2019] [Accepted: 03/08/2019] [Indexed: 12/23/2022] Open

Abstract

PURPOSE

There has been burgeoning interest in applying machine learning methods for predicting radiotherapy outcomes. However, the imbalanced ratio of a large number of variables to a limited sample size in radiation oncology constitutes a major challenge. Therefore, dimensionality reduction methods can be a key to success. The study investigates and contrasts the application of traditional machine learning methods and deep learning approaches for outcome modeling in radiotherapy. In particular, new joint architectures based on variational autoencoder (VAE) for dimensionality reduction are presented and their application is demonstrated for the prediction of lung radiation pneumonitis (RP) from a large-scale heterogeneous dataset.

METHODS

A large-scale heterogeneous dataset containing a pool of 230 variables including clinical factors (e.g., dose, KPS, stage) and biomarkers (e.g., single nucleotide polymorphisms (SNPs), cytokines, and micro-RNAs) in a population of 106 nonsmall cell lung cancer (NSCLC) patients who received radiotherapy was used for modeling RP. Twenty-two patients had grade 2 or higher RP. Four methods were investigated, including feature selection (case A) and feature extraction (case B) with traditional machine learning methods, a VAE-MLP joint architecture (case C) with deep learning and lastly, the combination of feature selection and joint architecture (case D). For feature selection, Random forest (RF), Support Vector Machine (SVM), and multilayer perceptron (MLP) were implemented to select relevant features. Specifically, each method was run for multiple times to rank features within several cross-validated (CV) resampled sets. A collection of ranking lists were then aggregated by top 5% and Kemeny graph methods to identify the final ranking for prediction. A synthetic minority oversampling technique was applied to correct for class imbalance during this process. For deep learning, a VAE-MLP joint architecture where a VAE aimed for dimensionality reduction and an MLP aimed for classification was developed. In this architecture, reconstruction loss and prediction loss were combined into a single loss function to realize simultaneous training and weights were assigned to different classes to mitigate class imbalance. To evaluate the prediction performance and conduct comparisons, the area under receiver operating characteristic curves (AUCs) were performed for nested CVs for both handcrafted feature selections and the deep learning approach. The significance of differences in AUCs was assessed using the DeLong test of U-statistics.

RESULTS

An MLP-based method using weight pruning (WP) feature selection yielded the best performance among the different hand-crafted feature selection methods (case A), reaching an AUC of 0.804 (95% CI: 0.761-0.823) with 29 top features. A VAE-MLP joint architecture (case C) achieved a comparable but slightly lower AUC of 0.781 (95% CI: 0.737-0.808) with the size of latent dimension being 2. The combination of handcrafted features (case A) and latent representation (case D) achieved a significant AUC improvement of 0.831 (95% CI: 0.805-0.863) with 22 features (P-value = 0.000642 compared with handcrafted features only (Case A) and P-value = 0.000453 compared to VAE alone (Case C)) with an MLP classifier.

CONCLUSION

The potential for combination of traditional machine learning methods and deep learning VAE techniques has been demonstrated for dealing with limited datasets in modeling radiotherapy toxicities. Specifically, latent variables from a VAE-MLP joint architecture are able to complement handcrafted features for the prediction of RP and improve prediction over either method alone.

Collapse

Padfield N, Zabalza J, Zhao H, Masero V, Ren J. EEG-Based Brain-Computer Interfaces Using Motor-Imagery: Techniques and Challenges. SENSORS 2019;19:s19061423. [PMID: 30909489 PMCID: PMC6471241 DOI: 10.3390/s19061423] [Citation(s) in RCA: 153] [Impact Index Per Article: 30.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/30/2019] [Revised: 03/10/2019] [Accepted: 03/19/2019] [Indexed: 12/11/2022]

Wan Y, Wang M, Ye Z, Lai X. A feature selection method based on modified binary coded ant colony optimization algorithm. Appl Soft Comput 2016. [DOI: 10.1016/j.asoc.2016.08.011] [Citation(s) in RCA: 67] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Explaining Support Vector Machines: A Color Based Nomogram. PLoS One 2016;11:e0164568. [PMID: 27723811 PMCID: PMC5056733 DOI: 10.1371/journal.pone.0164568] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2016] [Accepted: 09/27/2016] [Indexed: 02/05/2023] Open

Abstract

Problem setting

Support vector machines (SVMs) are very popular tools for classification, regression and other problems. Due to the large choice of kernels they can be applied with, a large variety of data can be analysed using these tools. Machine learning thanks its popularity to the good performance of the resulting models. However, interpreting the models is far from obvious, especially when non-linear kernels are used. Hence, the methods are used as black boxes. As a consequence, the use of SVMs is less supported in areas where interpretability is important and where people are held responsible for the decisions made by models.

Objective

In this work, we investigate whether SVMs using linear, polynomial and RBF kernels can be explained such that interpretations for model-based decisions can be provided. We further indicate when SVMs can be explained and in which situations interpretation of SVMs is (hitherto) not possible. Here, explainability is defined as the ability to produce the final decision based on a sum of contributions which depend on one single or at most two input variables.

Results

Our experiments on simulated and real-life data show that explainability of an SVM depends on the chosen parameter values (degree of polynomial kernel, width of RBF kernel and regularization constant). When several combinations of parameter values yield the same cross-validation performance, combinations with a lower polynomial degree or a larger kernel width have a higher chance of being explainable.

Conclusions

This work summarizes SVM classifiers obtained with linear, polynomial and RBF kernels in a single plot. Linear and polynomial kernels up to the second degree are represented exactly. For other kernels an indication of the reliability of the approximation is presented. The complete methodology is available as an R package and two apps and a movie are provided to illustrate the possibilities offered by the method.

Collapse

Does Feature Selection Improve Classification? A Large Scale Experiment in OpenML. ACTA ACUST UNITED AC 2016. [DOI: 10.1007/978-3-319-46349-0_14] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/16/2023]

Vukicevic AM, Stojadinovic M, Radovic M, Djordjevic M, Cirkovic BA, Pejovic T, Jovicic G, Filipovic N. Automated development of artificial neural networks for clinical purposes: Application for predicting the outcome of choledocholithiasis surgery. Comput Biol Med 2016;75:80-9. [DOI: 10.1016/j.compbiomed.2016.05.016] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2015] [Revised: 05/23/2016] [Accepted: 05/24/2016] [Indexed: 02/07/2023]

Moradi P, Gholampour M. A hybrid particle swarm optimization for feature subset selection by integrating a novel local search strategy. Appl Soft Comput 2016. [DOI: 10.1016/j.asoc.2016.01.044] [Citation(s) in RCA: 231] [Impact Index Per Article: 28.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Holsbach N, Fogliatto FS, Anzanello MJ. [A data mining method for breast cancer identification based on a selection of variables]. CIENCIA & SAUDE COLETIVA 2015;19:1295-304. [PMID: 24820612 DOI: 10.1590/1413-81232014194.01722013] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2013] [Accepted: 04/29/2013] [Indexed: 11/21/2022] Open

Messay T, Hardie RC, Tuinstra TR. Segmentation of pulmonary nodules in computed tomography using a regression neural network approach and its application to the Lung Image Database Consortium and Image Database Resource Initiative dataset. Med Image Anal 2015;22:48-62. [PMID: 25791434 DOI: 10.1016/j.media.2015.02.002] [Citation(s) in RCA: 98] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2014] [Revised: 02/06/2015] [Accepted: 02/12/2015] [Indexed: 11/26/2022]

Ahmad F, Mat Isa NA, Hussain Z, Osman MK, Sulaiman SN. A GA-based feature selection and parameter optimization of an ANN in diagnosing breast cancer. Pattern Anal Appl 2014. [DOI: 10.1007/s10044-014-0375-9] [Citation(s) in RCA: 73] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Ganivada A, Ray SS, Pal SK. Fuzzy rough sets, and a granular neural network for unsupervised feature selection. Neural Netw 2013;48:91-108. [DOI: 10.1016/j.neunet.2013.07.008] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2012] [Revised: 05/04/2013] [Accepted: 07/30/2013] [Indexed: 10/26/2022]

Jing SY. A hybrid genetic algorithm for feature subset selection in rough set theory. Soft comput 2013. [DOI: 10.1007/s00500-013-1150-3] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

Artificial neural networks in medical diagnosis. J Appl Biomed 2013. [DOI: 10.2478/v10136-012-0031-x] [Citation(s) in RCA: 462] [Impact Index Per Article: 42.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open

Vera V, Corchado E, Redondo R, Sedano J, García ÁE. Applying soft computing techniques to optimise a dental milling process. Neurocomputing 2013. [DOI: 10.1016/j.neucom.2012.04.033] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

A hybrid feature selection scheme for mixed attributes data. ACTA ACUST UNITED AC 2013. [DOI: 10.1007/s40314-013-0019-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

ELLA HASSANIEN ABOUL, ABRAHAM AJITH. ROUGH MORPHOLOGY HYBRID APPROACH FOR MAMMOGRAPHY IMAGE CLASSIFICATION AND PREDICTION. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS 2011. [DOI: 10.1142/s1469026808002181] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Categorizing Normal and Pathological Voices: Automated and Perceptual Categorization. J Voice 2011;25:700-8. [DOI: 10.1016/j.jvoice.2010.04.009] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2009] [Accepted: 04/28/2010] [Indexed: 11/16/2022]

Kabir MM, Shahjahan M, Murase K. A new local search based hybrid genetic algorithm for feature selection. Neurocomputing 2011. [DOI: 10.1016/j.neucom.2011.03.034] [Citation(s) in RCA: 106] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Bhooshan N, Giger M, Edwards D, Yuan Y, Jansen S, Li H, Lan L, Sattar H, Newstead G. Computerized three-class classification of MRI-based prognostic markers for breast cancer. Phys Med Biol 2011;56:5995-6008. [PMID: 21860079 PMCID: PMC4134441 DOI: 10.1088/0031-9155/56/18/014] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Kabir MM, Shahjahan M, Murase K. Ant Colony Optimization for Feature Selection Involving Effective Local Search. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS 2011. [DOI: 10.20965/jaciii.2011.p0671] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Monirul Kabir M, Monirul Islam M, Murase K. A new wrapper feature selection approach using neural network. Neurocomputing 2010. [DOI: 10.1016/j.neucom.2010.04.003] [Citation(s) in RCA: 76] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Verikas A, Guzaitis J, Gelzinis A, Bacauskiene M. A general framework for designing a fuzzy rule-based classifier. Knowl Inf Syst 2010. [DOI: 10.1007/s10115-010-0340-x] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Riul A, Dantas CAR, Miyazaki CM, Oliveira ON. Recent advances in electronic tongues. Analyst 2010;135:2481-95. [PMID: 20730141 DOI: 10.1039/c0an00292e] [Citation(s) in RCA: 197] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Verikas A, Gelzinis A, Bacauskiene M, Hållander M, Uloza V, Kaseta M. Combining image, voice, and the patient’s questionnaire data to categorize laryngeal disorders. Artif Intell Med 2010;49:43-50. [DOI: 10.1016/j.artmed.2010.02.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2008] [Revised: 01/19/2010] [Accepted: 02/16/2010] [Indexed: 11/28/2022]

Hybrid and ensemble-based soft computing techniques in bankruptcy prediction: a survey. Soft comput 2009. [DOI: 10.1007/s00500-009-0490-5] [Citation(s) in RCA: 98] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Parameter determination and feature selection for back-propagation network by particle swarm optimization. Knowl Inf Syst 2009. [DOI: 10.1007/s10115-009-0242-y] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Baraldi P, Pedroni N, Zio E. Application of a niched Pareto genetic algorithm for selecting features for nuclear transients classification. INT J INTELL SYST 2009. [DOI: 10.1002/int.20328] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Tsai CY, Chou SY, Lin SW, Wang WH. Location determination of mobile devices for an indoor WLAN application using a neural network. Knowl Inf Syst 2008. [DOI: 10.1007/s10115-008-0154-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Advances in evolutionary feature selection neural networks with co-evolution learning. Neural Comput Appl 2008. [DOI: 10.1007/s00521-007-0114-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Feature Selection for Classificatory Analysis Based on Information-theoretic Criteria. ACTA ACUST UNITED AC 2008. [DOI: 10.3724/sp.j.1004.2008.00383] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Romero E, Sopena J. Performing Feature Selection With Multilayer Perceptrons. ACTA ACUST UNITED AC 2008;19:431-41. [DOI: 10.1109/tnn.2007.909535] [Citation(s) in RCA: 79] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Huang JJ, Cai YZ, Xu XM. A parameterless feature ranking algorithm based on MI. Neurocomputing 2008. [DOI: 10.1016/j.neucom.2007.04.012] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Eickhoff R, Rückert U. Robustness of radial basis functions. Neurocomputing 2007. [DOI: 10.1016/j.neucom.2006.04.012] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

François D, Rossi F, Wertz V, Verleysen M. Resampling methods for parameter-free and robust feature selection with mutual information. Neurocomputing 2007. [DOI: 10.1016/j.neucom.2006.11.019] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Su CT, Chen LS, Chiang TL. A neural network based information granulation approach to shorten the cellular phone test process. COMPUT IND 2006. [DOI: 10.1016/j.compind.2006.01.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Yen GG, Leong WF. Fault classification on vibration data with wavelet based feature selection scheme. ISA TRANSACTIONS 2006;45:141-51. [PMID: 16649561 DOI: 10.1016/s0019-0578(07)60185-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]

Muni DP, Pal NR, Das J. Genetic programming for simultaneous feature selection and classifier design. ACTA ACUST UNITED AC 2006;36:106-17. [PMID: 16468570 DOI: 10.1109/tsmcb.2005.854499] [Citation(s) in RCA: 230] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Abdel-Aal RE. GMDH-based feature ranking and selection for improved classification of medical data. J Biomed Inform 2005;38:456-68. [PMID: 16337569 DOI: 10.1016/j.jbi.2005.03.003] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2005] [Revised: 03/29/2005] [Accepted: 03/30/2005] [Indexed: 11/17/2022]

Abstract

Medical applications are often characterized by a large number of disease markers and a relatively small number of data records. We demonstrate that complete feature ranking followed by selection can lead to appreciable reductions in data dimensionality, with significant improvements in the implementation and performance of classifiers for medical diagnosis. We describe a novel approach for ranking all features according to their predictive quality using properties unique to learning algorithms based on the group method of data handling (GMDH). An abductive network training algorithm is repeatedly used to select groups of optimum predictors from the feature set at gradually increasing levels of model complexity specified by the user. Groups selected earlier are better predictors. The process is then repeated to rank features within individual groups. The resulting full feature ranking can be used to determine the optimum feature subset by starting at the top of the list and progressively including more features until the classification error rate on an out-of-sample evaluation set starts to increase due to overfitting. The approach is demonstrated on two medical diagnosis datasets (breast cancer and heart disease) and comparisons are made with other feature ranking and selection methods. Receiver operating characteristics (ROC) analysis is used to compare classifier performance. At default model complexity, dimensionality reduction of 22 and 54% could be achieved for the breast cancer and heart disease data, respectively, leading to improvements in the overall classification performance. For both datasets, considerable dimensionality reduction introduced no significant reduction in the area under the ROC curve. GMDH-based feature selection results have also proved effective with neural network classifiers.

Collapse

An extended classifiability index for feature selection in nuclear transients. ANN NUCL ENERGY 2005. [DOI: 10.1016/j.anucene.2005.06.003] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]