1
|
Ding X, Li Y, Chen S. Maximum margin and global criterion based-recursive feature selection. Neural Netw 2024; 169:597-606. [PMID: 37956576 DOI: 10.1016/j.neunet.2023.10.037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Revised: 06/19/2023] [Accepted: 10/22/2023] [Indexed: 11/15/2023]
Abstract
In this research paper, we aim to investigate and address the limitations of recursive feature elimination (RFE) and its variants in high-dimensional feature selection tasks. We identify two main challenges associated with these methods. Firstly, the feature ranking criterion utilized in these approaches is inconsistent with the maximum-margin theory. Secondly, the computation of the criterion is performed locally, lacking the ability to measure the importance of features globally. To overcome these challenges, we propose a novel feature ranking criterion called Maximum Margin and Global (MMG) criterion. This criterion utilizes the classification margin to determine the importance of features and computes it globally, enabling a more accurate assessment of feature importance. Moreover, we introduce an optimal feature subset evaluation algorithm that leverages the MMG criterion to determine the best subset of features. To enhance the efficiency of the proposed algorithms, we provide two alpha seeding strategies that significantly reduce computational costs while maintaining high accuracy. These strategies offer a practical means to expedite the feature selection process. Through extensive experiments conducted on ten benchmark datasets, we demonstrate that our proposed algorithms outperform current state-of-the-art methods. Additionally, the alpha seeding strategies yield significant speedups, further enhancing the efficiency of the feature selection process.
Collapse
Affiliation(s)
- Xiaojian Ding
- College of Information Engineering, Nanjing University of Finance and Economics, Nanjing 210023, China.
| | - Yi Li
- College of Economics and Management, Nanjing Agricultural University, Nanjing 210095, China
| | - Shilin Chen
- Thoracic Surgery, Nanjing Medical University Affiliated Cancer Hospital, Jiangsu Cancer Hospital, Jiangsu Institute of Cancer Research, Nanjing 221005, China
| |
Collapse
|
2
|
Research on Industry Data Analytics on Processing Procedure of Named 3-4-8-2 Components Combination for the Application Identification in New Chain Convenience Store. Processes (Basel) 2023. [DOI: 10.3390/pr11010180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
With the rapid economic boom of Asian countries, the president of Country-A has made great efforts to reform in recent years. The prospect of economic development is promising, and business opportunities are emerging gradually, depicting a prosperous scene; accordingly, people’s livelihood consumption also has changed significantly. The original main point of consumption for urban and rural people was the old and traditional grocery store with poor sanitation, but due to the economic improvement, the quality of consumption has also improved, and convenience stores are gradually replacing grocery store. However, convenience store management involves performance, logistic, competition, and personnel costs. Both whether the store can create a net profit and evaluate and select a new store will be important keys that significantly influence business performance. Therefore, this study attempts to use the industry data analysis method for highlighting a concept of processing an experience procedure of named 3-4-8-2 components combination in two stages. First, in the data preprocessing stage, this research considers 22 condition attributes and two types of decision factors, that include net profit and new store selection, and use both techniques of attribute selection and data discretization through the analysis and prediction of data mining tools. Next, in the experiment execution stage, three well-known classifiers (Bayes net, logistic regression, and J48 decision tree) with past good performance and four models (without preprocessing, with attribute selection, with data discretization, and with attribute selection and data discretization) are used for eight different experiments through two data verification methods (percentage split and cross-validation). Conclusively, three key results are identified from empirical analysis: (1) It is found that the prediction accuracy of the J48 decision tree classifier is relatively high and stable among the three classifiers in this study; at the same time, the J48 decision tree can yield comprehensible knowledge-based rules to instruct interested parties. (2) The results of this study show that the important attributes for the net profit decision attribute include the store type, POS number, and cashier number, while the important attributes for the new store selection include the store type and cashier number. (3) There is a difference in the selection of important attributes. Furthermore, four key valuable contributions are addressed from the empirical results, including academic contributions, enterprise contributions, application contributions, and management contributions. It is expected that the direction of store layout expansion can be found and identified through this study, but there are still many risks hidden behind the considerable business opportunities that need to be carefully managed.
Collapse
|
3
|
Feature selection for label distribution learning using dual-similarity based neighborhood fuzzy entropy. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.10.054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
4
|
Akinola OO, Ezugwu AE, Agushaka JO, Zitar RA, Abualigah L. Multiclass feature selection with metaheuristic optimization algorithms: a review. Neural Comput Appl 2022; 34:19751-19790. [PMID: 36060097 PMCID: PMC9424068 DOI: 10.1007/s00521-022-07705-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Accepted: 08/02/2022] [Indexed: 11/24/2022]
Abstract
Selecting relevant feature subsets is vital in machine learning, and multiclass feature selection is harder to perform since most classifications are binary. The feature selection problem aims at reducing the feature set dimension while maintaining the performance model accuracy. Datasets can be classified using various methods. Nevertheless, metaheuristic algorithms attract substantial attention to solving different problems in optimization. For this reason, this paper presents a systematic survey of literature for solving multiclass feature selection problems utilizing metaheuristic algorithms that can assist classifiers selects optima or near optima features faster and more accurately. Metaheuristic algorithms have also been presented in four primary behavior-based categories, i.e., evolutionary-based, swarm-intelligence-based, physics-based, and human-based, even though some literature works presented more categorization. Further, lists of metaheuristic algorithms were introduced in the categories mentioned. In finding the solution to issues related to multiclass feature selection, only articles on metaheuristic algorithms used for multiclass feature selection problems from the year 2000 to 2022 were reviewed about their different categories and detailed descriptions. We considered some application areas for some of the metaheuristic algorithms applied for multiclass feature selection with their variations. Popular multiclass classifiers for feature selection were also examined. Moreover, we also presented the challenges of metaheuristic algorithms for feature selection, and we identified gaps for further research studies.
Collapse
Affiliation(s)
- Olatunji O. Akinola
- School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, King Edward Avenue, Pietermaritzburg Campus, Pietermaritzburg, 3201 KwaZulu-Natal South Africa
| | - Absalom E. Ezugwu
- School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, King Edward Avenue, Pietermaritzburg Campus, Pietermaritzburg, 3201 KwaZulu-Natal South Africa
| | - Jeffrey O. Agushaka
- School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, King Edward Avenue, Pietermaritzburg Campus, Pietermaritzburg, 3201 KwaZulu-Natal South Africa
| | - Raed Abu Zitar
- Sorbonne Center of Artificial Intelligence, Sorbonne University-Abu Dhabi, 38044 Abu Dhabi, United Arab Emirates
| | - Laith Abualigah
- Hourani Center for Applied Scientific Research, Al-Ahliyya Amman University, Amman, 19328 Jordan
- Faculty of Inforsmation Technology, Middle East University, Amman, 11831 Jordan
| |
Collapse
|
5
|
Dogra V, Verma S, Kavita, Chatterjee P, Shafi J, Choi J, Ijaz MF. A Complete Process of Text Classification System Using State-of-the-Art NLP Models. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:1883698. [PMID: 35720939 PMCID: PMC9203176 DOI: 10.1155/2022/1883698] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 04/20/2022] [Accepted: 05/09/2022] [Indexed: 11/30/2022]
Abstract
With the rapid advancement of information technology, online information has been exponentially growing day by day, especially in the form of text documents such as news events, company reports, reviews on products, stocks-related reports, medical reports, tweets, and so on. Due to this, online monitoring and text mining has become a prominent task. During the past decade, significant efforts have been made on mining text documents using machine and deep learning models such as supervised, semisupervised, and unsupervised. Our area of the discussion covers state-of-the-art learning models for text mining or solving various challenging NLP (natural language processing) problems using the classification of texts. This paper summarizes several machine learning and deep learning algorithms used in text classification with their advantages and shortcomings. This paper would also help the readers understand various subtasks, along with old and recent literature, required during the process of text classification. We believe that readers would be able to find scope for further improvements in the area of text classification or to propose new techniques of text classification applicable in any domain of their interest.
Collapse
Affiliation(s)
- Varun Dogra
- School of Computer Science and Engineering, Lovely Professional University, Phagwara, Punjab, India
| | - Sahil Verma
- Department of Computer Science and Engineering, Chandigarh University, Mohali 140413, India
- Bio and Health Informatics Research Lab, Chandigarh University, Mohali 140413, India
| | - Kavita
- Department of Computer Science and Engineering, Chandigarh University, Mohali 140413, India
- Machine Learning and Data Science Research Lab, Chandigarh University, Mohali 140413, India
| | | | - Jana Shafi
- Department of Computer Science, College of Arts and Science, Prince Sattam Bin Abdul Aziz University, Wadi Ad-Dwasir 11991, Saudi Arabia
| | - Jaeyoung Choi
- School of Computing, Gachon University, Seongnam-si 13120, Republic of Korea
| | - Muhammad Fazal Ijaz
- Department of Intelligent Mechatronics Engineering, Sejong University, Seoul 05006, Republic of Korea
| |
Collapse
|
6
|
Multi-classification for high-dimensional data using probabilistic neural networks. JOURNAL OF RADIATION RESEARCH AND APPLIED SCIENCES 2022. [DOI: 10.1016/j.jrras.2022.05.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
7
|
Huang H, Wu N, Liang Y, Peng X, Jun S. SLNL: A novel method for gene selection and phenotype classification. INT J INTELL SYST 2022. [DOI: 10.1002/int.22844] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Affiliation(s)
- HaiHui Huang
- School of Information Engineering Shaoguan University Shaoguan China
| | - NaiQi Wu
- Macau Institute of Systems Engineering and Collaborative Laboratory of Intelligent Science and Systems Macau University of Science and Technology Macau China
| | - Yong Liang
- The Peng Cheng Laboratory Shenzhen China
| | - XinDong Peng
- School of Information Engineering Shaoguan University Shaoguan China
| | - Shu Jun
- School of Mathematics and Statistics Xi'an Jiaotong University Xi'an China
| |
Collapse
|
8
|
Shu L, Huang K, Jiang W, Wu W, Liu H. Feature selection using autoencoders with Bayesian methods to high-dimensional data. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2021. [DOI: 10.3233/jifs-211348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
It is easy to lead to poor generalization in machine learning tasks using real-world data directly, since such data is usually high-dimensional dimensionality and limited. Through learning the low dimensional representations of high-dimensional data, feature selection can retain useful features for machine learning tasks. Using these useful features effectively trains machine learning models. Hence, it is a challenge for feature selection from high-dimensional data. To address this issue, in this paper, a hybrid approach consisted of an autoencoder and Bayesian methods is proposed for a novel feature selection. Firstly, Bayesian methods are embedded in the proposed autoencoder as a special hidden layer. This of doing is to increase the precision during selecting non-redundant features. Then, the other hidden layers of the autoencoder are used for non-redundant feature selection. Finally, compared with the mainstream approaches for feature selection, the proposed method outperforms them. We find that the way consisted of autoencoders and probabilistic correction methods is more meaningful than that of stacking architectures or adding constraints to autoencoders as regards feature selection. We also demonstrate that stacked autoencoders are more suitable for large-scale feature selection, however, sparse autoencoders are beneficial for a smaller number of feature selection. We indicate that the value of the proposed method provides a theoretical reference to analyze the optimality of feature selection.
Collapse
Affiliation(s)
- Lei Shu
- Chongqing Aerospace Polytechnic, Chongqing, China
| | - Kun Huang
- Urban Vocational College of Sichuan, P.R. China
| | - Wenhao Jiang
- Chongqing Aerospace Polytechnic, Chongqing, China
| | - Wenming Wu
- Chongqing Aerospace Polytechnic, Chongqing, China
| | - Hongling Liu
- Chongqing Aerospace Polytechnic, Chongqing, China
| |
Collapse
|
9
|
|
10
|
Komeili M, Armanfard N, Hatzinakos D. Multiview Feature Selection for Single-View Classification. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2021; 43:3573-3586. [PMID: 32305902 DOI: 10.1109/tpami.2020.2987013] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
In many real-world scenarios, data from multiple modalities (sources) are collected during a development phase. Such data are referred to as multiview data. While additional information from multiple views often improves the performance, collecting data from such additional views during the testing phase may not be desired due to the high costs associated with measuring such views or, unavailability of such additional views. Therefore, in many applications, despite having a multiview training data set, it is desired to do performance testing using data from only one view. In this paper, we present a multiview feature selection method that leverages the knowledge of all views and use it to guide the feature selection process in an individual view. We realize this via a multiview feature weighting scheme such that the local margins of samples in each view are maximized and similarities of samples to some reference points in different views are preserved. Also, the proposed formulation can be used for cross-view matching when the view-specific feature weights are pre-computed on an auxiliary data set. Promising results have been achieved on nine real-world data sets as well as three biometric recognition applications. On average, the proposed feature selection method has improved the classification error rate by 31 percent of the error rate of the state-of-the-art.
Collapse
|
11
|
Wang F, Wang X. A novel feature selection algorithm based on damping oscillation theory. PLoS One 2021; 16:e0255307. [PMID: 34358234 PMCID: PMC8345869 DOI: 10.1371/journal.pone.0255307] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2021] [Accepted: 07/13/2021] [Indexed: 11/18/2022] Open
Abstract
Feature selection is an important task in big data analysis and information retrieval processing. It reduces the number of features by removing noise, extraneous data. In this paper, one feature subset selection algorithm based on damping oscillation theory and support vector machine classifier is proposed. This algorithm is called the Maximum Kendall coefficient Maximum Euclidean Distance Improved Gray Wolf Optimization algorithm (MKMDIGWO). In MKMDIGWO, first, a filter model based on Kendall coefficient and Euclidean distance is proposed, which is used to measure the correlation and redundancy of the candidate feature subset. Second, the wrapper model is an improved grey wolf optimization algorithm, in which its position update formula has been improved in order to achieve optimal results. Third, the filter model and the wrapper model are dynamically adjusted by the damping oscillation theory to achieve the effect of finding an optimal feature subset. Therefore, MKMDIGWO achieves both the efficiency of the filter model and the high precision of the wrapper model. Experimental results on five UCI public data sets and two microarray data sets have demonstrated the higher classification accuracy of the MKMDIGWO algorithm than that of other four state-of-the-art algorithms. The maximum ACC value of the MKMDIGWO algorithm is at least 0.5% higher than other algorithms on 10 data sets.
Collapse
Affiliation(s)
- Fujun Wang
- School of Electronic and Information Engineering, Liaoning Technical University, Huludao, People’s Republic of China
- Key Laboratory of Preparation and Application of Environmentally Friendly Materials, Chinese Ministry of Education, Jilin Normal University, Changchun, People’s Republic of China
| | - Xing Wang
- School of Electronic and Information Engineering, Liaoning Technical University, Huludao, People’s Republic of China
| |
Collapse
|
12
|
Zhang S, Dang X, Nguyen D, Wilkins D, Chen Y. Estimating Feature-Label Dependence Using Gini Distance Statistics. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2021; 43:1947-1963. [PMID: 31869782 DOI: 10.1109/tpami.2019.2960358] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Identifying statistical dependence between the features and the label is a fundamental problem in supervised learning. This paper presents a framework for estimating dependence between numerical features and a categorical label using generalized Gini distance, an energy distance in reproducing kernel Hilbert spaces (RKHS). Two Gini distance based dependence measures are explored: Gini distance covariance and Gini distance correlation. Unlike Pearson covariance and correlation, which do not characterize independence, the above Gini distance based measures define dependence as well as independence of random variables. The test statistics are simple to calculate and do not require probability density estimation. Uniform convergence bounds and asymptotic bounds are derived for the test statistics. Comparisons with distance covariance statistics are provided. It is shown that Gini distance statistics converge faster than distance covariance statistics in the uniform convergence bounds, hence tighter upper bounds on both Type I and Type II errors. Moreover, the probability of Gini distance covariance statistic under-performing the distance covariance statistic in Type II error decreases to 0 exponentially with the increase of the sample size. Extensive experimental results are presented to demonstrate the performance of the proposed method.
Collapse
|
13
|
Masoumi PM, Sadjedi H. Trial-Specific Feature Performance on Single-Channel Auditory Mismatch Negativity Detection. IEEE J Biomed Health Inform 2021; 25:1062-1069. [PMID: 33108302 DOI: 10.1109/jbhi.2020.3034295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Successful detection of uncommon events is vital in the survival of an organism. Specifically, the study of neuro-sensory detection lends itself widely to understanding the human brain. Mismatch Negativity (MMN) is an important Event-Related Potential (ERP) response to an oddball stimulus which is preceded by repeated homogeneous stimulation. MMN is associated with perceptual learning and medical diagnostics among other applications. Currently, MMN detection relies on visual inspection of ERPs by skilled clinicians which makes for a costly, slow and subjective tool. In this paper, we use MMN to quantify the discriminative abilities of healthy or diagnosed subjects. We introduce a novel algorithmic method to extract and select important trial-specific features for discriminating standard from deviant responses. We utilize machine learning and classification approaches to evaluate our novel model using single-subject trial data while minimizing the number of necessary selection features provided by statistical test parameters and Genetic Algorithm (GA). In this work, a large variety of methods with 27 subjects, hundreds of trials and electrode counts compete for the definitive discrimination of MMN events. Our model requires only one EEG channel, a single subject and as low as five deviant tones. The results show statistically significant detection improvement over the traditional methods while maximizing resource economy.
Collapse
|
14
|
Afshar M, Usefi H. High-dimensional feature selection for genomic datasets. Knowl Based Syst 2020. [DOI: 10.1016/j.knosys.2020.106370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
15
|
Feature selection for hierarchical classification via joint semantic and structural information of labels. Knowl Based Syst 2020. [DOI: 10.1016/j.knosys.2020.105655] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
|
16
|
Wang Y, Li T. Local feature selection based on artificial immune system for classification. Appl Soft Comput 2020. [DOI: 10.1016/j.asoc.2019.105989] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
17
|
Huang H, Liang Y, Yang X, Hao Z. Pixel-Level Discrete Multiobjective Sampling for Image Matting. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2019; 28:3739-3751. [PMID: 30843834 DOI: 10.1109/tip.2019.2902830] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
In sampling-based matting methods, the alpha is estimated by choosing the best pair of foreground and background color samples. The lack of true samples is the major obstacle in obtaining high-quality alpha mattes. Regrettably, several proposed approaches did not address the conflicts among multiple sampling criteria and the effects of incomplete sample spaces. To address this issue, we propose a pixel-level discrete multiobjective sampling (PDMS) method. The color sampling process at each unknown pixel is formalized as a multiobjective optimization problem (MOP). The strength of PDMS includes its ability to minimize both color difference and spatial distance between unknown and known pixels, and its capacity to adaptively make trade-offs among conflicting sampling criteria. To mitigate the effects of incomplete sample spaces, the sample space is extended to complete known regions in PDMS, which means that the colors of all known pixels can be sampled, instead of mean colors of superpixels. Our experimental results show that PDMS collects a small set of samples while achieving smaller minimum absolute difference in alpha estimation. Moreover, PDMS implements pixel-level sampling by using the proposed multiobjective optimization algorithm to efficiently solve sampling MOPs. The PDMS-based matting method provides high-quality alpha mattes with sharp boundaries and thus outperforms those prior image matting methods in terms of gradient error.
Collapse
|
18
|
Connolly JF, Reilly JP, Fox-Robichaud A, Britz P, Blain-Moraes S, Sonnadara R, Hamielec C, Herrera-Díaz A, Boshra R. Development of a point of care system for automated coma prognosis: a prospective cohort study protocol. BMJ Open 2019; 9:e029621. [PMID: 31320356 PMCID: PMC6661548 DOI: 10.1136/bmjopen-2019-029621] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
INTRODUCTION Coma is a deep state of unconsciousness that can be caused by a variety of clinical conditions. Traditional tests for coma outcome prediction are based mainly on a set of clinical observations. Recently, certain event-related potentials (ERPs), which are transient electroencephalogram (EEG) responses to auditory, visual or tactile stimuli, have been introduced as useful predictors of a positive coma outcome (ie, emergence). However, such tests require the skills of clinical neurophysiologists, who are not commonly available in many clinical settings. Additionally, none of the current standard clinical approaches have sufficient predictive accuracies to provide definitive prognoses. OBJECTIVE The objective of this study is to develop improved machine learning procedures based on EEG/ERP for determining emergence from coma. METHODS AND ANALYSIS Data will be collected from 50 participants in coma. EEG/ERP data will be recorded for 24 consecutive hours at a maximum of five time points spanning 30 days from the date of recruitment to track participants' progression. The study employs paradigms designed to elicit brainstem potentials, middle-latency responses, N100, mismatch negativity, P300 and N400. In the case of patient emergence, data are recorded on that occasion to form an additional basis for comparison. A relevant data set will be developed from the testing of 20 healthy controls, each spanning a 15-hour recording period in order to formulate a baseline. Collected data will be used to develop an automated procedure for analysis and detection of various ERP components that are salient to prognosis. Salient features extracted from the ERP and resting-state EEG will be identified and combined to give an accurate indicator of prognosis. ETHICS AND DISSEMINATION This study is approved by the Hamilton Integrated Research Ethics Board (project number 4840). Results will be disseminated through peer-reviewed journal articles and presentations at scientific conferences. TRIAL REGISTRATION NUMBER NCT03826407.
Collapse
Affiliation(s)
- John F Connolly
- School of Biomedical Engineering, McMaster University, Hamilton, Ontario, Canada
- Vector Institute, MaRS Discovery District, Ontario, Canada
- Department of Psychology, Neuroscience & Behaviour, McMaster University, Hamilton, Ontario, Canada
- ARiEAL Research Centre, McMaster University, Hamilton, Ontario, Canada
- Department of Linguistics and Languages, McMaster University, Hamilton, Ontario, Canada
- Neuroscience Graduate Program, McMaster University, Hamilton, Ontario, Canada
| | - James P Reilly
- School of Biomedical Engineering, McMaster University, Hamilton, Ontario, Canada
- Vector Institute, MaRS Discovery District, Ontario, Canada
- ARiEAL Research Centre, McMaster University, Hamilton, Ontario, Canada
- Department of Electrical and Computer Engineering, McMaster University, Hamilton, Ontario, Canada
| | - Alison Fox-Robichaud
- Department of Medicine, McMaster University, Hamilton, Ontario, Canada
- Critical Care Medicine, Hamilton Health Sciences, Ontario, Canada
| | | | - Stefanie Blain-Moraes
- School of Physical and Occupational Therapy, McGill University, Montreal, Quebec, Canada
| | - Ranil Sonnadara
- School of Biomedical Engineering, McMaster University, Hamilton, Ontario, Canada
- Vector Institute, MaRS Discovery District, Ontario, Canada
- Department of Psychology, Neuroscience & Behaviour, McMaster University, Hamilton, Ontario, Canada
- ARiEAL Research Centre, McMaster University, Hamilton, Ontario, Canada
- Department of Linguistics and Languages, McMaster University, Hamilton, Ontario, Canada
- Neuroscience Graduate Program, McMaster University, Hamilton, Ontario, Canada
- Department of Surgery, McMaster University, Hamilton, Ontario, Canada
| | - Cindy Hamielec
- Department of Medicine, McMaster University, Hamilton, Ontario, Canada
- Critical Care Medicine, Hamilton Health Sciences, Ontario, Canada
| | - Adianes Herrera-Díaz
- ARiEAL Research Centre, McMaster University, Hamilton, Ontario, Canada
- Neuroscience Graduate Program, McMaster University, Hamilton, Ontario, Canada
| | - Rober Boshra
- School of Biomedical Engineering, McMaster University, Hamilton, Ontario, Canada
- Vector Institute, MaRS Discovery District, Ontario, Canada
- ARiEAL Research Centre, McMaster University, Hamilton, Ontario, Canada
| |
Collapse
|
19
|
Shukla AK, Singh P, Vardhan M. A hybrid framework for optimal feature subset selection. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2019. [DOI: 10.3233/jifs-169936] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Alok Kumar Shukla
- Department of Computer Science & Engineering, NIT Raipur, Chhattisgarh (C.G), India
| | - Pradeep Singh
- Department of Computer Science & Engineering, NIT Raipur, Chhattisgarh (C.G), India
| | - Manu Vardhan
- Department of Computer Science & Engineering, NIT Raipur, Chhattisgarh (C.G), India
| |
Collapse
|
20
|
Xu X, Liang T, Zhu J, Zheng D, Sun T. Review of classical dimensionality reduction and sample selection methods for large-scale data processing. Neurocomputing 2019. [DOI: 10.1016/j.neucom.2018.02.100] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
|
21
|
Granular maximum decision entropy-based monotonic uncertainty measure for attribute reduction. Int J Approx Reason 2019. [DOI: 10.1016/j.ijar.2018.10.014] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
22
|
Xie L, Yin M, Wang L, Tan F, Yin G. Matrix regression preserving projections for robust feature extraction. Knowl Based Syst 2018. [DOI: 10.1016/j.knosys.2018.07.028] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
23
|
Armanfard N, Komeili M, Reilly JP, Connolly JF. A Machine Learning Framework for Automatic and Continuous MMN Detection With Preliminary Results for Coma Outcome Prediction. IEEE J Biomed Health Inform 2018; 23:1794-1804. [PMID: 30369457 DOI: 10.1109/jbhi.2018.2877738] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Mismatch negativity (MMN) is a component of the event-related potential (ERP) that is elicited through an odd-ball paradigm. The existence of the MMN in a coma patient has a good correlation with coma emergence; however, this component can be difficult to detect. Previously, MMN detection was based on visual inspection of the averaged ERPs by a skilled clinician, a process that is expensive and not always feasible in practice. In this paper, we propose a practical machine learning (ML) based approach for detection of MMN component, thus, improving the accuracy of prediction of emergence from coma. Furthermore, the method can operate on an automatic and continuous basis thus alleviating the need for clinician involvement. The proposed method is capable of the MMN detection over intervals as short as two minutes. This finer time resolution enables identification of waxing and waning cycles of a conscious state. An auditory odd-ball paradigm was applied to 22 healthy subjects and 2 coma patients. A coma patient is tested by measuring the similarity of the patient's ERP responses with the aggregate healthy responses. Because the training process for measuring similarity requires only healthy subjects, the complexity and practicality of training procedure of the proposed method are greatly improved relative to training on coma patients directly. Since there are only two coma patients involved with this study, the results are reported on a very preliminary basis. Preliminary results indicate we can detect the MMN component with an accuracy of 92.7% on healthy subjects. The method successfully predicted emergence in both coma patients when conventional methods failed. The proposed method for collecting training data using exclusively healthy subjects is a novel approach that may prove useful in future, unrelated studies where ML methods are used.
Collapse
|
24
|
An S, Wang J, Wei J. Local-Nearest-Neighbors-Based Feature Weighting for Gene Selection. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:1538-1548. [PMID: 28600259 DOI: 10.1109/tcbb.2017.2712775] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Selecting functional genes is essential for analyzing microarray data. Among many available feature (gene) selection approaches, the ones on the basis of the large margin nearest neighbor receive more attention due to their low computational costs and high accuracies in analyzing the high-dimensional data. Yet, there still exist some problems that hamper the existing approaches in sifting real target genes, including selecting erroneous nearest neighbors, high sensitivity to irrelevant genes, and inappropriate evaluation criteria. Previous pioneer works have partly addressed some of the problems, but none of them are capable of solving these problems simultaneously. In this paper, we propose a new local-nearest-neighbors-based feature weighting approach to alleviate the above problems. The proposed approach is based on the trick of locally minimizing the within-class distances and maximizing the between-class distances with the nearest neighbors rule. We further define a feature weight vector, and construct it by minimizing the cost function with a regularization term. The proposed approach can be applied naturally to the multi-class problems and does not require extra modification. Experimental results on the UCI and the open microarray data sets validate the effectiveness and efficiency of the new approach.
Collapse
|
25
|
Zhang C, Cheng J, Li C, Tian Q. Image-Specific Classification With Local and Global Discriminations. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2018; 29:4479-4486. [PMID: 28961130 DOI: 10.1109/tnnls.2017.2748952] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Most image classification methods try to learn classifiers for each class using training images alone. Due to the interclass and intraclass variations, it would be more effective to take the testing images into consideration for classifier learning. In this brief, we propose a novel image-specific classification method by combing the local and global discriminations of training images. We adaptively train classifier for each testing image instead of generating classifiers for each class with training images alone. For each testing image, we first select its ${k}$ nearest neighbors in the training set with the corresponding labels for local classifier training. This helps to model the distinctive characters of each testing image. Besides, we also use all the training images for global discrimination modeling. The local and global discriminations are combined for final classification. In this way, we could not only model the specific character of each testing image but also avoid the local optimum by jointly considering all the training images. To evaluate the usefulness of the proposed image-specific classification with local and global discrimination (ISC-LG) method, we conduct image classification experiments on several public image data sets. The superior performances over other baseline methods prove the effectiveness of the proposed ISC-LG method.
Collapse
|
26
|
Zhang C, Cheng J, Li L, Li C, Tian Q. Object Categorization Using Class-Specific Representations. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2018; 29:4528-4534. [PMID: 29990030 DOI: 10.1109/tnnls.2017.2757497] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Object categorization refers to the task of automatically classifying objects based on the visual content. Existing approaches simply represent each image with the visual features without considering the specific characters of images within the same class. However, objects of the same class may exhibit unique characters, which should be represented accordingly. In this brief, we propose a novel class-specific representation strategy for object categorization. For each class, we first model the characters of images within the same class using Gaussian mixture model (GMM). We then represent each image by calculating the Euclidean distance and relative Euclidean distance between the image and the GMM model for each class. We concatenate the representations of each class for joint representation. In this way, we can represent an image by not only considering the visual contents but also combining the class-specific characters. Experiments on several public available data sets validate the superiority of the proposed class-specific representation method over well-established algorithms for object category predictions.
Collapse
|
27
|
Liu L, Wang Q, Adeli E, Zhang L, Zhang H, Shen D. Exploring diagnosis and imaging biomarkers of Parkinson's disease via iterative canonical correlation analysis based feature selection. Comput Med Imaging Graph 2018; 67:21-29. [PMID: 29702348 PMCID: PMC6374153 DOI: 10.1016/j.compmedimag.2018.04.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2017] [Revised: 03/30/2018] [Accepted: 04/02/2018] [Indexed: 10/17/2022]
Abstract
Parkinson's disease (PD) is a neurodegenerative disorder that progressively hampers the brain functions and leads to various movement and non-motor symptoms. However, it is difficult to attain early-stage PD diagnosis based on the subjective judgment of physicians in clinical routines. Therefore, automatic and accurate diagnosis of PD is highly demanded, so that the corresponding treatment can be implemented more appropriately. In this paper, we focus on finding the most discriminative features from different brain regions in PD through T1-weighted MR images, which can help the subsequent PD diagnosis. Specifically, we proposed a novel iterative canonical correlation analysis (ICCA) feature selection method, aiming at exploiting MR images in a more comprehensive manner and fusing features of different types into a common space. To state succinctly, we first extract the feature vectors from the gray matter and the white matter tissues separately, represented as insights of two different anatomical feature spaces for the subject's brain. The ICCA feature selection method aims at iteratively finding the optimal feature subset from two sets of features that have inherent high correlation with each other. In experiments we have conducted thorough investigations on the optimal feature set extracted by our ICCA method. We also demonstrate that using the proposed feature selection method, the PD diagnosis performance is further improved, and also outperforms many state-of-the-art methods.
Collapse
Affiliation(s)
- Luyan Liu
- Institute for Medical Imaging Technology, School of Biomedical Engineering, Shanghai Jiao Tong University, China.
| | - Qian Wang
- Institute for Medical Imaging Technology, School of Biomedical Engineering, Shanghai Jiao Tong University, China.
| | - Ehsan Adeli
- Department of Radiology and BRIC, University of North Carolina at Chapel Hill, United States.
| | - Lichi Zhang
- Institute for Medical Imaging Technology, School of Biomedical Engineering, Shanghai Jiao Tong University, China; Department of Radiology and BRIC, University of North Carolina at Chapel Hill, United States.
| | - Han Zhang
- Department of Radiology and BRIC, University of North Carolina at Chapel Hill, United States.
| | - Dinggang Shen
- Department of Radiology and BRIC, University of North Carolina at Chapel Hill, United States; Department of Brain and Cognitive Engineering, Korea University, Seoul, 02841, Republic of Korea.
| |
Collapse
|
28
|
Armanfard N, Reilly JP, Komeili M. Logistic Localized Modeling of the Sample Space for Feature Selection and Classification. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2018; 29:1396-1413. [PMID: 28333643 DOI: 10.1109/tnnls.2017.2676101] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Conventional feature selection algorithms assign a single common feature set to all regions of the sample space. In contrast, this paper proposes a novel algorithm for localized feature selection for which each region of the sample space is characterized by its individual distinct feature subset that may vary in size and membership. This approach can therefore select an optimal feature subset that adapts to local variations of the sample space, and hence offer the potential for improved performance. Feature subsets are computed by choosing an optimal coordinate space so that, within a localized region, within-class distances and between-class distances are, respectively, minimized and maximized. Distances are measured using a logistic function metric within the corresponding region. This enables the optimization process to focus on a localized region within the sample space. A local classification approach is utilized for measuring the similarity of a new input data point to each class. The proposed logistic localized feature selection (lLFS) algorithm is invariant to the underlying probability distribution of the data; hence, it is appropriate when the data are distributed on a nonlinear or disjoint manifold. lLFS is efficiently formulated as a joint convex/increasing quasi-convex optimization problem with a unique global optimum point. The method is most applicable when the number of available training samples is small. The performance of the proposed localized method is successfully demonstrated on a large variety of data sets. We demonstrate that the number of features selected by the lLFS method saturates at the number of available discriminative features. In addition, we have shown that the Vapnik-Chervonenkis dimension of the localized classifier is finite. Both these factors suggest that the lLFS method is insensitive to the overfitting issue, relative to other methods.
Collapse
|
29
|
Gong YJ, Zhou Y. Differential Evolutionary Superpixel Segmentation. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2018; 27:1390-1404. [PMID: 29990063 DOI: 10.1109/tip.2017.2778569] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Superpixel segmentation has been of increasing importance in many computer vision applications recently. To handle the problem, most state-of-the-art algorithms either adopt a local color variance model or a local optimization algorithm. This paper develops a new approach, named differential evolutionary superpixels, which is able to optimize the global properties of segmentation by means of a global optimizer. We design a comprehensive objective function aggregating within-superpixel error, boundary gradient, and a regularization term. Minimizing the within-superpixel error enforces the homogeneity of superpixels. In addition, the introduction of boundary gradient drives the superpixel boundaries to capture the natural image boundaries, so as to make each superpixel overlaps with a single object. The regularizer further encourages producing similarly sized superpixels that are friendly to human vision. The optimization is then accomplished by a powerful global optimizer-differential evolution. The algorithm constantly evolves the superpixels by mimicking the process of natural evolution, while using a linear complexity to the image size. Experimental results and comparisons with eleven state-of-the-art peer algorithms verify the promising performance of our algorithm.
Collapse
|
30
|
|
31
|
|
32
|
Bocklitz TW, Guo S, Ryabchykov O, Vogler N, Popp J. Raman Based Molecular Imaging and Analytics: A Magic Bullet for Biomedical Applications!? Anal Chem 2015; 88:133-51. [DOI: 10.1021/acs.analchem.5b04665] [Citation(s) in RCA: 72] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Affiliation(s)
- Thomas W. Bocklitz
- Institute
of Physical Chemistry and Abbe Center of Photonics, Friedrich Schiller University Jena, Helmholtzweg 4, 07743 Jena, Germany
- Leibniz Institute of Photonic Technology (IPHT), Albert-Einstein-Strasse 9, 07745 Jena, Germany
| | - Shuxia Guo
- Institute
of Physical Chemistry and Abbe Center of Photonics, Friedrich Schiller University Jena, Helmholtzweg 4, 07743 Jena, Germany
- Leibniz Institute of Photonic Technology (IPHT), Albert-Einstein-Strasse 9, 07745 Jena, Germany
- InfectoGnostics
Forschungscampus Jena e.V., Zentrum für Angewandte Forschung, Philosophenweg 7, 07743 Jena, Germany
| | - Oleg Ryabchykov
- Institute
of Physical Chemistry and Abbe Center of Photonics, Friedrich Schiller University Jena, Helmholtzweg 4, 07743 Jena, Germany
- Leibniz Institute of Photonic Technology (IPHT), Albert-Einstein-Strasse 9, 07745 Jena, Germany
- InfectoGnostics
Forschungscampus Jena e.V., Zentrum für Angewandte Forschung, Philosophenweg 7, 07743 Jena, Germany
| | - Nadine Vogler
- Institute
of Physical Chemistry and Abbe Center of Photonics, Friedrich Schiller University Jena, Helmholtzweg 4, 07743 Jena, Germany
- Leibniz Institute of Photonic Technology (IPHT), Albert-Einstein-Strasse 9, 07745 Jena, Germany
- InfectoGnostics
Forschungscampus Jena e.V., Zentrum für Angewandte Forschung, Philosophenweg 7, 07743 Jena, Germany
| | - Jürgen Popp
- Institute
of Physical Chemistry and Abbe Center of Photonics, Friedrich Schiller University Jena, Helmholtzweg 4, 07743 Jena, Germany
- Leibniz Institute of Photonic Technology (IPHT), Albert-Einstein-Strasse 9, 07745 Jena, Germany
- InfectoGnostics
Forschungscampus Jena e.V., Zentrum für Angewandte Forschung, Philosophenweg 7, 07743 Jena, Germany
| |
Collapse
|