1
|
An S, Guo X, Wang C, Guo G, Dai J. A Soft Neighborhood Rough Set Model and Its Applications. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.12.074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
|
2
|
Dai J, Wang W, Zhang C, Qu S. Semi-supervised attribute reduction via attribute indiscernibility. INT J MACH LEARN CYB 2022. [DOI: 10.1007/s13042-022-01708-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
|
3
|
Liang S, Liu Z, You D, Pan W, Zhao J, Cao Y. PSO-NRS: an online group feature selection algorithm based on PSO multi-objective optimization. APPL INTELL 2022. [DOI: 10.1007/s10489-022-04275-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|
4
|
Streaming Feature Selection via Graph Diffusion. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.10.087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
5
|
Sun L, Wang X, Ding W, Xu J. TSFNFR: Two-stage fuzzy neighborhood-based feature reduction with binary whale optimization algorithm for imbalanced data classification. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109849] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
6
|
Zhang D, Zhu P. Variable radius neighborhood rough sets and attribute reduction. Int J Approx Reason 2022. [DOI: 10.1016/j.ijar.2022.08.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
7
|
|
8
|
Zhang X, Jiang Z, Xu W. Feature selection using a weighted method in interval-valued decision information systems. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03987-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
9
|
Multi-Label Attribute Reduction Based on Neighborhood Multi-Target Rough Sets. Symmetry (Basel) 2022. [DOI: 10.3390/sym14081652] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
The rough set model has two symmetry approximations called upper approximation and lower approximation, which correspond to a concept’s intension and extension, respectively. Multi-label learning enforces the rough set model, which wants to be applied considering the correlations among labels, while the target concept should not be limited to only one. This paper proposes a multi-target model considering label correlation (Neighborhood Multi-Target Rough Sets, NMTRS) and proposes an attribute reduction approach based on NMTRS. First, some definitions of NMTRS are introduced. Second, some properties of NMTRS are discussed. Third, some discussion about the attribute significance measure is given. Fourth, the attribute reduction approaches based on NMTRS are proposed. Finally, the efficiency and validity of the designed algorithms are verified by experiments. The experiments show that our algorithm shows considerable performance when compared to state-of-the-art approaches.
Collapse
|
10
|
Zaman EAK, Mohamed A, Ahmad A. Feature selection for online streaming high-dimensional data: A state-of-the-art review. Appl Soft Comput 2022. [DOI: 10.1016/j.asoc.2022.109355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
11
|
Online group streaming feature selection using entropy-based uncertainty measures for fuzzy neighborhood rough sets. COMPLEX INTELL SYST 2022. [DOI: 10.1007/s40747-022-00763-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
AbstractOnline group streaming feature selection, as an essential online processing method, can deal with dynamic feature selection tasks by considering the original group structure information of the features. Due to the fuzziness and uncertainty of the feature stream, some existing methods are unstable and yield low predictive accuracy. To address these issues, this paper presents a novel online group streaming feature selection method (FNE-OGSFS) using fuzzy neighborhood entropy-based uncertainty measures. First, a separability measure integrating the dependency degree with the coincidence degree is proposed and introduced into the fuzzy neighborhood rough sets model to define a new fuzzy neighborhood entropy. Second, inspired by both algebra and information views, some fuzzy neighborhood entropy-based uncertainty measures are investigated and some properties are derived. Furthermore, the optimal features in the group are selected to flow into the feature space according to the significance of features, and the features with interactions are left. Then, all selected features are re-evaluated by the Lasso model to discard the redundant features. Finally, an online group streaming feature selection algorithm is designed. Experimental results compared with eight representative methods on thirteen datasets show that FNE-OGSFS can achieve better comprehensive performance.
Collapse
|
12
|
ASFS: A novel streaming feature selection for multi-label data based on neighborhood rough set. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03366-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
|
13
|
Sun L, Zhang J, Ding W, Xu J. Feature reduction for imbalanced data classification using similarity-based feature clustering with adaptive weighted K-nearest neighbors. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.02.004] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
|
14
|
Liang S, Liu Z, You D, Pan W. Online multi-label stream feature selection based on neighborhood rough set with missing labels. Pattern Anal Appl 2022. [DOI: 10.1007/s10044-022-01067-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
15
|
|
16
|
|
17
|
Lu Y, Song J, Wang P, Xu T. Label-specific guidance for efficiently searching reduct. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-213112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
In the era of big data for exploring attribute reduction/rough set-based feature selection related problems, to design efficient strategies for deriving reducts and then reduce the dimensions of data, two fundamental perspectives of Granular Computing may be taken into account: breaking up the whole into pieces and gathering parts into a whole. From this point of view, a novel strategy named label-specific guidance is introduced into the process of searching reduct. Given a formal description of attribute reduction, by considering the corresponding constraint, we divide it into several label-specific based constraints. Consequently, a sequence of these label-specific based constraints can be obtained, it follows that the reduct related to the previous label-specific based constraint may have guidance on the computation of that related to the subsequent label-specific based constraint. The thinking of this label-specific guidance runs through the whole process of searching reduct until the reduct over the whole universe is derived. Compared with five state-of-the-art algorithms over 20 data sets, the experimental results demonstrate that our proposed acceleration strategy can not only significantly accelerate the process of searching reduct but also offer justifiable performance in the task of classification. This study suggests a new trend concerning the problem of quickly deriving reduct.
Collapse
Affiliation(s)
- Yu Lu
- School of Computer, Jiangsu University of Science and Technology, Zhenjiang, Jiangsu, China
| | - Jingjing Song
- School of Computer, Jiangsu University of Science and Technology, Zhenjiang, Jiangsu, China
| | - Pingxin Wang
- School of Computer, Jiangsu University of Science and Technology, Zhenjiang, Jiangsu, China
| | - Taihua Xu
- School of Computer, Jiangsu University of Science and Technology, Zhenjiang, Jiangsu, China
| |
Collapse
|
18
|
Li S, Zhang K, Li Y, Wang S, Zhang S. Online streaming feature selection based on neighborhood rough set. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2021.108025] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
19
|
|
20
|
Chen Y, Wang P, Yang X, Mi J, Liu D. Granular ball guided selector for attribute reduction. Knowl Based Syst 2021. [DOI: 10.1016/j.knosys.2021.107326] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
21
|
|
22
|
|
23
|
Zhong W, Chen X, Nie F, Huang JZ. Adaptive discriminant analysis for semi-supervised feature selection. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2021.02.035] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
24
|
Zhou P, Wang N, Zhao S. Online group streaming feature selection considering feature interaction. Knowl Based Syst 2021. [DOI: 10.1016/j.knosys.2021.107157] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
|
25
|
Outlier detection based on weighted neighbourhood information network for mixed-valued datasets. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2021.02.045] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
26
|
Paul D, Jain A, Saha S, Mathew J. Multi-objective PSO based online feature selection for multi-label classification. Knowl Based Syst 2021. [DOI: 10.1016/j.knosys.2021.106966] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
27
|
Qian W, Xiong C, Wang Y. A ranking-based feature selection for multi-label classification with fuzzy relative discernibility. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2020.106995] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
28
|
Liu X, Zhou Y, Zhao H. Robust hierarchical feature selection driven by data and knowledge. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2020.11.003] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
29
|
Venkatesh B, Anuradha J. Fuzzy Rank Based Parallel Online Feature Selection Method using Multiple Sliding Windows. OPEN COMPUTER SCIENCE 2021. [DOI: 10.1515/comp-2020-0169] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Abstract
Nowadays, in real-world applications, the dimensions of data are generated dynamically, and the traditional batch feature selection methods are not suitable for streaming data. So, online streaming feature selection methods gained more attention but the existing methods had demerits like low classification accuracy, fails to avoid redundant and irrelevant features, and a higher number of features selected. In this paper, we propose a parallel online feature selection method using multiple sliding-windows and fuzzy fast-mRMR feature selection analysis, which is used for selecting minimum redundant and maximum relevant features, and also overcomes the drawbacks of existing online streaming feature selection methods. To increase the performance speed of the proposed method parallel processing is used. To evaluate the performance of the proposed online feature selection method k-NN, SVM, and Decision Tree Classifiers are used and compared against the state-of-the-art online feature selection methods. Evaluation metrics like Accuracy, Precision, Recall, F1-Score are used on benchmark datasets for performance analysis. From the experimental analysis, it is proved that the proposed method has achieved more than 95% accuracy for most of the datasets and performs well over other existing online streaming feature selection methods and also, overcomes the drawbacks of the existing methods.
Collapse
Affiliation(s)
- B. Venkatesh
- SCOPE, Vellore Institute of Technology , Vellore , India
| | - J. Anuradha
- SCOPE, Vellore Institute of Technology , Vellore , India
| |
Collapse
|
30
|
Abstract
Data streams are ubiquitous and related to the proliferation of low-cost mobile devices, sensors, wireless networks and the Internet of Things. While it is well known that complex phenomena are not stationary and exhibit a concept drift when observed for a sufficiently long time, relatively few studies have addressed the related problem of feature drift. In this paper, a variation of the QuickReduct algorithm suitable to process data streams is proposed and tested: it builds an evolving reduct that dynamically selects the relevant features in the stream, removing the redundant ones and adding the newly relevant ones as soon as they become such. Tests on five publicly available datasets with an artificially injected drift have confirmed the effectiveness of the proposed method.
Collapse
|
31
|
New Online Streaming Feature Selection Based on Neighborhood Rough Set for Medical Data. Symmetry (Basel) 2020. [DOI: 10.3390/sym12101635] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Not all features in many real-world applications, such as medical diagnosis and fraud detection, are available from the start. They are formed and individually flow over time. Online streaming feature selection (OSFS) has recently attracted much attention due to its ability to select the best feature subset with growing features. Rough set theory is widely used as an effective tool for feature selection, specifically the neighborhood rough set. However, the two main neighborhood relations, namely k-neighborhood and neighborhood, cannot efficiently deal with the uneven distribution of data. The traditional method of dependency calculation does not take into account the structure of neighborhood covering. In this study, a novel neighborhood relation combined with k-neighborhood and neighborhood relations is initially defined. Then, we propose a weighted dependency degree computation method considering the structure of the neighborhood relation. In addition, we propose a new OSFS approach named OSFS-KW considering the challenge of learning class imbalanced data. OSFS-KW has no adjustable parameters and pretraining requirements. The experimental results on 19 datasets demonstrate that OSFS-KW not only outperforms traditional methods but, also, exceeds the state-of-the-art OSFS approaches.
Collapse
|
32
|
Multilabel feature selection using ML-ReliefF and neighborhood mutual information for multilabel neighborhood decision systems. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2020.05.102] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
|
33
|
Kernelized fuzzy rough sets based online streaming feature selection for large-scale hierarchical classification. APPL INTELL 2020. [DOI: 10.1007/s10489-020-01863-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
|
34
|
Multi-objective feature selection using hybridization of a genetic algorithm and direct multisearch for key quality characteristic selection. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2020.03.032] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
35
|
Jahan MS, Amjad A, Qamar U, Riaz MT, Ayub K. A Novel Approach for Ensemble Feature Selection Using Clustering with Automatic Threshold. COMMUNICATIONS IN COMPUTER AND INFORMATION SCIENCE 2020:390-401. [DOI: 10.1007/978-3-030-62554-2_28] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
|
36
|
|