1
|
Li Y, Zhang B, Mo H, Hu J, Liu Y, Tan X. Unsupervised attribute reduction based on variable precision weighted neighborhood dependency. iScience 2024; 27:111270. [PMID: 39660055 PMCID: PMC11629270 DOI: 10.1016/j.isci.2024.111270] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Revised: 08/18/2024] [Accepted: 10/24/2024] [Indexed: 12/12/2024] Open
Abstract
Neighborhood rough set (NRS) have been successfully applied to attribute reduction (AR). However, most current methods of AR based on NRS are supervised or semi-supervised. This limits their ability to process data without decision information. When granulating data samples, NRS considers only the number of samples within the neighborhood radius. It does not consider distribution information between samples, which can result in the loss of original data information. To address the aforementioned issue, we propose an unsupervised attribute reduction (UAR) strategy based on variable precision weighted neighborhood dependency (VPWND) (UAR_VPWND). We compare algorithm UAR_VPWND to existing classical UAR algorithms using public datasets. The experimental results show that algorithm UAR_VPWND can select fewer attributes to maintain or improve the performance of clustering learning algorithms.
Collapse
Affiliation(s)
- Yi Li
- Institute of Computer Application Research, Sichuan Minzu College, Kangding 626001, China
| | - Benwen Zhang
- Institute of Computer Application Research, Sichuan Minzu College, Kangding 626001, China
| | - Hongming Mo
- Institute of Computer Application Research, Sichuan Minzu College, Kangding 626001, China
| | - Jiancheng Hu
- College of Applied Mathematics, Chengdu University of Information Technology, Chengdu 610225, China
| | - Yuncheng Liu
- School of Mathematics, Southwest Minzu University, Chengdu 610041, China
| | - Xingqiang Tan
- Institute of Computer Application Research, Sichuan Minzu College, Kangding 626001, China
| |
Collapse
|
2
|
Sheikhpour R. A local spline regression-based framework for semi-supervised sparse feature selection. Knowl Based Syst 2023. [DOI: 10.1016/j.knosys.2023.110265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
3
|
Semi-supervised feature selection for partially labeled mixed-type data based on multi-criteria measure approach. Int J Approx Reason 2022. [DOI: 10.1016/j.ijar.2022.11.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]
|
4
|
Semi-supervised feature selection based on pairwise constraint-guided dual space latent representation learning and double sparse graphs discriminant. APPL INTELL 2022. [DOI: 10.1007/s10489-022-04040-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
5
|
Unsupervised feature selection based on incremental forward iterative Laplacian score. Artif Intell Rev 2022; 56:4077-4112. [PMID: 36160366 PMCID: PMC9484723 DOI: 10.1007/s10462-022-10274-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Feature selection facilitates intelligent information processing, and the unsupervised learning of feature selection has become important. In terms of unsupervised feature selection, the Laplacian score (LS) provides a powerful measurement and optimization method, and good performance has been achieved using the recent forward iterative Laplacian score (FILS) algorithm. However, there is still room for advancement. The aim of this paper is to improve the FILS algorithm, and thus, feature significance (SIG) is mainly introduced to develop a high-quality selection method, i.e., the incremental forward iterative Laplacian score (IFILS) algorithm. Based on the modified LS, the metric difference in the incremental feature process motivates SIG. Therefore, SIG offers a dynamic characterization by considering initial and terminal states, and it promotes the current FILS measurement on only the terminal state. Then, both the modified LS and integrated SIG acquire granulation nonmonotonicity and uncertainty, especially on incremental feature chains, and the corresponding verification is achieved by completing examples and experiments. Furthermore, a SIG-based incremental criterion of minimum selection is designed to choose optimization features, and thus, the IFILS algorithm is naturally formulated to implement unsupervised feature selection. Finally, an in-depth comparison of the IFILS algorithm with the FILS algorithm is achieved using data experiments on multiple datasets, including a nominal dataset of COVID-19 surveillance. As validated by the experimental results, the IFILS algorithm outperforms the FILS algorithm and achieves better classification performance.
Collapse
|
6
|
Chen Z, Xu J, Peng T, Yang C. Graph Convolutional Network-Based Method for Fault Diagnosis Using a Hybrid of Measurement and Prior Knowledge. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:9157-9169. [PMID: 33710969 DOI: 10.1109/tcyb.2021.3059002] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Deep-neural network-based fault diagnosis methods have been widely used according to the state of the art. However, a few of them consider the prior knowledge of the system of interest, which is beneficial for fault diagnosis. To this end, a new fault diagnosis method based on the graph convolutional network (GCN) using a hybrid of the available measurement and the prior knowledge is proposed. Specifically, this method first uses the structural analysis (SA) method to prediagnose the fault and then converts the prediagnosis results into the association graph. Then, the graph and measurements are sent into the GCN model, in which a weight coefficient is introduced to adjust the influence of measurements and the prior knowledge. In this method, the graph structure of GCN is used as a joint point to connect SA based on the model and GCN based on data. In order to verify the effectiveness of the proposed method, an experiment is carried out. The results show that the proposed method, which combines the advantages of both SA and GCN, has better diagnosis results than the existing methods based on common evaluation indicators.
Collapse
|
7
|
Azadifar S, Rostami M, Berahmand K, Moradi P, Oussalah M. Graph-based relevancy-redundancy gene selection method for cancer diagnosis. Comput Biol Med 2022; 147:105766. [PMID: 35779479 DOI: 10.1016/j.compbiomed.2022.105766] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2022] [Revised: 06/12/2022] [Accepted: 06/18/2022] [Indexed: 11/26/2022]
Abstract
Nowadays, microarray data processing is one of the most important applications in molecular biology for cancer diagnosis. A major task in microarray data processing is gene selection, which aims to find a subset of genes with the least inner similarity and most relevant to the target class. Removing unnecessary, redundant, or noisy data reduces the data dimensionality. This research advocates a graph theoretic-based gene selection method for cancer diagnosis. Both unsupervised and supervised modes use well-known and successful social network approaches such as the maximum weighted clique criterion and edge centrality to rank genes. The suggested technique has two goals: (i) to maximize the relevancy of the chosen genes with the target class and (ii) to reduce their inner redundancy. A maximum weighted clique is chosen in a repetitive way in each iteration of this procedure. The appropriate genes are then chosen from among the existing features in this maximum clique using edge centrality and gene relevance. In the experiment, several datasets consisting of Colon, Leukemia, SRBCT, Prostate Tumor, and Lung Cancer, with different properties, are used to demonstrate the efficacy of the developed model. Our performance is compared to that of renowned filter-based gene selection approaches for cancer diagnosis whose results demonstrate a clear superiority.
Collapse
Affiliation(s)
- Saeid Azadifar
- Department of Computer Engineering, University of Khajeh Nasir Toosi, Tehran, Iran
| | - Mehrdad Rostami
- Centre for Machine Vision and Signal Processing, University of Oulu, Oulu, Finland.
| | - Kamal Berahmand
- School of Computer Science, Faculty of Science, Queensland University of Technology (QUT), Brisbane, Australia
| | - Parham Moradi
- Department of Computer Engineering, University of Kurdistan, Sanandaj, Iran
| | - Mourad Oussalah
- Centre for Machine Vision and Signal Processing, University of Oulu, Oulu, Finland; Research Unit of Medical Imaging, Physics, and Technology, Faculty of Medicine, University of Oulu, Finland
| |
Collapse
|
8
|
Information gain-based semi-supervised feature selection for hybrid data. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03770-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
9
|
|
10
|
Robust dual-graph regularized and minimum redundancy based on self-representation for semi-supervised feature selection. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.03.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
11
|
Cheng C, Zhou Y, You B, Liu Y, Fei G, Yang L, Dai Y. Multiview Feature Fusion Representation for Interictal Epileptiform Spikes Detection. Int J Neural Syst 2022; 32:2250014. [PMID: 35272587 DOI: 10.1142/s0129065722500149] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Interictal epileptiform spikes (IES) of scalp electroencephalogram (EEG) signals have a strong relation with the epileptogenic region. Since IES are highly unlikely to be detected in scalp EEG signals, the primary diagnosis depends heavily on the visual evaluation of IES. However, visual inspection of EEG signals, the standard IES detection procedure is time-consuming, highly subjective, and error-prone. Furthermore, the highly complex, nonlinear, and nonstationary characteristics of EEG signals lead to the incomplete representation of EEG signals in existing computer-aided methods and consequently unsatisfactory detection performance. Therefore, a novel multiview feature fusion representation (MVFFR) method was developed and combined with a robustness classifier to detect EEG signals with/without IES. MVFFR comprises two steps: First, temporal, frequency, temporal-frequency, spatial, and nonlinear domain features are transformed by the IES to express the latent information effectively. Second, the unsupervised infinite feature-selection method determines the most distinct feature fusion representations. Experimental results using a balanced dataset of six patients showed that MVFFR achieved the optimal detection performance (accuracy: 89.27%, sensitivity: 89.01%, specificity: 89.54%, and precision: 89.82%) compared with other feature ranking methods, and the MVFFR-related method were complementary and indispensable. Additionally, in an independent test, MVFFR maintained excellent generalization capacity with a false detection rate per minute of 0.15 on the unbalanced dataset of one patient.
Collapse
Affiliation(s)
- Chenchen Cheng
- School of Mechanical and Power Engineering, Harbin University of Science and Technology, Harbin 150080, P. R. China.,Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou 215163, P. R. China.,Heilongjiang Provincial Key Laboratory, of Complex Intelligent System and Integration, Harbin University of Science and Technology, Harbin 150080, P. R. China
| | - Yuanfeng Zhou
- Department of Neurology, Children's Hospital of Fudan University, Shanghai 200000, P. R. China
| | - Bo You
- School of Mechanical and Power Engineering, Harbin University of Science and Technology, Harbin 150080, P. R. China.,Heilongjiang Provincial Key Laboratory, of Complex Intelligent System and Integration, Harbin University of Science and Technology, Harbin 150080, P. R. China.,School of Automation, Harbin University of Science and Technology, Harbin 150080, P. R. China
| | - Yan Liu
- Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou 215163, P. R. China.,Jinan Guoke Medical Engineering Technology Development Co., Ltd, Jinan 250000, P. R. China
| | - Gao Fei
- Department of Radiology, Shandong Provincial Hospital, Cheeloo College of Medicine, Shandong University Jinan, P. R. China
| | - Liling Yang
- Department of Neurology, Shandong Provincial Hospital, Affiliated to Shandong First Medical University, Jinan 250021, P. R. China
| | - Yakang Dai
- Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou 215163, P. R. China.,Jinan Guoke Medical Engineering Technology Development Co., Ltd, Jinan 250000, P. R. China
| |
Collapse
|
12
|
Dai J, Liu Q. Semi-supervised attribute reduction for interval data based on misclassification cost. INT J MACH LEARN CYB 2021. [DOI: 10.1007/s13042-021-01483-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
13
|
Wan J, Chen H, Yuan Z, Li T, Yang X, Sang B. A novel hybrid feature selection method considering feature interaction in neighborhood rough set. Knowl Based Syst 2021. [DOI: 10.1016/j.knosys.2021.107167 10.1016/j.knosys.2021.107167] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
|
14
|
Wan J, Chen H, Yuan Z, Li T, Yang X, Sang B. A novel hybrid feature selection method considering feature interaction in neighborhood rough set. Knowl Based Syst 2021. [DOI: 10.1016/j.knosys.2021.107167] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
|
15
|
|
16
|
Discrimination of Tomato Maturity Using Hyperspectral Imaging Combined with Graph-Based Semi-supervised Method Considering Class Probability Information. FOOD ANAL METHOD 2021. [DOI: 10.1007/s12161-020-01955-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|