2
|
Zhang X, Fan M, Wang D, Zhou P, Tao D. Top-k Feature Selection Framework Using Robust 0-1 Integer Programming. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021; 32:3005-3019. [PMID: 32735538 DOI: 10.1109/tnnls.2020.3009209] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Feature selection (FS), which identifies the relevant features in a data set to facilitate subsequent data analysis, is a fundamental problem in machine learning and has been widely studied in recent years. Most FS methods rank the features in order of their scores based on a specific criterion and then select the k top-ranked features, where k is the number of desired features. However, these features are usually not the top- k features and may present a suboptimal choice. To address this issue, we propose a novel FS framework in this article to select the exact top- k features in the unsupervised, semisupervised, and supervised scenarios. The new framework utilizes the l0,2 -norm as the matrix sparsity constraint rather than its relaxations, such as the l1,2 -norm. Since the l0,2 -norm constrained problem is difficult to solve, we transform the discrete l0,2 -norm-based constraint into an equivalent 0-1 integer constraint and replace the 0-1 integer constraint with two continuous constraints. The obtained top- k FS framework with two continuous constraints is theoretically equivalent to the l0,2 -norm constrained problem and can be optimized by the alternating direction method of multipliers (ADMM). Unsupervised and semisupervised FS methods are developed based on the proposed framework, and extensive experiments on real-world data sets are conducted to demonstrate the effectiveness of the proposed FS framework.
Collapse
|
4
|
Gui J, Sun Z, Ji S, Tao D, Tan T. Feature Selection Based on Structured Sparsity: A Comprehensive Study. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2017; 28:1490-1507. [PMID: 28287983 DOI: 10.1109/tnnls.2016.2551724] [Citation(s) in RCA: 126] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Feature selection (FS) is an important component of many pattern recognition tasks. In these tasks, one is often confronted with very high-dimensional data. FS algorithms are designed to identify the relevant feature subset from the original features, which can facilitate subsequent analysis, such as clustering and classification. Structured sparsity-inducing feature selection (SSFS) methods have been widely studied in the last few years, and a number of algorithms have been proposed. However, there is no comprehensive study concerning the connections between different SSFS methods, and how they have evolved. In this paper, we attempt to provide a survey on various SSFS methods, including their motivations and mathematical representations. We then explore the relationship among different formulations and propose a taxonomy to elucidate their evolution. We group the existing SSFS methods into two categories, i.e., vector-based feature selection (feature selection based on lasso) and matrix-based feature selection (feature selection based on lr,p-norm). Furthermore, FS has been combined with other machine learning algorithms for specific applications, such as multitask learning, multilabel learning, multiview learning, classification, and clustering. This paper not only compares the differences and commonalities of these methods based on regression and regularization strategies, but also provides useful guidelines to practitioners working in related fields to guide them how to do feature selection.
Collapse
|
5
|
Cao X, Zhang C, Zhou C, Fu H, Foroosh H. Constrained Multi-View Video Face Clustering. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2015; 24:4381-4393. [PMID: 26259245 DOI: 10.1109/tip.2015.2463223] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
In this paper, we focus on face clustering in videos. To promote the performance of video clustering by multiple intrinsic cues, i.e., pairwise constraints and multiple views, we propose a constrained multi-view video face clustering method under a unified graph-based model. First, unlike most existing video face clustering methods which only employ these constraints in the clustering step, we strengthen the pairwise constraints through the whole video face clustering framework, both in sparse subspace representation and spectral clustering. In the constrained sparse subspace representation, the sparse representation is forced to explore unknown relationships. In the constrained spectral clustering, the constraints are used to guide for learning more reasonable new representations. Second, our method considers both the video face pairwise constraints as well as the multi-view consistence simultaneously. In particular, the graph regularization enforces the pairwise constraints to be respected and the co-regularization penalizes the disagreement among different graphs of multiple views. Experiments on three real-world video benchmark data sets demonstrate the significant improvements of our method over the state-of-the-art methods.
Collapse
|
6
|
Sun Z, Wang L, Tan T. Ordinal feature selection for iris and palmprint recognition. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2014; 23:3922-3934. [PMID: 25029458 DOI: 10.1109/tip.2014.2332396] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Ordinal measures have been demonstrated as an effective feature representation model for iris and palmprint recognition. However, ordinal measures are a general concept of image analysis and numerous variants with different parameter settings, such as location, scale, orientation, and so on, can be derived to construct a huge feature space. This paper proposes a novel optimization formulation for ordinal feature selection with successful applications to both iris and palmprint recognition. The objective function of the proposed feature selection method has two parts, i.e., misclassification error of intra and interclass matching samples and weighted sparsity of ordinal feature descriptors. Therefore, the feature selection aims to achieve an accurate and sparse representation of ordinal measures. And, the optimization subjects to a number of linear inequality constraints, which require that all intra and interclass matching pairs are well separated with a large margin. Ordinal feature selection is formulated as a linear programming (LP) problem so that a solution can be efficiently obtained even on a large-scale feature pool and training database. Extensive experimental results demonstrate that the proposed LP formulation is advantageous over existing feature selection methods, such as mRMR, ReliefF, Boosting, and Lasso for biometric recognition, reporting state-of-the-art accuracy on CASIA and PolyU databases.
Collapse
|
7
|
Mascelli S, Barla A, Raso A, Mosci S, Nozza P, Biassoni R, Morana G, Huber M, Mircean C, Fasulo D, Noy K, Wittemberg G, Pignatelli S, Piatelli G, Cama A, Garré ML, Capra V, Verri A. Molecular fingerprinting reflects different histotypes and brain region in low grade gliomas. BMC Cancer 2013; 13:387. [PMID: 23947815 PMCID: PMC3765921 DOI: 10.1186/1471-2407-13-387] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2013] [Accepted: 08/06/2013] [Indexed: 01/30/2023] Open
Abstract
BACKGROUND Paediatric low-grade gliomas (LGGs) encompass a heterogeneous set of tumours of different histologies, site of lesion, age and gender distribution, growth potential, morphological features, tendency to progression and clinical course. Among LGGs, Pilocytic astrocytomas (PAs) are the most common central nervous system (CNS) tumours in children. They are typically well-circumscribed, classified as grade I by the World Health Organization (WHO), but recurrence or progressive disease occurs in about 10-20% of cases. Despite radiological and neuropathological features deemed as classic are acknowledged, PA may present a bewildering variety of microscopic features. Indeed, tumours containing both neoplastic ganglion and astrocytic cells occur at a lower frequency. METHODS Gene expression profiling on 40 primary LGGs including PAs and mixed glial-neuronal tumours comprising gangliogliomas (GG) and desmoplastic infantile gangliogliomas (DIG) using Affymetrix array platform was performed. A biologically validated machine learning workflow for the identification of microarray-based gene signatures was devised. The method is based on a sparsity inducing regularization algorithm l₁l₂ that selects relevant variables and takes into account their correlation. The most significant genetic signatures emerging from gene-chip analysis were confirmed and validated by qPCR. RESULTS We identified an expression signature composed by a biologically validated list of 15 genes, able to distinguish infratentorial from supratentorial LGGs. In addition, a specific molecular fingerprinting distinguishes the supratentorial PAs from those originating in the posterior fossa. Lastly, within supratentorial tumours, we also identified a gene expression pattern composed by neurogenesis, cell motility and cell growth genes which dichotomize mixed glial-neuronal tumours versus PAs. Our results reinforce previous observations about aberrant activation of the mitogen-activated protein kinase (MAPK) pathway in LGGs, but still point to an active involvement of TGF-beta signaling pathway in the PA development and pick out some hitherto unreported genes worthy of further investigation for the mixed glial-neuronal tumours. CONCLUSIONS The identification of a brain region-specific gene signature suggests that LGGs, with similar pathological features but located at different sites, may be distinguishable on the basis of cancer genetics. Molecular fingerprinting seems to be able to better sub-classify such morphologically heterogeneous tumours and it is remarkable that mixed glial-neuronal tumours are strikingly separated from PAs.
Collapse
Affiliation(s)
- Samantha Mascelli
- Neurosurgery Unit, Istituto Giannina Gaslini, via G. Gaslini 5, 16147, Genoa, Italy
| | - Annalisa Barla
- DISI - Department of Computer Science, Università degli Studi di Genova, Via Dodecaneso, 35-16146, Genoa, Italy
| | - Alessandro Raso
- Neurosurgery Unit, Istituto Giannina Gaslini, via G. Gaslini 5, 16147, Genoa, Italy
| | - Sofia Mosci
- DISI - Department of Computer Science, Università degli Studi di Genova, Via Dodecaneso, 35-16146, Genoa, Italy
| | - Paolo Nozza
- Pathology Unit, Istituto Giannina Gaslini, via G. Gaslini 5, 16147, Genoa, Italy
| | - Roberto Biassoni
- Molecular Medicine Unit, Istituto Giannina Gaslini, via G. Gaslini 5, 16147, Genoa, Italy
| | - Giovanni Morana
- Neuroradiology Unit, Istituto Giannina Gaslini, via G. Gaslini 5, 16147, Genoa, Italy
| | - Martin Huber
- Siemens AG, Corporate Technology, Freyeslebenstr. 1, 91058, Erlangen, Germany
| | - Cristian Mircean
- Siemens AG, Corporate Technology, Freyeslebenstr. 1, 91058, Erlangen, Germany
| | - Daniel Fasulo
- SCR - Siemens Corporate Research, Princeton, NJ, USA
| | - Karin Noy
- SCR - Siemens Corporate Research, Princeton, NJ, USA
| | | | - Sara Pignatelli
- Neuro-oncology Unit, Istituto Giannina Gaslini, via G. Gaslini 5, 16147, Genoa, Italy
| | - Gianluca Piatelli
- Neurosurgery Unit, Istituto Giannina Gaslini, via G. Gaslini 5, 16147, Genoa, Italy
| | - Armando Cama
- Neurosurgery Unit, Istituto Giannina Gaslini, via G. Gaslini 5, 16147, Genoa, Italy
| | - Maria Luisa Garré
- Neuro-oncology Unit, Istituto Giannina Gaslini, via G. Gaslini 5, 16147, Genoa, Italy
| | - Valeria Capra
- Neurosurgery Unit, Istituto Giannina Gaslini, via G. Gaslini 5, 16147, Genoa, Italy
| | - Alessandro Verri
- DISI - Department of Computer Science, Università degli Studi di Genova, Via Dodecaneso, 35-16146, Genoa, Italy
| |
Collapse
|
8
|
Zhang L, Chen C, Bu J, He X. A unified feature and instance selection framework using optimum experimental design. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2012; 21:2379-2388. [PMID: 22262681 DOI: 10.1109/tip.2012.2183879] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
The goal of feature selection is to identify the most informative features for compact representation, whereas the goal of active learning is to select the most informative instances for prediction. Previous studies separately address these two problems, despite of the fact that selecting features and instances are dual operations over a data matrix. In this paper, we consider the novel problem of simultaneously selecting the most informative features and instances and develop a solution from the perspective of optimum experimental design. That is, by using the selected features as the new representation and the selected instances as training data, the variance of the parameter estimate of a learning function can be minimized. Specifically, we propose a novel approach, which is called Unified criterion for Feature and Instance selection (UFI), to simultaneously identify the most informative features and instances that minimize the trace of the parameter covariance matrix. A greedy algorithm is introduced to efficiently solve the optimization problem. Experimental results on two benchmark data sets demonstrate the effectiveness of our proposed method.
Collapse
Affiliation(s)
- Lijun Zhang
- Zhejiang Provincial Key Laboratory of Service Robot, College of Computer Science, Zhejiang University, Hangzhou, China.
| | | | | | | |
Collapse
|
10
|
Shen C, Paisitkriangkrai S, Zhang J. Efficiently learning a detection cascade with sparse eigenvectors. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2011; 20:22-35. [PMID: 20601314 DOI: 10.1109/tip.2010.2055880] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
Real-time object detection has many computer vision applications. Since Viola and Jones proposed the first real-time AdaBoost based face detection system, much effort has been spent on improving the boosting method. In this work, we first show that feature selection methods other than boosting can also be used for training an efficient object detector. In particular, we introduce greedy sparse linear discriminant analysis (GSLDA) for its conceptual simplicity and computational efficiency; and slightly better detection performance is achieved compared with . Moreover, we propose a new technique, termed boosted greedy sparse linear discriminant analysis (BGSLDA), to efficiently train a detection cascade. BGSLDA exploits the sample reweighting property of boosting and the class-separability criterion of GSLDA. Experiments in the domain of highly skewed data distributions (e.g., face detection) demonstrate that classifiers trained with the proposed BGSLDA outperforms AdaBoost and its variants. This finding provides a significant opportunity to argue that AdaBoost and similar approaches are not the only methods that can achieve high detection results for real-time object detection.
Collapse
Affiliation(s)
- Chunhua Shen
- NICTA, Canberra Research Laboratory, Canberra, ACT 2601, Australia.
| | | | | |
Collapse
|