1
|
Kim J, Lee Y, Liang Z. The Geometry of Nonlinear Embeddings in Kernel Discriminant Analysis. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2023; 45:5203-5217. [PMID: 35857735 DOI: 10.1109/tpami.2022.3192726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Fisher's linear discriminant analysis is a classical method for classification, yet it is limited to capturing linear features only. Kernel discriminant analysis as an extension is known to successfully alleviate the limitation through a nonlinear feature mapping. We study the geometry of nonlinear embeddings in discriminant analysis with polynomial kernels and Gaussian kernel by identifying the population-level discriminant function that depends on the data distribution and the kernel. In order to obtain the discriminant function, we solve a generalized eigenvalue problem with between-class and within-class covariance operators. The polynomial discriminants are shown to capture the class difference through the population moments explicitly. For approximation of the Gaussian discriminant, we use a particular representation of the Gaussian kernel by utilizing the exponential generating function for Hermite polynomials. We also show that the Gaussian discriminant can be approximated using randomized projections of the data. Our results illuminate how the data distribution and the kernel interact in determination of the nonlinear embedding for discrimination, and provide a guideline for choice of the kernel and its parameters.
Collapse
|
2
|
He F, Wu X, Wu B, Zeng S, Zhu X. Green tea grades identification via Fourier transform near‐infrared spectroscopy and weighted global fuzzy uncorrelated discriminant transform. J FOOD PROCESS ENG 2022. [DOI: 10.1111/jfpe.14109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Fei He
- School of Electrical and Information Engineering Jiangsu University Zhenjiang China
- High‐tech Key Laboratory of Agricultural Equipment and Intelligence of Jiangsu Province Jiangsu University Zhenjiang China
| | - Xiaohong Wu
- School of Electrical and Information Engineering Jiangsu University Zhenjiang China
- High‐tech Key Laboratory of Agricultural Equipment and Intelligence of Jiangsu Province Jiangsu University Zhenjiang China
| | - Bin Wu
- Department of Information Engineering Chuzhou Polytechnic Chuzhou China
| | - Shupeng Zeng
- School of Electrical and Information Engineering Jiangsu University Zhenjiang China
| | - Xingchen Zhu
- School of Electrical and Information Engineering Jiangsu University Zhenjiang China
| |
Collapse
|
3
|
Hedegaard L, Sheikh-Omar OA, Iosifidis A. Supervised Domain Adaptation: A Graph Embedding Perspective and a Rectified Experimental Protocol. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2021; 30:8619-8631. [PMID: 34648445 DOI: 10.1109/tip.2021.3118978] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Domain Adaptation is the process of alleviating distribution gaps between data from different domains. In this paper, we show that Domain Adaptation methods using pair-wise relationships between source and target domain data can be formulated as a Graph Embedding in which the domain labels are incorporated into the structure of the intrinsic and penalty graphs. Specifically, we analyse the loss functions of three existing state-of-the-art Supervised Domain Adaptation methods and demonstrate that they perform Graph Embedding. Moreover, we highlight some generalisation and reproducibility issues related to the experimental setup commonly used to demonstrate the few-shot learning capabilities of these methods. To assess and compare Supervised Domain Adaptation methods accurately, we propose a rectified evaluation protocol, and report updated benchmarks on the standard datasets Office31 (Amazon, DSLR, and Webcam), Digits (MNIST, USPS, SVHN, and MNIST-M) and VisDA (Synthetic, Real).
Collapse
|
4
|
Yan C, Chang X, Luo M, Zheng Q, Zhang X, Li Z, Nie F. Self-weighted Robust LDA for Multiclass Classification with Edge Classes. ACM T INTEL SYST TEC 2021. [DOI: 10.1145/3418284] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
Linear discriminant analysis (LDA) is a popular technique to learn the most discriminative features for multi-class classification. A vast majority of existing LDA algorithms are prone to be dominated by the class with very large deviation from the others, i.e., edge class, which occurs frequently in multi-class classification. First, the existence of edge classes often makes the total mean biased in the calculation of between-class scatter matrix. Second, the exploitation of ℓ
2
-norm based between-class distance criterion magnifies the extremely large distance corresponding to edge class. In this regard, a novel self-weighted robust LDA with ℓ
2,1
-norm based pairwise between-class distance criterion, called SWRLDA, is proposed for multi-class classification especially with edge classes. SWRLDA can automatically avoid the optimal mean calculation and simultaneously learn adaptive weights for each class pair without setting any additional parameter. An efficient re-weighted algorithm is exploited to derive the global optimum of the challenging ℓ
2,1
-norm maximization problem. The proposed SWRLDA is easy to implement and converges fast in practice. Extensive experiments demonstrate that SWRLDA performs favorably against other compared methods on both synthetic and real-world datasets while presenting superior computational efficiency in comparison with other techniques.
Collapse
Affiliation(s)
- Caixia Yan
- School of Electronic and Information Engineering, Xi’an Jiaotong University, Shaanxi, China
| | - Xiaojun Chang
- Faculty of Information Technology, Monash University, Australia
| | - Minnan Luo
- School of Electronic and Information Engineering, Xi’an Jiaotong University, Shaanxi, China
| | - Qinghua Zheng
- School of Electronic and Information Engineering, Xi’an Jiaotong University, Shaanxi, China
| | - Xiaoqin Zhang
- College of Computer Science and Artificial Intelligence, Wenzhou University, China
| | - Zhihui Li
- Qilu University of Technology (Shandong Academy of Sciences), Jinan, Shandong, China
| | - Feiping Nie
- Center for Optical Image Analysis and Learning, Northwestern Polytechnical University, Shaanxi, China
| |
Collapse
|
5
|
Iosifidis A. Class mean vector component and discriminant analysis. Pattern Recognit Lett 2020. [DOI: 10.1016/j.patrec.2020.10.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
6
|
Perturbation-based classifier. Soft comput 2020. [DOI: 10.1007/s00500-020-04960-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
7
|
Zhao H, Zhang B, Wang Z, Nie F. Multiclass discriminant analysis via adaptive weighted scheme. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2019.10.070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
8
|
Wang Y, Yue W, Li X, Liu S, Guo L, Xu H, Zhang H, Yang G. Comparison Study of Radiomics and Deep Learning-Based Methods for Thyroid Nodules Classification Using Ultrasound Images. IEEE ACCESS 2020; 8:52010-52017. [DOI: 10.1109/access.2020.2980290] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/07/2022]
|
9
|
Noh YK, Park JY, Choi BG, Kim KE, Rha SW. A Machine Learning-Based Approach for the Prediction of Acute Coronary Syndrome Requiring Revascularization. J Med Syst 2019; 43:253. [PMID: 31254109 DOI: 10.1007/s10916-019-1359-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2017] [Accepted: 05/30/2019] [Indexed: 11/28/2022]
Abstract
The aim of this study is to predict acute coronary syndrome (ACS) requiring revascularization in those patients presenting early-stage angina-like symptom using machine learning algorithms. We obtained data from 2344 ACS patients, who required revascularization and from 3538 non-ACS patients. We analyzed 20 features that are relevant to ACS using standard algorithms, support vector machines and linear discriminant analysis. Based on feature pattern and filter characteristics, we analyzed and extracted a strong prediction function out of the 20 selected features. The obtained prediction functions are relevant showing the area under curve of 0.860 for the prediction of ACS that requiring revascularization. Some features are missing in many data though they are considered to be very informative; it turned out that omitting those features from the input and using more data without those features for training improves the prediction accuracy. Additionally, from the investigation using the receiver operating characteristic curves, a reliable prediction of 2.60% of non-ACS patients could be made with a specificity of 1.0. For those 2.60% non-ACS patients, we can consider the recommendation of medical treatment without risking misdiagnosis of the patients requiring revascularization. We investigated prediction algorithm to select ACS patients requiring revascularization and non-ACS patients presenting angina-like symptoms at an early stage. In the future, a large cohort study is necessary to increase the prediction accuracy and confirm the possibility of safely discriminating the non-ACS patients from the ACS patients with confidence.
Collapse
Affiliation(s)
- Yung-Kyun Noh
- Department of Computer Science, Hanyang University, 222 Wangsimni-ro, Seongdong-gu, Seoul, 04763, South Korea
| | - Ji Young Park
- Division of Cardiology, Nowon Eulji Hospital, Eulji University, 68 Hangeulbiseok-ro, Nowon-gu, Seoul, 01830, South Korea
| | - Byoung Geol Choi
- Cardiovascular Center, Korea University Guro Hospital, 148 Gurodong-ro, Guro-gu, Seoul, 08308, South Korea
| | - Kee-Eung Kim
- School of Computing, Korea Advanced Institute of Science and Technology, 291 Daehak-ro, Yuseong-gu, Daejeon, 34141, South Korea
| | - Seung-Woon Rha
- Cardiovascular Center, Korea University Guro Hospital, 148 Gurodong-ro, Guro-gu, Seoul, 08308, South Korea.
| |
Collapse
|
10
|
Xiong H, Cheng W, Bian J, Hu W, Sun Z, Guo Z. DBSDA : Lowering the Bound of Misclassification Rate for Sparse Linear Discriminant Analysis via Model Debiasing. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2019; 30:707-717. [PMID: 30047901 DOI: 10.1109/tnnls.2018.2846783] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Linear discriminant analysis (LDA) is a well-known technique for linear classification, feature extraction, and dimension reduction. To improve the accuracy of LDA under the high dimension low sample size (HDLSS) settings, shrunken estimators, such as Graphical Lasso, can be used to strike a balance between biases and variances. Although the estimator with induced sparsity obtains a faster convergence rate, however, the introduced bias may also degrade the performance. In this paper, we theoretically analyze how the sparsity and the convergence rate of the precision matrix (also known as inverse covariance matrix) estimator would affect the classification accuracy by proposing an analytic model on the upper bound of an LDA misclassification rate. Guided by the model, we propose a novel classifier, DBSDA , which improves classification accuracy through debiasing. Theoretical analysis shows that DBSDA possesses a reduced upper bound of misclassification rate and better asymptotic properties than sparse LDA (SDA). We conduct experiments on both synthetic datasets and real application datasets to confirm the correctness of our theoretical analysis and demonstrate the superiority of DBSDA over LDA, SDA, and other downstream competitors under HDLSS settings.
Collapse
|
11
|
Cao G, Iosifidis A, Chen K, Gabbouj M. Generalized Multi-View Embedding for Visual Recognition and Cross-Modal Retrieval. IEEE TRANSACTIONS ON CYBERNETICS 2018; 48:2542-2555. [PMID: 28885168 DOI: 10.1109/tcyb.2017.2742705] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
In this paper, the problem of multi-view embedding from different visual cues and modalities is considered. We propose a unified solution for subspace learning methods using the Rayleigh quotient, which is extensible for multiple views, supervised learning, and nonlinear embeddings. Numerous methods including canonical correlation analysis, partial least square regression, and linear discriminant analysis are studied using specific intrinsic and penalty graphs within the same framework. Nonlinear extensions based on kernels and (deep) neural networks are derived, achieving better performance than the linear ones. Moreover, a novel multi-view modular discriminant analysis is proposed by taking the view difference into consideration. We demonstrate the effectiveness of the proposed multi-view embedding methods on visual object recognition and cross-modal image retrieval, and obtain superior results in both applications compared to related methods.
Collapse
|
12
|
Su B, Ding X, Liu C, Wu Y. Heteroscedastic Max-min Distance Analysis for Dimensionality Reduction. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2018; 27:4052-4065. [PMID: 29994529 DOI: 10.1109/tip.2018.2836312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Max-min distance analysis (MMDA) performs dimensionality reduction by maximizing the minimum pairwise distance between classes in the latent subspace under the homoscedastic assumption, which can address the class separation problem caused by the Fisher criterion, but is incapable of tackling heteroscedastic data properly. In this paper, we propose two heteroscedastic MMDA (HMMDA) methods to employ the differences of class covariances. Whitened HMMDA (WHMMDA) extends MMDA by utilizing the Chernoff distance as the separability measure between classes in the whitened space. Orthogonal HMMDA (OHMMDA) incorporates the maximization of the minimal pairwise Chernoff distance and the minimization of class compactness into a trace quotient formulation with an orthogonal constraint of the transformation, which can be solved by bisection search. Two variants of OHMMDA further encode the margin information by using only neighboring samples to construct the intra-class and inter-class scatters. Experiments on several UCI datasets and two face databases demonstrate the effectiveness of the HMMDA methods.
Collapse
|
13
|
Noh YK, Hamm J, Park FC, Zhang BT, Lee DD. Fluid Dynamic Models for Bhattacharyya-Based Discriminant Analysis. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2018; 40:92-105. [PMID: 28186879 DOI: 10.1109/tpami.2017.2666148] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Classical discriminant analysis attempts to discover a low-dimensional subspace where class label information is maximally preserved under projection. Canonical methods for estimating the subspace optimize an information-theoretic criterion that measures the separation between the class-conditional distributions. Unfortunately, direct optimization of the information-theoretic criteria is generally non-convex and intractable in high-dimensional spaces. In this work, we propose a novel, tractable algorithm for discriminant analysis that considers the class-conditional densities as interacting fluids in the high-dimensional embedding space. We use the Bhattacharyya criterion as a potential function that generates forces between the interacting fluids, and derive a computationally tractable method for finding the low-dimensional subspace that optimally constrains the resulting fluid flow. We show that this model properly reduces to the optimal solution for homoscedastic data as well as for heteroscedastic Gaussian distributions with equal means. We also extend this model to discover optimal filters for discriminating Gaussian processes and provide experimental results and comparisons on a number of datasets.
Collapse
|
14
|
Iosifidis A, Gabbouj M. Class-Specific Kernel Discriminant Analysis Revisited: Further Analysis and Extensions. IEEE TRANSACTIONS ON CYBERNETICS 2017; 47:4485-4496. [PMID: 28113416 DOI: 10.1109/tcyb.2016.2612479] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
In this paper, we revisit class-specific kernel discriminant analysis (KDA) formulation, which has been applied in various problems, such as human face verification and human action recognition. We show that the original optimization problem solved for the determination of class-specific discriminant projections is equivalent to a low-rank kernel regression (LRKR) problem using training data-independent target vectors. In addition, we show that the regularized version of class-specific KDA is equivalent to a regularized LRKR problem, exploiting the same targets. This analysis allows us to devise a novel fast solution. Furthermore, we devise novel incremental, approximate and deep (hierarchical) variants. The proposed methods are tested in human facial image and action video verification problems, where their effectiveness and efficiency is shown.
Collapse
|
15
|
|
16
|
Cao G, Iosifidis A, Gabbouj M. Neural class‐specific regression for face verification. IET BIOMETRICS 2017. [DOI: 10.1049/iet-bmt.2017.0081] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Affiliation(s)
- Guanqun Cao
- Laboratory of Signal ProcessingTampere University of TechnologyTampereFinland
| | - Alexandros Iosifidis
- Laboratory of Signal ProcessingTampere University of TechnologyTampereFinland
- Department of Engineering, Electrical and Computer EngineeringAarhus UniversityAarhusDenmark
| | - Moncef Gabbouj
- Laboratory of Signal ProcessingTampere University of TechnologyTampereFinland
| |
Collapse
|
17
|
|
18
|
Papachristou K, Tefas A, Pitas I. Symmetric subspace learning for image analysis. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2014; 23:5683-5697. [PMID: 25376040 DOI: 10.1109/tip.2014.2367321] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Subspace learning (SL) is one of the most useful tools for image analysis and recognition. A large number of such techniques have been proposed utilizing a priori knowledge about the data. In this paper, new subspace learning techniques are presented that use symmetry constraints in their objective functions. The rational behind this idea is to exploit the a priori knowledge that geometrical symmetry appears in several types of data, such as images, objects, faces, and so on. Experiments on artificial, facial expression recognition, face recognition, and object categorization databases highlight the superiority and the robustness of the proposed techniques, in comparison with standard SL techniques.
Collapse
|
19
|
|