1
|
Chen A, Zhou X, Fan Y, Chen H. Underground Diagnosis Based on GPR and Learning in the Model Space. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2024; 46:3832-3844. [PMID: 38153824 DOI: 10.1109/tpami.2023.3347739] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2023]
Abstract
Ground Penetrating Radar (GPR) has been widely used in pipeline detection and underground diagnosis. In practical applications, the characteristics of the GPR data of the detected area and the likely underground anomalous structures could be rarely acknowledged before fully analyzing the obtained GPR data, causing challenges to identify the underground structures or anomalies automatically. In this article, a GPR B-scan image diagnosis method based on learning in the model space is proposed. The idea of learning in the model space is to use models fitted on parts of data as more stable and parsimonious representations of the data. For the GPR image, 2-Direction Echo State Network (2D-ESN) is proposed to fit the image segments through the next item prediction. By building the connections between the points on the image in both the horizontal and vertical directions, the 2D-ESN regards the GPR image segment as a whole and could effectively capture the dynamic characteristics of the GPR image. And then, semi-supervised and supervised learning methods could be further implemented on the 2D-ESN models for underground diagnosis. Experiments on real-world datasets are conducted, and the results demonstrate the effectiveness of the proposed model.
Collapse
|
2
|
Chen M, Guo Y, Jin Y, Yang S, Gong D, Yu Z. An environment-driven hybrid evolutionary algorithm for dynamic multi-objective optimization problems. COMPLEX INTELL SYST 2022. [DOI: 10.1007/s40747-022-00824-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
AbstractIn dynamic multi-objective optimization problems, the environmental parameters may change over time, which makes the Pareto fronts shifting. To address the issue, a common idea is to track the moving Pareto front once an environmental change occurs. However, it might be hard to obtain the Pareto optimal solutions if the environment changes rapidly. Moreover, it may be costly to implement a new solution. By contrast, robust Pareto optimization over time provides a novel framework to find the robust solutions whose performance is acceptable for more than one environment, which not only saves the computational costs for tracking solutions, but also minimizes the cost for switching solutions. However, neither of the above two approaches can balance between the quality of the obtained non-dominated solutions and the computation cost. To address this issue, environment-driven hybrid dynamic multi-objective evolutionary optimization method is proposed, aiming to fully use strengths of TMO and RPOOT under various characteristics of environmental changes. Two indexes, i.e., the frequency and intensity of environmental changes, are first defined. Then, a criterion is presented based on the characteristics of dynamic environments and the switching cost of solutions, to select an appropriate optimization method in a given environment. The experimental results on a set of dynamic benchmark functions indicate that the proposed hybrid dynamic multi-objective evolutionary optimization method can choose the most rational method that meets the requirements of decision makers, and balance the convergence and robustness of the obtained non-dominated solutions.
Collapse
|
3
|
Chiu CW, Minku LL. A Diversity Framework for Dealing With Multiple Types of Concept Drift Based on Clustering in the Model Space. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; 33:1299-1309. [PMID: 33351764 DOI: 10.1109/tnnls.2020.3041684] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Data stream applications usually suffer from multiple types of concept drift. However, most existing approaches are only able to handle a subset of types of drift well, hindering predictive performance. We propose to use diversity as a framework to handle multiple types of drift. The motivation is that a diverse ensemble can not only contain models representing different concepts, which may be useful to handle recurring concepts, but also accelerate the adaptation to different types of concept drift. Our framework innovatively uses clustering in the model space to build a diverse ensemble and identify recurring concepts. The resulting diversity also accelerates adaptation to different types of drift where the new concept shares similarities with past concepts. Experiments with 20 synthetic and three real-world data streams containing different types of drift show that our diversity framework usually achieves similar or better prequential accuracy than existing approaches, especially when there are recurring concepts or when new concepts share similarities with past concepts.
Collapse
|
4
|
Lyu S, Wu X, Li J, Chen Q, Chen H. Do Models Learn the Directionality of Relations? A New Evaluation: Relation Direction Recognition. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE 2022. [DOI: 10.1109/tetci.2021.3136598] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
5
|
Lu X, Cui L, Sun Z, Zhu Y. ProAID: path-based reasoning for self-attentional disease prediction. Knowl Inf Syst 2021. [DOI: 10.1007/s10115-021-01617-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
6
|
Wang X, Chen L, Ban T, Usman M, Guan Y, Liu S, Wu T, Chen H. WITHDRAWN: Knowledge Graph Quality Control: A Survey. FUNDAMENTAL RESEARCH 2021. [DOI: 10.1016/j.fmre.2021.08.018] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
|
7
|
Wang X, Chen L, Ban T, Usman M, Guan Y, Liu S, Wu T, Chen H. Knowledge graph quality control: A survey. FUNDAMENTAL RESEARCH 2021. [DOI: 10.1016/j.fmre.2021.09.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
|
8
|
Shu W, Yao Y, Lyu S, Li J, Chen H. Short isometric shapelet transform for binary time series classification. Knowl Inf Syst 2021. [DOI: 10.1007/s10115-021-01583-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
9
|
Bianchi FM, Scardapane S, Lokse S, Jenssen R. Reservoir Computing Approaches for Representation and Classification of Multivariate Time Series. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021; 32:2169-2179. [PMID: 32598284 DOI: 10.1109/tnnls.2020.3001377] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Classification of multivariate time series (MTS) has been tackled with a large variety of methodologies and applied to a wide range of scenarios. Reservoir computing (RC) provides efficient tools to generate a vectorial, fixed-size representation of the MTS that can be further processed by standard classifiers. Despite their unrivaled training speed, MTS classifiers based on a standard RC architecture fail to achieve the same accuracy of fully trainable neural networks. In this article, we introduce the reservoir model space, an unsupervised approach based on RC to learn vectorial representations of MTS. Each MTS is encoded within the parameters of a linear model trained to predict a low-dimensional embedding of the reservoir dynamics. Compared with other RC methods, our model space yields better representations and attains comparable computational performance due to an intermediate dimensionality reduction procedure. As a second contribution, we propose a modular RC framework for MTS classification, with an associated open-source Python library. The framework provides different modules to seamlessly implement advanced RC architectures. The architectures are compared with other MTS classifiers, including deep learning models and time series kernels. Results obtained on the benchmark and real-world MTS data sets show that RC classifiers are dramatically faster and, when implemented using our proposed representation, also achieve superior classification accuracy.
Collapse
|
10
|
Yao Y, Chen H, Yao X. Discriminative Learning in the Model Space for Symbolic Sequence Classification. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE 2021. [DOI: 10.1109/tetci.2019.2914266] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
11
|
Symbolic Sequence Classification in the Fractal Space. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE 2021. [DOI: 10.1109/tetci.2018.2876528] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
12
|
Liu R, Reimer B, Song S, Mehler B, Solovey E. Unsupervised fNIRS feature extraction with CAE and ESN autoencoder for driver cognitive load classification. J Neural Eng 2021; 18. [PMID: 33307543 DOI: 10.1088/1741-2552/abd2ca] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Accepted: 12/11/2020] [Indexed: 11/11/2022]
Abstract
Objective. Understanding the cognitive load of drivers is crucial for road safety. Brain sensing has the potential to provide an objective measure of driver cognitive load. We aim to develop an advanced machine learning framework for classifying driver cognitive load using functional near-infrared spectroscopy (fNIRS).Approach. We conducted a study using fNIRS in a driving simulator with theN-back task used as a secondary task to impart structured cognitive load on drivers. To classify different driver cognitive load levels, we examined the application of convolutional autoencoder (CAE) and Echo State Network (ESN) autoencoder for extracting features from fNIRS.Main results. By using CAE, the accuracies for classifying two and four levels of driver cognitive load with the 30 s window were 73.25% and 47.21%, respectively. The proposed ESN autoencoder achieved state-of-art classification results for group-level models without window selection, with accuracies of 80.61% and 52.45% for classifying two and four levels of driver cognitive load.Significance. This work builds a foundation for using fNIRS to measure driver cognitive load in real-world applications. Also, the results suggest that the proposed ESN autoencoder can effectively extract temporal information from fNIRS data and can be useful for other fNIRS data classification tasks.
Collapse
Affiliation(s)
- Ruixue Liu
- Worcester Polytechnic Institute, P.O. Box 1212, Worcester, MA 016091, United States of America
| | - Bryan Reimer
- Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, United States of America
| | - Siyang Song
- University of Nottingham, Nottingham NG7 2RD, United Kingdom
| | - Bruce Mehler
- Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, United States of America
| | - Erin Solovey
- Worcester Polytechnic Institute, P.O. Box 1212, Worcester, MA 016091, United States of America
| |
Collapse
|
13
|
On-line anomaly detection with advanced independent component analysis of multi-variate residual signals from causal relation networks. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2020.06.034] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
14
|
Gonon L, Ortega JP. Reservoir Computing Universality With Stochastic Inputs. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2020; 31:100-112. [PMID: 30892244 DOI: 10.1109/tnnls.2019.2899649] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
The universal approximation properties with respect to L p -type criteria of three important families of reservoir computers with stochastic discrete-time semi-infinite inputs are shown. First, it is proven that linear reservoir systems with either polynomial or neural network readout maps are universal. More importantly, it is proven that the same property holds for two families with linear readouts, namely, trigonometric state-affine systems and echo state networks, which are the most widely used reservoir systems in applications. The linearity in the readouts is a key feature in supervised machine learning applications. It guarantees that these systems can be used in high-dimensional situations and in the presence of large data sets. The L p criteria used in this paper allow the formulation of universality results that do not necessarily impose almost sure uniform boundedness in the inputs or the fading memory property in the filter that needs to be approximated.
Collapse
|
15
|
Gong Z, Chen H, Yuan B, Yao X. Multiobjective Learning in the Model Space for Time Series Classification. IEEE TRANSACTIONS ON CYBERNETICS 2019; 49:918-932. [PMID: 29994189 DOI: 10.1109/tcyb.2018.2789422] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
A well-defined distance is critical for the performance of time series classification. Existing distance measurements can be categorized into two branches. One is to utilize handmade features for calculating distance, e.g., dynamic time warping, which is limited to exploiting the dynamic information of time series. The other methods make use of the dynamic information by approximating the time series with a generative model, e.g., Fisher kernel. However, previous distance measurements for time series seldom exploit the label information, which is helpful for classification by distance metric learning. In order to attain the benefits of the dynamic information of time series and the label information simultaneously, this paper proposes a multiobjective learning algorithm for both time series approximation and classification, termed multiobjective model-metric (MOMM) learning. In MOMM, a recurrent network is exploited as the temporal filter, based on which, a generative model is learned for each time series as a representation of that series. The models span a non-Euclidean space, where the label information is utilized to learn the distance metric. The distance between time series is then calculated as the model distance weighted by the learned metric. The network size is also optimized to learn parsimonious representations. MOMM simultaneously optimizes the data representation, the time series model separation, and the network size. The experiments show that MOMM achieves not only superior overall performance on uni/multivariate time series classification but also promising time series prediction performance.
Collapse
|
16
|
Deng X, Tian X, Chen S, Harris CJ. Nonlinear Process Fault Diagnosis Based on Serial Principal Component Analysis. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2018; 29:560-572. [PMID: 28026785 DOI: 10.1109/tnnls.2016.2635111] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Many industrial processes contain both linear and nonlinear parts, and kernel principal component analysis (KPCA), widely used in nonlinear process monitoring, may not offer the most effective means for dealing with these nonlinear processes. This paper proposes a new hybrid linear-nonlinear statistical modeling approach for nonlinear process monitoring by closely integrating linear principal component analysis (PCA) and nonlinear KPCA using a serial model structure, which we refer to as serial PCA (SPCA). Specifically, PCA is first applied to extract PCs as linear features, and to decompose the data into the PC subspace and residual subspace (RS). Then, KPCA is performed in the RS to extract the nonlinear PCs as nonlinear features. Two monitoring statistics are constructed for fault detection, based on both the linear and nonlinear features extracted by the proposed SPCA. To effectively perform fault identification after a fault is detected, an SPCA similarity factor method is built for fault recognition, which fuses both the linear and nonlinear features. Unlike PCA and KPCA, the proposed method takes into account both linear and nonlinear PCs simultaneously, and therefore, it can better exploit the underlying process's structure to enhance fault diagnosis performance. Two case studies involving a simulated nonlinear process and the benchmark Tennessee Eastman process demonstrate that the proposed SPCA approach is more effective than the existing state-of-the-art approach based on KPCA alone, in terms of nonlinear process fault detection and identification.
Collapse
|
17
|
Zhao J, Hei X, Shi Z, Dong L, Liu Y, Yan R, Li X. Regression learning based on incomplete relationships between attributes. Inf Sci (N Y) 2018. [DOI: 10.1016/j.ins.2017.09.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
18
|
|
19
|
|
20
|
|
21
|
|
22
|
Huang G, Yang Z, Chen X, Ji G. An innovative one-class least squares support vector machine model based on continuous cognition. Knowl Based Syst 2017. [DOI: 10.1016/j.knosys.2017.02.024] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
23
|
Wong HK, Tiffin PA, Chappell MJ, Nichols TE, Welsh PR, Doyle OM, Lopez-Kolkovska BC, Inglis SK, Coghill D, Shen Y, Tiño P. Personalized Medication Response Prediction for Attention-Deficit Hyperactivity Disorder: Learning in the Model Space vs. Learning in the Data Space. Front Physiol 2017; 8:199. [PMID: 28443027 PMCID: PMC5387107 DOI: 10.3389/fphys.2017.00199] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2016] [Accepted: 03/17/2017] [Indexed: 12/04/2022] Open
Abstract
Attention-Deficit Hyperactive Disorder (ADHD) is one of the most common mental health disorders amongst school-aged children with an estimated prevalence of 5% in the global population (American Psychiatric Association, 2013). Stimulants, particularly methylphenidate (MPH), are the first-line option in the treatment of ADHD (Reeves and Schweitzer, 2004; Dopheide and Pliszka, 2009) and are prescribed to an increasing number of children and adolescents in the US and the UK every year (Safer et al., 1996; McCarthy et al., 2009), though recent studies suggest that this is tailing off, e.g., Holden et al. (2013). Around 70% of children demonstrate a clinically significant treatment response to stimulant medication (Spencer et al., 1996; Schachter et al., 2001; Swanson et al., 2001; Barbaresi et al., 2006). However, it is unclear which patient characteristics may moderate treatment effectiveness. As such, most existing research has focused on investigating univariate or multivariate correlations between a set of patient characteristics and the treatment outcome, with respect to dosage of one or several types of medication. The results of such studies are often contradictory and inconclusive due to a combination of small sample sizes, low-quality data, or a lack of available information on covariates. In this paper, feature extraction techniques such as latent trait analysis were applied to reduce the dimension of on a large dataset of patient characteristics, including the responses to symptom-based questionnaires, developmental health factors, demographic variables such as age and gender, and socioeconomic factors such as parental income. We introduce a Bayesian modeling approach in a "learning in the model space" framework that combines existing knowledge in the literature on factors that may potentially affect treatment response, with constraints imposed by a treatment response model. The model is personalized such that the variability among subjects is accounted for by a set of subject-specific parameters. For remission classification, this approach compares favorably with conventional methods such as support vector machines and mixed effect models on a range of performance measures. For instance, the proposed approach achieved an area under receiver operator characteristic curve of 82-84%, compared to 75-77% obtained from conventional regression or machine learning ("learning in the data space") methods.
Collapse
Affiliation(s)
- Hin K. Wong
- Warwick Manufacturing Group, Institute of Digital Healthcare, University of WarwickCoventry, UK
| | - Paul A. Tiffin
- Mental Health and Addiction Research Group, Department of Health Sciences, University of YorkYork, UK
| | | | - Thomas E. Nichols
- Warwick Manufacturing Group, Institute of Digital Healthcare, University of WarwickCoventry, UK
| | - Patrick R. Welsh
- School of Psychology, Newcastle UniversityNewcastle upon Tyne, UK
| | - Orla M. Doyle
- Centre for Neuroimaging Sciences, King's College LondonLondon, UK
| | | | - Sarah K. Inglis
- Division of Maternal and Child Health Sciences, Ninewells Hospital and Medical School, University of DundeeDundee, UK
| | - David Coghill
- Departments of Paediatrics and Psychiatry, University of MelbourneMelbourne, VIC, Australia
| | - Yuan Shen
- School of Computer Science, University of BirminghamBirmingham, UK
| | - Peter Tiño
- School of Computer Science, University of BirminghamBirmingham, UK
| |
Collapse
|
24
|
Alippi C, Ntalampiras S, Roveri M. Model-Free Fault Detection and Isolation in Large-Scale Cyber-Physical Systems. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE 2017. [DOI: 10.1109/tetci.2016.2641452] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
25
|
Roveri M, Trovò F. An Ensemble Approach for Cognitive Fault Detection and Isolation in Sensor Networks. Int J Neural Syst 2016; 27:1650047. [PMID: 27802791 DOI: 10.1142/s0129065716500477] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Cognitive fault detection and diagnosis systems are systems able to provide timely information about possibly occurring faults without requiring any a priori knowledge about the process generating the data or the possible faults. This ability is crucial in sensor network scenarios where a priori information about the data generating process, the noise level or the dictionary of the possibly occurring faults is generally hard to obtain. We here present a novel cognitive fault detection and isolation system for sensor networks. The proposed solution relies on the modeling of spatial and temporal relationships present in the acquired datastreams and an ensemble of Hidden Markov Model change-detection tests working in the space of estimated parameters for fault detection and isolation purposes. The effectiveness of the proposed solution has been evaluated on both synthetically generated and real datasets.
Collapse
Affiliation(s)
- Manuel Roveri
- 1 Dipartimento di Elettronica, Informazione e Bioingegneria, Politecnico di Milano, piazza L. da Vinci 32, Milano, 20133, Italy
| | - Francesco Trovò
- 1 Dipartimento di Elettronica, Informazione e Bioingegneria, Politecnico di Milano, piazza L. da Vinci 32, Milano, 20133, Italy
| |
Collapse
|
26
|
|
27
|
Gianniotis N, Kügler SD, Tiňo P, Polsterer KL. Model-coupled autoencoder for time series visualisation. Neurocomputing 2016. [DOI: 10.1016/j.neucom.2016.01.086] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
28
|
Fuzzy fault isolation using gradient information and quality criteria from system identification models. Inf Sci (N Y) 2015. [DOI: 10.1016/j.ins.2015.04.008] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
29
|
Dufrenois F. A one-class kernel fisher criterion for outlier detection. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2015; 26:982-994. [PMID: 25051559 DOI: 10.1109/tnnls.2014.2329534] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Recently, Dufrenois and Noyer proposed a one class Fisher's linear discriminant to isolate normal data from outliers. In this paper, a kernelized version of their criterion is presented. Originally on the basis of an iterative optimization process, alternating between subspace selection and clustering, I show here that their criterion has an upper bound making these two problems independent. In particular, the estimation of the label vector is formulated as an unconstrained binary linear problem (UBLP) which can be solved using an iterative perturbation method. Once the label vector is estimated, an optimal projection subspace is obtained by solving a generalized eigenvalue problem. Like many other kernel methods, the performance of the proposed approach depends on the choice of the kernel. Constructed with a Gaussian kernel, I show that the proposed contrast measure is an efficient indicator for selecting an optimal kernel width. This property simplifies the model selection problem which is typically solved by costly (generalized) cross-validation procedures. Initialization, convergence analysis, and computational complexity are also discussed. Lastly, the proposed algorithm is compared with recent novelty detectors on synthetic and real data sets.
Collapse
|
30
|
|
31
|
Dealing with temporal and spatial correlations to classify outliers in geophysical data streams. Inf Sci (N Y) 2014. [DOI: 10.1016/j.ins.2013.12.009] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
32
|
Chen H, Tiňo P, Yao X. Cognitive fault diagnosis in Tennessee Eastman Process using learning in the model space. Comput Chem Eng 2014. [DOI: 10.1016/j.compchemeng.2014.03.015] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|