1
|
A multiple classifiers time-serial ensemble pruning algorithm based on the mechanism of forward supplement. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03855-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
2
|
Huang K, Wu Y, Long C, Ji H, Sun B, Chen X, Yang C. Adaptive process monitoring via online dictionary learning and its industrial application. ISA TRANSACTIONS 2021; 114:399-412. [PMID: 33397583 DOI: 10.1016/j.isatra.2020.12.046] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2019] [Revised: 12/25/2020] [Accepted: 12/25/2020] [Indexed: 06/12/2023]
Abstract
For industrial processes, one common drawback of conventional process monitoring methods is that they would make an increasing number of false alarms in cases of various factors such as catalyst deactivation, seasonal fluctuation and so forth. To address this issue, the present work proposes an online dictionary learning method, which can fulfill the process monitoring and fault diagnosis task adaptively. The proposed method would incorporate currently available information to update the dictionary and control limit, instead of keeping a fixed monitoring model. The online dictionary learning method are more superior than conventional methods. Firstly, compared with some traditional offline methods based on small amounts of historical data, the proposed method can augment train data with online dictionary updating, thus it copes with time-varying processes well. Secondly, the proposed method enjoys a lower computational complexity than the offline learning method with mass data, which is appealing in the era of industrial big data. Thirdly, the proposed method performs more reliably than the existing recursive principal component analysis-based methods because it can resolve the anomaly of principal component or non-orthogonality of eigenvectors problem which was often confronted in the recursive principal component analysis-based methods. Finally, some experiments were designed and carried out to demonstrate the advantage of the online dictionary learning.
Collapse
Affiliation(s)
- Keke Huang
- School of Automation, Central South University, Changsha 410083, China.
| | - Yiming Wu
- School of Automation, Central South University, Changsha 410083, China
| | - Cheng Long
- School of Computer Science and Engineering, Nanyang Technological University, Singapore, Singapore
| | - Hongquan Ji
- College of Electrical Engineering and Automation, Shandong University of Science and Technology, Qingdao 266590, China
| | - Bei Sun
- School of Automation, Central South University, Changsha 410083, China.
| | - Xiaofang Chen
- School of Automation, Central South University, Changsha 410083, China
| | - Chunhua Yang
- School of Automation, Central South University, Changsha 410083, China
| |
Collapse
|
3
|
Stacked Ensemble of Recurrent Neural Networks for Predicting Turbocharger Remaining Useful Life. APPLIED SCIENCES-BASEL 2019. [DOI: 10.3390/app10010069] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Predictive Maintenance (PM) is a proactive maintenance strategy that tries to minimize a system’s downtime by predicting failures before they happen. It uses data from sensors to measure the component’s state of health and make forecasts about its future degradation. However, existing PM methods typically focus on individual measurements. While it is natural to assume that a history of measurements carries more information than a single one. This paper aims at incorporating such information into PM models. In practice, especially in the automotive domain, diagnostic models have low performance, due to a large amount of noise in the data and limited sensing capability. To address this issue, this paper proposes to use a specific type of ensemble learning known as Stacked Ensemble. The idea is to aggregate predictions of multiple models—consisting of Long Short-Term Memory (LSTM) and Convolutional-LSTM—via a meta model, in order to boost performance. Stacked Ensemble model performs well when its base models are as diverse as possible. To this end, each such model is trained using a specific combination of the following three aspects: feature subsets, past dependency horizon, and model architectures. Experimental results demonstrate benefits of the proposed approach on a case study of heavy-duty truck turbochargers.
Collapse
|
4
|
Dal Pozzolo A, Boracchi G, Caelen O, Alippi C, Bontempi G. Credit Card Fraud Detection: A Realistic Modeling and a Novel Learning Strategy. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2018; 29:3784-3797. [PMID: 28920909 DOI: 10.1109/tnnls.2017.2736643] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Detecting frauds in credit card transactions is perhaps one of the best testbeds for computational intelligence algorithms. In fact, this problem involves a number of relevant challenges, namely: concept drift (customers' habits evolve and fraudsters change their strategies over time), class imbalance (genuine transactions far outnumber frauds), and verification latency (only a small set of transactions are timely checked by investigators). However, the vast majority of learning algorithms that have been proposed for fraud detection rely on assumptions that hardly hold in a real-world fraud-detection system (FDS). This lack of realism concerns two main aspects: 1) the way and timing with which supervised information is provided and 2) the measures used to assess fraud-detection performance. This paper has three major contributions. First, we propose, with the help of our industrial partner, a formalization of the fraud-detection problem that realistically describes the operating conditions of FDSs that everyday analyze massive streams of credit card transactions. We also illustrate the most appropriate performance measures to be used for fraud-detection purposes. Second, we design and assess a novel learning strategy that effectively addresses class imbalance, concept drift, and verification latency. Third, in our experiments, we demonstrate the impact of class unbalance and concept drift in a real-world data stream containing more than 75 million transactions, authorized over a time window of three years.
Collapse
|
5
|
Bu L, Alippi C, Zhao D. A pdf-Free Change Detection Test Based on Density Difference Estimation. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2018; 29:324-334. [PMID: 28113960 DOI: 10.1109/tnnls.2016.2619909] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
The ability to detect online changes in stationarity or time variance in a data stream is a hot research topic with striking implications. In this paper, we propose a novel probability density function-free change detection test, which is based on the least squares density-difference estimation method and operates online on multidimensional inputs. The test does not require any assumption about the underlying data distribution, and is able to operate immediately after having been configured by adopting a reservoir sampling mechanism. Thresholds requested to detect a change are automatically derived once a false positive rate is set by the application designer. Comprehensive experiments validate the effectiveness in detection of the proposed method both in terms of detection promptness and accuracy.
Collapse
|
6
|
Alippi C, Boracchi G, Roveri M. Hierarchical Change-Detection Tests. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2017; 28:246-258. [PMID: 26800551 DOI: 10.1109/tnnls.2015.2512714] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
We present hierarchical change-detection tests (HCDTs), as effective online algorithms for detecting changes in datastreams. HCDTs are characterized by a hierarchical architecture composed of a detection layer and a validation layer. The detection layer steadily analyzes the input datastream by means of an online, sequential CDT, which operates as a low-complexity trigger that promptly detects possible changes in the process generating the data. The validation layer is activated when the detection one reveals a change, and performs an offline, more sophisticated analysis on recently acquired data to reduce false alarms. Our experiments show that, when the process generating the datastream is unknown, as it is mostly the case in the real world, HCDTs achieve a far more advantageous tradeoff between false-positive rate and detection delay than their single-layered, more traditional counterpart. Moreover, the successful interplay between the two layers permits HCDTs to automatically reconfigure after having detected and validated a change. Thus, HCDTs are able to reveal further departures from the postchange state of the data-generating process.
Collapse
|
7
|
|
8
|
Ditzler G, Roveri M, Alippi C, Polikar R. Learning in Nonstationary Environments: A Survey. IEEE COMPUT INTELL M 2015. [DOI: 10.1109/mci.2015.2471196] [Citation(s) in RCA: 403] [Impact Index Per Article: 40.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
9
|
Yin XC, Huang K, Hao HW. DE2: Dynamic ensemble of ensembles for learning nonstationary data. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2014.06.092] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
10
|
|
11
|
Wei Y, Ji C, Galvan F, Couvillon S, Orellana G, Momoh J. Learning geotemporal nonstationary failure and recovery of power distribution. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2014; 25:229-240. [PMID: 24806656 DOI: 10.1109/tnnls.2013.2271853] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Smart energy grid is an emerging area for new applications of machine learning in a nonstationary environment. Such a nonstationary environment emerges when large-scale failures occur at power networks because of external disruptions such as hurricanes and severe storms. Power distribution networks lie at the edge of the grid, and are especially vulnerable to external disruptions. Quantifiable approaches are lacking and needed to learn nonstationary behaviors of large-scale failure and recovery of power distribution. This paper studies such nonstationary behaviors in three aspects. First, a novel formulation is derived for an entire life cycle of large-scale failure and recovery of power distribution. Second, spatial-temporal models of failure and recovery of power distribution are developed as geolocation-based multivariate nonstationary GI(t)/G(t)/∞ queues. Third, the nonstationary spatial-temporal models identify a small number of parameters to be learned. Learning is applied to two real-life examples of large-scale disruptions. One is from Hurricane Ike, where data from an operational network is exact on failures and recoveries. The other is from Hurricane Sandy, where aggregated data is used for inferring failure and recovery processes at one of the impacted areas. Model parameters are learned using real data. Two findings emerge as results of learning: 1) failure rates behave similarly at the two different provider networks for two different hurricanes but differently at the geographical regions and 2) both the rapid and slow-recovery are present for Hurricane Ike but only slow recovery is shown for a regional distribution network from Hurricane Sandy.
Collapse
|
12
|
|
13
|
Alippi C, Boracchi G, Roveri M. Just-in-time classifiers for recurrent concepts. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2013; 24:620-634. [PMID: 24808382 DOI: 10.1109/tnnls.2013.2239309] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Just-in-time (JIT) classifiers operate in evolving environments by classifying instances and reacting to concept drift. In stationary conditions, a JIT classifier improves its accuracy over time by exploiting additional supervised information coming from the field. In nonstationary conditions, however, the classifier reacts as soon as concept drift is detected; the current classification setup is discarded and a suitable one activated to keep the accuracy high. We present a novel generation of JIT classifiers able to deal with recurrent concept drift by means of a practical formalization of the concept representation and the definition of a set of operators working on such representations. The concept-drift detection activity, which is crucial in promptly reacting to changes exactly when needed, is advanced by considering change-detection tests monitoring both inputs and classes distributions.
Collapse
|
14
|
Ghazikhani A, Monsefi R, Sadoghi Yazdi H. Recursive least square perceptron model for non-stationary and imbalanced data stream classification. EVOLVING SYSTEMS 2013. [DOI: 10.1007/s12530-013-9076-7] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
15
|
|