1
|
Disabato S, Roveri M. Tiny Machine Learning for Concept Drift. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:8470-8481. [PMID: 37015671 DOI: 10.1109/tnnls.2022.3229897] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Tiny machine learning (TML) is a new research area whose goal is to design machine and deep learning (DL) techniques able to operate in embedded systems and the Internet-of-Things (IoT) units, hence satisfying the severe technological constraints on memory, computation, and energy characterizing these pervasive devices. Interestingly, the related literature mainly focused on reducing the computational and memory demand of the inference phase of machine and deep learning models. At the same time, the training is typically assumed to be carried out in cloud or edge computing systems (due to the larger memory and computational requirements). This assumption results in TML solutions that might become obsolete when the process generating the data is affected by concept drift (e.g., due to periodicity or seasonality effect, faults or malfunctioning affecting sensors or actuators, or changes in the users' behavior), a common situation in real-world application scenarios. For the first time in the literature, this article introduces a TML for concept drift (TML-CD) solution based on deep learning feature extractors and a k -nearest neighbors ( k -NNs) classifier integrating a hybrid adaptation module able to deal with concept drift affecting the data-generating process. This adaptation module continuously updates (in a passive way) the knowledge base of TML-CD and, at the same time, employs a change detection test (CDT) to inspect for changes (in an active way) to quickly adapt to concept drift by removing obsolete knowledge. Experimental results on both image and audio benchmarks show the effectiveness of the proposed solution, whilst the porting of TML-CD on three off-the-shelf micro-controller units (MCUs) shows the feasibility of what is proposed in real-world pervasive systems.
Collapse
|
2
|
Wang X, Yao L, Wang X, Paik HY, Wang S. Uncertainty Estimation With Neural Processes for Meta-Continual Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:6887-6897. [PMID: 36315531 DOI: 10.1109/tnnls.2022.3215633] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
The ability to evaluate uncertainties in evolving data streams has become equally, if not more, crucial than building a static predictor. For instance, during the pandemic, a model should consider possible uncertainties such as governmental policies, meteorological features, and vaccination schedules. Neural process families (NPFs) have recently shone a light on predicting such uncertainties by bridging Gaussian processes (GPs) and neural networks (NNs). Their abilities to output average predictions and the acceptable variances, i.e., uncertainties, made them suitable for predictions with insufficient data, such as meta-learning or few-shot learning. However, existing models have not addressed continual learning which imposes a stricter constraint on the data access. Regarding this, we introduce a member meta-continual learning with neural process (MCLNP) for uncertainty estimation. We enable two levels of uncertainty estimations: the local uncertainty on certain points and the global uncertainty p(z) that represents the function evolution in dynamic environments. To facilitate continual learning, we hypothesize that the previous knowledge can be applied to the current task, hence adopt a coreset as a memory buffer to alleviate catastrophic forgetting. The relationships between the degree of global uncertainties with the intratask diversity and model complexity are discussed. We have estimated prediction uncertainties with multiple evolving types including abrupt/gradual/recurrent shifts. The applications encompass meta-continual learning in the 1-D, 2-D datasets, and a novel spatial-temporal COVID dataset. The results show that our method outperforms the baselines on the likelihood and can rebound quickly even for heavily evolved data streams.
Collapse
|
3
|
Wu X, Jiang B, Wang X, Ban T, Chen H. Feature Selection in the Data Stream Based on Incremental Markov Boundary Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:6740-6754. [PMID: 37028034 DOI: 10.1109/tnnls.2023.3249767] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Recent years have witnessed the proliferation of techniques for streaming data mining to meet the demands of many real-time systems, where high-dimensional streaming data are generated at high speed, increasing the burden on both hardware and software. Some feature selection algorithms for streaming data are proposed to tackle this issue. However, these algorithms do not consider the distribution shift due to nonstationary scenarios, leading to performance degradation when the underlying distribution changes in the data stream. To solve this problem, this article investigates feature selection in streaming data through incremental Markov boundary (MB) learning and proposes a novel algorithm. Different from existing algorithms focusing on prediction performance on off-line data, the MB is learned by analyzing conditional dependence/independence in data, which uncovers the underlying mechanism and is naturally more robust against the distribution shift. To learn MB in the data stream, the proposal transforms the learned information in previous data blocks to prior knowledge and employs them to assist MB discovery in current data blocks, where the likelihood of distribution shift and reliability of conditional independence test are monitored to avoid the negative impact from invalid prior information. Extensive experiments on synthetic and real-world datasets demonstrate the superiority of the proposed algorithm.
Collapse
|
4
|
Ma D, Liu M, Zhang H, Wang R, Xie X. Accurate Power Sharing and Voltage Regulation for AC Microgrids: An Event-Triggered Coordinated Control Approach. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:13001-13011. [PMID: 34406955 DOI: 10.1109/tcyb.2021.3095959] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
The microgrid with the high proportion of renewable sources has become the trend of the future. However, the negative features, such as renewable energy perturbation, nonlinear counterpart, and so on, are prone to causing the low-power quality of the ac microgrid. To deal with these problems, this article proposes an event-triggered consensus control approach. First, the nonlinear state-space function regarding the ac microgrid is built, which is further transformed into the standard linear multiagent model by using the singular perturbation method. It provides indispensable preprocessing for the direct application of advanced linear control approaches. Then, based on this standard linear multiagent model, the secondary consensus approach with the leader is designed to compensate for the output voltage deviation and achieve accurate power sharing. In order to decrease the communication among various distributed generators, the event-triggered communication method is further proposed. Meanwhile, the Zeno behavior is avoided through the theoretical proof. Finally, simulation results are presented to demonstrate the effectiveness of the proposed approach.
Collapse
|
5
|
Jiao B, Guo Y, Gong D, Chen Q. Dynamic Ensemble Selection for Imbalanced Data Streams With Concept Drift. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; PP:1278-1291. [PMID: 35731763 DOI: 10.1109/tnnls.2022.3183120] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Ensemble learning, as a popular method to tackle concept drift in data stream, forms a combination of base classifiers according to their global performances. However, concept drift generally occurs in local data space, causing significantly different performances of a base classifier at different locations. Thus, employing global performance as a criterion to select base classifier is inappropriate. Moreover, data stream is often accompanied by class imbalance problem, which affects the classification accuracy of ensemble learning on minority instances. To drawback these problems, a dynamic ensemble selection for imbalanced data streams with concept drift (DES-ICD) is proposed. For data arrived in chunk-by-chunk, a novel synthetic minority oversampling technique with adaptive nearest neighbors (AnnSMOTE) is developed to generate new minority instances that conform to the new concept. Following that, DES-ICD creates a base classifier on newly arrived data chunk balanced by AnnSMOTE and merges it with historical base classifiers to form a candidate classifier pool. For each query instance, the optimal combination is constructed in terms of the performance of candidate classifiers in its neighborhood. Experimental results for nine synthetic and five real-world datasets show that the proposed method outperforms seven comparative methods on classification accuracy and tracks new concepts in an imbalanced data stream more preciously.
Collapse
|
6
|
Wang KF, An J, Wei Z, Cui C, Ma XH, Ma C, Bao HQ. Deep Learning-Based Imbalanced Classification With Fuzzy Support Vector Machine. Front Bioeng Biotechnol 2022; 9:802712. [PMID: 35127672 PMCID: PMC8815771 DOI: 10.3389/fbioe.2021.802712] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Accepted: 12/20/2021] [Indexed: 12/23/2022] Open
Abstract
Imbalanced classification is widespread in the fields of medical diagnosis, biomedicine, smart city and Internet of Things. The imbalance of data distribution makes traditional classification methods more biased towards majority classes and ignores the importance of minority class. It makes the traditional classification methods ineffective in imbalanced classification. In this paper, a novel imbalance classification method based on deep learning and fuzzy support vector machine is proposed and named as DFSVM. DFSVM first uses a deep neural network to obtain an embedding representation of the data. This deep neural network is trained by using triplet loss to enhance similarities within classes and differences between classes. To alleviate the effects of imbalanced data distribution, oversampling is performed in the embedding space of the data. In this paper, we use an oversampling method based on feature and center distance, which can obtain more diverse new samples and prevent overfitting. To enhance the impact of minority class, we use a fuzzy support vector machine (FSVM) based on cost-sensitive learning as the final classifier. FSVM assigns a higher misclassification cost to minority class samples to improve the classification quality. Experiments were performed on multiple biological datasets and real-world datasets. The experimental results show that DFSVM has achieved promising classification performance.
Collapse
Affiliation(s)
- Ke-Fan Wang
- School of Electrical and Electronic Engineering, Shanghai Institute of Technology, Shanghai, China
| | - Jing An
- School of Electrical and Electronic Engineering, Shanghai Institute of Technology, Shanghai, China
| | - Zhen Wei
- School of Design, East China Normal University, Shanghai, China
- *Correspondence: Zhen Wei,
| | - Can Cui
- College of Electronic and Information Engineering, Tongji University, Shanghai, China
| | - Xiang-Hua Ma
- School of Electrical and Electronic Engineering, Shanghai Institute of Technology, Shanghai, China
| | - Chao Ma
- School of Electrical and Electronic Engineering, Shanghai Institute of Technology, Shanghai, China
| | - Han-Qiu Bao
- College of Electronic and Information Engineering, Tongji University, Shanghai, China
| |
Collapse
|
7
|
Abstract
A crane system often works in a complex environment. It is difficult to model or learn its true dynamics by traditional system identification approaches. If a dynamics model is created by minimizing its prediction error, its use tends to introduce inaccuracies and thus lead to suboptimal performance. Is it possible to learn the dynamics model of a crane that can achieve the best performance, instead of learning its true dynamics? This work answers the question by presenting a performance-driven model predictive control (P-MPC) algorithm for a two-dimensional underactuated bridge crane. In the proposed dual-layer control architecture, an inner-loop controller uses a proportional–integral–derivative controller to achieve anti-sway rapidly. An outer-loop controller uses MPC to ensure accurate trolley positioning under control constraints. Compared with classical MPC, this work proposes a data-driven method for plant modeling and controller parameter updating. By considering the control target at the learning stage, the method can avoid adjusting the controller to deal with uncertainty. We use Bayesian optimization in an active learning framework where a locally linear dynamics model is learned with the intent of maximizing control performance and then used in conjunction with optimal control schemes to efficiently design a controller for a given task. The model is updated directly based on the performance observed in experiments on the physical system in an iterative manner till a desired performance is achieved. The controller parameters and prediction models of the best closed-loop performance can be found through continuous experiments and iterative optimization. Simulation and experiment results show that we can explicitly find the dynamics model that produces the best performance for an actual system, and the method can quickly suppress swing and realize accurate trolley positioning. The results verified its effectiveness, feasibility, and superior performance on comparing it with state-of-the-art methods.
Collapse
|
8
|
B V, V VS. Optimized generative adversarial network with fractional calculus based feature fusion using Twitter stream for spam detection. INFORMATION SECURITY JOURNAL: A GLOBAL PERSPECTIVE 2021. [DOI: 10.1080/19393555.2021.1956024] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Affiliation(s)
- Venkateswarlu B
- Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Green Fields, Vaddeswaram, Andhra Pradesh
| | - Viswanath Shenoi V
- Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Green Fields, Vaddeswaram, Andhra Pradesh
| |
Collapse
|