1
|
Adeoye AD, Bemporad A. An Inexact Sequential Quadratic Programming Method for Learning and Control of Recurrent Neural Networks. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:2762-2776. [PMID: 38294918 DOI: 10.1109/tnnls.2024.3354855] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/02/2024]
Abstract
This article considers the two-stage approach to solving a partially observable Markov decision process (POMDP): the identification stage and the (optimal) control stage. We present an inexact sequential quadratic programming framework for recurrent neural network learning (iSQPRL) for solving the identification stage of the POMDP, in which the true system is approximated by a recurrent neural network (RNN) with dynamically consistent overshooting (DCRNN). We formulate the learning problem as a constrained optimization problem and study the quadratic programming (QP) subproblem with a convergence analysis under a restarted Krylov-subspace iterative scheme that implicitly exploits the structure of the associated Karush-Kuhn-Tucker (KKT) subsystem. In the control stage, where a feedforward neural network (FNN) controller is designed on top of the RNN model, we adapt a generalized Gauss-Newton (GGN) algorithm that exploits useful approximations to the curvature terms of the training data and selects its mini-batch step size using a known property of some regularization function. Simulation results are provided to demonstrate the effectiveness of our approach.
Collapse
|
2
|
Ma L, Liu X, Gao F, Lee KY. Event-Based Switching Iterative Learning Model Predictive Control for Batch Processes With Randomly Varying Trial Lengths. IEEE TRANSACTIONS ON CYBERNETICS 2023; 53:7881-7894. [PMID: 37022073 DOI: 10.1109/tcyb.2023.3234630] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Iterative learning model predictive control (ILMPC) has been recognized as an excellent batch process control strategy for progressively improving tracking performance along trials. However, as a typical learning-based control method, ILMPC generally requires the strict identity of trial lengths to implement 2-D receding horizon optimization. The randomly varying trial lengths extensively existing in practice can result in the insufficiency of learning prior information, and even the suspension of control update. Regarding this issue, this article embeds a novel prediction-based modification mechanism into ILMPC, to adjust the process data of each trial into the same length by compensating the data of absent running periods with the predictive sequences at the end point. Under this modification scheme, it is proved that the convergence of the classical ILMPC is guaranteed by an inequality condition relative with the probability distribution of trial lengths. Considering the practical batch process with complex nonlinearity, a 2-D neural-network predictive model with parameter adaptability along trials is established to generate highly matched compensation data for the prediction-based modification. To best utilize the real process information of multiple past trials while guaranteeing the learning priority of the latest trials, an event-based switching learning structure is proposed in ILMPC to determine different learning orders according to the probability event with respect to the trial length variation direction. The convergence of the nonlinear event-based switching ILMPC system is analyzed theoretically under two situations divided by the switching condition. The simulations on a numerical example and the injection molding process verify the superiority of the proposed control methods.
Collapse
|
3
|
Chen C, Lu CX, Wang B, Trigoni N, Markham A. DynaNet: Neural Kalman Dynamical Model for Motion Estimation and Prediction. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021; 32:5479-5491. [PMID: 34559667 DOI: 10.1109/tnnls.2021.3112460] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Dynamical models estimate and predict the temporal evolution of physical systems. State-space models (SSMs) in particular represent the system dynamics with many desirable properties, such as being able to model uncertainty in both the model and measurements, and optimal (in the Bayesian sense) recursive formulations, e.g., the Kalman filter. However, they require significant domain knowledge to derive the parametric form and considerable hand tuning to correctly set all the parameters. Data-driven techniques, e.g., recurrent neural networks, have emerged as compelling alternatives to SSMs with wide success across a number of challenging tasks, in part due to their impressive capability to extract relevant features from rich inputs. They, however, lack interpretability and robustness to unseen conditions. Thus, data-driven models are hard to be applied in safety-critical applications, such as self-driving vehicles. In this work, we present DynaNet, a hybrid deep learning and time-varying SSM, which can be trained end-to-end. Our neural Kalman dynamical model allows us to exploit the relative merits of both SSM and deep neural networks. We demonstrate its effectiveness in the estimation and prediction on a number of physically challenging tasks, including visual odometry, sensor fusion for visual-inertial navigation, and motion prediction. In addition, we show how DynaNet can indicate failures through investigation of properties, such as the rate of innovation (Kalman gain).
Collapse
|
4
|
Narkhede P, Walambe R, Poddar S, Kotecha K. Incremental learning of LSTM framework for sensor fusion in attitude estimation. PeerJ Comput Sci 2021; 7:e662. [PMID: 34435103 PMCID: PMC8356651 DOI: 10.7717/peerj-cs.662] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Accepted: 07/16/2021] [Indexed: 06/13/2023]
Abstract
This paper presents a novel method for attitude estimation of an object in 3D space by incremental learning of the Long-Short Term Memory (LSTM) network. Gyroscope, accelerometer, and magnetometer are few widely used sensors in attitude estimation applications. Traditionally, multi-sensor fusion methods such as the Extended Kalman Filter and Complementary Filter are employed to fuse the measurements from these sensors. However, these methods exhibit limitations in accounting for the uncertainty, unpredictability, and dynamic nature of the motion in real-world situations. In this paper, the inertial sensors data are fed to the LSTM network which are then updated incrementally to incorporate the dynamic changes in motion occurring in the run time. The robustness and efficiency of the proposed framework is demonstrated on the dataset collected from a commercially available inertial measurement unit. The proposed framework offers a significant improvement in the results compared to the traditional method, even in the case of a highly dynamic environment. The LSTM framework-based attitude estimation approach can be deployed on a standard AI-supported processing module for real-time applications.
Collapse
Affiliation(s)
- Parag Narkhede
- Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, Maharashtra, India
| | - Rahee Walambe
- Symbiosis Centre for Applied Artificial Intelligence, Symbiosis International (Deemed University), Pune, Maharashtra, India
| | - Shashi Poddar
- Central Scientific Instruments Organisation, Council of Scientific and Industrial Research, Chandigarh, Chandigarh, India
| | - Ketan Kotecha
- Symbiosis Centre for Applied Artificial Intelligence, Symbiosis International (Deemed University), Pune, Maharashtra, India
| |
Collapse
|
5
|
An Optimized Parallel Implementation of Non-Iteratively Trained Recurrent Neural Networks. JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH 2020. [DOI: 10.2478/jaiscr-2021-0003] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Abstract
Recurrent neural networks (RNN) have been successfully applied to various sequential decision-making tasks, natural language processing applications, and time-series predictions. Such networks are usually trained through back-propagation through time (BPTT) which is prohibitively expensive, especially when the length of the time dependencies and the number of hidden neurons increase. To reduce the training time, extreme learning machines (ELMs) have been recently applied to RNN training, reaching a 99% speedup on some applications. Due to its non-iterative nature, ELM training, when parallelized, has the potential to reach higher speedups than BPTT.
In this work, we present Opt-PR-ELM, an optimized parallel RNN training algorithm based on ELM that takes advantage of the GPU shared memory and of parallel QR factorization algorithms to efficiently reach optimal solutions. The theoretical analysis of the proposed algorithm is presented on six RNN architectures, including LSTM and GRU, and its performance is empirically tested on ten time-series prediction applications. Opt-PR-ELM is shown to reach up to 461 times speedup over its sequential counterpart and to require up to 20x less time to train than parallel BPTT. Such high speedups over new generation CPUs are extremely crucial in real-time applications and IoT environments.
Collapse
|
6
|
Motion Planning of Autonomous Mobile Robot Using Recurrent Fuzzy Neural Network Trained by Extended Kalman Filter. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2019; 2019:1934575. [PMID: 30863434 PMCID: PMC6378056 DOI: 10.1155/2019/1934575] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Revised: 12/07/2018] [Accepted: 01/02/2019] [Indexed: 11/17/2022]
Abstract
This paper proposes a novel motion planning method for an autonomous ground mobile robot to address dynamic surroundings, nonlinear program, and robust optimization problems. A planner based on the recurrent fuzzy neural network (RFNN) is designed to program trajectory and motion of mobile robots to reach target. And, obstacle avoidance is achieved. In RFNN, inference capability of fuzzy logic and learning capability of neural network are combined to improve nonlinear programming performance. A recurrent frame with self-feedback loops in RFNN enhances stability and robustness of the structure. The extended Kalman filter (EKF) is designed to train weights of RFNN considering the kinematic constraint of autonomous mobile robots as well as target and obstacle constraints. EKF's characteristics of fast convergence and little limit in training data make it suitable to train the weights in real time. Convergence of the training process is also analyzed in this paper. Optimization technique and update strategy are designed to improve the robust optimization of a system in dynamic surroundings. Simulation experiment and hardware experiment are implemented to prove the effectiveness of the proposed method. Hardware experiment is carried out on a tracked mobile robot. An omnidirectional vision is used to locate the robot in the surroundings. Forecast improvement of the proposed method is then discussed at the end.
Collapse
|
7
|
On extreme learning machines in sequential and time series prediction: A non-iterative and approximate training algorithm for recurrent neural networks. Neurocomputing 2019. [DOI: 10.1016/j.neucom.2018.09.012] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
8
|
Li Y, Chen J, Jiang L, Zeng N, Jiang H, Du M. The p53–Mdm2 regulation relationship under different radiation doses based on the continuous–discrete extended Kalman filter algorithm. Neurocomputing 2018. [DOI: 10.1016/j.neucom.2017.08.016] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
9
|
Nanda SJ, Jonwal N. Robust nonlinear channel equalization using WNN trained by symbiotic organism search algorithm. Appl Soft Comput 2017. [DOI: 10.1016/j.asoc.2017.03.029] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
10
|
Zheng D, Xie WF, Ren X, Na J. Identification and Control for Singularly Perturbed Systems Using Multitime-Scale Neural Networks. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2017; 28:321-333. [PMID: 26742148 DOI: 10.1109/tnnls.2015.2508738] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Many well-established singular perturbation theories for singularly perturbed systems require the full knowledge of system model parameters. In order to obtain an accurate and faithful model, a new identification scheme for singularly perturbed nonlinear system using multitime-scale recurrent high-order neural networks (NNs) is proposed in this paper. Inspired by the optimal bounded ellipsoid algorithm, which is originally designed for discrete-time systems, a novel weight updating law is developed for continuous-time NNs identification process. Compared with other widely used gradient-descent updating algorithms, this new method can achieve faster convergence, due to its adaptively adjusted learning rate. Based on the identification results, a control scheme using singular perturbation theories is developed. By using singular perturbation methods, the system order is reduced, and the controller structure is simplified. The closed-loop stability is analyzed and the convergence of system states is guaranteed. The effectiveness of the identification and the control scheme is demonstrated by simulation results.
Collapse
|
11
|
Jahanchahi C, Mandic DP. A class of quaternion Kalman filters. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2014; 25:533-544. [PMID: 24807449 DOI: 10.1109/tnnls.2013.2277540] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
The existing Kalman filters for quaternion-valued signals do not operate fully in the quaternion domain, and are combined with the real Kalman filter to enable the tracking in 3-D spaces. Using the recently introduced HR-calculus, we develop the fully quaternion-valued Kalman filter (QKF) and quaternion-extended Kalman filter (QEKF), allowing for the tracking of 3-D and 4-D signals directly in the quaternion domain. To consider the second-order noncircularity of signals, we employ the recently developed augmented quaternion statistics to derive the widely linear QKF (WL-QKF) and widely linear QEKF (WL-QEKF). To reduce computational requirements of the widely linear algorithms, their efficient implementation are proposed and it is shown that the quaternion widely linear model can be simplified when processing 3-D data, further reducing the computational requirements. Simulations using both synthetic and real-world circular and noncircular signals illustrate the advantages offered by widely linear over strictly linear quaternion Kalman filters.
Collapse
|
12
|
|
13
|
de Jesús Rubio J. Evolving intelligent algorithms for the modelling of brain and eye signals. Appl Soft Comput 2014. [DOI: 10.1016/j.asoc.2013.07.023] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
14
|
de Araújo Júnior JM, de Menezes Júnior JMP, de Albuquerque AAM, Almeida ODM, de Araújo FMU. Assessment and certification of neonatal incubator sensors through an inferential neural network. SENSORS 2013; 13:15613-32. [PMID: 24248278 PMCID: PMC3871086 DOI: 10.3390/s131115613] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/11/2013] [Revised: 10/12/2013] [Accepted: 10/12/2013] [Indexed: 12/05/2022]
Abstract
Measurement and diagnostic systems based on electronic sensors have been increasingly essential in the standardization of hospital equipment. The technical standard IEC (International Electrotechnical Commission) 60601-2-19 establishes requirements for neonatal incubators and specifies the calibration procedure and validation tests for such devices using sensors systems. This paper proposes a new procedure based on an inferential neural network to evaluate and calibrate a neonatal incubator. The proposal presents significant advantages over the standard calibration process, i.e., the number of sensors is drastically reduced, and it runs with the incubator under operation. Since the sensors used in the new calibration process are already installed in the commercial incubator, no additional hardware is necessary; and the calibration necessity can be diagnosed in real time without the presence of technical professionals in the neonatal intensive care unit (NICU). Experimental tests involving the aforementioned calibration system are carried out in a commercial incubator in order to validate the proposal.
Collapse
Affiliation(s)
- José Medeiros de Araújo Júnior
- Electrical Engineering Course, Federal University of Piauí (UFPI), 64049-550, Teresina, Piauí, Brazil; E-Mails: (J.M.P.M.J.); (O.M.A.)
- Author to whom correspondence should be addressed; E-Mail: ; Tel.: +55-86-3237-1555
| | | | | | - Otacílio da Mota Almeida
- Electrical Engineering Course, Federal University of Piauí (UFPI), 64049-550, Teresina, Piauí, Brazil; E-Mails: (J.M.P.M.J.); (O.M.A.)
| | - Fábio Meneghetti Ugulino de Araújo
- Department of Computer Engineering and Automation, Federal University of Rio Grande do Norte (UFRN), 59078-900, Natal, Rio Grande do Norte, Brazil; E-Mail:
| |
Collapse
|
15
|
Zhao J, Zhu X, Wang W, Liu Y. Extended Kalman filter-based Elman networks for industrial time series prediction with GPU acceleration. Neurocomputing 2013. [DOI: 10.1016/j.neucom.2013.02.031] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
16
|
Liu X, Gao Q. Parameter estimation and control for a neural mass model based on the unscented Kalman filter. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2013; 88:042905. [PMID: 24229247 DOI: 10.1103/physreve.88.042905] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/03/2013] [Revised: 08/30/2013] [Indexed: 06/02/2023]
Abstract
Recent progress in Kalman filters to estimate states and parameters in nonlinear systems has provided the possibility of applying such approaches to neural systems. We here apply the nonlinear method of unscented Kalman filters (UKFs) to observe states and estimate parameters in a neural mass model that can simulate distinct rhythms in electroencephalography (EEG) including dynamical evolution during epilepsy seizures. We demonstrate the efficiency of the UKF in estimating states and parameters. We also develop an UKF-based control strategy to modulate the dynamics of the neural mass model. In this strategy the UKF plays the role of observing states, and the control law is constructed via the estimated states. We demonstrate the feasibility of using such a strategy to suppress epileptiform spikes in the neural mass model.
Collapse
Affiliation(s)
- Xian Liu
- Key Lab of Industrial Computer Control Engineering of Hebei Province, Institute of Electrical Engineering, Yanshan University, Qinhuangdao 066004, China
| | | |
Collapse
|
17
|
Ornelas-Tellez F, Sanchez EN, Loukianov AG. Discrete-time neural inverse optimal control for nonlinear systems via passivation. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2012; 23:1327-1339. [PMID: 24807528 DOI: 10.1109/tnnls.2012.2200501] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
This paper presents a discrete-time inverse optimal neural controller, which is constituted by combination of two techniques: 1) inverse optimal control to avoid solving the Hamilton-Jacobi-Bellman equation associated with nonlinear system optimal control and 2) on-line neural identification, using a recurrent neural network trained with an extended Kalman filter, in order to build a model of the assumed unknown nonlinear system. The inverse optimal controller is based on passivity theory. The applicability of the proposed approach is illustrated via simulations for an unstable nonlinear system and a planar robot.
Collapse
|
18
|
Dini DH, Mandic DP. Class of widely linear complex Kalman filters. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2012; 23:775-86. [PMID: 24806126 DOI: 10.1109/tnnls.2012.2189893] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Recently, a class of widely linear (augmented) complex-valued Kalman filters (KFs), that make use of augmented complex statistics, have been proposed for sequential state space estimation of the generality of complex signals. This was achieved in the context of neural network training, and has allowed for a unified treatment of both second-order circular and noncircular signals, that is, both those with rotation invariant and rotation-dependent distributions. In this paper, we revisit the augmented complex KF, augmented complex extended KF, and augmented complex unscented KF in a more general context, and analyze their performances for different degrees of noncircularity of input and the state and measurement noises. For rigor, a theoretical bound for the performance advantage of widely linear KFs over their strictly linear counterparts is provided. The analysis also addresses the duality with bivariate real-valued KFs, together with several issues of implementation. Simulations using both synthetic and real world proper and improper signals support the analysis.
Collapse
|
19
|
Zhang R, Xu ZB, Huang GB, Wang D. Global convergence of online BP training with dynamic learning rate. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2012; 23:330-341. [PMID: 24808511 DOI: 10.1109/tnnls.2011.2178315] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
The online backpropagation (BP) training procedure has been extensively explored in scientific research and engineering applications. One of the main factors affecting the performance of the online BP training is the learning rate. This paper proposes a new dynamic learning rate which is based on the estimate of the minimum error. The global convergence theory of the online BP training procedure with the proposed learning rate is further studied. It is proved that: 1) the error sequence converges to the global minimum error; and 2) the weight sequence converges to a fixed point at which the error function attains its global minimum. The obtained global convergence theory underlies the successful applications of the online BP training procedure. Illustrative examples are provided to support the theoretical analysis.
Collapse
|
20
|
Song Q. Robust initialization of a Jordan network with recurrent constrained learning. IEEE TRANSACTIONS ON NEURAL NETWORKS 2011; 22:2460-2473. [PMID: 21965202 DOI: 10.1109/tnn.2011.2168423] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
In this paper, we propose a robust initialization of a Jordan network with a recurrent constrained learning (RIJNRCL) algorithm for multilayered recurrent neural networks (RNNs). This novel algorithm is based on the constrained learning concept of the Jordan network with a recurrent sensitivity and weight convergence analysis, which is used to obtain a tradeoff between the training and testing errors. In addition to using classical techniques for the adaptive learning rate and the adaptive dead zone, RIJNRCL employs a recurrent constrained parameter matrix to switch off excessive contributions from the hidden layer neurons based on weight convergence and stability conditions of the multilayered RNNs. It is well known that a good response from the hidden layer neurons and proper initialization play a dominant role in avoiding local minima in multilayered RNNs. The new RIJNRCL algorithm solves the twin problems of weight initialization and selection of the hidden layer neurons via a novel recurrent sensitivity ratio analysis. We provide the detailed steps for using RIJNRCL in a few benchmark time-series prediction problems and show that the proposed algorithm achieves superior generalization performance.
Collapse
Affiliation(s)
- Qing Song
- School of Electrical and Electronic Engineering, Nanyang Technological University, 639798 Singapore.
| |
Collapse
|