1
|
Namiki W, Nishioka D, Tsuchiya T, Higuchi T, Terabe K. Magnetization Vector Rotation Reservoir Computing Operated by Redox Mechanism. Nano Lett 2024; 24:4383-4392. [PMID: 38513213 DOI: 10.1021/acs.nanolett.3c05029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/23/2024]
Abstract
Physical reservoir computing is a promising way to develop efficient artificial intelligence using physical devices exhibiting nonlinear dynamics. Although magnetic materials have advantages in miniaturization, the need for a magnetic field and large electric current results in high electric power consumption and a complex device structure. To resolve these issues, we propose a redox-based physical reservoir utilizing the planar Hall effect and anisotropic magnetoresistance, which are phenomena described by different nonlinear functions of the magnetization vector that do not need a magnetic field to be applied. The expressive power of this reservoir based on a compact all-solid-state redox transistor is higher than the previous physical reservoir. The normalized mean square error of the reservoir on a second-order nonlinear equation task was 1.69 × 10-3, which is lower than that of a memristor array (3.13 × 10-3) even though the number of reservoir nodes was fewer than half that of the memristor array.
Collapse
Affiliation(s)
- Wataru Namiki
- Research Center for Materials Nanoarchitectonics, National Institute for Materials Science, 1-1 Namiki, Tsukuba, Ibaraki 305-0044, Japan
| | - Daiki Nishioka
- Research Center for Materials Nanoarchitectonics, National Institute for Materials Science, 1-1 Namiki, Tsukuba, Ibaraki 305-0044, Japan
- Department of Applied Physics, Faculty of Science, Tokyo University of Science, 6-3-1 Niijuku, Katsushika, Tokyo 125-8585, Japan
| | - Takashi Tsuchiya
- Research Center for Materials Nanoarchitectonics, National Institute for Materials Science, 1-1 Namiki, Tsukuba, Ibaraki 305-0044, Japan
| | - Tohru Higuchi
- Department of Applied Physics, Faculty of Science, Tokyo University of Science, 6-3-1 Niijuku, Katsushika, Tokyo 125-8585, Japan
| | - Kazuya Terabe
- Research Center for Materials Nanoarchitectonics, National Institute for Materials Science, 1-1 Namiki, Tsukuba, Ibaraki 305-0044, Japan
| |
Collapse
|
2
|
Ebato Y, Nobukawa S, Sakemi Y, Nishimura H, Kanamaru T, Sviridova N, Aihara K. Impact of time-history terms on reservoir dynamics and prediction accuracy in echo state networks. Sci Rep 2024; 14:8631. [PMID: 38622178 PMCID: PMC11018609 DOI: 10.1038/s41598-024-59143-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Accepted: 04/08/2024] [Indexed: 04/17/2024] Open
Abstract
The echo state network (ESN) is an excellent machine learning model for processing time-series data. This model, utilising the response of a recurrent neural network, called a reservoir, to input signals, achieves high training efficiency. Introducing time-history terms into the neuron model of the reservoir is known to improve the time-series prediction performance of ESN, yet the reasons for this improvement have not been quantitatively explained in terms of reservoir dynamics characteristics. Therefore, we hypothesised that the performance enhancement brought about by time-history terms could be explained by delay capacity, a recently proposed metric for assessing the memory performance of reservoirs. To test this hypothesis, we conducted comparative experiments using ESN models with time-history terms, namely leaky integrator ESNs (LI-ESN) and chaotic echo state networks (ChESN). The results suggest that compared with ESNs without time-history terms, the reservoir dynamics of LI-ESN and ChESN can maintain diversity and stability while possessing higher delay capacity, leading to their superior performance. Explaining ESN performance through dynamical metrics are crucial for evaluating the numerous ESN architectures recently proposed from a general perspective and for the development of more sophisticated architectures, and this study contributes to such efforts.
Collapse
Affiliation(s)
- Yudai Ebato
- Graduate School of Information and Computer Science, Chiba Institute of Technology, 2-17-1 Tsudanuma, Narashino, Chiba, 275-0016, Japan.
| | - Sou Nobukawa
- Graduate School of Information and Computer Science, Chiba Institute of Technology, 2-17-1 Tsudanuma, Narashino, Chiba, 275-0016, Japan
- Department of Computer Science, Chiba Institute of Technology, 2-17-1 Tsudanuma, Narashino, Chiba, 275-0016, Japan
- Department of Preventive Intervention for Psychiatric Disorders, National Center of Neurology and Psychiatry, 4-1-1 Ogawa-Higashi, Kodaira, Tokyo, 187-8551, Japan
- Research Center for Mathematical Engineering, Chiba Institute of Technology, 2-17-1 Tsudanuma, Narashino, Chiba, 275-0016, Japan
| | - Yusuke Sakemi
- Research Center for Mathematical Engineering, Chiba Institute of Technology, 2-17-1 Tsudanuma, Narashino, Chiba, 275-0016, Japan
- International Research Center for Neurointelligence, The University of Tokyo Institutes for Advanced Study, The University of Tokyo, 7 choume 3-1 Hongou, Bunkyu ku, Tokyo, 113-8654, Japan
| | - Haruhiko Nishimura
- Faculty of Informatics, Yamato University, 2-5-1 Katanama chou, Suita, Osaka, 564-0082, Japan
| | - Takashi Kanamaru
- Department of Mechanical Science and Engineering, School of Advanced Engineering, Kogakuin University, 2665-1 Nakano chou, Hachioji, Tokyo, 192-0015, Japan
- International Research Center for Neurointelligence, The University of Tokyo Institutes for Advanced Study, The University of Tokyo, 7 choume 3-1 Hongou, Bunkyu ku, Tokyo, 113-8654, Japan
| | - Nina Sviridova
- Department of Intelligent Systems, Tokyo City University, 1 choume 28-1 Tamazutsumi, Setagaya, Tokyo, 158-8557, Japan
- International Research Center for Neurointelligence, The University of Tokyo Institutes for Advanced Study, The University of Tokyo, 7 choume 3-1 Hongou, Bunkyu ku, Tokyo, 113-8654, Japan
| | - Kazuyuki Aihara
- Research Center for Mathematical Engineering, Chiba Institute of Technology, 2-17-1 Tsudanuma, Narashino, Chiba, 275-0016, Japan
- International Research Center for Neurointelligence, The University of Tokyo Institutes for Advanced Study, The University of Tokyo, 7 choume 3-1 Hongou, Bunkyu ku, Tokyo, 113-8654, Japan
| |
Collapse
|
3
|
Sugiura S, Ariizumi R, Asai T, Azuma SI. Existence of reservoir with finite-dimensional output for universal reservoir computing. Sci Rep 2024; 14:8448. [PMID: 38600157 PMCID: PMC11006892 DOI: 10.1038/s41598-024-56742-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Accepted: 03/11/2024] [Indexed: 04/12/2024] Open
Abstract
In this paper, we prove the existence of a reservoir that has a finite-dimensional output and makes the reservoir computing model universal. Reservoir computing is a method for dynamical system approximation that trains the static part of a model but fixes the dynamical part called the reservoir. Hence, reservoir computing has the advantage of training models with a low computational cost. Moreover, fixed reservoirs can be implemented as physical systems. Such reservoirs have attracted attention in terms of computation speed and energy consumption. The universality of a reservoir computing model is its ability to approximate an arbitrary system with arbitrary accuracy. Two sufficient reservoir conditions to make the model universal have been proposed. The first is the combination of fading memory and the separation property. The second is the neighborhood separation property, which we proposed recently. To date, it has been unknown whether a reservoir with a finite-dimensional output can satisfy these conditions. In this study, we prove that no reservoir with a finite-dimensional output satisfies the former condition. By contrast, we propose a single output reservoir that satisfies the latter condition. This implies that, for any dimension, a reservoir making the model universal exists with the output of that specified dimension. These results clarify the practical importance of our proposed conditions.
Collapse
Affiliation(s)
- Shuhei Sugiura
- Graduate School of Engineering, Nagoya University, Nagoya, 464-8603, Japan
| | - Ryo Ariizumi
- Department of Mechanical Systems Engineering, Tokyo University of Agriculture and Technology, Koganei, Tokyo, 184-8588, Japan.
| | - Toru Asai
- Graduate School of Engineering, Nagoya University, Nagoya, 464-8603, Japan
| | - Shun-Ichi Azuma
- Graduate School of Informatics, Kyoto University, Kyoto, 606-8501, Japan
| |
Collapse
|
4
|
Szalisznyó K, Silverstein DN. Computational insights on asymmetrical D1 and D2 receptor-mediated chunking: implications for OCD and Schizophrenia. Cogn Neurodyn 2024; 18:217-232. [PMID: 38406202 PMCID: PMC10881457 DOI: 10.1007/s11571-022-09865-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2021] [Revised: 07/06/2022] [Accepted: 07/21/2022] [Indexed: 01/15/2023] Open
Abstract
Repetitive thoughts and motor programs including perseveration are bridge symptoms characteristic of obsessive compulsive disorder (OCD), schizophrenia and in the co-morbid overlap of these conditions. The above pathologies are sensitive to altered activation and kinetics of dopamine D 1 and D 2 receptors that differently influence sequence learning and recall. Recognizing start and stop elements of motor and cognitive behaviors has crucial importance. During chunking, frequent components of temporal strings are concatenated into single units. We extended a published computational model (Asabuki et al. 2018), where two populations of neurons are connected and simulated in a reservoir computing framework. These neural pools were adopted to represent D1 and D2 striatal neuronal populations. We investigated how specific neural and striatal circuit parameters can influence start/stop signaling and found that asymmetric intra-network connection probabilities, synaptic weights and differential time constants may contribute to signaling of start/stop elements within learned sequences. Asymmetric coupling between the striatal D 1 and D 2 neural populations was also demonstrated to be beneficial. Our modeling results predict that dynamical differences between the two dopaminergic striatal populations and the interaction between them may play complementary roles in chunk boundary signaling. Start and stop dichotomies can arise from the larger circuit dynamics as well, since neural and intra-striatal connections only partially support a clear division of labor.
Collapse
Affiliation(s)
- Krisztina Szalisznyó
- Department of Medical Sciences, Psychiatry, Uppsala University Hospital, Uppsala University, 751 85 Uppsala, Sweden
- Theoretical Neuroscience and Complex Systems Research Group, Wigner Research Centre for Physics, Budapest, Hungary
| | | |
Collapse
|
5
|
Wikner A, Harvey J, Girvan M, Hunt BR, Pomerance A, Antonsen T, Ott E. Stabilizing machine learning prediction of dynamics: Novel noise-inspired regularization tested with reservoir computing. Neural Netw 2024; 170:94-110. [PMID: 37977092 DOI: 10.1016/j.neunet.2023.10.054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Revised: 10/19/2023] [Accepted: 10/30/2023] [Indexed: 11/19/2023]
Abstract
Recent work has shown that machine learning (ML) models can skillfully forecast the dynamics of unknown chaotic systems. Short-term predictions of the state evolution and long-term predictions of the statistical patterns of the dynamics ("climate") can be produced by employing a feedback loop, whereby the model is trained to predict forward only one time step, then the model output is used as input for multiple time steps. In the absence of mitigating techniques, however, this feedback can result in artificially rapid error growth ("instability"). One established mitigating technique is to add noise to the ML model training input. Based on this technique, we formulate a new penalty term in the loss function for ML models with memory of past inputs that deterministically approximates the effect of many small, independent noise realizations added to the model input during training. We refer to this penalty and the resulting regularization as Linearized Multi-Noise Training (LMNT). We systematically examine the effect of LMNT, input noise, and other established regularization techniques in a case study using reservoir computing, a machine learning method using recurrent neural networks, to predict the spatiotemporal chaotic Kuramoto-Sivashinsky equation. We find that reservoir computers trained with noise or with LMNT produce climate predictions that appear to be indefinitely stable and have a climate very similar to the true system, while the short-term forecasts are substantially more accurate than those trained with other regularization techniques. Finally, we show the deterministic aspect of our LMNT regularization facilitates fast reservoir computer regularization hyperparameter tuning.
Collapse
Affiliation(s)
- Alexander Wikner
- Department of Physics, University of Maryland, 4150 Campus Dr, 20742, College Park, United States.
| | - Joseph Harvey
- Hillsdale College, 33 E College St, 49242, Hillsdale, United States
| | - Michelle Girvan
- Department of Physics, University of Maryland, 4150 Campus Dr, 20742, College Park, United States
| | - Brian R Hunt
- Department of Mathematics, University of Maryland, 4176 Campus Dr, 20742, College Park, United States
| | - Andrew Pomerance
- Potomac Research LLC, 801 N Pitt St, 22341, Alexandria, United States
| | - Thomas Antonsen
- Department of Physics, University of Maryland, 4150 Campus Dr, 20742, College Park, United States; Department of Electrical and Computer Engineering, University of Maryland, 8223 Paint Branch Dr, 20742, College Park, United States
| | - Edward Ott
- Department of Physics, University of Maryland, 4150 Campus Dr, 20742, College Park, United States; Department of Electrical and Computer Engineering, University of Maryland, 8223 Paint Branch Dr, 20742, College Park, United States
| |
Collapse
|
6
|
Peng H, Xiong X, Wu M, Wang J, Yang Q, Orellana-Martín D, Pérez-Jiménez MJ. Reservoir computing models based on spiking neural P systems for time series classification. Neural Netw 2024; 169:274-281. [PMID: 37918270 DOI: 10.1016/j.neunet.2023.10.041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 09/12/2023] [Accepted: 10/25/2023] [Indexed: 11/04/2023]
Abstract
Nonlinear spiking neural P (NSNP) systems are neural-like membrane computing models with nonlinear spiking mechanisms. Because of this nonlinear spiking mechanism, NSNP systems can show rich nonlinear dynamics. Reservoir computing (RC) is a novel recurrent neural network (RNN) and can overcome some shortcomings of traditional RNNs. Based on NSNP systems, we developed two RC variants for time series classification, RC-SNP and RC-RMS-SNP, which are without and integrated with reservoir model space (RMS), respectively. The two RC variants use NSNP systems as the reservoirs and can be easily implemented in the RC framework. The proposed two RC variants were evaluated on 17 benchmark time series classification datasets and compared with 16 state-of-the-art or baseline classification models. The comparison results demonstrate the effectiveness of the proposed two RC variants for time series classification tasks.
Collapse
Affiliation(s)
- Hong Peng
- School of Computer and Software Engineering, Xihua University, Chengdu, 610039, China.
| | - Xin Xiong
- School of Computer and Software Engineering, Xihua University, Chengdu, 610039, China
| | - Min Wu
- School of Computer and Software Engineering, Xihua University, Chengdu, 610039, China
| | - Jun Wang
- School of Electrical Engineering and Electronic Information, Xihua University, Chengdu, 610039, China
| | - Qian Yang
- School of Computer and Software Engineering, Xihua University, Chengdu, 610039, China
| | - David Orellana-Martín
- Research Group of Natural Computing, Department of Computer Science and Artificial Intelligence, University of Seville, Sevilla, 41012, Spain
| | - Mario J Pérez-Jiménez
- Research Group of Natural Computing, Department of Computer Science and Artificial Intelligence, University of Seville, Sevilla, 41012, Spain
| |
Collapse
|
7
|
Semenov A, Pecherskaya E, Golubkov P, Gurin S, Artamonov D, Shepeleva Y. Parametric identification of the mathematical model of the micro-arc oxidation process. Heliyon 2023; 9:e19995. [PMID: 37810131 PMCID: PMC10559672 DOI: 10.1016/j.heliyon.2023.e19995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Revised: 05/01/2023] [Accepted: 09/07/2023] [Indexed: 10/10/2023] Open
Abstract
The article is aimed at solving the problem of parametric identification of non-linear object models using the example of a mathematical model of the micro-arc oxidation process. An algorithm for parametric identification, based on an experiment in the micro-arc oxidation process, the results of which form a training and control sample is proposed; sequential training of neural networks and calculation of the parameters estimates of the nonlinear model according to experimental data are performed. Experimental testing of the proposed method of neural network parametric identification on the example of the micro-arc oxidation process confirmed that the standard deviation of current and voltage from the nominal values does not exceed ±4%. The obtained results were used in the development of an intelligent hardware-software complex for the production of protective coatings by the micro-arc oxidation method.
Collapse
Affiliation(s)
- Anatoliy Semenov
- Department of Information and Measurement Equipment and Metrology, Penza State University, Krasnaya Street 40, Penza 440026, Russia
| | - Ekaterina Pecherskaya
- Department of Information and Measurement Equipment and Metrology, Penza State University, Krasnaya Street 40, Penza 440026, Russia
| | - Pavel Golubkov
- Department of Information and Measurement Equipment and Metrology, Penza State University, Krasnaya Street 40, Penza 440026, Russia
| | - Sergey Gurin
- Department of Information and Measurement Equipment and Metrology, Penza State University, Krasnaya Street 40, Penza 440026, Russia
| | - Dmitriy Artamonov
- Polytechnic Institute, Penza State University, Krasnaya Street 40, Penza 440026, Russia
| | - Yuliya Shepeleva
- Polytechnic Institute, Penza State University, Krasnaya Street 40, Penza 440026, Russia
| |
Collapse
|
8
|
Picco E, Antonik P, Massar S. High speed human action recognition using a photonic reservoir computer. Neural Netw 2023; 165:662-675. [PMID: 37364475 DOI: 10.1016/j.neunet.2023.06.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Revised: 05/30/2023] [Accepted: 06/07/2023] [Indexed: 06/28/2023]
Abstract
The recognition of human actions in videos is one of the most active research fields in computer vision. The canonical approach consists in a more or less complex preprocessing stages of the raw video data, followed by a relatively simple classification algorithm. Here we address recognition of human actions using the reservoir computing algorithm, which allows us to focus on the classifier stage. We introduce a new training method for the reservoir computer, based on "Timesteps Of Interest", which combines in a simple way short and long time scales. We study the performance of this algorithm using both numerical simulations and a photonic implementation based on a single non-linear node and a delay line on the well known KTH dataset. We solve the task with high accuracy and speed, to the point of allowing for processing multiple video streams in real time. The present work is thus an important step towards developing efficient dedicated hardware for video processing.
Collapse
Affiliation(s)
- Enrico Picco
- Laboratoire d'Information Quantique, CP 224, Université Libre de Bruxelles (ULB), B-1050, Bruxelles, Belgium.
| | - Piotr Antonik
- MICS EA-4037 Laboratory, CentraleSupélec, F-91192, Gif-sur-Yvette, France
| | - Serge Massar
- Laboratoire d'Information Quantique, CP 224, Université Libre de Bruxelles (ULB), B-1050, Bruxelles, Belgium
| |
Collapse
|
9
|
Kawai Y, Park J, Tsuda I, Asada M. Learning long-term motor timing/patterns on an orthogonal basis in random neural networks. Neural Netw 2023; 163:298-311. [PMID: 37087852 DOI: 10.1016/j.neunet.2023.04.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 03/15/2023] [Accepted: 04/05/2023] [Indexed: 04/25/2023]
Abstract
The ability of the brain to generate complex spatiotemporal patterns with specific timings is essential for motor learning and temporal processing. An approach that can model this function, using the spontaneous activity of a random neural network (RNN), is associated with orbital instability. We propose a simple system that learns an arbitrary time series as the linear sum of stable trajectories produced by several small network modules. New finding in computer experiments is that the trajectories of the module outputs are orthogonal to each other. They created a dynamic orthogonal basis acquiring a high representational capacity, which enabled the system to learn the timing of extremely long intervals, such as tens of seconds for a millisecond computation unit, and also the complex time series of Lorenz attractors. This self-sustained system satisfies the stability and orthogonality requirements and thus provides a new neurocomputing framework and perspective for the neural mechanisms of motor learning.
Collapse
Affiliation(s)
- Yuji Kawai
- Symbiotic Intelligent Systems Research Center, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, 1-1 Yamadaoka, Suita, Osaka 565-0871, Japan.
| | - Jihoon Park
- Symbiotic Intelligent Systems Research Center, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, 1-1 Yamadaoka, Suita, Osaka 565-0871, Japan; Center for Information and Neural Networks, National Institute of Information and Communications Technology, 1-4 Yamadaoka, Suita, Osaka 565-0871, Japan
| | - Ichiro Tsuda
- Chubu University Academy of Emerging Sciences/Center for Mathematical Science and Artificial Intelligence, Chubu University, 1200 Matsumoto-cho, Kasugai, Aichi 487-8501, Japan
| | - Minoru Asada
- Symbiotic Intelligent Systems Research Center, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, 1-1 Yamadaoka, Suita, Osaka 565-0871, Japan; Center for Information and Neural Networks, National Institute of Information and Communications Technology, 1-4 Yamadaoka, Suita, Osaka 565-0871, Japan; Chubu University Academy of Emerging Sciences/Center for Mathematical Science and Artificial Intelligence, Chubu University, 1200 Matsumoto-cho, Kasugai, Aichi 487-8501, Japan; International Professional University of Technology in Osaka, 3-3-1 Umeda, Kita-ku, Osaka 530-0001, Japan
| |
Collapse
|
10
|
Gupta V, Li LKB, Chen S, Wan M. Model-free forecasting of partially observable spatiotemporally chaotic systems. Neural Netw 2023; 160:297-305. [PMID: 36716509 DOI: 10.1016/j.neunet.2023.01.013] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 01/09/2023] [Accepted: 01/15/2023] [Indexed: 01/24/2023]
Abstract
Reservoir computing is a powerful tool for forecasting turbulence because its simple architecture has the computational efficiency to handle high-dimensional systems. Its implementation, however, often requires full state-vector measurements and knowledge of the system nonlinearities. We use nonlinear projector functions to expand the system measurements to a high dimensional space and then feed them to a reservoir to obtain forecasts. We demonstrate the application of such reservoir computing networks on spatiotemporally chaotic systems, which model several features of turbulence. We show that using radial basis functions as nonlinear projectors enables complex system nonlinearities to be captured robustly even with only partial observations and without knowing the governing equations. Finally, we show that when measurements are sparse or incomplete and noisy, such that even the governing equations become inaccurate, our networks can still produce reasonably accurate forecasts, thus paving the way towards model-free forecasting of practical turbulent systems.
Collapse
Affiliation(s)
- Vikrant Gupta
- Guangdong Provincial Key Laboratory of Turbulence Research and Applications, Department of Mechanics and Aerospace Engineering, Southern University of Science and Technology, Shenzhen, 518055, PR China; Guangdong-Hong Kong-Macao Joint Laboratory for Data-Driven Fluid Mechanics and Engineering Applications, Southern University of Science and Technology, Shenzhen, 518055, PR China
| | - Larry K B Li
- Department of Mechanical and Aerospace Engineering, Hong Kong University of Science and Technology, Hong Kong, China; Guangdong-Hong Kong-Macao Joint Laboratory for Data-Driven Fluid Mechanics and Engineering Applications, Hong Kong University of Science and Technology, Hong Kong, China
| | - Shiyi Chen
- Guangdong Provincial Key Laboratory of Turbulence Research and Applications, Department of Mechanics and Aerospace Engineering, Southern University of Science and Technology, Shenzhen, 518055, PR China; Guangdong-Hong Kong-Macao Joint Laboratory for Data-Driven Fluid Mechanics and Engineering Applications, Southern University of Science and Technology, Shenzhen, 518055, PR China; Eastern Institute for Advanced Study, Ningbo, 315200, PR China
| | - Minping Wan
- Guangdong Provincial Key Laboratory of Turbulence Research and Applications, Department of Mechanics and Aerospace Engineering, Southern University of Science and Technology, Shenzhen, 518055, PR China; Guangdong-Hong Kong-Macao Joint Laboratory for Data-Driven Fluid Mechanics and Engineering Applications, Southern University of Science and Technology, Shenzhen, 518055, PR China; Jiaxing Research Institute, Southern University of Science and Technology, Jiaxing, 314031, PR China.
| |
Collapse
|
11
|
Scleidorovich P, Weitzenfeld A, Fellous JM, Dominey PF. Integration of velocity-dependent spatio-temporal structure of place cell activation during navigation in a reservoir model of prefrontal cortex. Biol Cybern 2022; 116:585-610. [PMID: 36222887 DOI: 10.1007/s00422-022-00945-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Accepted: 09/10/2022] [Indexed: 06/16/2023]
Abstract
Sequential behavior unfolds both in space and in time. The same spatial trajectory can be realized in different manners in the same overall time by changing instantaneous speeds. The current research investigates how speed profiles might be given behavioral significance and how cortical networks might encode this information. We first demonstrate that rats can associate different speed patterns on the same trajectory with distinct behavioral choices. In this novel experimental paradigm, rats follow a small baited robot in a large megaspace environment where the rat's speed is precisely controlled by the robot's speed. Based on this proof of concept and research showing that recurrent reservoir networks are ideal for representing spatio-temporal structures, we then test reservoir networks in simulated navigation contexts and demonstrate they can discriminate between traversals of the same path with identical durations but different speed profiles. We then test the networks in an embodied robotic setup, where we use place cell representations from physically navigating robots as input and again successfully discriminate between traversals. To demonstrate that this capability is inherent to recurrent networks, we compared the model against simple linear integrators. Interestingly, although the linear integrators could also perform the speed profile discrimination, a clear difference emerged when examining information coding in both models. Reservoir neurons displayed a form of statistical mixed selectivity as a complex interaction between spatial location and speed that was not as abundant in the linear integrators. This mixed selectivity is characteristic of cortex and reservoirs and allows us to generate specific predictions about the neural activity that will be recorded in rat cortex in future experiments.
Collapse
Affiliation(s)
- Pablo Scleidorovich
- Department of Computer Science and Engineering, University of South Florida, Tampa, USA
| | - Alfredo Weitzenfeld
- Department of Computer Science and Engineering, University of South Florida, Tampa, USA
| | - Jean-Marc Fellous
- Departments of Psychology and Biomedical Engineering, University of Arizona, Tucson, USA
| | - Peter Ford Dominey
- INSERM UMR1093-CAPS, Université Bourgogne Franche-Comté, UFR Des Sciences du Sport, 21000, Dijon, France.
- Robot Cognition Laboratory, Institute Marey, Dijon, France.
| |
Collapse
|
12
|
Platt JA, Penny SG, Smith TA, Chen TC, Abarbanel HDI. A systematic exploration of reservoir computing for forecasting complex spatiotemporal dynamics. Neural Netw 2022; 153:530-552. [PMID: 35839598 DOI: 10.1016/j.neunet.2022.06.025] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 06/14/2022] [Accepted: 06/20/2022] [Indexed: 11/16/2022]
Abstract
A reservoir computer (RC) is a type of recurrent neural network architecture with demonstrated success in the prediction of spatiotemporally chaotic dynamical systems. A further advantage of RC is that it reproduces intrinsic dynamical quantities essential for its incorporation into numerical forecasting routines such as the ensemble Kalman filter-used in numerical weather prediction to compensate for sparse and noisy data. We explore here the architecture and design choices for a "best in class" RC for a number of characteristic dynamical systems. Our analysis points to the importance of large scale parameter optimization. We also note in particular the importance of including input bias in the RC design, which has a significant impact on the forecast skill of the trained RC model. In our tests, the use of a nonlinear readout operator does not affect the forecast time or the stability of the forecast. The effects of the reservoir dimension, spinup time, amount of training data, normalization, noise, and the RC time step are also investigated. Finally, we detail how our investigation leads to optimal design choices for a parallel RC scheme applied to the 40 dimensional spatiotemporally chaotic Lorenz 1996 dynamics.
Collapse
Affiliation(s)
- Jason A Platt
- Department of Physics, University of California San Diego, United States of America.
| | - Stephen G Penny
- Cooperative Institute for Research in Environmental Sciences, University of Colorado Boulder, Boulder, CO 80309, United States of America; Physical Sciences Laboratory, National Oceanic and Atmospheric Administration, Boulder, CO 80305, United States of America
| | - Timothy A Smith
- Cooperative Institute for Research in Environmental Sciences, University of Colorado Boulder, Boulder, CO 80309, United States of America; Physical Sciences Laboratory, National Oceanic and Atmospheric Administration, Boulder, CO 80305, United States of America
| | - Tse-Chun Chen
- Cooperative Institute for Research in Environmental Sciences, University of Colorado Boulder, Boulder, CO 80309, United States of America; Physical Sciences Laboratory, National Oceanic and Atmospheric Administration, Boulder, CO 80305, United States of America
| | - Henry D I Abarbanel
- Department of Physics, University of California San Diego, United States of America; Marine Physical Laboratory, Scripps Institution of Oceanography, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA
| |
Collapse
|
13
|
Daniels RK, Mallinson JB, Heywood ZE, Bones PJ, Arnold MD, Brown SA. Reservoir computing with 3D nanowire networks. Neural Netw 2022; 154:122-130. [PMID: 35882080 DOI: 10.1016/j.neunet.2022.07.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 04/27/2022] [Accepted: 07/06/2022] [Indexed: 10/17/2022]
Abstract
Networks of nanowires are currently being explored for a range of applications in brain-like (or neuromorphic) computing, and especially in reservoir computing (RC). Fabrication of real-world computing devices requires that the nanowires are deposited sequentially, leading to stacking of the wires on top of each other. However, most simulations of computational tasks using these systems treat the nanowires as 1D objects lying in a perfectly 2D plane - the effect of stacking on RC performance has not yet been established. Here we use detailed simulations to compare the performance of perfectly 2D and quasi-3D (stacked) networks of nanowires in two tasks: memory capacity and nonlinear transformation. We also show that our model of the junctions between nanowires is general enough to describe a wide range of memristive networks, and consider the impact of physically realistic electrode configurations on performance. We show that the various networks and configurations have a strikingly similar performance in RC tasks, which is surprising given their radically different topologies. Our results show that networks with an experimentally achievable number of electrodes perform close to the upper bounds achievable when using the information from every wire. However, we also show important differences, in particular that the quasi-3D networks are more resilient to changes in the input parameters, generalizing better to noisy training data. Since previous literature suggests that topology plays an important role in computing performance, these results may have important implications for future applications of nanowire networks in neuromorphic computing.
Collapse
Affiliation(s)
- R K Daniels
- The MacDiarmid Institute for Advanced Materials and Nanotechnology, School of Physical and Chemical Sciences, Te Kura Matū, University of Canterbury, Private Bag 4800, Christchurch 8140, New Zealand
| | - J B Mallinson
- The MacDiarmid Institute for Advanced Materials and Nanotechnology, School of Physical and Chemical Sciences, Te Kura Matū, University of Canterbury, Private Bag 4800, Christchurch 8140, New Zealand
| | - Z E Heywood
- Electrical and Computer Engineering, University of Canterbury, Private Bag 4800, Christchurch 8140, New Zealand
| | - P J Bones
- Electrical and Computer Engineering, University of Canterbury, Private Bag 4800, Christchurch 8140, New Zealand
| | - M D Arnold
- School of Mathematical and Physical Sciences, University of Technology Sydney, PO Box 123 Broadway NSW 2007, Australia
| | - S A Brown
- The MacDiarmid Institute for Advanced Materials and Nanotechnology, School of Physical and Chemical Sciences, Te Kura Matū, University of Canterbury, Private Bag 4800, Christchurch 8140, New Zealand.
| |
Collapse
|
14
|
Ibrahim H, Loo CK, Alnajjar F. Bidirectional parallel echo state network for speech emotion recognition. Neural Comput Appl 2022;:1-19. [PMID: 35669535 DOI: 10.1007/s00521-022-07410-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Accepted: 05/09/2022] [Indexed: 10/28/2022]
Abstract
Speech is an effective way for communicating and exchanging complex information between humans. Speech signal has involved a great attention in human-computer interaction. Therefore, emotion recognition from speech has become a hot research topic in the field of interacting machines with humans. In this paper, we proposed a novel speech emotion recognition system by adopting multivariate time series handcrafted feature representation from speech signals. Bidirectional echo state network with two parallel reservoir layers has been applied to capture additional independent information. The parallel reservoirs produce multiple representations for each direction from the bidirectional data with two stages of concatenation. The sparse random projection approach has been adopted to reduce the high-dimensional sparse output for each direction separately from both reservoirs. Random over-sampling and random under-sampling methods are used to overcome the imbalanced nature of the used speech emotion datasets. The performance of the proposed parallel ESN model is evaluated from the speaker-independent experiments on EMO-DB, SAVEE, RAVDESS, and FAU Aibo datasets. The results show that the proposed SER model is superior to the single reservoir and the state-of-the-art studies.
Collapse
|
15
|
Mammedov YD, Olugu EU, Farah GA. Weather forecasting based on data-driven and physics-informed reservoir computing models. Environ Sci Pollut Res Int 2022; 29:24131-24144. [PMID: 34825327 DOI: 10.1007/s11356-021-17668-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Accepted: 11/17/2021] [Indexed: 06/13/2023]
Abstract
In response to the growing demand for the global energy supply chain, wind power has become an important research subject among studies in the advancement of renewable energy sources. The major concern is the stochastic volatility of weather conditions that hinder the development of wind power forecasting approaches. To address this issue, the current study proposes a weather prediction method divided into two models for wind speed and atmospheric system forecasting. First, the data-based model incorporated with wavelet transform and recurrent neural networks is employed to predict the wind speed. Second, the physics-informed echo state network was used to learn the chaotic behavior of the atmospheric system. The findings were validated with a case study conducted on wind speed data from Turkmenistan. The results suggest the outperformance of physics-informed model for accurate and reliable forecasting analysis, which indicates the potential for implementation in wind energy analysis.
Collapse
Affiliation(s)
- Yslam D Mammedov
- Department of Industrial and Petroleum Engineering, Faculty of Engineering, Technology and Built Environment, UCSI University, 56000, Kuala Lumpur, Malaysia.
| | - Ezutah Udoncy Olugu
- Department of Mechanical Engineering, Faculty of Engineering, Technology and Built Environment, UCSI University, 56000, Kuala Lumpur, Malaysia.
| | - Guleid A Farah
- Department of Computer Science, College of Computing, Georgia Institute of Technology, Atlanta, GA, 30332, USA
| |
Collapse
|
16
|
Tamura H, Tanaka G. Transfer-RLS method and transfer-FORCE learning for simple and fast training of reservoir computing models. Neural Netw 2021; 143:550-63. [PMID: 34304003 DOI: 10.1016/j.neunet.2021.06.031] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Revised: 05/13/2021] [Accepted: 06/29/2021] [Indexed: 11/22/2022]
Abstract
Reservoir computing is a machine learning framework derived from a special type of recurrent neural network. Following recent advances in physical reservoir computing, some reservoir computing devices are thought to be promising as energy-efficient machine learning hardware for real-time information processing. To realize efficient online learning with low-power reservoir computing devices, it is beneficial to develop fast convergence learning methods with simpler operations. This study proposes a training method located in the middle between the recursive least squares (RLS) method and the least mean squares (LMS) method, which are standard online learning methods for reservoir computing models. The RLS method converges fast but requires updates of a huge matrix called a gain matrix, whereas the LMS method does not use a gain matrix but converges very slow. On the other hand, the proposed method called a transfer-RLS method does not require updates of the gain matrix in the main-training phase by updating that in advance (i.e., in a pre-training phase). As a result, the transfer-RLS method can work with simpler operations than the original RLS method without sacrificing much convergence speed. We numerically and analytically show that the transfer-RLS method converges much faster than the LMS method. Furthermore, we show that a modified version of the transfer-RLS method (called transfer-FORCE learning) can be applied to the first-order reduced and controlled error (FORCE) learning for a reservoir computing model with a closed-loop, which is challenging to train.
Collapse
|
17
|
Lin X, Zou X, Ji Z, Huang T, Wu S, Mi Y. A brain-inspired computational model for spatio-temporal information processing. Neural Netw 2021; 143:74-87. [PMID: 34091238 DOI: 10.1016/j.neunet.2021.05.015] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Revised: 03/31/2021] [Accepted: 05/12/2021] [Indexed: 11/28/2022]
Abstract
Spatio-temporal information processing is fundamental in both brain functions and AI applications. Current strategies for spatio-temporal pattern recognition usually involve explicit feature extraction followed by feature aggregation, which requires a large amount of labeled data. In the present study, motivated by the subcortical visual pathway and early stages of the auditory pathway for motion and sound processing, we propose a novel brain-inspired computational model for generic spatio-temporal pattern recognition. The model consists of two modules, a reservoir module and a decision-making module. The former projects complex spatio-temporal patterns into spatially separated neural representations via its recurrent dynamics, the latter reads out neural representations via integrating information over time, and the two modules are linked together using known examples. Using synthetic data, we demonstrate that the model can extract the frequency and order information of temporal inputs. We apply the model to reproduce the looming pattern discrimination behavior as observed in experiments successfully. Furthermore, we apply the model to the gait recognition task, and demonstrate that our model accomplishes the recognition in an event-based manner and outperforms deep learning counterparts when training data is limited.
Collapse
Affiliation(s)
- Xiaohan Lin
- School of Electronics Engineering and Computer Science, Peking University, No.5 Yiheyuan Road Haidian District, Beijing 100871, PR China.
| | - Xiaolong Zou
- School of Electronics Engineering and Computer Science, Peking University, No.5 Yiheyuan Road Haidian District, Beijing 100871, PR China; School of Psychological and Cognitive Sciences, IDG/McGovern Institute for Brain Research, PKU-Tsinghua Center for Life Sciences, Peking University, No.5 Yiheyuan Road Haidian District, Beijing 100871, PR China.
| | - Zilong Ji
- School of Electronics Engineering and Computer Science, Peking University, No.5 Yiheyuan Road Haidian District, Beijing 100871, PR China; School of Psychological and Cognitive Sciences, IDG/McGovern Institute for Brain Research, PKU-Tsinghua Center for Life Sciences, Peking University, No.5 Yiheyuan Road Haidian District, Beijing 100871, PR China.
| | - Tiejun Huang
- School of Electronics Engineering and Computer Science, Peking University, No.5 Yiheyuan Road Haidian District, Beijing 100871, PR China.
| | - Si Wu
- School of Electronics Engineering and Computer Science, Peking University, No.5 Yiheyuan Road Haidian District, Beijing 100871, PR China; School of Psychological and Cognitive Sciences, IDG/McGovern Institute for Brain Research, PKU-Tsinghua Center for Life Sciences, Peking University, No.5 Yiheyuan Road Haidian District, Beijing 100871, PR China.
| | - Yuanyuan Mi
- Center for Neurointelligence, School of Medicine, Chongqing University, No.174 Shazhengjie, Shapingba, Chongqing 400044, PR China; AI Research Center, Peng Cheng Laboratory, No.2, Xingke First Street, Nanshan District, Shenzhen 518005, PR China.
| |
Collapse
|
18
|
Abstract
Echo state networks (ESNs) have been recently proved to be universal approximants for input/output systems with respect to various Lp-type criteria. When 1≤p<∞, only p-integrability hypotheses need to be imposed, while in the case p=∞ a uniform boundedness hypotheses on the inputs is required. This note shows that, in the last case, a universal family of ESNs can be constructed that contains exclusively elements that have the echo state and the fading memory properties. This conclusion could not be drawn with the results and methods available so far in the literature.
Collapse
Affiliation(s)
- Lukas Gonon
- Ludwig-Maximilians-Universität München, Mathematics Institute, Theresienstrasse 39, D-80333 Munich, Germany.
| | - Juan-Pablo Ortega
- Universität Sankt Gallen, Faculty of Mathematics and Statistics, Bodanstrasse 6, CH-9000 Sankt Gallen, Switzerland; Centre National de la Recherche Scientifique (CNRS), France.
| |
Collapse
|
19
|
Tokuda K, Fujiwara N, Sudo A, Katori Y. Chaos may enhance expressivity in cerebellar granular layer. Neural Netw 2021; 136:72-86. [PMID: 33450654 DOI: 10.1016/j.neunet.2020.12.020] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Revised: 11/23/2020] [Accepted: 12/20/2020] [Indexed: 11/22/2022]
Abstract
Recent evidence suggests that Golgi cells in the cerebellar granular layer are densely connected to each other with massive gap junctions. Here, we propose that the massive gap junctions between the Golgi cells contribute to the representational complexity of the granular layer of the cerebellum by inducing chaotic dynamics. We construct a model of cerebellar granular layer with diffusion coupling through gap junctions between the Golgi cells, and evaluate the representational capability of the network with the reservoir computing framework. First, we show that the chaotic dynamics induced by diffusion coupling results in complex output patterns containing a wide range of frequency components. Second, the long non-recursive time series of the reservoir represents the passage of time from an external input. These properties of the reservoir enable mapping different spatial inputs into different temporal patterns.
Collapse
|
20
|
Zhang A, Xu Z. Chaotic time series prediction using phase space reconstruction based conceptor network. Cogn Neurodyn 2020; 14:849-857. [PMID: 33101536 DOI: 10.1007/s11571-020-09612-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Revised: 04/30/2020] [Accepted: 06/20/2020] [Indexed: 10/23/2022] Open
Abstract
The Conceptor network is a new framework of reservoir computing (RC), in addition to the features of easy training, global convergence, it can online learn new classes of input patterns without complete re-learning from all the training data. The conventional connection topology and weights of the hidden layer (reservoir) of RC are initialized randomly, and are fixed to be no longer fine-tuned after initialization. However, it has been demonstrated that the reservoir connection of RC plays an important role in the computational performance of RC. Therefore, in this paper, we optimize the Conceptor's reservoir connection and propose a phase space reconstruction (PSR) -based reservoir generation method. We tested the generation method on time series prediction task, and the experiment results showed that the proposed PSR-based method can improve the prediction accuracy of Conceptor networks. Further, we compared the PSR-based Conceptor with two Conceptor networks of other typical reservoir topologies (random connected, cortex-like connected), and found that all of their prediction accuracy showed a nonlinear decline trend with increasing storage load, but in comparison, our proposed PSR-based method has the best accuracy under different storage loads.
Collapse
Affiliation(s)
- Anguo Zhang
- College of Physics and Information Engineering, Fuzhou University, Fuzhou, 350108 China.,Key Laboratory of Medical Instrumentation and Pharmaceutical Technology of Fujian Province, Fuzhou, 350116 China.,Research Institute of Ruijie, Ruijie Networks Co., Ltd., Fuzhou, 350002 China
| | - Zheng Xu
- School of Mechanical Engineering, Nanjing University of Science and Technology, Nanjing, 210094 China
| |
Collapse
|
21
|
Matsuki T, Shibata K. Adaptive balancing of exploration and exploitation around the edge of chaos in internal-chaos-based learning. Neural Netw 2020; 132:19-29. [PMID: 32861145 DOI: 10.1016/j.neunet.2020.08.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2019] [Revised: 06/02/2020] [Accepted: 08/04/2020] [Indexed: 11/15/2022]
Abstract
This paper addresses learning with exploration driven by chaotic internal dynamics of a neural network. Hoerzer et al. showed that a chaotic reservoir network (RN) can learn with exploration driven by external random noise and a sequential reward. In this paper, we demonstrate that a chaotic RN can learn without external noise because the output fluctuation originated from its internal chaotic dynamics functions as exploration. As learning progresses, the chaoticity decreases and the network can automatically switch from exploration mode to exploitation mode. Furthermore, the network can resume exploration when presented with a new situation. In addition, we found that even when the two parameters that influence the chaoticity are varied, learning performance always improves around the edge of chaos. From these results, we think that exploration is generated from internal chaotic dynamics, and exploitation appears in the process of forming attractors on the chaotic dynamics through learning. Consequently, exploration and exploitation are well-balanced around the edge of chaos, which leads to good learning performance.
Collapse
|
22
|
Pontes-Filho S, Lind P, Yazidi A, Zhang J, Hammer H, Mello GBM, Sandvig I, Tufte G, Nichele S. A neuro-inspired general framework for the evolution of stochastic dynamical systems: Cellular automata, random Boolean networks and echo state networks towards criticality. Cogn Neurodyn 2020; 14:657-74. [PMID: 33014179 DOI: 10.1007/s11571-020-09600-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2019] [Revised: 05/08/2020] [Accepted: 05/14/2020] [Indexed: 10/27/2022] Open
Abstract
Although deep learning has recently increased in popularity, it suffers from various problems including high computational complexity, energy greedy computation, and lack of scalability, to mention a few. In this paper, we investigate an alternative brain-inspired method for data analysis that circumvents the deep learning drawbacks by taking the actual dynamical behavior of biological neural networks into account. For this purpose, we develop a general framework for dynamical systems that can evolve and model a variety of substrates that possess computational capacity. Therefore, dynamical systems can be exploited in the reservoir computing paradigm, i.e., an untrained recurrent nonlinear network with a trained linear readout layer. Moreover, our general framework, called EvoDynamic, is based on an optimized deep neural network library. Hence, generalization and performance can be balanced. The EvoDynamic framework contains three kinds of dynamical systems already implemented, namely cellular automata, random Boolean networks, and echo state networks. The evolution of such systems towards a dynamical behavior, called criticality, is investigated because systems with such behavior may be better suited to do useful computation. The implemented dynamical systems are stochastic and their evolution with genetic algorithm mutates their update rules or network initialization. The obtained results are promising and demonstrate that criticality is achieved. In addition to the presented results, our framework can also be utilized to evolve the dynamical systems connectivity, update and learning rules to improve the quality of the reservoir used for solving computational tasks and physical substrate modeling.
Collapse
|
23
|
Cazin N, Scleidorovich P, Weitzenfeld A, Dominey PF. Real-time sensory-motor integration of hippocampal place cell replay and prefrontal sequence learning in simulated and physical rat robots for novel path optimization. Biol Cybern 2020; 114:249-268. [PMID: 32095878 DOI: 10.1007/s00422-020-00820-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Accepted: 02/04/2020] [Indexed: 06/10/2023]
Abstract
An open problem in the cognitive dimensions of navigation concerns how previous exploratory experience is reorganized in order to allow the creation of novel efficient navigation trajectories. This behavior is revealed in the "traveling salesrat problem" (TSP) when rats discover the shortest path linking baited food wells after a few exploratory traversals. We have recently published a model of navigation sequence learning, where sharp wave ripple replay of hippocampal place cells transmit "snippets" of the recent trajectories that the animal has explored to the prefrontal cortex (PFC) (Cazin et al. in PLoS Comput Biol 15:e1006624, 2019). PFC is modeled as a recurrent reservoir network that is able to assemble these snippets into the efficient sequence (trajectory of spatial locations coded by place cell activation). The model of hippocampal replay generates a distribution of snippets as a function of their proximity to a reward, thus implementing a form of spatial credit assignment that solves the TSP task. The integrative PFC reservoir reconstructs the efficient TSP sequence based on exposure to this distribution of snippets that favors paths that are most proximal to rewards. While this demonstrates the theoretical feasibility of the PFC-HIPP interaction, the integration of such a dynamic system into a real-time sensory-motor system remains a challenge. In the current research, we test the hypothesis that the PFC reservoir model can operate in a real-time sensory-motor loop. Thus, the main goal of the paper is to validate the model in simulated and real robot scenarios. Place cell activation encoding the current position of the simulated and physical rat robot feeds the PFC reservoir which generates the successor place cell activation that represents the next step in the reproduced sequence in the readout. This is input to the robot, which advances to the coded location and then generates de novo the current place cell activation. This allows demonstration of the crucial role of embodiment. If the spatial code readout from PFC is played back directly into PFC, error can accumulate, and the system can diverge from desired trajectories. This required a spatial filter to decode the PFC code to a location and then recode a new place cell code for that location. In the robot, the place cell vector output of PFC is used to physically displace the robot and then generate a new place cell coded input to the PFC, replacing part of the software recoding procedure that was required otherwise. We demonstrate how this integrated sensory-motor system can learn simple navigation sequences and then, importantly, how it can synthesize novel efficient sequences based on prior experience, as previously demonstrated (Cazin et al. 2019). This contributes to the understanding of hippocampal replay in novel navigation sequence formation and the important role of embodiment.
Collapse
Affiliation(s)
- Nicolas Cazin
- INSERM UMR1093-CAPS, Université Bourgogne Franche-Comté, UFR des Sciences du Sport, 21000, Dijon, France
- Robot Cognition Laboratory, Institut Marey, INSERM U1093 CAPS, UBFC, Dijon, France
| | | | | | - Peter Ford Dominey
- INSERM UMR1093-CAPS, Université Bourgogne Franche-Comté, UFR des Sciences du Sport, 21000, Dijon, France.
- Robot Cognition Laboratory, Institut Marey, INSERM U1093 CAPS, UBFC, Dijon, France.
| |
Collapse
|
24
|
Stelzer F, Röhm A, Lüdge K, Yanchuk S. Performance boost of time-delay reservoir computing by non-resonant clock cycle. Neural Netw 2020; 124:158-169. [PMID: 32006747 DOI: 10.1016/j.neunet.2020.01.010] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2019] [Revised: 01/09/2020] [Accepted: 01/09/2020] [Indexed: 11/25/2022]
Abstract
The time-delay-based reservoir computing setup has seen tremendous success in both experiment and simulation. It allows for the construction of large neuromorphic computing systems with only few components. However, until now the interplay of the different timescales has not been investigated thoroughly. In this manuscript, we investigate the effects of a mismatch between the time-delay and the clock cycle for a general model. Typically, these two time scales are considered to be equal. Here we show that the case of equal or resonant time-delay and clock cycle could be actively detrimental and leads to an increase of the approximation error of the reservoir. In particular, we can show that non-resonant ratios of these time scales have maximal memory capacities. We achieve this by translating the periodically driven delay-dynamical system into an equivalent network. Networks that originate from a system with resonant delay-times and clock cycles fail to utilize all of their degrees of freedom, which causes the degradation of their performance.
Collapse
Affiliation(s)
- Florian Stelzer
- Institute of Mathematics, Technische Universität Berlin, D-10623, Germany; Department of Mathematics, Humboldt-Universität zu Berlin, D-12489, Germany.
| | - André Röhm
- Institute of Theoretical Physics, Technische Universität Berlin, D-10623, Germany; Instituto de Física Interdisciplinar y Sistemas Complejos, IFISC (CSIC-UIB), Campus Universitat de les Illes Balears, E-07122 Palma de Mallorca, Spain.
| | - Kathy Lüdge
- Institute of Theoretical Physics, Technische Universität Berlin, D-10623, Germany.
| | - Serhiy Yanchuk
- Institute of Mathematics, Technische Universität Berlin, D-10623, Germany.
| |
Collapse
|
25
|
Kiefer C. Sample-level sound synthesis with recurrent neural networks and conceptors. PeerJ Comput Sci 2019; 5:e205. [PMID: 33816858 PMCID: PMC7924416 DOI: 10.7717/peerj-cs.205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Accepted: 06/13/2019] [Indexed: 06/12/2023]
Abstract
Conceptors are a recent development in the field of reservoir computing; they can be used to influence the dynamics of recurrent neural networks (RNNs), enabling generation of arbitrary patterns based on training data. Conceptors allow interpolation and extrapolation between patterns, and also provide a system of boolean logic for combining patterns together. Generation and manipulation of arbitrary patterns using conceptors has significant potential as a sound synthesis method for applications in computer music but has yet to be explored. Conceptors are untested with the generation of multi-timbre audio patterns, and little testing has been done on scalability to longer patterns required for audio. A novel method of sound synthesis based on conceptors is introduced. Conceptular Synthesis is based on granular synthesis; sets of conceptors are trained to recall varying patterns from a single RNN, then a runtime mechanism switches between them, generating short patterns which are recombined into a longer sound. The quality of sound resynthesis using this technique is experimentally evaluated. Conceptor models are shown to resynthesise audio with a comparable quality to a close equivalent technique using echo state networks with stored patterns and output feedback. Conceptor models are also shown to excel in their malleability and potential for creative sound manipulation, in comparison to echo state network models which tend to fail when the same manipulations are applied. Examples are given demonstrating creative sonic possibilities, by exploiting conceptor pattern morphing, boolean conceptor logic and manipulation of RNN dynamics. Limitations of conceptor models are revealed with regards to reproduction quality, and pragmatic limitations are also shown, where rises in computation and memory requirements preclude the use of these models for training with longer sound samples. The techniques presented here represent an initial exploration of the sound synthesis potential of conceptors, demonstrating possible creative applications in sound design; future possibilities and research questions are outlined.
Collapse
Affiliation(s)
- Chris Kiefer
- Experimental Music Technologies Lab, Department of Music, University of Sussex, Brighton, United Kingdom
| |
Collapse
|
26
|
Abstract
The neural network is a powerful computing framework that has been exploited by biological evolution and by humans for solving diverse problems. Although the computational capabilities of neural networks are determined by their structure, the current understanding of the relationships between a neural network's architecture and function is still primitive. Here we reveal that a neural network's modular architecture plays a vital role in determining the neural dynamics and memory performance of the network of threshold neurons. In particular, we demonstrate that there exists an optimal modularity for memory performance, where a balance between local cohesion and global connectivity is established, allowing optimally modular networks to remember longer. Our results suggest that insights from dynamical analysis of neural networks and information-spreading processes can be leveraged to better design neural networks and may shed light on the brain's modular organization.
Collapse
Affiliation(s)
- Nathaniel Rodriguez
- School of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN, USA
| | - Eduardo Izquierdo
- School of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN, USA
- Cognitive Science Program, Indiana University, Bloomington, IN, USA
| | - Yong-Yeol Ahn
- School of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN, USA
- Indiana University Network Science Institute, Bloomington, IN, USA
| |
Collapse
|
27
|
Tanaka G, Yamane T, Héroux JB, Nakane R, Kanazawa N, Takeda S, Numata H, Nakano D, Hirose A. Recent advances in physical reservoir computing: A review. Neural Netw 2019; 115:100-123. [PMID: 30981085 DOI: 10.1016/j.neunet.2019.03.005] [Citation(s) in RCA: 293] [Impact Index Per Article: 58.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Revised: 02/24/2019] [Accepted: 03/07/2019] [Indexed: 02/06/2023]
Abstract
Reservoir computing is a computational framework suited for temporal/sequential data processing. It is derived from several recurrent neural network models, including echo state networks and liquid state machines. A reservoir computing system consists of a reservoir for mapping inputs into a high-dimensional space and a readout for pattern analysis from the high-dimensional states in the reservoir. The reservoir is fixed and only the readout is trained with a simple method such as linear regression and classification. Thus, the major advantage of reservoir computing compared to other recurrent neural networks is fast learning, resulting in low training cost. Another advantage is that the reservoir without adaptive updating is amenable to hardware implementation using a variety of physical systems, substrates, and devices. In fact, such physical reservoir computing has attracted increasing attention in diverse fields of research. The purpose of this review is to provide an overview of recent advances in physical reservoir computing by classifying them according to the type of the reservoir. We discuss the current issues and perspectives related to physical reservoir computing, in order to further expand its practical applications and develop next-generation machine learning systems.
Collapse
Affiliation(s)
- Gouhei Tanaka
- Institute for Innovation in International Engineering Education, Graduate School of Engineering, The University of Tokyo, Tokyo 113-8656, Japan; Department of Electrical Engineering and Information Systems, Graduate School of Engineering, The University of Tokyo, Tokyo 113-8656, Japan.
| | | | | | - Ryosho Nakane
- Institute for Innovation in International Engineering Education, Graduate School of Engineering, The University of Tokyo, Tokyo 113-8656, Japan; Department of Electrical Engineering and Information Systems, Graduate School of Engineering, The University of Tokyo, Tokyo 113-8656, Japan
| | | | | | | | | | - Akira Hirose
- Institute for Innovation in International Engineering Education, Graduate School of Engineering, The University of Tokyo, Tokyo 113-8656, Japan; Department of Electrical Engineering and Information Systems, Graduate School of Engineering, The University of Tokyo, Tokyo 113-8656, Japan
| |
Collapse
|
28
|
Thiede LA, Parlitz U. Gradient based hyperparameter optimization in Echo State Networks. Neural Netw 2019; 115:23-29. [PMID: 30921562 DOI: 10.1016/j.neunet.2019.02.001] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2018] [Revised: 02/01/2019] [Accepted: 02/04/2019] [Indexed: 10/27/2022]
Abstract
Like most machine learning algorithms, Echo State Networks possess several hyperparameters that have to be carefully tuned for achieving best performance. For minimizing the error on a specific task, we present a gradient based optimization algorithm, for the input scaling, the spectral radius, the leaking rate, and the regularization parameter.
Collapse
Affiliation(s)
- Luca Anthony Thiede
- Max Planck Institute for Dynamics and Self-Organization, Am Fassberg 17, 37077 Göttingen, Germany; Institut für Dynamik komplexer Systeme, Georg-August-Universität Göttingen, Friedrich-Hund-Platz 1, 37077 Göttingen, Germany.
| | - Ulrich Parlitz
- Max Planck Institute for Dynamics and Self-Organization, Am Fassberg 17, 37077 Göttingen, Germany; Institut für Dynamik komplexer Systeme, Georg-August-Universität Göttingen, Friedrich-Hund-Platz 1, 37077 Göttingen, Germany.
| |
Collapse
|
29
|
Ortín S, Soriano MC, Alfaras M, Mirasso CR. Automated real-time method for ventricular heartbeat classification. Comput Methods Programs Biomed 2019; 169:1-8. [PMID: 30638588 DOI: 10.1016/j.cmpb.2018.11.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2018] [Revised: 08/31/2018] [Accepted: 11/19/2018] [Indexed: 06/09/2023]
Abstract
BACKGROUND AND OBJECTIVE In this work, we develop a fully automatic and real-time ventricular heartbeat classifier based on a single ECG lead. Single ECG lead classifiers can be especially useful for wearable technologies that provide continuous and long-term monitoring of the electrocardiogram. These wearables usually have a few non-standard leads and the quality of the signals depends on the user physical activity. METHODS The proposed method uses an Echo State Network (ESN) to classify ECG signals following the Association for the Advancement of Medical Instrumentation (AAMI) recommendations with an inter-patient scheme. To achieve real-time classification, the classifier itself and the feature extraction approach are fast and computationally efficient. In addition, our approach allows transferring the knowledge from one database to another without additional training. RESULTS The classification performance of the proposed model is validated on the MIT-BIH arrhythmia and INCART databases. The sensitivity and precision of the proposed method for MIT-BIH arrhythmia database are 95.3 and 88.8 for the modified lead II and 90.9 and 89.2 for the V1 lead. The results reported are further compared to the existing methodologies in literature. Our methodology is a competitive single lead ventricular heartbeat classifier, that is comparable to state-of-the-art algorithms using multiple leads. CONCLUSIONS The proposed fully automated, single-lead and real-time heartbeat classifier of ventricular heartbeats reports an improved classification accuracy in different leads of the evaluated databases in comparison with other single lead heartbeat classifiers. These results open the possibility of applying our methodology to wearable long-term monitoring devices with an unconventional placement of the electrodes.
Collapse
Affiliation(s)
- Silvia Ortín
- Instituto de Física Interdisciplinar y Sistemas Complejos, IFISC (CSIC-UIB), Campus Universitat de les Illes Balears, E-07122 Palma de Mallorca, Spain.
| | - Miguel C Soriano
- Instituto de Física Interdisciplinar y Sistemas Complejos, IFISC (CSIC-UIB), Campus Universitat de les Illes Balears, E-07122 Palma de Mallorca, Spain.
| | - Miquel Alfaras
- Instituto de Física Interdisciplinar y Sistemas Complejos, IFISC (CSIC-UIB), Campus Universitat de les Illes Balears, E-07122 Palma de Mallorca, Spain.
| | - Claudio R Mirasso
- Instituto de Física Interdisciplinar y Sistemas Complejos, IFISC (CSIC-UIB), Campus Universitat de les Illes Balears, E-07122 Palma de Mallorca, Spain.
| |
Collapse
|
30
|
Gallicchio C, Micheli A, Pedrelli L. Design of deep echo state networks. Neural Netw 2018; 108:33-47. [PMID: 30138751 DOI: 10.1016/j.neunet.2018.08.002] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2017] [Revised: 07/27/2018] [Accepted: 08/02/2018] [Indexed: 11/21/2022]
Abstract
In this paper, we provide a novel approach to the architectural design of deep Recurrent Neural Networks using signal frequency analysis. In particular, focusing on the Reservoir Computing framework and inspired by the principles related to the inherent effect of layering, we address a fundamental open issue in deep learning, namely the question of how to establish the number of layers in recurrent architectures in the form of deep echo state networks (DeepESNs). The proposed method is first analyzed and refined on a controlled scenario and then it is experimentally assessed on challenging real-world tasks. The achieved results also show the ability of properly designed DeepESNs to outperform RC approaches on a speech recognition task, and to compete with the state-of-the-art in time-series prediction on polyphonic music tasks.
Collapse
|
31
|
Szalisznyó K, Silverstein D, Teichmann M, Duffau H, Smits A. Cortico-striatal language pathways dynamically adjust for syntactic complexity: A computational study. Brain Lang 2017; 164:53-62. [PMID: 27792887 DOI: 10.1016/j.bandl.2016.08.005] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2016] [Revised: 08/05/2016] [Accepted: 08/14/2016] [Indexed: 06/06/2023]
Abstract
A growing body of literature supports a key role of fronto-striatal circuits in language perception. It is now known that the striatum plays a role in engaging attentional resources and linguistic rule computation while also serving phonological short-term memory capabilities. The ventral semantic and the dorsal phonological stream dichotomy assumed for spoken language processing also seems to play a role in cortico-striatal perception. Based on recent studies that correlate deep Broca-striatal pathways with complex syntax performance, we used a previously developed computational model of frontal-striatal syntax circuits and hypothesized that different parallel language pathways may contribute to canonical and non-canonical sentence comprehension separately. We modified and further analyzed a thematic role assignment task and corresponding reservoir computing model of language circuits, as previously developed by Dominey and coworkers. We examined the models performance under various parameter regimes, by influencing how fast the presented language input decays and altering the temporal dynamics of activated word representations. This enabled us to quantify canonical and non-canonical sentence comprehension abilities. The modeling results suggest that separate cortico-cortical and cortico-striatal circuits may be recruited differently for processing syntactically more difficult and less complicated sentences. Alternatively, a single circuit would need to dynamically and adaptively adjust to syntactic complexity.
Collapse
Affiliation(s)
- Krisztina Szalisznyó
- Department of Neuroscience, Psychiatry, University Hospital, Uppsala University, 751 85 Uppsala, Sweden; Computational Neuroscience Group, Wigner Research Institute, Hungarian Academy of Sciences, P.O. Box 49, Budapest, Hungary.
| | - David Silverstein
- Department of Computational Science and Technology, KTH Royal Institute of Technology; Stockholm Brain Institute, Karolinska Institutet, Stockholm, Sweden
| | - Marc Teichmann
- Department of Neurology, Institut de la mémoire et de la maladie d'Alzheimer, Centre de Référence Démences Rares, Hopital de la Pitié-Salpetriére, AP-HP, Paris, France; Institut du Cerveau et de la Moelle Epiniére (ICM), ICM-INSERM 1127, FrontLab, Paris, France
| | - Hugues Duffau
- Department of Neurosurgery, Gui de Chauliac Hospital, University of Montpellier, Institute for Neurosciences of Montpellier, Montpellier, France
| | - Anja Smits
- Department of Neuroscience, Neurology, Uppsala University, Uppsala, Sweden; Department of Clinical Neurosciences and Rehabilitation, Sahlgrenska Academy, Gothenburg, Institute of Neurosciences and Physiology, University of Gothenburg, Gothenburg, Sweden
| |
Collapse
|
32
|
Kainz P, Burgsteiner H, Asslaber M, Ahammer H. Training echo state networks for rotation-invariant bone marrow cell classification. Neural Comput Appl 2017; 28:1277-92. [PMID: 28706349 DOI: 10.1007/s00521-016-2609-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2016] [Accepted: 09/07/2016] [Indexed: 11/26/2022]
Abstract
The main principle of diagnostic pathology is the reliable interpretation of individual cells in context of the tissue architecture. Especially a confident examination of bone marrow specimen is dependent on a valid classification of myeloid cells. In this work, we propose a novel rotation-invariant learning scheme for multi-class echo state networks (ESNs), which achieves very high performance in automated bone marrow cell classification. Based on representing static images as temporal sequence of rotations, we show how ESNs robustly recognize cells of arbitrary rotations by taking advantage of their short-term memory capacity. The performance of our approach is compared to a classification random forest that learns rotation-invariance in a conventional way by exhaustively training on multiple rotations of individual samples. The methods were evaluated on a human bone marrow image database consisting of granulopoietic and erythropoietic cells in different maturation stages. Our ESN approach to cell classification does not rely on segmentation of cells or manual feature extraction and can therefore directly be applied to image data.
Collapse
|
33
|
Mannella F, Baldassarre G. Selection of cortical dynamics for motor behaviour by the basal ganglia. Biol Cybern 2015; 109:575-595. [PMID: 26537483 PMCID: PMC4656718 DOI: 10.1007/s00422-015-0662-6] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/21/2014] [Accepted: 09/29/2015] [Indexed: 06/05/2023]
Abstract
The basal ganglia and cortex are strongly implicated in the control of motor preparation and execution. Re-entrant loops between these two brain areas are thought to determine the selection of motor repertoires for instrumental action. The nature of neural encoding and processing in the motor cortex as well as the way in which selection by the basal ganglia acts on them is currently debated. The classic view of the motor cortex implementing a direct mapping of information from perception to muscular responses is challenged by proposals viewing it as a set of dynamical systems controlling muscles. Consequently, the common idea that a competition between relatively segregated cortico-striato-nigro-thalamo-cortical channels selects patterns of activity in the motor cortex is no more sufficient to explain how action selection works. Here, we contribute to develop the dynamical view of the basal ganglia-cortical system by proposing a computational model in which a thalamo-cortical dynamical neural reservoir is modulated by disinhibitory selection of the basal ganglia guided by top-down information, so that it responds with different dynamics to the same bottom-up input. The model shows how different motor trajectories can so be produced by controlling the same set of joint actuators. Furthermore, the model shows how the basal ganglia might modulate cortical dynamics by preserving coarse-grained spatiotemporal information throughout cortico-cortical pathways.
Collapse
Affiliation(s)
- Francesco Mannella
- Laboratory of Computational Embodied Neuroscience, Institute of Cognitive Sciences and Technologies, National Research Council (CNR-ISTC-LOCEN), Via San Martino della Battaglia 44, 00185, Rome, Italy.
| | - Gianluca Baldassarre
- Laboratory of Computational Embodied Neuroscience, Institute of Cognitive Sciences and Technologies, National Research Council (CNR-ISTC-LOCEN), Via San Martino della Battaglia 44, 00185, Rome, Italy.
| |
Collapse
|
34
|
Abstract
Standard methods for the analysis of functional MRI data strongly rely on prior implicit and explicit hypotheses made to simplify the analysis. In this work the attention is focused on two such commonly accepted hypotheses: (i) the hemodynamic response function (HRF) to be searched in the BOLD signal can be described by a specific parametric model e.g., double-gamma; (ii) the effect of stimuli on the signal is taken to be linearly additive. While these assumptions have been empirically proven to generate high sensitivity for statistical methods, they also limit the identification of relevant voxels to what is already postulated in the signal, thus not allowing the discovery of unknown correlates in the data due to the presence of unexpected hemodynamics. This paper tries to overcome these limitations by proposing a method wherein the HRF is learned directly from data rather than induced from its basic form assumed in advance. This approach produces a set of voxel-wise models of HRF and, as a result, relevant voxels are filterable according to the accuracy of their prediction in a machine learning framework. This approach is instantiated using a temporal architecture based on the paradigm of Reservoir Computing wherein a Liquid State Machine is combined with a decoding Feed-Forward Neural Network. This splits the modeling into two parts: first a representation of the complex temporal reactivity of the hemodynamic response is determined by a universal global "reservoir" which is essentially temporal; second an interpretation of the encoded representation is determined by a standard feed-forward neural network, which is trained by the data. Thus the reservoir models the temporal state of information during and following temporal stimuli in a feed-back system, while the neural network "translates" this data to fit the specific HRF response as given, e.g. by BOLD signal measurements in fMRI. An empirical analysis on synthetic datasets shows that the learning process can be robust both to noise and to the varying shape of the underlying HRF. A similar investigation on real fMRI datasets provides evidence that BOLD predictability allows for discrimination between relevant and irrelevant voxels for a given set of stimuli.
Collapse
Affiliation(s)
- Paolo Avesani
- NeuroInformatics Laboratory (NILab), Fondazione Bruno Kessler, Trento, Italy; Centro Interdipartimentale Mente e Cervello (CIMeC), Università di Trento, Italy
| | - Hananel Hazan
- Department of Computer Science, University of Haifa, Israel; Network Biology Research Laboratory, Technion, Haifa, Israel
| | - Ester Koilis
- Department of Computer Science, University of Haifa, Israel
| | | | - Diego Sona
- NeuroInformatics Laboratory (NILab), Fondazione Bruno Kessler, Trento, Italy; Pattern Analysis and Computer Vision, Istituto Italiano di Tecnologia, Genova, Italy
| |
Collapse
|
35
|
Yamazaki T, Igarashi J. Realtime cerebellum: a large-scale spiking network model of the cerebellum that runs in realtime using a graphics processing unit. Neural Netw 2013; 47:103-11. [PMID: 23434303 DOI: 10.1016/j.neunet.2013.01.019] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2012] [Revised: 01/24/2013] [Accepted: 01/25/2013] [Indexed: 11/29/2022]
Abstract
The cerebellum plays an essential role in adaptive motor control. Once we are able to build a cerebellar model that runs in realtime, which means that a computer simulation of 1 s in the simulated world completes within 1 s in the real world, the cerebellar model could be used as a realtime adaptive neural controller for physical hardware such as humanoid robots. In this paper, we introduce "Realtime Cerebellum (RC)", a new implementation of our large-scale spiking network model of the cerebellum, which was originally built to study cerebellar mechanisms for simultaneous gain and timing control and acted as a general-purpose supervised learning machine of spatiotemporal information known as reservoir computing, on a graphics processing unit (GPU). Owing to the massive parallel computing capability of a GPU, RC runs in realtime, while reproducing qualitatively the same simulation results of the Pavlovian delay eyeblink conditioning with the previous version. RC is adopted as a realtime adaptive controller of a humanoid robot, which is instructed to learn a proper timing to swing a bat to hit a flying ball online. These results suggest that RC provides a means to apply the computational power of the cerebellum as a versatile supervised learning machine towards engineering applications.
Collapse
Affiliation(s)
- Tadashi Yamazaki
- RIKEN Brain Science Institute, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan.
| | | |
Collapse
|