1
|
Liu Q, Yu M, Bai M. A study on a recommendation algorithm based on spectral clustering and GRU. iScience 2024; 27:108660. [PMID: 38313050 PMCID: PMC10835353 DOI: 10.1016/j.isci.2023.108660] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2023] [Revised: 11/29/2023] [Accepted: 12/05/2023] [Indexed: 02/06/2024] Open
Abstract
With the development of e-commerce, the importance of recommendation algorithms has significantly increased. However, traditional recommendation systems struggle to address issues such as data sparsity and cold start. This article proposes an optimization method for a recommendation system based on spectral clustering (SC) and gated recurrent unit (GRU), named the GRU-KSC algorithm. Firstly, this paper improves the original spectral clustering algorithm by introducing Kmc2, proposing a novel spectral clustering recommendation algorithm (K-means++ SC, KSC) based on the existing SC algorithm. Secondly, building upon the original GRU model, the paper presents a hybrid recommendation algorithm (Hybrid GRU, HGRU) capable of capturing long-term user interests for a more personalized recommendation. Experiments conducted on real datasets demonstrate that our method outperforms existing benchmark methods in terms of accuracy and robustness.
Collapse
Affiliation(s)
- Qingyuan Liu
- College of Computer and Control Engineering, Northeast Forestry University, Harbin, Heilongjiang Province, China
| | - Ming Yu
- College of Computer and Control Engineering, Northeast Forestry University, Harbin, Heilongjiang Province, China
| | - Miaoyuan Bai
- College of Computer and Control Engineering, Northeast Forestry University, Harbin, Heilongjiang Province, China
| |
Collapse
|
2
|
Prescott TJ, Wilson SP. Understanding brain functional architecture through robotics. Sci Robot 2023; 8:eadg6014. [PMID: 37256968 DOI: 10.1126/scirobotics.adg6014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Accepted: 05/05/2023] [Indexed: 06/02/2023]
Abstract
Robotics is increasingly seen as a useful test bed for computational models of the brain functional architecture underlying animal behavior. We provide an overview of past and current work, focusing on probabilistic and dynamical models, including approaches premised on the free energy principle, situating this endeavor in relation to evidence that the brain constitutes a layered control system. We argue that future neurorobotic models should integrate multiple neurobiological constraints and be hybrid in nature.
Collapse
Affiliation(s)
- Tony J Prescott
- Department of Computer Science, University of Sheffield, Sheffield, UK
| | - Stuart P Wilson
- Department of Computer Science, University of Sheffield, Sheffield, UK
| |
Collapse
|
3
|
Mahmoud S, Billing E, Svensson H, Thill S. How to train a self-driving vehicle: On the added value (or lack thereof) of curriculum learning and replay buffers. Front Artif Intell 2023; 6:1098982. [PMID: 36762255 PMCID: PMC9905678 DOI: 10.3389/frai.2023.1098982] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Accepted: 01/05/2023] [Indexed: 01/26/2023] Open
Abstract
Learning from only real-world collected data can be unrealistic and time consuming in many scenario. One alternative is to use synthetic data as learning environments to learn rare situations and replay buffers to speed up the learning. In this work, we examine the hypothesis of how the creation of the environment affects the training of reinforcement learning agent through auto-generated environment mechanisms. We take the autonomous vehicle as an application. We compare the effect of two approaches to generate training data for artificial cognitive agents. We consider the added value of curriculum learning-just as in human learning-as a way to structure novel training data that the agent has not seen before as well as that of using a replay buffer to train further on data the agent has seen before. In other words, the focus of this paper is on characteristics of the training data rather than on learning algorithms. We therefore use two tasks that are commonly trained early on in autonomous vehicle research: lane keeping and pedestrian avoidance. Our main results show that curriculum learning indeed offers an additional benefit over a vanilla reinforcement learning approach (using Deep-Q Learning), but the replay buffer actually has a detrimental effect in most (but not all) combinations of data generation approaches we considered here. The benefit of curriculum learning does depend on the existence of a well-defined difficulty metric with which various training scenarios can be ordered. In the lane-keeping task, we can define it as a function of the curvature of the road, in which the steeper and more occurring curves on the road, the more difficult it gets. Defining such a difficulty metric in other scenarios is not always trivial. In general, the results of this paper emphasize both the importance of considering data characterization, such as curriculum learning, and the importance of defining an appropriate metric for the task.
Collapse
Affiliation(s)
- Sara Mahmoud
- Interaction Lab, School of Informatics, University of Skövde, Skövde, Sweden,*Correspondence: Sara Mahmoud ✉
| | - Erik Billing
- Interaction Lab, School of Informatics, University of Skövde, Skövde, Sweden
| | - Henrik Svensson
- Interaction Lab, School of Informatics, University of Skövde, Skövde, Sweden
| | - Serge Thill
- Donders Institute for Brain, Cognition, and Behaviour, Radboud University, Nijmegen, Netherlands
| |
Collapse
|
4
|
Mahmoud S, Billing E, Svensson H, Thill S. Where to from here? On the future development of autonomous vehicles from a cognitive systems perspective. COGN SYST RES 2022. [DOI: 10.1016/j.cogsys.2022.09.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
5
|
Gumbsch C, Butz MV, Martius G. Autonomous Identification and Goal-Directed Invocation of Event-Predictive Behavioral Primitives. IEEE Trans Cogn Dev Syst 2021. [DOI: 10.1109/tcds.2019.2925890] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
6
|
Ciria A, Schillaci G, Pezzulo G, Hafner VV, Lara B. Predictive Processing in Cognitive Robotics: A Review. Neural Comput 2021; 33:1402-1432. [PMID: 34496394 DOI: 10.1162/neco_a_01383] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Accepted: 12/31/2020] [Indexed: 11/04/2022]
Abstract
Predictive processing has become an influential framework in cognitive sciences. This framework turns the traditional view of perception upside down, claiming that the main flow of information processing is realized in a top-down, hierarchical manner. Furthermore, it aims at unifying perception, cognition, and action as a single inferential process. However, in the related literature, the predictive processing framework and its associated schemes, such as predictive coding, active inference, perceptual inference, and free-energy principle, tend to be used interchangeably. In the field of cognitive robotics, there is no clear-cut distinction on which schemes have been implemented and under which assumptions. In this letter, working definitions are set with the main aim of analyzing the state of the art in cognitive robotics research working under the predictive processing framework as well as some related nonrobotic models. The analysis suggests that, first, research in both cognitive robotics implementations and nonrobotic models needs to be extended to the study of how multiple exteroceptive modalities can be integrated into prediction error minimization schemes. Second, a relevant distinction found here is that cognitive robotics implementations tend to emphasize the learning of a generative model, while in nonrobotics models, it is almost absent. Third, despite the relevance for active inference, few cognitive robotics implementations examine the issues around control and whether it should result from the substitution of inverse models with proprioceptive predictions. Finally, limited attention has been placed on precision weighting and the tracking of prediction error dynamics. These mechanisms should help to explore more complex behaviors and tasks in cognitive robotics research under the predictive processing framework.
Collapse
Affiliation(s)
- Alejandra Ciria
- Facultad de Psicología, Universidad Nacional Autónoma de México, Mexico City, CP 04510, Mexico
| | - Guido Schillaci
- BioRobotics Institute, Scuola Superiore Sant'Anna, 34 56025 Pontedera, Italy
| | - Giovanni Pezzulo
- Institute of Cognitive Sciences and Technologies, National Research Council, 44 00185 Rome, Italy
| | - Verena V Hafner
- Adaptive Systems Group, Department of Computer Science, Humboldt-Universität zu Berlin, D-12489, Germany
| | - Bruno Lara
- Laboratorio de Robótica Cognitiva, Centro de Investigación en Ciencias, Universidad Autónoma del Estado de Morelos, Cuernavaca CP 62209, Mexico
| |
Collapse
|
7
|
Afebu KO, Liu Y, Papatheou E, Guo B. LSTM-based approach for predicting periodic motions of an impacting system via transient dynamics. Neural Netw 2021; 140:49-64. [PMID: 33744713 DOI: 10.1016/j.neunet.2021.02.027] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2020] [Revised: 12/12/2020] [Accepted: 02/25/2021] [Indexed: 11/18/2022]
Abstract
Dynamically impacting systems are characterised with inherent instability and complex non-linear phenomena which makes it practically difficult to predict the steady state response of the system at transient periods. This study investigates the ability of a data driven machine learning method using Long Short-Term Memory networks to learn the complex nonlinearity associated with co-existing impact responses from limited transient data. A one-degree-of-freedom impact oscillator has been used to represent the bit-rock interaction for percussive drilling. Simulated data results show velocity measurements to contribute most to predicting steady state responses from transient dynamics with most of the network models reaching an accuracy of over 95%. Limitations to practically measurable variables in dynamic systems warranted the development of a feature based network model for impact motion classification. Experimental data from a two-degrees-of-freedom impacting system representing percussive bit penetration has been used to demonstrate the effectiveness of this method. The study thus provides a precise and less computational means of detecting and avoiding underperforming impact modes in percussive drilling.
Collapse
Affiliation(s)
- Kenneth Omokhagbo Afebu
- College of Engineering, Mathematics and Physical Sciences, University of Exeter, North Park Road, Exeter, EX4 4QF, UK.
| | - Yang Liu
- College of Engineering, Mathematics and Physical Sciences, University of Exeter, North Park Road, Exeter, EX4 4QF, UK.
| | - Evangelos Papatheou
- College of Engineering, Mathematics and Physical Sciences, University of Exeter, North Park Road, Exeter, EX4 4QF, UK.
| | - Bingyong Guo
- College of Engineering, Mathematics and Physical Sciences, University of Exeter, North Park Road, Exeter, EX4 4QF, UK.
| |
Collapse
|
8
|
Ohata W, Tani J. Investigation of the Sense of Agency in Social Cognition, Based on Frameworks of Predictive Coding and Active Inference: A Simulation Study on Multimodal Imitative Interaction. Front Neurorobot 2020; 14:61. [PMID: 33013346 PMCID: PMC7509423 DOI: 10.3389/fnbot.2020.00061] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2020] [Accepted: 07/28/2020] [Indexed: 12/31/2022] Open
Abstract
When agents interact socially with different intentions (or wills), conflicts are difficult to avoid. Although the means by which social agents can resolve such problems autonomously has not been determined, dynamic characteristics of agency may shed light on underlying mechanisms. Therefore, the current study focused on the sense of agency, a specific aspect of agency referring to congruence between the agent's intention in acting and the outcome, especially in social interaction contexts. Employing predictive coding and active inference as theoretical frameworks of perception and action generation, we hypothesize that regulation of complexity in the evidence lower bound of an agent's model should affect the strength of the agent's sense of agency and should have a significant impact on social interactions. To evaluate this hypothesis, we built a computational model of imitative interaction between a robot and a human via visuo-proprioceptive sensation with a variational Bayes recurrent neural network, and simulated the model in the form of pseudo-imitative interaction using recorded human body movement data, which serve as the counterpart in the interactions. A key feature of the model is that the complexity of each modality can be regulated differently by changing the values of a hyperparameter assigned to each local module of the model. We first searched for an optimal setting of hyperparameters that endow the model with appropriate coordination of multimodal sensation. These searches revealed that complexity of the vision module should be more tightly regulated than that of the proprioception module because of greater uncertainty in visual information flow. Using this optimally trained model as a default model, we investigated how changing the tightness of complexity regulation in the entire network after training affects the strength of the sense of agency during imitative interactions. The results showed that with looser regulation of complexity, an agent tends to act more egocentrically, without adapting to the other. In contrast, with tighter regulation, the agent tends to follow the other by adjusting its intention. We conclude that the tightness of complexity regulation significantly affects the strength of the sense of agency and the dynamics of interactions between agents in social settings.
Collapse
Affiliation(s)
- Wataru Ohata
- Cognitive Neurorobotics Research Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
| | - Jun Tani
- Cognitive Neurorobotics Research Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
| |
Collapse
|
9
|
Matsumoto T, Tani J. Goal-Directed Planning for Habituated Agents by Active Inference Using a Variational Recurrent Neural Network. ENTROPY 2020; 22:e22050564. [PMID: 33286336 PMCID: PMC7517093 DOI: 10.3390/e22050564] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/20/2020] [Revised: 05/12/2020] [Accepted: 05/15/2020] [Indexed: 11/16/2022]
Abstract
It is crucial to ask how agents can achieve goals by generating action plans using only partial models of the world acquired through habituated sensory-motor experiences. Although many existing robotics studies use a forward model framework, there are generalization issues with high degrees of freedom. The current study shows that the predictive coding (PC) and active inference (AIF) frameworks, which employ a generative model, can develop better generalization by learning a prior distribution in a low dimensional latent state space representing probabilistic structures extracted from well habituated sensory-motor trajectories. In our proposed model, learning is carried out by inferring optimal latent variables as well as synaptic weights for maximizing the evidence lower bound, while goal-directed planning is accomplished by inferring latent variables for maximizing the estimated lower bound. Our proposed model was evaluated with both simple and complex robotic tasks in simulation, which demonstrated sufficient generalization in learning with limited training data by setting an intermediate value for a regularization coefficient. Furthermore, comparative simulation results show that the proposed model outperforms a conventional forward model in goal-directed planning, due to the learned prior confining the search of motor plans within the range of habituated trajectories.
Collapse
|
10
|
Starzyk JA, Maciura L, Horzyk A. Associative Memories With Synaptic Delays. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2020; 31:331-344. [PMID: 31295124 DOI: 10.1109/tnnls.2019.2921143] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
In this paper, we introduce a new concept of associative memories in which synaptic connections of the self-organizing neural network learn time delays between input sequence elements. Synaptic connections represent both the synaptic weights and expected delays between the network inputs. This property of synaptic connections facilitates recognition of time sequences and provides context-based associations between sequence elements. Characteristics of time delays are learned and are updated each time an input sequence is presented. There are no separate learning and testing modes typically used in other neural networks, as the network starts to predict the next input element as soon as there is no expected input signal. The network generates output signals useful for associative recall and prediction. These output signals depend on the presented input context and the knowledge stored in the graph. Such a mode of operation is preferred for the organization of episodic memories used to store the observed episodes and to recall them if a sufficient context is provided. The associative sequential recall is useful for the operation of working memory in a cognitive agent. Test results demonstrate that the network correctly recognizes the input sequences with variable delays and that it is more efficient than other recently developed sequential memory networks based on associative neurons.
Collapse
|
11
|
Isomura T, Parr T, Friston K. Bayesian Filtering with Multiple Internal Models: Toward a Theory of Social Intelligence. Neural Comput 2019; 31:2390-2431. [PMID: 31614100 DOI: 10.1162/neco_a_01239] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
To exhibit social intelligence, animals have to recognize whom they are communicating with. One way to make this inference is to select among internal generative models of each conspecific who may be encountered. However, these models also have to be learned via some form of Bayesian belief updating. This induces an interesting problem: When receiving sensory input generated by a particular conspecific, how does an animal know which internal model to update? We consider a theoretical and neurobiologically plausible solution that enables inference and learning of the processes that generate sensory inputs (e.g., listening and understanding) and reproduction of those inputs (e.g., talking or singing), under multiple generative models. This is based on recent advances in theoretical neurobiology-namely, active inference and post hoc (online) Bayesian model selection. In brief, this scheme fits sensory inputs under each generative model. Model parameters are then updated in proportion to the probability that each model could have generated the input (i.e., model evidence). The proposed scheme is demonstrated using a series of (real zebra finch) birdsongs, where each song is generated by several different birds. The scheme is implemented using physiologically plausible models of birdsong production. We show that generalized Bayesian filtering, combined with model selection, leads to successful learning across generative models, each possessing different parameters. These results highlight the utility of having multiple internal models when making inferences in social environments with multiple sources of sensory information.
Collapse
Affiliation(s)
- Takuya Isomura
- Laboratory for Neural Computation and Adaptation, RIKEN Center for Brain Science, Wako, Saitama 351-0198, Japan
| | - Thomas Parr
- Wellcome Centre for Human Neuroimaging, Institute of Neurology, University College London, London, WC1N 3AR, U.K.
| | - Karl Friston
- Wellcome Centre for Human Neuroimaging, Institute of Neurology, University College London, London, WC1N 3AR, U.K.
| |
Collapse
|
12
|
Ahmadi A, Tani J. A Novel Predictive-Coding-Inspired Variational RNN Model for Online Prediction and Recognition. Neural Comput 2019; 31:2025-2074. [PMID: 31525309 DOI: 10.1162/neco_a_01228] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
This study introduces PV-RNN, a novel variational RNN inspired by predictive-coding ideas. The model learns to extract the probabilistic structures hidden in fluctuating temporal patterns by dynamically changing the stochasticity of its latent states. Its architecture attempts to address two major concerns of variational Bayes RNNs: how latent variables can learn meaningful representations and how the inference model can transfer future observations to the latent variables. PV-RNN does both by introducing adaptive vectors mirroring the training data, whose values can then be adapted differently during evaluation. Moreover, prediction errors during backpropagation-rather than external inputs during the forward computation-are used to convey information to the network about the external data. For testing, we introduce error regression for predicting unseen sequences as inspired by predictive coding that leverages those mechanisms. As in other variational Bayes RNNs, our model learns by maximizing a lower bound on the marginal likelihood of the sequential data, which is composed of two terms: the negative of the expectation of prediction errors and the negative of the Kullback-Leibler divergence between the prior and the approximate posterior distributions. The model introduces a weighting parameter, the meta-prior, to balance the optimization pressure placed on those two terms. We test the model on two data sets with probabilistic structures and show that with high values of the meta-prior, the network develops deterministic chaos through which the randomness of the data is imitated. For low values, the model behaves as a random process. The network performs best on intermediate values and is able to capture the latent probabilistic structure with good generalization. Analyzing the meta-prior's impact on the network allows us to precisely study the theoretical value and practical benefits of incorporating stochastic dynamics in our model. We demonstrate better prediction performance on a robot imitation task with our model using error regression compared to a standard variational Bayes model lacking such a procedure.
Collapse
Affiliation(s)
- Ahmadreza Ahmadi
- Okinawa Institute of Science and Technology, Okinawa, Japan 904-0495, and School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, 305-701, Republic of Korea
| | - Jun Tani
- Okinawa Institute of Science and Technology, Okinawa, Japan 904-0495
| |
Collapse
|
13
|
Abstract
Hierarchically organized brains communicate through feedforward (FF) and feedback (FB) pathways. In mammals, FF and FB are mediated by higher and lower frequencies during wakefulness. FB is preferentially impaired by general anesthetics in multiple mammalian species. This suggests FB serves critical functions in waking brains. The brain of Drosophila melanogaster (fruit fly) is also hierarchically organized, but the presence of FB in these brains is not established. Here, we studied FB in the fly brain, by simultaneously recording local field potentials (LFPs) from low-order peripheral structures and higher-order central structures. We analyzed the data using Granger causality (GC), the first application of this analysis technique to recordings from the insect brain. Our analysis revealed that low frequencies (0.1–5 Hz) mediated FB from the center to the periphery, while higher frequencies (10–45 Hz) mediated FF in the opposite direction. Further, isoflurane anesthesia preferentially reduced FB. Our results imply that the spectral characteristics of FF and FB may be a signature of hierarchically organized brains that is conserved from insects to mammals. We speculate that general anesthetics may induce unresponsiveness across species by targeting the mechanisms that support FB.
Collapse
|
14
|
Parisi GI, Tani J, Weber C, Wermter S. Lifelong learning of human actions with deep neural network self-organization. Neural Netw 2017; 96:137-149. [DOI: 10.1016/j.neunet.2017.09.001] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Revised: 08/23/2017] [Accepted: 09/01/2017] [Indexed: 10/18/2022]
|
15
|
Unsupervised identification and recognition of situations for high-dimensional sensori-motor streams. Neurocomputing 2017. [DOI: 10.1016/j.neucom.2017.02.090] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
16
|
Choi M, Tani J. Predictive Coding for Dynamic Visual Processing: Development of Functional Hierarchy in a Multiple Spatiotemporal Scales RNN Model. Neural Comput 2017; 30:237-270. [PMID: 29064785 DOI: 10.1162/neco_a_01026] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
This letter proposes a novel predictive coding type neural network model, the predictive multiple spatiotemporal scales recurrent neural network (P-MSTRNN). The P-MSTRNN learns to predict visually perceived human whole-body cyclic movement patterns by exploiting multiscale spatiotemporal constraints imposed on network dynamics by using differently sized receptive fields as well as different time constant values for each layer. After learning, the network can imitate target movement patterns by inferring or recognizing corresponding intentions by means of the regression of prediction error. Results show that the network can develop a functional hierarchy by developing a different type of dynamic structure at each layer. The letter examines how model performance during pattern generation, as well as predictive imitation, varies depending on the stage of learning. The number of limit cycle attractors corresponding to target movement patterns increases as learning proceeds. Transient dynamics developing early in the learning process successfully perform pattern generation and predictive imitation tasks. The letter concludes that exploitation of transient dynamics facilitates successful task performance during early learning periods.
Collapse
Affiliation(s)
- Minkyu Choi
- School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, 305-701, Republic of Korea
| | - Jun Tani
- Okinawa Institute of Science and Technology, Okinawa, Japan 904-0495, and School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, 305-701, Republic of Korea
| |
Collapse
|
17
|
Abstract
In recent decades, embodiment has become an influential concept in psychology and cognitive neuroscience. Embodiment denotes the study of the reciprocal (causal) relationships between mind and body, with the mind not only affecting the body but also vice versa. Embodied cognition comes to the fore in sensorimotor coupling, predictive coding, and nonverbal behavior. Additionally, the embodiment of the mind constitutes the basis of social interaction and communication, as evident in research on nonverbal synchrony and mimicry. These theoretical and empirical developments portend a range of implications for schizophrenia research and treatment. Sensorimotor dysfunctions are closely associated with affective and psychotic psychopathology, leading to altered timing in the processing of stimuli and to disordered appraisals of the environment. Problems of social cognition may be newly viewed as disordered embodied communication. The embodiment perspective suggests novel treatment strategies through psychotherapy and body-oriented interventions, and may ultimately provide biomarkers for diagnosis.
Collapse
Affiliation(s)
- Wolfgang Tschacher
- Universitätsklinik für Psychiatrie und Psychotherapie, Universität Bern, Bolligenstrasse 111, 3060 Bern, Schweiz
| | - Anne Giersch
- INSERM U1114, FMTS, Departement de Psychiatrie, CHRU de Strasbourg, Strasbourg, France
| | - Karl Friston
- The Wellcome Trust Centre for Neuroimaging, Institute of Neurology, University College London, London, UK
| |
Collapse
|
18
|
|
19
|
Abstract
In this paper, we show how the development of plastic behaviours, i.e., behaviour displaying a modular organisation characterised by behavioural subunits that are alternated in a context-dependent manner, can enable evolving robots to solve their adaptive task more efficiently also when it does not require the accomplishment of multiple conflicting functions. The comparison of the results obtained in different experimental conditions indicates that the most important prerequisites for the evolution of behavioural plasticity are: the possibility to generate and perceive affordances (i.e., opportunities for behaviour execution), the possibility to rely on flexible regulatory processes that exploit both external and internal cues, and the possibility to realise smooth and effective transitions between behaviours.
Collapse
|
20
|
Abstract
This letter offers a computational account of Pavlovian conditioning in the cerebellum based on active inference and predictive coding. Using eyeblink conditioning as a canonical paradigm, we formulate a minimal generative model that can account for spontaneous blinking, startle responses, and (delay or trace) conditioning. We then establish the face validity of the model using simulated responses to unconditioned and conditioned stimuli to reproduce the sorts of behavior that are observed empirically. The scheme's anatomical validity is then addressed by associating variables in the predictive coding scheme with nuclei and neuronal populations to match the (extrinsic and intrinsic) connectivity of the cerebellar (eyeblink conditioning) system. Finally, we try to establish predictive validity by reproducing selective failures of delay conditioning, trace conditioning, and extinction using (simulated and reversible) focal lesions. Although rather metaphorical, the ensuing scheme can account for a remarkable range of anatomical and neurophysiological aspects of cerebellar circuitry-and the specificity of lesion-deficit mappings that have been established experimentally. From a computational perspective, this work shows how conditioning or learning can be formulated in terms of minimizing variational free energy (or maximizing Bayesian model evidence) using exactly the same principles that underlie predictive coding in perception.
Collapse
Affiliation(s)
- Karl Friston
- Wellcome Trust Centre for Neuroimaging, UCL, London WC1N 3BG, U.K.
| | - Ivan Herreros
- Catalan Institute of Advanced Research, Technology Department, Universitat Pompeu Fabra, Barcelona 08018, Spain
| |
Collapse
|
21
|
Taniguchi T, Nagai T, Nakamura T, Iwahashi N, Ogata T, Asoh H. Symbol emergence in robotics: a survey. Adv Robot 2016. [DOI: 10.1080/01691864.2016.1164622] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
22
|
Pezzulo G, Rigoli F, Friston K. Active Inference, homeostatic regulation and adaptive behavioural control. Prog Neurobiol 2015; 134:17-35. [PMID: 26365173 PMCID: PMC4779150 DOI: 10.1016/j.pneurobio.2015.09.001] [Citation(s) in RCA: 280] [Impact Index Per Article: 31.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2014] [Revised: 07/20/2015] [Accepted: 09/08/2015] [Indexed: 11/30/2022]
Abstract
We review a theory of homeostatic regulation and adaptive behavioural control within the Active Inference framework. Our aim is to connect two research streams that are usually considered independently; namely, Active Inference and associative learning theories of animal behaviour. The former uses a probabilistic (Bayesian) formulation of perception and action, while the latter calls on multiple (Pavlovian, habitual, goal-directed) processes for homeostatic and behavioural control. We offer a synthesis these classical processes and cast them as successive hierarchical contextualisations of sensorimotor constructs, using the generative models that underpin Active Inference. This dissolves any apparent mechanistic distinction between the optimization processes that mediate classical control or learning. Furthermore, we generalize the scope of Active Inference by emphasizing interoceptive inference and homeostatic regulation. The ensuing homeostatic (or allostatic) perspective provides an intuitive explanation for how priors act as drives or goals to enslave action, and emphasises the embodied nature of inference.
Collapse
Affiliation(s)
- Giovanni Pezzulo
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy.
| | | | | |
Collapse
|
23
|
Antonelo EA, Schrauwen B. On learning navigation behaviors for small mobile robots with reservoir computing architectures. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2015; 26:763-780. [PMID: 25794381 DOI: 10.1109/tnnls.2014.2323247] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
This paper proposes a general reservoir computing (RC) learning framework that can be used to learn navigation behaviors for mobile robots in simple and complex unknown partially observable environments. RC provides an efficient way to train recurrent neural networks by letting the recurrent part of the network (called reservoir) be fixed while only a linear readout output layer is trained. The proposed RC framework builds upon the notion of navigation attractor or behavior that can be embedded in the high-dimensional space of the reservoir after learning. The learning of multiple behaviors is possible because the dynamic robot behavior, consisting of a sensory-motor sequence, can be linearly discriminated in the high-dimensional nonlinear space of the dynamic reservoir. Three learning approaches for navigation behaviors are shown in this paper. The first approach learns multiple behaviors based on the examples of navigation behaviors generated by a supervisor, while the second approach learns goal-directed navigation behaviors based only on rewards. The third approach learns complex goal-directed behaviors, in a supervised way, using a hierarchical architecture whose internal predictions of contextual switches guide the sequence of basic navigation behaviors toward the goal.
Collapse
|
24
|
Friston K, Rigoli F, Ognibene D, Mathys C, Fitzgerald T, Pezzulo G. Active inference and epistemic value. Cogn Neurosci 2015; 6:187-214. [PMID: 25689102 DOI: 10.1080/17588928.2015.1020053] [Citation(s) in RCA: 287] [Impact Index Per Article: 31.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
We offer a formal treatment of choice behavior based on the premise that agents minimize the expected free energy of future outcomes. Crucially, the negative free energy or quality of a policy can be decomposed into extrinsic and epistemic (or intrinsic) value. Minimizing expected free energy is therefore equivalent to maximizing extrinsic value or expected utility (defined in terms of prior preferences or goals), while maximizing information gain or intrinsic value (or reducing uncertainty about the causes of valuable outcomes). The resulting scheme resolves the exploration-exploitation dilemma: Epistemic value is maximized until there is no further information gain, after which exploitation is assured through maximization of extrinsic value. This is formally consistent with the Infomax principle, generalizing formulations of active vision based upon salience (Bayesian surprise) and optimal decisions based on expected utility and risk-sensitive (Kullback-Leibler) control. Furthermore, as with previous active inference formulations of discrete (Markovian) problems, ad hoc softmax parameters become the expected (Bayes-optimal) precision of beliefs about, or confidence in, policies. This article focuses on the basic theory, illustrating the ideas with simulations. A key aspect of these simulations is the similarity between precision updates and dopaminergic discharges observed in conditioning paradigms.
Collapse
Affiliation(s)
- Karl Friston
- a The Wellcome Trust Centre for Neuroimaging , Institute of Neurology , London , UK
| | - Francesco Rigoli
- a The Wellcome Trust Centre for Neuroimaging , Institute of Neurology , London , UK
| | - Dimitri Ognibene
- b Centre for Robotics Research, Department of Informatics , King's College London , London , UK
| | - Christoph Mathys
- a The Wellcome Trust Centre for Neuroimaging , Institute of Neurology , London , UK.,c Translational Neuromodeling Unit (TNU) , Institute for Biomedical Engineering, University of Zürich and ETH Zürich , Zürich , Switzerland.,d Laboratory for Social and Neural Systems Research (SNS Lab), Department of Economics , University of Zürich , Zürich , Switzerland
| | - Thomas Fitzgerald
- a The Wellcome Trust Centre for Neuroimaging , Institute of Neurology , London , UK
| | - Giovanni Pezzulo
- e Institute of Cognitive Sciences and Technologies , National Research Council , Rome , Italy
| |
Collapse
|
25
|
Emergence of Discrete and Abstract State Representation through Reinforcement Learning in a Continuous Input Task. ACTA ACUST UNITED AC 2013. [DOI: 10.1007/978-3-642-37374-9_2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
|
26
|
Friston K, Adams R, Montague R. What is value-accumulated reward or evidence? Front Neurorobot 2012; 6:11. [PMID: 23133414 PMCID: PMC3487150 DOI: 10.3389/fnbot.2012.00011] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2012] [Accepted: 10/16/2012] [Indexed: 12/04/2022] Open
Abstract
Why are you reading this abstract? In some sense, your answer will cast the exercise as valuable—but what is value? In what follows, we suggest that value is evidence or, more exactly, log Bayesian evidence. This implies that a sufficient explanation for valuable behavior is the accumulation of evidence for internal models of our world. This contrasts with normative models of optimal control and reinforcement learning, which assume the existence of a value function that explains behavior, where (somewhat tautologically) behavior maximizes value. In this paper, we consider an alternative formulation—active inference—that replaces policies in normative models with prior beliefs about the (future) states agents should occupy. This enables optimal behavior to be cast purely in terms of inference: where agents sample their sensorium to maximize the evidence for their generative model of hidden states in the world, and minimize their uncertainty about those states. Crucially, this formulation resolves the tautology inherent in normative models and allows one to consider how prior beliefs are themselves optimized in a hierarchical setting. We illustrate these points by showing that any optimal policy can be specified with prior beliefs in the context of Bayesian inference. We then show how these prior beliefs are themselves prescribed by an imperative to minimize uncertainty. This formulation explains the saccadic eye movements required to read this text and defines the value of the visual sensations you are soliciting.
Collapse
Affiliation(s)
- Karl Friston
- Wellcome Trust Centre for Neuroimaging, University College London London, UK
| | | | | |
Collapse
|
27
|
Möller R, Schenck W. Bootstrapping cognition from behavior-a computerized thought experiment. Cogn Sci 2012; 32:504-42. [PMID: 21635344 DOI: 10.1080/03640210802035241] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
We show that simple perceptual competences can emerge from an internal simulation of action effects and are thus grounded in behavior. A simulated agent learns to distinguish between dead ends and corridors without the necessity to represent these concepts in the sensory domain. Initially, the agent is only endowed with a simple value system and the means to extract low-level features from an image. In the interaction with the environment, it acquires a visuo-tactile forward model that allows the agent to predict how the visual input is changing under its movements, and whether movements will lead to a collision. From short-term predictions based on the forward model, the agent learns an inverse model. The inverse model in turn produces suggestions about which actions should be simulated in long-term predictions, and long-term predictions eventually give rise to the perceptual ability.
Collapse
Affiliation(s)
- Ralf Möller
- Computer Engineering Group, Faculty of Technology, Bielefeld University
| | | |
Collapse
|
28
|
Taniguchi T, Sawaragi T. Incremental acquisition of behaviors and signs based on a reinforcement learning schemata model and a spike timing-dependent plasticity network. Adv Robot 2012. [DOI: 10.1163/156855307781389374] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Tadahiro Taniguchi
- a Graduate School of Engineering, Kyoto University Yoshida-honmachi, Sakyo, Kyoto, 606-8501, Japan
| | - Tetsuo Sawaragi
- b Graduate School of Engineering, Kyoto University Yoshida-honmachi, Sakyo, Kyoto, 606-8501, Japan
| |
Collapse
|
29
|
Affiliation(s)
- Takeshi Ando
- a Faculty of Science and Engineering, Waseda University, 3-4-1 59-309 Ohkubo, Shinjuku-ku, Tokyo 169-8555, Japan
| | - Jun Okamoto
- b Faculty of Science and Engineering, Waseda University, 3-4-1 59-309 Ohkubo, Shinjuku-ku, Tokyo 169-8555, Japan
| | - Masakatsu G. Fujie
- c Faculty of Science and Engineering, Waseda University, 3-4-1 59-309 Ohkubo, Shinjuku-ku, Tokyo 169-8555, Japan
| |
Collapse
|
30
|
Taniguchi T, Hamahata K, Iwahashi N. Unsupervised Segmentation of Human Motion Data Using a Sticky Hierarchical Dirichlet Process-Hidden Markov Model and Minimal Description Length-Based Chunking Method for Imitation Learning. Adv Robot 2012. [DOI: 10.1163/016918611x594775] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Tadahiro Taniguchi
- a Ritsumeikan Univeirsity, 1-1-1 Noji Higashi, Kusatsu, Shiga 525-8577, Japan;,
| | - Keita Hamahata
- b Ritsumeikan Univeirsity, 1-1-1 Noji Higashi, Kusatsu, Shiga 525-8577, Japan
| | - Naoto Iwahashi
- c National Institute of Information and Communications Technology, 3-5 Hikaridai, Seika-cho, Sohraku-gun, Kyoto 619-0289, Japan
| |
Collapse
|
31
|
Ando T, Watanabe M, Nishimoto K, Matsumoto Y, Seki M, Fujie MG. Myoelectric-Controlled Exoskeletal Elbow Robot to Suppress Essential Tremor: Extraction of Elbow Flexion Movement Using STFTs and TDNN. JOURNAL OF ROBOTICS AND MECHATRONICS 2012. [DOI: 10.20965/jrm.2012.p0141] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Essential tremor is the most common of all involuntary movements. Many patients with an upper-limb tremor have serious difficulties in performing daily activities. We developed a myoelectric-controlled exoskeletal robot to suppress tremor. In this article, we focus on developing a signal processing method to extract voluntary movement from a myoelectric in which the voluntary movement and tremor were mixed. First, a Low-Pass Filter (LPF) and Neural Network (NN) were used to recognize the tremor patient’s movement. Using these techniques, it was difficult to recognize the movement accurately because the myoelectric signal of the tremor patient periodically oscillated. Then, Short-Time Fourier Transformation (STFT) and NN were used to recognize the movement. This method was more suitable than LPF and NN. However, the recognition timing at the start of the movement was late. Finally, a hybrid algorithm for using both short and long windows’ STFTs, which is a kind of “mixture of experts,” was proposed and developed. With this type of signal processing, elbow flexion was accurately recognized without the time delay in starting the movement.
Collapse
|
32
|
GOUKO MANABU, ITO KOJI. AN ACTION GENERATION MODEL BY USING TIME SERIES PREDICTION AND ITS APPLICATION TO ROBOT NAVIGATION. Int J Neural Syst 2011; 19:105-13. [DOI: 10.1142/s0129065709001872] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
This paper proposes an action generation model which consists of many motor primitive modules. The motor primitive modules output motor commands based on sensory information. Complicated behavior is generated by sequentially switching the modules. The model also has a prediction unit. This unit predicts which module will be used for current action generation. We have confirmed the effectiveness of the model by applying it to a robot navigation task simulation, and have investigated the influence of the prediction on the action generation.
Collapse
Affiliation(s)
- MANABU GOUKO
- Department of Computational Intelligence and Systems Science, Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology, 4259 Nagatsuta, Midori-ku, Yokohama, 226-8502, Japan
| | - KOJI ITO
- Department of Computational Intelligence and Systems Science, Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology, 4259 Nagatsuta, Midori-ku, Yokohama, 226-8502, Japan
| |
Collapse
|
33
|
Namikawa J, Nishimoto R, Tani J. A neurodynamic account of spontaneous behaviour. PLoS Comput Biol 2011; 7:e1002221. [PMID: 22028634 PMCID: PMC3197631 DOI: 10.1371/journal.pcbi.1002221] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2011] [Accepted: 08/19/2011] [Indexed: 01/25/2023] Open
Abstract
The current article suggests that deterministic chaos self-organized in cortical dynamics could be responsible for the generation of spontaneous action sequences. Recently, various psychological observations have suggested that humans and primates can learn to extract statistical structures hidden in perceptual sequences experienced during active environmental interactions. Although it has been suggested that such statistical structures involve chunking or compositional primitives, their neuronal implementations in brains have not yet been clarified. Therefore, to reconstruct the phenomena, synthetic neuro-robotics experiments were conducted by using a neural network model, which is characterized by a generative model with intentional states and its multiple timescales dynamics. The experimental results showed that the robot successfully learned to imitate tutored behavioral sequence patterns by extracting the underlying transition probability among primitive actions. An analysis revealed that a set of primitive action patterns was embedded in the fast dynamics part, and the chaotic dynamics of spontaneously sequencing these action primitive patterns was structured in the slow dynamics part, provided that the timescale was adequately set for each part. It was also shown that self-organization of this type of functional hierarchy ensured robust action generation by the robot in its interactions with a noisy environment. This article discusses the correspondence of the synthetic experiments with the known hierarchy of the prefrontal cortex, the supplementary motor area, and the primary motor cortex for action generation. We speculate that deterministic dynamical structures organized in the prefrontal cortex could be essential because they can account for the generation of both intentional behaviors of fixed action sequences and spontaneous behaviors of pseudo-stochastic action sequences by the same mechanism.
Collapse
Affiliation(s)
| | | | - Jun Tani
- Brain Science Institute, RIKEN, Wako, Japan
| |
Collapse
|
34
|
Ando T, Okamoto J, Takahashi M, Fujie MG. Response Evaluation of Rollover Recognition in Myoelectric Controlled Orthosis Using Pneumatic Rubber Muscle for Cancer Bone Metastasis Patient. JOURNAL OF ROBOTICS AND MECHATRONICS 2011. [DOI: 10.20965/jrm.2011.p0302] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The myoelectric controlled rollover support orthosis we have been developing for use in bone cancer metastasis requires high accuracy and quick response in signal processing to recognize movement. We quantitatively evaluated the response performance of recognizing rollover using our original Micro Macro Neural Network (MMNN) algorithm. Required response time was calculated as 60 ms by measuring contraction time for the muscle used in the orthosis to support rollover. TheMMNN recognized rollover 65 ms before it started. Rollover was recognized 5 ms after a myoelectric signal was generated, so the MMNN response was sufficient for the muscle to finish contraction before rollover started.
Collapse
|
35
|
Cangelosi A, Metta G, Sagerer G, Nolfi S, Nehaniv C, Fischer K, Tani J, Belpaeme T, Sandini G, Nori F, Fadiga L, Wrede B, Rohlfing K, Tuci E, Dautenhahn K, Saunders J, Zeschel A. Integration of Action and Language Knowledge: A Roadmap for Developmental Robotics. ACTA ACUST UNITED AC 2010. [DOI: 10.1109/tamd.2010.2053034] [Citation(s) in RCA: 107] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
36
|
Namikawa J, Tani J. Learning to imitate stochastic time series in a compositional way by chaos. Neural Netw 2009; 23:625-38. [PMID: 20045751 DOI: 10.1016/j.neunet.2009.12.006] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2008] [Revised: 12/13/2009] [Accepted: 12/14/2009] [Indexed: 10/20/2022]
Abstract
This study shows that a mixture of RNN experts model can acquire the ability to generate sequences that are combination of multiple primitive patterns by means of self-organizing chaos. By training the model, each expert learns a primitive sequence pattern, and a gating network learns to imitate stochastic switching of the multiple primitives via chaotic dynamics, utilizing a sensitive dependence on initial conditions. As a demonstration, we present a numerical simulation in which the model learns Markov chain switching among some Lissajous curves by chaotic dynamics. Our analysis shows that by using a sufficient amount of training data, balanced with the network memory capacity, it is possible to satisfy the conditions for embedding the target stochastic sequences into a chaotic dynamical system. It is also shown that reconstruction of a stochastic time series by a chaotic model can be stabilized by adding a negligible amount of noise to the dynamics of the model.
Collapse
Affiliation(s)
- Jun Namikawa
- Brain Science Institute, RIKEN, 2-1 Hirosawa, Wako-shi, Saitama, 351-0198, Japan.
| | | |
Collapse
|
37
|
Portegys TE. A maze learning comparison of Elman, long short-term memory, and Mona neural networks. Neural Netw 2009; 23:306-13. [PMID: 19945822 DOI: 10.1016/j.neunet.2009.11.002] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2007] [Revised: 11/04/2009] [Accepted: 11/06/2009] [Indexed: 11/30/2022]
Abstract
This study compares the maze learning performance of three artificial neural network architectures: an Elman recurrent neural network, a long short-term memory (LSTM) network, and Mona, a goal-seeking neural network. The mazes are networks of distinctly marked rooms randomly interconnected by doors that open probabilistically. The mazes are used to examine two important problems related to artificial neural networks: (1) the retention of long-term state information and (2) the modular use of learned information. For the former, mazes impose a context learning demand: at the beginning of the maze, an initial door choice forms a context that must be remembered until the end of the maze, where the same numbered door must be chosen again in order to reach the goal. For the latter, the effect of modular and non-modular training is examined. In modular training, the door associations are trained in separate trials from the intervening maze paths, and only presented together in testing trials. All networks performed well on mazes without the context learning requirement. The Mona and LSTM networks performed well on context learning with non-modular training; the Elman performance degraded as the task length increased. Mona also performed well for modular training; both the LSTM and Elman networks performed poorly with modular training.
Collapse
Affiliation(s)
- Thomas E Portegys
- School of Information Technology, Illinois State University, Normal, IL 61790, USA.
| |
Collapse
|
38
|
Building Recurrent Neural Networks to Implement Multiple Attractor Dynamics Using the Gradient Descent Method. ACTA ACUST UNITED AC 2009. [DOI: 10.1155/2009/846040] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The present paper proposes a recurrent neural network model and learning algorithm that can acquire the ability to generate desired multiple sequences. The network model is a dynamical system in which the transition function is a contraction mapping, and the learning algorithm is based on the gradient descent method. We show a numerical simulation in which a recurrent neural network obtains a multiple periodic attractor consisting of five Lissajous curves, or a Van der Pol oscillator with twelve different parameters. The present analysis clarifies that the model contains many stable regions as attractors, and multiple time series can be embedded into these regions by using the present learning method.
Collapse
|
39
|
Starzyk J, He H. Spatio–Temporal Memories for Machine Learning: A Long-Term Memory Organization. ACTA ACUST UNITED AC 2009; 20:768-80. [DOI: 10.1109/tnn.2009.2012854] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
40
|
Gentili RJ, Papaxanthis C, Ebadzadeh M, Eskiizmirliler S, Ouanezar S, Darlot C. Integration of gravitational torques in cerebellar pathways allows for the dynamic inverse computation of vertical pointing movements of a robot arm. PLoS One 2009; 4:e5176. [PMID: 19384420 PMCID: PMC2668755 DOI: 10.1371/journal.pone.0005176] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2008] [Accepted: 03/03/2009] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND Several authors suggested that gravitational forces are centrally represented in the brain for planning, control and sensorimotor predictions of movements. Furthermore, some studies proposed that the cerebellum computes the inverse dynamics (internal inverse model) whereas others suggested that it computes sensorimotor predictions (internal forward model). METHODOLOGY/PRINCIPAL FINDINGS This study proposes a model of cerebellar pathways deduced from both biological and physical constraints. The model learns the dynamic inverse computation of the effect of gravitational torques from its sensorimotor predictions without calculating an explicit inverse computation. By using supervised learning, this model learns to control an anthropomorphic robot arm actuated by two antagonists McKibben artificial muscles. This was achieved by using internal parallel feedback loops containing neural networks which anticipate the sensorimotor consequences of the neural commands. The artificial neural networks architecture was similar to the large-scale connectivity of the cerebellar cortex. Movements in the sagittal plane were performed during three sessions combining different initial positions, amplitudes and directions of movements to vary the effects of the gravitational torques applied to the robotic arm. The results show that this model acquired an internal representation of the gravitational effects during vertical arm pointing movements. CONCLUSIONS/SIGNIFICANCE This is consistent with the proposal that the cerebellar cortex contains an internal representation of gravitational torques which is encoded through a learning process. Furthermore, this model suggests that the cerebellum performs the inverse dynamics computation based on sensorimotor predictions. This highlights the importance of sensorimotor predictions of gravitational torques acting on upper limb movements performed in the gravitational field.
Collapse
Affiliation(s)
- Rodolphe J. Gentili
- CNRS UMR 7060, Université Paris Descartes, Paris-5, Paris, France
- Université Paris Diderot, Paris-7, Paris, France
- INSERM U887, Motricité-Plasticité, Université de Bourgogne, Dijon, France
- Ecole supérieure des Télécommunications, Paris, France
- Cognitive Motor Neuroscience laboratory, Department of Kinesiology, University of Maryland, School of Public Health, College Park, Maryland, United States of America
| | | | - Mehdi Ebadzadeh
- Amirkabir University of Technology, Computer Engineering and Information Technology Department, Tehran, Iran
| | - Selim Eskiizmirliler
- CNRS UMR 7060, Université Paris Descartes, Paris-5, Paris, France
- Université Paris Diderot, Paris-7, Paris, France
| | - Sofiane Ouanezar
- CNRS UMR 7060, Université Paris Descartes, Paris-5, Paris, France
- Université Paris Diderot, Paris-7, Paris, France
- Ecole supérieure des Télécommunications, Paris, France
| | - Christian Darlot
- INSERM U887, Motricité-Plasticité, Université de Bourgogne, Dijon, France
- Ecole supérieure des Télécommunications, Paris, France
| |
Collapse
|
41
|
Development of hierarchical structures for actions and motor imagery: a constructivist view from synthetic neuro-robotics study. PSYCHOLOGICAL RESEARCH 2009; 73:545-58. [PMID: 19352697 DOI: 10.1007/s00426-009-0236-0] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2008] [Accepted: 01/08/2009] [Indexed: 10/19/2022]
Abstract
The current paper shows a neuro-robotics experiment on developmental learning of goal-directed actions. The robot was trained to predict visuo-proprioceptive flow of achieving a set of goal-directed behaviors through iterative tutor training processes. The learning was conducted by employing a dynamic neural network model which is characterized by their multiple time-scale dynamics. The experimental results showed that functional hierarchical structures emerge through stages of developments where behavior primitives are generated in earlier stages and their sequences of achieving goals appear in later stages. It was also observed that motor imagery is generated in earlier stages compared to actual behaviors. Our claim that manipulatable inner representation should emerge through the sensory-motor interactions is corresponded to Piaget's constructivist view.
Collapse
|
42
|
|
43
|
Yamashita Y, Tani J. Emergence of functional hierarchy in a multiple timescale neural network model: a humanoid robot experiment. PLoS Comput Biol 2008; 4:e1000220. [PMID: 18989398 PMCID: PMC2570613 DOI: 10.1371/journal.pcbi.1000220] [Citation(s) in RCA: 338] [Impact Index Per Article: 21.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2008] [Accepted: 09/30/2008] [Indexed: 11/17/2022] Open
Abstract
It is generally thought that skilled behavior in human beings results from a functional hierarchy of the motor control system, within which reusable motor primitives are flexibly integrated into various sensori-motor sequence patterns. The underlying neural mechanisms governing the way in which continuous sensori-motor flows are segmented into primitives and the way in which series of primitives are integrated into various behavior sequences have, however, not yet been clarified. In earlier studies, this functional hierarchy has been realized through the use of explicit hierarchical structure, with local modules representing motor primitives in the lower level and a higher module representing sequences of primitives switched via additional mechanisms such as gate-selecting. When sequences contain similarities and overlap, however, a conflict arises in such earlier models between generalization and segmentation, induced by this separated modular structure. To address this issue, we propose a different type of neural network model. The current model neither makes use of separate local modules to represent primitives nor introduces explicit hierarchical structure. Rather than forcing architectural hierarchy onto the system, functional hierarchy emerges through a form of self-organization that is based on two distinct types of neurons, each with different time properties ("multiple timescales"). Through the introduction of multiple timescales, continuous sequences of behavior are segmented into reusable primitives, and the primitives, in turn, are flexibly integrated into novel sequences. In experiments, the proposed network model, coordinating the physical body of a humanoid robot through high-dimensional sensori-motor control, also successfully situated itself within a physical environment. Our results suggest that it is not only the spatial connections between neurons but also the timescales of neural activity that act as important mechanisms leading to functional hierarchy in neural systems.
Collapse
Affiliation(s)
- Yuichi Yamashita
- Laboratory for Behavior and Dynamic Cognition, RIKEN Brain Science Institute, Wako-shi, Saitama, Japan.
| | | |
Collapse
|
44
|
Namikawa J, Tani J. A model for learning to segment temporal sequences, utilizing a mixture of RNN experts together with adaptive variance. Neural Netw 2008; 21:1466-75. [PMID: 18938059 DOI: 10.1016/j.neunet.2008.09.005] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2008] [Revised: 06/17/2008] [Accepted: 09/17/2008] [Indexed: 11/19/2022]
Abstract
This paper proposes a novel learning method for a mixture of recurrent neural network (RNN) experts model, which can acquire the ability to generate desired sequences by dynamically switching between experts. Our method is based on maximum likelihood estimation, using a gradient descent algorithm. This approach is similar to that used in conventional methods; however, we modify the likelihood function by adding a mechanism to alter the variance for each expert. The proposed method is demonstrated to successfully learn Markov chain switching among a set of 9 Lissajous curves, for which the conventional method fails. The learning performance, analyzed in terms of the generalization capability, of the proposed method is also shown to be superior to that of the conventional method. With the addition of a gating network, the proposed method is successfully applied to the learning of sensory-motor flows for a small humanoid robot as a realistic problem of time series prediction and generation.
Collapse
Affiliation(s)
- Jun Namikawa
- Brain Science Institute, RIKEN, 2-1 Hirosawa, Wako-shi, Saitama, 351-0198, Japan.
| | | |
Collapse
|
45
|
Tani J, Nishimoto R, Paine RW. Achieving "organic compositionality" through self-organization: reviews on brain-inspired robotics experiments. Neural Netw 2008; 21:584-603. [PMID: 18495423 DOI: 10.1016/j.neunet.2008.03.008] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2007] [Revised: 03/07/2008] [Accepted: 03/07/2008] [Indexed: 10/22/2022]
Abstract
The current paper examines how compositional structures can self-organize in given neuro-dynamical systems when robot agents are forced to learn multiple goal-directed behaviors simultaneously. Firstly, we propose a basic model accounting for the roles of parietal-premotor interactions for representing skills for goal-directed behaviors. The basic model had been implemented in a set of robotics experiments employing different neural network architectures. The comparative reviews among those experimental results address the issues of local vs distributed representations in representing behavior and the effectiveness of level structures associated with different sensory-motor articulation mechanisms. It is concluded that the compositional structures can be acquired "organically" by achieving generalization in learning and by capturing the contextual nature of skilled behaviors under specific conditions. Furthermore, the paper discusses possible feedback for empirical neuroscience studies in the future.
Collapse
Affiliation(s)
- Jun Tani
- RIKEN Brain Science Institute, 2-1 Hirosawa, Wako-shi, Saitama, Japan.
| | | | | |
Collapse
|
46
|
Tani J, Nishimoto R, Namikawa J, Ito M. Codevelopmental learning between human and humanoid robot using a dynamic neural-network model. ACTA ACUST UNITED AC 2008; 38:43-59. [PMID: 18270081 DOI: 10.1109/tsmcb.2007.907738] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
This paper examines characteristics of interactive learning between human tutors and a robot having a dynamic neural-network model, which is inspired by human parietal cortex functions. A humanoid robot, with a recurrent neural network that has a hierarchical structure, learns to manipulate objects. Robots learn tasks in repeated self-trials with the assistance of human interaction, which provides physical guidance until the tasks are mastered and learning is consolidated within the neural networks. Experimental results and the analyses showed the following: 1) codevelopmental shaping of task behaviors stems from interactions between the robot and a tutor; 2) dynamic structures for articulating and sequencing of behavior primitives are self-organized in the hierarchically organized network; and 3) such structures can afford both generalization and context dependency in generating skilled behaviors.
Collapse
Affiliation(s)
- Jun Tani
- RIKEN Brain Science Institute, Wako 351-0198, Japan
| | | | | | | |
Collapse
|
47
|
|
48
|
Taniguchi T, Sawaragi T. Incremental acquisition of multiple nonlinear forward models based on differentiation process of schema model. Neural Netw 2008; 21:13-27. [PMID: 18226495 DOI: 10.1016/j.neunet.2007.10.007] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2005] [Accepted: 10/23/2007] [Indexed: 11/25/2022]
Abstract
We introduce the schema model as an alternative computational model representing multiple internal models. The human central nervous system is believed to obtain multiple forward-inverse models. The schema model enables agents to obtain multiple nonlinear forward models incrementally. This model is based on hypothesis testing theory whereas most modular learning methods are based on a Bayesian framework. As a specific example, we describe a schema model with a normalized Gaussian network (NGSM). Simulation revealed that NGSM has two advantages over MOSAIC's learning method: NGSM can obtain multiple models incrementally and does not depend on the initial parameters of the forward models.
Collapse
|
49
|
Demiris Y, Meltzoff A. The Robot in the Crib: A Developmental Analysis of Imitation Skills in Infants and Robots. INFANT AND CHILD DEVELOPMENT 2008; 17:43-53. [PMID: 18458795 DOI: 10.1002/icd.543] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Interesting systems, whether biological or artificial, develop. Starting from some initial conditions, they respond to environmental changes, and continuously improve their capabilities. Developmental psychologists have dedicated significant effort to studying the developmental progression of infant imitation skills, because imitation underlies the infant's ability to understand and learn from his or her social environment. In a converging intellectual endeavour, roboticists have been equipping robots with the ability to observe and imitate human actions because such abilities can lead to rapid teaching of robots to perform tasks. We provide here a comparative analysis between studies of infants imitating and learning from human demonstrators, and computational experiments aimed at equipping a robot with such abilities. We will compare the research across the following two dimensions: (a) initial conditions-what is innate in infants, and what functionality is initially given to robots, and (b) developmental mechanisms-how does the performance of infants improve over time, and what mechanisms are given to robots to achieve equivalent behaviour. Both developmental science and robotics are critically concerned with: (a) how their systems can and do go 'beyond the stimulus' given during the demonstration, and (b) how the internal models used in this process are acquired during the lifetime of the system.
Collapse
Affiliation(s)
- Yiannis Demiris
- Department of Electrical and Electronic Engineering, Imperial College London, London, UK
| | | |
Collapse
|
50
|
Tani J. On the interactions between top-down anticipation and bottom-up regression. Front Neurorobot 2007; 1:2. [PMID: 18958273 PMCID: PMC2533585 DOI: 10.3389/neuro.12.002.2007] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2007] [Accepted: 10/11/2007] [Indexed: 11/13/2022] Open
Abstract
This paper discusses the importance of anticipation and regression in modeling cognitive behavior. The meanings of these cognitive functions are explained by describing our proposed neural network model which has been implemented on a set of cognitive robotics experiments. The reviews of these experiments suggest that the essences of embodied cognition may reside in the phenomena of the break-down between the top-down anticipation and the bottom-up regression and in its recovery process.
Collapse
Affiliation(s)
- Jun Tani
- Brain Science Institute, RIKEN Japan
| |
Collapse
|