1
|
Schilling M, Ritter HJ, Ohl FW. Linking meta-learning to meta-structure. Behav Brain Sci 2024; 47:e164. [PMID: 39311506 DOI: 10.1017/s0140525x24000232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/25/2024]
Abstract
We propose that a principled understanding of meta-learning, as aimed for by the authors, benefits from linking the focus on learning with an equally strong focus on structure, which means to address the question: What are the meta-structures that can guide meta-learning?
Collapse
Affiliation(s)
- Malte Schilling
- Autonomous Intelligent Systems Group, Computer Science Department, University of Münster, Münster, Germany https://www.uni-muenster.de/AISystems/
| | - Helge J Ritter
- Neuroinformatics Group Faculty of Technology/CITEC, Bielefeld University, Bielefeld, Germany ://ni.www.techfak.uni-bielefeld.de/people/helge/
| | - Frank W Ohl
- Department of Systems Physiology of Learning, Leibniz Institute for Neurobiology, Magdeburg, Germany
- Institute of Biology, Otto-von-Guericke University, Magdeburg, Germanyhttps://www.ovgu.de/Ohl.html
| |
Collapse
|
2
|
Modularity in Nervous Systems—a Key to Efficient Adaptivity for Deep Reinforcement Learning. Cognit Comput 2023. [DOI: 10.1007/s12559-022-10080-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
AbstractModularity as observed in biological systems has proven valuable for guiding classical motor theories towards good answers about action selection and execution. New challenges arise when we turn to learning: Trying to scale current computational models, such as deep reinforcement learning (DRL), to action spaces, input dimensions, and time horizons seen in biological systems still faces severe obstacles unless vast amounts of training data are available. This leads to the question: does biological modularity also hold an important key for better answers to obtain efficient adaptivity for deep reinforcement learning? We review biological experimental work on modularity in biological motor control and link this with current examples of (deep) RL approaches. Analyzing outcomes of simulation studies, we show that these approaches benefit from forms of modularization as found in biological systems. We identify three different strands of modularity exhibited in biological control systems. Two of them—modularity in state (i) and in action (ii) spaces—appear as a consequence of local interconnectivity (as in reflexes) and are often modulated by higher levels in a control hierarchy. A third strand arises from chunking of action elements along a (iii) temporal dimension. Usually interacting in an overarching spatio-temporal hierarchy of the overall system, the three strands offer major “factors” decomposing the entire modularity structure. We conclude that modularity with its above strands can provide an effective prior for DRL approaches to speed up learning considerably and making learned controllers more robust and adaptive.
Collapse
|
3
|
Jamaludeen N, Unnikrishnan V, Brechmann A, Spiliopoulou M. Discovering Instantaneous Granger Causalities in Non-stationary Categorical Time Series Data. Artif Intell Med 2022. [DOI: 10.1007/978-3-031-09342-5_19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
4
|
Jin B, Alam M, Tierno A, Zhong H, Roy RR, Gerasimenko Y, Lu DC, Edgerton VR. Serotonergic Facilitation of Forelimb Functional Recovery in Rats with Cervical Spinal Cord Injury. Neurotherapeutics 2021; 18:1226-1243. [PMID: 33420588 PMCID: PMC8423890 DOI: 10.1007/s13311-020-00974-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/14/2020] [Indexed: 10/22/2022] Open
Abstract
Serotonergic agents can improve the recovery of motor ability after a spinal cord injury. Herein, we compare the effects of buspirone, a 5-HT1A receptor partial agonist, to fluoxetine, a selective serotonin reuptake inhibitor, on forelimb motor function recovery after a C4 bilateral dorsal funiculi crush in adult female rats. After injury, single pellet reaching performance and forelimb muscle activity decreased in all rats. From 1 to 6 weeks after injury, rats were tested on these tasks with and without buspirone (1-2 mg/kg) or fluoxetine (1-5 mg/kg). Reaching and grasping success rates of buspirone-treated rats improved rapidly within 2 weeks after injury and plateaued over the next 4 weeks of testing. Electromyography (EMG) from selected muscles in the dominant forelimb showed that buspirone-treated animals used new reaching strategies to achieve success after the injury. However, forelimb performance dramatically decreased within 2 weeks of buspirone withdrawal. In contrast, fluoxetine treatment resulted in a more progressive rate of improvement in forelimb performance over 8 weeks after injury. Neither buspirone nor fluoxetine significantly improved quadrupedal locomotion on the horizontal ladder test. The improved accuracy of reaching and grasping, patterns of muscle activity, and increased excitability of spinal motor-evoked potentials after buspirone administration reflect extensive reorganization of connectivity within and between supraspinal and spinal sensory-motor netxcopy works. Thus, both serotonergic drugs, buspirone and fluoxetine, neuromodulated these networks to physiological states that enabled markedly improved forelimb function after cervical spinal cord injury.
Collapse
Affiliation(s)
- Benita Jin
- Department of Integrative Biology and Physiology, University of California, Los Angeles, 610 Charles E. Young Drive, Los Angeles, CA, 90095-1527, USA
| | - Monzurul Alam
- Department of Integrative Biology and Physiology, University of California, Los Angeles, 610 Charles E. Young Drive, Los Angeles, CA, 90095-1527, USA
| | - Alexa Tierno
- Department of Integrative Biology and Physiology, University of California, Los Angeles, 610 Charles E. Young Drive, Los Angeles, CA, 90095-1527, USA
| | - Hui Zhong
- Department of Integrative Biology and Physiology, University of California, Los Angeles, 610 Charles E. Young Drive, Los Angeles, CA, 90095-1527, USA
| | - Roland R Roy
- Department of Integrative Biology and Physiology, University of California, Los Angeles, 610 Charles E. Young Drive, Los Angeles, CA, 90095-1527, USA
- Brain Research Institute, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Yury Gerasimenko
- Department of Integrative Biology and Physiology, University of California, Los Angeles, 610 Charles E. Young Drive, Los Angeles, CA, 90095-1527, USA
- Pavlov Institute of Physiology, St. Petersburg, 199034, Russia
- Institute of Fundamental Medicine and Biology, Kazan Federal University, Kazan, 420006, Russia
| | - Daniel C Lu
- Department of Neurosurgery, University of California, Los Angeles, CA, 90095, USA
| | - V Reggie Edgerton
- Department of Integrative Biology and Physiology, University of California, Los Angeles, 610 Charles E. Young Drive, Los Angeles, CA, 90095-1527, USA.
- Brain Research Institute, University of California, Los Angeles, Los Angeles, CA, 90095, USA.
- Department of Neurosurgery, University of California, Los Angeles, CA, 90095, USA.
- Department of Neurobiology, University of California, Los Angeles, CA, 90095, USA.
- Faculty of Science, The Centre for Neuroscience and Regenerative Medicine, University of Technology Sydney, Ultimo, NSW, Australia.
- Institut Guttmann, Hospital de Neurorehabilitació, Institut Universitari adscript a la Universitat Autònoma de Barcelona, 08916, Badalona, Spain.
| |
Collapse
|
5
|
Abolfazli A, Brechmann A, Wolff S, Spiliopoulou M. Machine learning identifies the dynamics and influencing factors in an auditory category learning experiment. Sci Rep 2020; 10:6548. [PMID: 32300111 PMCID: PMC7162940 DOI: 10.1038/s41598-020-61703-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Accepted: 02/16/2020] [Indexed: 12/04/2022] Open
Abstract
Human learning is one of the main topics in psychology and cognitive neuroscience. The analysis of experimental data, e.g. from category learning experiments, is a major challenge due to confounding factors related to perceptual processing, feedback value, response selection, as well as inter-individual differences in learning progress due to differing strategies or skills. We use machine learning to investigate (Q1) how participants of an auditory category-learning experiment evolve towards learning, (Q2) how participant performance saturates and (Q3) how early we can differentiate whether a participant has learned the categories or not. We found that a Gaussian Mixture Model describes well the evolution of participant performance and serves as basis for identifying influencing factors of task configuration (Q1). We found early saturation trends (Q2) and that CatBoost, an advanced classification algorithm, can separate between participants who learned the categories and those who did not, well before the end of the learning session, without much degradation of separation quality (Q3). Our results show that machine learning can model participant dynamics, identify influencing factors of task design and performance trends. This will help to improve computational models of auditory category learning and define suitable time points for interventions into learning, e.g. by tutorial systems.
Collapse
Affiliation(s)
- Amir Abolfazli
- Faculty of Computer Science, Otto von Guericke University Magdeburg, Magdeburg, 39106, Germany
| | - André Brechmann
- Special Lab Non-Invasive Brain Imaging, Leibniz Institute for Neurobiology, Magdeburg, 39118, Germany.
| | - Susann Wolff
- Special Lab Non-Invasive Brain Imaging, Leibniz Institute for Neurobiology, Magdeburg, 39118, Germany
| | - Myra Spiliopoulou
- Faculty of Computer Science, Otto von Guericke University Magdeburg, Magdeburg, 39106, Germany
| |
Collapse
|
6
|
Schicknick H, Henschke JU, Budinger E, Ohl FW, Gundelfinger ED, Tischmeyer W. β-adrenergic modulation of discrimination learning and memory in the auditory cortex. Eur J Neurosci 2019; 50:3141-3163. [PMID: 31162753 PMCID: PMC6900137 DOI: 10.1111/ejn.14480] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Revised: 05/27/2019] [Accepted: 05/31/2019] [Indexed: 01/11/2023]
Abstract
Despite vast literature on catecholaminergic neuromodulation of auditory cortex functioning in general, knowledge about its role for long‐term memory formation is scarce. Our previous pharmacological studies on cortex‐dependent frequency‐modulated tone‐sweep discrimination learning of Mongolian gerbils showed that auditory‐cortical D1/5‐dopamine receptor activity facilitates memory consolidation and anterograde memory formation. Considering overlapping functions of D1/5‐dopamine receptors and β‐adrenoceptors, we hypothesised a role of β‐adrenergic signalling in the auditory cortex for sweep discrimination learning and memory. Supporting this hypothesis, the β1/2‐adrenoceptor antagonist propranolol bilaterally applied to the gerbil auditory cortex after task acquisition prevented the discrimination increment that was normally monitored 1 day later. The increment in the total number of hurdle crossings performed in response to the sweeps per se was normal. Propranolol infusion after the seventh training session suppressed the previously established sweep discrimination. The suppressive effect required antagonist injection in a narrow post‐session time window. When applied to the auditory cortex 1 day before initial conditioning, β1‐adrenoceptor‐antagonising and β1‐adrenoceptor‐stimulating agents retarded and facilitated, respectively, sweep discrimination learning, whereas β2‐selective drugs were ineffective. In contrast, single‐sweep detection learning was normal after propranolol infusion. By immunohistochemistry, β1‐ and β2‐adrenoceptors were identified on the neuropil and somata of pyramidal and non‐pyramidal neurons of the gerbil auditory cortex. The present findings suggest that β‐adrenergic signalling in the auditory cortex has task‐related importance for discrimination learning of complex sounds: as previously shown for D1/5‐dopamine receptor signalling, β‐adrenoceptor activity supports long‐term memory consolidation and reconsolidation; additionally, tonic input through β1‐adrenoceptors may control mechanisms permissive for memory acquisition.
Collapse
Affiliation(s)
- Horst Schicknick
- Special Lab Molecular Biological Techniques, Leibniz Institute for Neurobiology, Magdeburg, Germany
| | - Julia U Henschke
- Department Systems Physiology of Learning, Leibniz Institute for Neurobiology, Magdeburg, Germany.,Institute of Cognitive Neurology and Dementia Research, Otto von Guericke University Magdeburg, Magdeburg, Germany
| | - Eike Budinger
- Department Systems Physiology of Learning, Leibniz Institute for Neurobiology, Magdeburg, Germany.,Center for Behavioral Brain Sciences, Magdeburg, Germany
| | - Frank W Ohl
- Department Systems Physiology of Learning, Leibniz Institute for Neurobiology, Magdeburg, Germany.,Center for Behavioral Brain Sciences, Magdeburg, Germany.,Institute of Biology, Otto von Guericke University Magdeburg, Magdeburg, Germany
| | - Eckart D Gundelfinger
- Center for Behavioral Brain Sciences, Magdeburg, Germany.,Department Neurochemistry and Molecular Biology, Leibniz Institute for Neurobiology, Magdeburg, Germany.,Molecular Neurobiology, Medical Faculty, Otto von Guericke University Magdeburg, Magdeburg, Germany
| | - Wolfgang Tischmeyer
- Special Lab Molecular Biological Techniques, Leibniz Institute for Neurobiology, Magdeburg, Germany.,Center for Behavioral Brain Sciences, Magdeburg, Germany
| |
Collapse
|
7
|
Prezenski S, Brechmann A, Wolff S, Russwinkel N. A Cognitive Modeling Approach to Strategy Formation in Dynamic Decision Making. Front Psychol 2017; 8:1335. [PMID: 28824512 PMCID: PMC5543095 DOI: 10.3389/fpsyg.2017.01335] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2017] [Accepted: 07/20/2017] [Indexed: 11/13/2022] Open
Abstract
Decision-making is a high-level cognitive process based on cognitive processes like perception, attention, and memory. Real-life situations require series of decisions to be made, with each decision depending on previous feedback from a potentially changing environment. To gain a better understanding of the underlying processes of dynamic decision-making, we applied the method of cognitive modeling on a complex rule-based category learning task. Here, participants first needed to identify the conjunction of two rules that defined a target category and later adapt to a reversal of feedback contingencies. We developed an ACT-R model for the core aspects of this dynamic decision-making task. An important aim of our model was that it provides a general account of how such tasks are solved and, with minor changes, is applicable to other stimulus materials. The model was implemented as a mixture of an exemplar-based and a rule-based approach which incorporates perceptual-motor and metacognitive aspects as well. The model solves the categorization task by first trying out one-feature strategies and then, as a result of repeated negative feedback, switching to two-feature strategies. Overall, this model solves the task in a similar way as participants do, including generally successful initial learning as well as reversal learning after the change of feedback contingencies. Moreover, the fact that not all participants were successful in the two learning phases is also reflected in the modeling data. However, we found a larger variance and a lower overall performance of the modeling data as compared to the human data which may relate to perceptual preferences or additional knowledge and rules applied by the participants. In a next step, these aspects could be implemented in the model for a better overall fit. In view of the large interindividual differences in decision performance between participants, additional information about the underlying cognitive processes from behavioral, psychobiological and neurophysiological data may help to optimize future applications of this model such that it can be transferred to other domains of comparable dynamic decision tasks.
Collapse
Affiliation(s)
- Sabine Prezenski
- Cognitive Modeling in Dynamic Human-Machine Systems, Department of Psychology and Ergonomics, Technical University BerlinBerlin, Germany
| | - André Brechmann
- Special Lab Non-Invasive Brain Imaging, Leibniz Institute for NeurobiologyMagdeburg, Germany
| | - Susann Wolff
- Special Lab Non-Invasive Brain Imaging, Leibniz Institute for NeurobiologyMagdeburg, Germany
| | - Nele Russwinkel
- Cognitive Modeling in Dynamic Human-Machine Systems, Department of Psychology and Ergonomics, Technical University BerlinBerlin, Germany
| |
Collapse
|