Embodied Synaptic Plasticity With Online Reinforcement Learning.

neuromorphic vision neurorobotics reinforcement learning spiking neural networks synaptic plasticity

Journal

Frontiers in neurorobotics
ISSN: 1662-5218
Titre abrégé: Front Neurorobot
Pays: Switzerland
ID NLM: 101477958

Informations de publication

Date de publication:
2019
Historique:
received: 01 02 2019
accepted: 13 09 2019
entrez: 22 10 2019
pubmed: 22 10 2019
medline: 22 10 2019
Statut: epublish

Résumé

The endeavor to understand the brain involves multiple collaborating research fields. Classically, synaptic plasticity rules derived by theoretical neuroscientists are evaluated in isolation on pattern classification tasks. This contrasts with the biological brain which purpose is to control a body in closed-loop. This paper contributes to bringing the fields of computational neuroscience and robotics closer together by integrating open-source software components from these two fields. The resulting framework allows to evaluate the validity of biologically-plausibe plasticity models in closed-loop robotics environments. We demonstrate this framework to evaluate Synaptic Plasticity with Online REinforcement learning (SPORE), a reward-learning rule based on synaptic sampling, on two visuomotor tasks: reaching and lane following. We show that SPORE is capable of learning to perform policies within the course of simulated hours for both tasks. Provisional parameter explorations indicate that the learning rate and the temperature driving the stochastic processes that govern synaptic learning dynamics need to be regulated for performance improvements to be retained. We conclude by discussing the recent deep reinforcement learning techniques which would be beneficial to increase the functionality of SPORE on visuomotor tasks.

Identifiants

pubmed: 31632262
doi: 10.3389/fnbot.2019.00081
pmc: PMC6786305
doi:

Types de publication

Journal Article

Langues

eng

Pagination

81

Informations de copyright

Copyright © 2019 Kaiser, Hoff, Konle, Vasquez Tieck, Kappel, Reichard, Subramoney, Legenstein, Roennau, Maass and Dillmann.

Références

Elife. 2017 Nov 27;6:
pubmed: 29173280
Neural Comput. 2006 Jun;18(6):1318-48
pubmed: 16764506
Nature. 1997 Feb 6;385(6616):533-6
pubmed: 9020359
Neural Comput. 2007 Jun;19(6):1468-502
pubmed: 17444757
Front Neurorobot. 2017 Jan 25;11:2
pubmed: 28179882
J Psychopharmacol. 2016 Jan;30(1):3-12
pubmed: 26601905
eNeuro. 2018 Apr 24;5(2):
pubmed: 29696150
PLoS Comput Biol. 2015 Nov 06;11(11):e1004485
pubmed: 26545099
Cereb Cortex. 2007 Oct;17(10):2443-52
pubmed: 17220510
PLoS Comput Biol. 2008 Oct;4(10):e1000180
pubmed: 18846203
Neuroinformatics. 2010 Mar;8(1):43-60
pubmed: 20195795
Front Neuroinform. 2016 Aug 03;10:31
pubmed: 27536234
IEEE Trans Pattern Anal Mach Intell. 2013 Aug;35(8):1847-71
pubmed: 23787340
PLoS One. 2015 Mar 03;10(3):e0115620
pubmed: 25734662
Neural Comput. 2018 Jun;30(6):1514-1541
pubmed: 29652587
Neuron. 2014 Feb 5;81(3):521-8
pubmed: 24507189
PLoS Comput Biol. 2014 Mar 27;10(3):e1003511
pubmed: 24675787
Front Neurorobot. 2018 Jul 06;12:35
pubmed: 30034334
J Neurosci. 2005 Jun 29;25(26):6235-42
pubmed: 15987953
Nat Neurosci. 2016 Jan;19(1):117-26
pubmed: 26595651
Front Neurosci. 2020 May 12;14:424
pubmed: 32477050
Nature. 2015 Feb 26;518(7540):529-33
pubmed: 25719670

Auteurs

Jacques Kaiser (J)

FZI Research Center for Information Technology, Karlsruhe, Germany.

Michael Hoff (M)

FZI Research Center for Information Technology, Karlsruhe, Germany.
Institute for Theoretical Computer Science, Graz University of Technology, Graz, Austria.

Andreas Konle (A)

FZI Research Center for Information Technology, Karlsruhe, Germany.

J Camilo Vasquez Tieck (JC)

FZI Research Center for Information Technology, Karlsruhe, Germany.

David Kappel (D)

Institute for Theoretical Computer Science, Graz University of Technology, Graz, Austria.
Bernstein Center for Computational Neuroscience, III Physikalisches Institut-Biophysik, Georg-August Universität, Göttingen, Germany.
Technische Universität Dresden, Chair of Highly Parallel VLSI Systems and Neuromorphic Circuits, Dresden, Germany.

Daniel Reichard (D)

FZI Research Center for Information Technology, Karlsruhe, Germany.

Anand Subramoney (A)

Institute for Theoretical Computer Science, Graz University of Technology, Graz, Austria.

Robert Legenstein (R)

Institute for Theoretical Computer Science, Graz University of Technology, Graz, Austria.

Arne Roennau (A)

FZI Research Center for Information Technology, Karlsruhe, Germany.

Wolfgang Maass (W)

Institute for Theoretical Computer Science, Graz University of Technology, Graz, Austria.

Rüdiger Dillmann (R)

FZI Research Center for Information Technology, Karlsruhe, Germany.

Classifications MeSH