Embodied Synaptic Plasticity With Online Reinforcement Learning.
neuromorphic vision
neurorobotics
reinforcement learning
spiking neural networks
synaptic plasticity
Journal
Frontiers in neurorobotics
ISSN: 1662-5218
Titre abrégé: Front Neurorobot
Pays: Switzerland
ID NLM: 101477958
Informations de publication
Date de publication:
2019
2019
Historique:
received:
01
02
2019
accepted:
13
09
2019
entrez:
22
10
2019
pubmed:
22
10
2019
medline:
22
10
2019
Statut:
epublish
Résumé
The endeavor to understand the brain involves multiple collaborating research fields. Classically, synaptic plasticity rules derived by theoretical neuroscientists are evaluated in isolation on pattern classification tasks. This contrasts with the biological brain which purpose is to control a body in closed-loop. This paper contributes to bringing the fields of computational neuroscience and robotics closer together by integrating open-source software components from these two fields. The resulting framework allows to evaluate the validity of biologically-plausibe plasticity models in closed-loop robotics environments. We demonstrate this framework to evaluate Synaptic Plasticity with Online REinforcement learning (SPORE), a reward-learning rule based on synaptic sampling, on two visuomotor tasks: reaching and lane following. We show that SPORE is capable of learning to perform policies within the course of simulated hours for both tasks. Provisional parameter explorations indicate that the learning rate and the temperature driving the stochastic processes that govern synaptic learning dynamics need to be regulated for performance improvements to be retained. We conclude by discussing the recent deep reinforcement learning techniques which would be beneficial to increase the functionality of SPORE on visuomotor tasks.
Identifiants
pubmed: 31632262
doi: 10.3389/fnbot.2019.00081
pmc: PMC6786305
doi:
Types de publication
Journal Article
Langues
eng
Pagination
81Informations de copyright
Copyright © 2019 Kaiser, Hoff, Konle, Vasquez Tieck, Kappel, Reichard, Subramoney, Legenstein, Roennau, Maass and Dillmann.
Références
Elife. 2017 Nov 27;6:
pubmed: 29173280
Neural Comput. 2006 Jun;18(6):1318-48
pubmed: 16764506
Nature. 1997 Feb 6;385(6616):533-6
pubmed: 9020359
Neural Comput. 2007 Jun;19(6):1468-502
pubmed: 17444757
Front Neurorobot. 2017 Jan 25;11:2
pubmed: 28179882
J Psychopharmacol. 2016 Jan;30(1):3-12
pubmed: 26601905
eNeuro. 2018 Apr 24;5(2):
pubmed: 29696150
PLoS Comput Biol. 2015 Nov 06;11(11):e1004485
pubmed: 26545099
Cereb Cortex. 2007 Oct;17(10):2443-52
pubmed: 17220510
PLoS Comput Biol. 2008 Oct;4(10):e1000180
pubmed: 18846203
Neuroinformatics. 2010 Mar;8(1):43-60
pubmed: 20195795
Front Neuroinform. 2016 Aug 03;10:31
pubmed: 27536234
IEEE Trans Pattern Anal Mach Intell. 2013 Aug;35(8):1847-71
pubmed: 23787340
PLoS One. 2015 Mar 03;10(3):e0115620
pubmed: 25734662
Neural Comput. 2018 Jun;30(6):1514-1541
pubmed: 29652587
Neuron. 2014 Feb 5;81(3):521-8
pubmed: 24507189
PLoS Comput Biol. 2014 Mar 27;10(3):e1003511
pubmed: 24675787
Front Neurorobot. 2018 Jul 06;12:35
pubmed: 30034334
J Neurosci. 2005 Jun 29;25(26):6235-42
pubmed: 15987953
Nat Neurosci. 2016 Jan;19(1):117-26
pubmed: 26595651
Front Neurosci. 2020 May 12;14:424
pubmed: 32477050
Nature. 2015 Feb 26;518(7540):529-33
pubmed: 25719670