A Differentiable Physics Engine for Deep Learning in Robotics.

deep learning differentiable physics engine gradient descent neural network controller robotics

Journal

Frontiers in neurorobotics
ISSN: 1662-5218
Titre abrégé: Front Neurorobot
Pays: Switzerland
ID NLM: 101477958

Informations de publication

Date de publication:
2019
Historique:
received: 07 06 2018
accepted: 11 02 2019
entrez: 23 3 2019
pubmed: 23 3 2019
medline: 23 3 2019
Statut: epublish

Résumé

An important field in robotics is the optimization of controllers. Currently, robots are often treated as a black box in this optimization process, which is the reason why derivative-free optimization methods such as evolutionary algorithms or reinforcement learning are omnipresent. When gradient-based methods are used, models are kept small or rely on finite difference approximations for the Jacobian. This method quickly grows expensive with increasing numbers of parameters, such as found in deep learning. We propose the implementation of a modern physics engine, which can differentiate control parameters. This engine is implemented for both CPU and GPU. Firstly, this paper shows how such an engine speeds up the optimization process, even for small problems. Furthermore, it explains why this is an alternative approach to deep Q-learning, for using deep learning in robotics. Finally, we argue that this is a big step for deep learning in robotics, as it opens up new possibilities to optimize robots, both in hardware and software.

Identifiants

pubmed: 30899218
doi: 10.3389/fnbot.2019.00006
pmc: PMC6416213
doi:

Types de publication

Journal Article

Langues

eng

Pagination

6

Références

Science. 2006 Nov 17;314(5802):1118-21
pubmed: 17110570
PLoS One. 2014 Jan 31;9(1):e86696
pubmed: 24497969
Front Neurorobot. 2019 Mar 07;13:6
pubmed: 30899218
Neural Comput. 1997 Nov 15;9(8):1735-80
pubmed: 9377276

Auteurs

Jonas Degrave (J)

IDLab-AIRO, Department of Electronics and Information Systems, Ghent University - imec, Ghent, Belgium.

Michiel Hermans (M)

Independent Researcher, Ghent, Belgium.

Joni Dambre (J)

IDLab-AIRO, Department of Electronics and Information Systems, Ghent University - imec, Ghent, Belgium.

Francis Wyffels (F)

IDLab-AIRO, Department of Electronics and Information Systems, Ghent University - imec, Ghent, Belgium.

Classifications MeSH