Reinforcement Learning With Vision-Proprioception Model for Robot Planar Pushing.

Soft Actor-Critic Variational Autoencoder multimodal planar pushing reinforcement learning robot manipulation

Journal

Frontiers in neurorobotics
ISSN: 1662-5218
Titre abrégé: Front Neurorobot
Pays: Switzerland
ID NLM: 101477958

Informations de publication

Date de publication:
2022
Historique:
received: 05 12 2021
accepted: 17 01 2022
entrez: 21 3 2022
pubmed: 22 3 2022
medline: 22 3 2022
Statut: epublish

Résumé

We propose a vision-proprioception model for planar object pushing, efficiently integrating all necessary information from the environment. A Variational Autoencoder (VAE) is used to extract compact representations from the task-relevant part of the image. With the real-time robot state obtained easily from the hardware system, we fuse the latent representations from the VAE and the robot end-effector position together as the state of a Markov Decision Process. We use Soft Actor-Critic to train the robot to push different objects from random initial poses to target positions in simulation. Hindsight Experience replay is applied during the training process to improve the sample efficiency. Experiments demonstrate that our algorithm achieves a pushing performance superior to a state-based baseline model that cannot be generalized to a different object and outperforms state-of-the-art policies which operate on raw image observations. At last, we verify that our trained model has a good generalization ability to unseen objects in the real world.

Identifiants

pubmed: 35308311
doi: 10.3389/fnbot.2022.829437
pmc: PMC8926160
doi:

Types de publication

Journal Article

Langues

eng

Pagination

829437

Informations de copyright

Copyright © 2022 Cong, Liang, Ruppel, Shi, Görner, Hendrich and Zhang.

Déclaration de conflit d'intérêts

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Auteurs

Lin Cong (L)

TAMS Group, Department of Informatics, Universität Hamburg, Hamburg, Germany.

Hongzhuo Liang (H)

TAMS Group, Department of Informatics, Universität Hamburg, Hamburg, Germany.

Philipp Ruppel (P)

TAMS Group, Department of Informatics, Universität Hamburg, Hamburg, Germany.

Yunlei Shi (Y)

TAMS Group, Department of Informatics, Universität Hamburg, Hamburg, Germany.

Michael Görner (M)

TAMS Group, Department of Informatics, Universität Hamburg, Hamburg, Germany.

Norman Hendrich (N)

TAMS Group, Department of Informatics, Universität Hamburg, Hamburg, Germany.

Jianwei Zhang (J)

TAMS Group, Department of Informatics, Universität Hamburg, Hamburg, Germany.

Classifications MeSH