An Open-Loop Baseline for Reinforcement Learning Locomotion Tasks

Antonin Raffin, Olivier Sigaud, Jens Kober, Alin Albu-Schäffer, Joao Silvério, Freek Stulp

October 2023

Preprint PDF Code Project Slides Video

Abstract

Outstanding Paper Award on Empirical Resourcefulness in RL
In search of a simple baseline for Deep Reinforcement Learning in locomotion tasks, we propose a model-free open-loop strategy. By leveraging prior knowledge and the elegance of simple oscillators to generate periodic joint motions, it achieves respectable performance in five different locomotion environments, with a number of tunable parameters that is a tiny fraction of the thousands typically required by DRL algorithms. We conduct two additional experiments using open-loop oscillators to identify current shortcomings of these algorithms. Our results show that, compared to the baseline, DRL is more prone to performance degradation when exposed to sensor noise or failure. Furthermore, we demonstrate a successful transfer from simulation to reality using an elastic quadruped, where RL fails without randomization or reward engineering. Overall, the proposed baseline and associated experiments highlight the existing limitations of DRL for robotic applications, provide insights on how to address them, and encourage reflection on the costs of complexity and generality.

Type

Conference paper

Publication

RL Conference

Reinforcement Learning, Robotics

Antonin Raffin

Research Engineer in Robotics and Machine Learning

Robots. Machine Learning. Blues Dance.

An Open-Loop Baseline for Reinforcement Learning Locomotion Tasks

Abstract

Antonin Raffin

Research Engineer in Robotics and Machine Learning

Related