Antonin Raffin

Research Engineer in Robotics and Machine Learning

German Aerospace Center (DLR)

Bio

Antonin Raffin is a research engineer at the German Aerospace Center (DLR) who specializes in reinforcement learning (RL). He is the lead developer of Stable-Baselines3 (SB3), an open-source library that implements Deep RL algorithms. His main focus is on learning controllers directly on real robots and improving the reproducibility of RL.

Interests

Robotics
Reinforcement Learning
State Representation Learning
Machine Learning

Projects

SBX: Stable Baselines Jax

Proof of concept version of Stable-Baselines3 in Jax.

Datasaurust

Blazingly fast implementation of the Datasaurus paper in Rust. Same Stats, Different Graphs.

Stable Baselines3

A set of improved implementations of reinforcement learning algorithms in PyTorch.

Learning to Drive Smoothly in Minutes

Learning to drive smoothly in minutes using reinforcement learning on a Donkey Car.

RL Baselines Zoo

A collection of 70+ pre-trained RL agents using Stable Baselines

S-RL Toolbox

S-RL Toolbox: Reinforcement Learning (RL) and State Representation Learning (SRL) for Robotics

Stable Baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Racing Robot

Autonomous Racing Robot With an Arduino, a Raspberry Pi and a Pi Camera

Arduino Robust Serial

A simple and robust serial communication protocol. Implementation in C Arduino, C++, Python and Rust.

Selected Publications

Antonin Raffin, Olivier Sigaud, Jens Kober, Alin Albu-Schäffer, Joao Silvério, Freek Stulp

October 2023 RLC 2024

An Open-Loop Baseline for Reinforcement Learning Locomotion Tasks

Outstanding Paper Award on Empirical Resourcefulness in RL
In search of a simple baseline for Deep Reinforcement Learning in locomotion tasks, we propose a model-free open-loop strategy. By leveraging prior knowledge and the elegance of simple oscillators to generate periodic joint motions, it achieves respectable performance in five different locomotion environments, with a number of tunable parameters that is a tiny fraction of the thousands typically required by DRL algorithms. We conduct two additional experiments using open-loop oscillators to identify current shortcomings of these algorithms. Our results show that, compared to the baseline, DRL is more prone to performance degradation when exposed to sensor noise or failure. Furthermore, we demonstrate a successful transfer from simulation to reality using an elastic quadruped, where RL fails without randomization or reward engineering. Overall, the proposed baseline and associated experiments highlight the existing limitations of DRL for robotic applications, provide insights on how to address them, and encourage reflection on the costs of complexity and generality.

Preprint PDF Code Project Slides Video

Antonin Raffin, Daniel Seidel, Jens Kober, Alin Albu-Schäffer, Joao Silvério, Freek Stulp

October 2022 SoftRobot 2024

Learning to Exploit Elastic Actuators for Quadruped Locomotion

Spring-based actuators in legged locomotion provide energy-efficiency and improved performance, but increase the difficulty of controller design. While previous work has focused on extensive modeling and simulation to find optimal controllers for such systems, we propose to learn model-free controllers directly on the real robot. In our approach, gaits are first synthesized by central pattern generators (CPGs), whose parameters are optimized to quickly obtain an open-loop controller that achieves efficient locomotion. Then, to make this controller more robust and further improve the performance, we use reinforcement learning to close the loop, to learn corrective actions on top of the CPGs. We evaluate the proposed approach on the DLR elastic quadruped bert. Our results in learning trotting and pronking gaits show that exploitation of the spring actuator dynamics emerges naturally from optimizing for dynamic motions, yielding high-performing locomotion, particularly the fastest walking gait recorded on bert, despite being model-free. The whole process takes no more than 1.5 hours on the real robot and results in natural-looking gaits.

Preprint Slides Video

Recent Publications

The 37 Implementation Details of Proximal Policy Optimization

Shengyi Huang, Rousslan Fernand Julien Dossa, Antonin Raffin, Anssi Kanervisto, Weixun Wang

PDF Code Video

Stable-Baselines3: Reliable Reinforcement Learning Implementations

Antonin Raffin, Ashley Hill, Adam Gleave, Anssi Kanervisto, Maximilian Ernestus, Noah Dormann

PDF Code Project

Smooth Exploration for Robotic Reinforcement Learning

Antonin Raffin, Jens Kober, Freek Stulp

Preprint PDF Code Project Poster Video

See all publications

Recent & Upcoming Talks

Recent Advances in RL for Continuous Control

A presentation on recent advances in RL, in terms of algorithms, software, and simulators.

May 21, 2025 11:00 — 12:00 CERN, Geneva, Switzerland

Antonin Raffin

Slides

Enabling Reinforcement Learning on Real Robots

Invited talk while visiting the INRIA Willow team in Paris.

Jan 31, 2025 13:30 — 14:15 Paris, France

Antonin Raffin

Slides

Ingredients for Learning Locomotion Directly on Real Hardware

Invited talk for the Soccer Robots workshop at Humanoids conference 2024

Nov 22, 2024 13:30 — 14:15 Nancy, France

Antonin Raffin

Slides

Designing and Running Real-World RL Experiments

Talk at the Reinforcement Learning for Autonomous Accelerators workshop (RL4AA). The idea is to walk through the different steps of RL experimentation (task design, choosing the right algorithm, implementing safety layers) and also provide practical advice on how to run experiments and troubleshoot common problems.

Feb 5, 2024 08:30 — Feb 5, 2023 10:00 Salzburg

Antonin Raffin

Slides Video

See all talks

Experience

Researcher

German Aerospace Center (DLR)

October 2018 – Present Munich

Machine Learning for Robots.

Research Engineer

ENSTA ParisTech - U2IS robotics lab

October 2017 – October 2018 Palaiseau

Working on Reinforcement Learning and State Representation Learning for the DREAM project.

Research Intern

Riminder

April 2017 – September 2017 Paris

Deep Learning for Human Resources.

Research Intern

TU Berlin - RBO lab

May 2016 – August 2016 Berlin

Research internship in representation and reinforcement learning.

Antonin Raffin

Research Engineer in Robotics and Machine Learning

German Aerospace Center (DLR)

Bio

Interests

Projects

SBX: Stable Baselines Jax

Datasaurust

Stable Baselines3

Learning to Drive Smoothly in Minutes

RL Baselines Zoo

S-RL Toolbox

Stable Baselines

Racing Robot

Arduino Robust Serial

Selected Publications

An Open-Loop Baseline for Reinforcement Learning Locomotion Tasks

Learning to Exploit Elastic Actuators for Quadruped Locomotion

Recent Publications

Recent & Upcoming Talks

Recent Advances in RL for Continuous Control

Enabling Reinforcement Learning on Real Robots

Ingredients for Learning Locomotion Directly on Real Hardware

Designing and Running Real-World RL Experiments

Recent Posts

Getting SAC to Work on a Massive Parallel Simulator: Tuning for Speed (Part II)

Automatic Hyperparameter Tuning - In Practice (Part 2)

Getting SAC to Work on a Massive Parallel Simulator: An RL Journey With Off-Policy Algorithms (Part I)

Automatic Hyperparameter Tuning - A Visual Guide (Part 1)

Experience

Researcher

German Aerospace Center (DLR)

Research Engineer

ENSTA ParisTech - U2IS robotics lab

Research Intern

Riminder

Research Intern

TU Berlin - RBO lab

Contact

Antonin Raffin

Research Engineer in Robotics and Machine Learning

Bio

Interests

Projects

Selected Publications

Recent Publications

Recent & Upcoming Talks

Recent Posts

Experience

Researcher

German Aerospace Center (DLR)

Research Engineer

ENSTA ParisTech - U2IS robotics lab

Research Intern

Riminder

Research Intern

TU Berlin - RBO lab

Tags

Contact