Sim2Real

Train in simulation, deploy in real world (with real-time adaptation)

Why simulators for robot learning?

Most RL-based algos are very sample inefficient

They are cheap/fast/scalable/safe/labeled

Problems of Sim2Real

Non-parametric mismatches (simulator doesn’t consider some effects at all)

complex aerodynamics, fluid dynamics, tire dynamics, etc

Parametric mismatches (simulator uses different parameters than real)

robot mass/friction,etc

Domain Randomization

Randomize $e$ in $x_{t + 1} = f (x_{t}, u_{t}, e)$

Train a single RL policy $π (x)$ that works for the whole distribution of $e$

Approximation of robust control

What is randomized?

Physics parameters (mass, gravity, friction, etc)

Sensor noise (camera blur, pixel noise, quantization, etc)

Rendering (lighting, textures, backgrounds)

Learning to Adapt (via Privileged Information)

Randomize $e$ in $x_{t + 1} = f (x_{t}, u_{t}, e)$

Train an adaptive RL policy $π (x, e)$ that works for many $e$

approximation of adaptive control

Issue! $e$ is often unknown in real world

Solution! Learning from a privileged teacher

Sim: First Train a teacher policy with privileged information $π (x, e)$

Sim: Student policy $π_{s} (x, available info in the real)$ learns from $π (x, e)$

Real: Deploy student policy $π_{s} (x, available info in the real)$

Basically becomes an Imitation Learning problem

Brayden Zhang

Explorer

Sim2Real

Graph View

Backlinks