Sim2Real

Train in simulation, deploy in real world (with real-time adaptation)

Why simulators for robot learning?

most RL-based algos are very sample inefficient

They are cheap/fast/scalable

Problems of Sim2Real

non-parametric mismatches (simulator doesn’t consider some effects at all)

complex aerodynamics, fluid dynamics, tire dynamics, etc

Parametric mismatches (simulator uses different parameters than real)

robot mass/friction,etc

Domain Randomization

Randomize $e$ in $x_{t + 1} = f (x_{t}, u_{t}, e)$

Train a single RL policy $π (x)$ that works for many $e$

Approximation of robust control

Learning to Adapt

Randomize $e$ in $x_{t + 1} = f (x_{t}, u_{t}, e)$

Train an adaptive RL policy $π (x, e)$ that works for many $e$

approximation of adaptive control

Issue! $e$ is often unknown in real

Solution! Learning from a privileged teacher

Sim: First Train a teacher policy with privileged information $π (x, e)$

Sim: Student policy $π_{s} (x, available info in the real)$ learns from $π (x, e)$

Real: Deploy student policy $π_{s} (x, available info in the real)$

Basically an Imitation Learning problem

Brayden Zhang

Explorer

Sim2Real

Recent Notes

HOME - Deep Reinforcement Learning

HOME - Deep Learning

HOME - Robotics

Graph View

Backlinks