Poster
in
Workshop: 6th Robot Learning Workshop: Pretraining, Fine-Tuning, and Generalization with Large Scale Models
Causal Influence Aware Counterfactual Data Augmentation
Núria Armengol Urpí · Georg Martius
Keywords: [ Data Augmentation ] [ Deep Reinforcement Learning ] [ learning from demonstrations ] [ out-of-distribution generalization ]
Pre-recorded data and human demonstrations are practical resources for teaching robots complex behaviors.However, the combinatorial nature of real-world scenarios requires a huge amount of data to prevent neural network policies from picking up on spurious and non-causal factors.We propose CAIAC, a data augmentation method that creates synthetic samples from a fixed dataset without the need to perform new environment interactions.Motivated by the fact that an agent may only modify the environment through its actions, we swap causally action-unaffected parts of the state-space from different observed trajectories.In several environment benchmarks, we observe an increase in generalization capabilities and sample efficiency.