NeurIPS Causal Influence Aware Counterfactual Data Augmentation

Poster
in
Workshop: 6th Robot Learning Workshop: Pretraining, Fine-Tuning, and Generalization with Large Scale Models

Causal Influence Aware Counterfactual Data Augmentation

Núria Armengol Urpí · Georg Martius

Keywords: [ Data Augmentation ] [ Deep Reinforcement Learning ] [ learning from demonstrations ] [ out-of-distribution generalization ]

[ Abstract ] [ Project Page ]

[ OpenReview]

Abstract:

Pre-recorded data and human demonstrations are practical resources for teaching robots complex behaviors.However, the combinatorial nature of real-world scenarios requires a huge amount of data to prevent neural network policies from picking up on spurious and non-causal factors.We propose CAIAC, a data augmentation method that creates synthetic samples from a fixed dataset without the need to perform new environment interactions.Motivated by the fact that an agent may only modify the environment through its actions, we swap causally action-unaffected parts of the state-space from different observed trajectories.In several environment benchmarks, we observe an increase in generalization capabilities and sample efficiency.

Chat is not available.

Poster in Workshop: 6th Robot Learning Workshop: Pretraining, Fine-Tuning, and Generalization with Large Scale Models

Causal Influence Aware Counterfactual Data Augmentation

Núria Armengol Urpí · Georg Martius

Poster
in
Workshop: 6th Robot Learning Workshop: Pretraining, Fine-Tuning, and Generalization with Large Scale Models