Skip to yearly menu bar Skip to main content


Poster
in
Workshop: 5th Workshop on Self-Supervised Learning: Theory and Practice

Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL

Ömer Çağatan · Baris Akgun


Abstract:

In this study, we examine the impact of different SSL objectives within the Self Predictive Representations (SPR) framework. Specifically, we explore SSL modifications like terminal state masking and prioritized replay weighting, which were not explicitly discussed in the original framework. These modifications are RL-specific but are not applicable to all RL algorithms. As such, it is of interest to gauge their impact on performance and look at other SSL objectives. We evaluate six SPR variants on the Atari 100k benchmark, including versions without these modifications, as well as others incorporating feature decorrelation methods like Barlow Twins and VICReg, which cannot accommodate these specific adjustments. Additionally, we assess the performance of these objectives on the DeepMind Control Suite, where the environment does not feature these modifications. Our findings show that the SSL modifications within SPR significantly influence performance, underscoring the critical importance of both the SSL objective selection and its accompanying modifications in data-efficient and self-predictive reinforcement learning.

Chat is not available.