NeurIPS Robust Self-Supervised Learning for Adversarial Attack Detection

Poster
in
Workshop: 5th Workshop on Self-Supervised Learning: Theory and Practice

Robust Self-Supervised Learning for Adversarial Attack Detection

Yi Li · Plamen P Angelov · Neeraj Suri

[ Abstract ] [ Project Page ]

[ OpenReview]

Abstract:

In this paper, we propose a self-supervised representation learning framework for the adversarial attack detection task to address this drawback. Firstly, we map the pixels of augmented input images into an embedding space. Then, we employ the prototype-wise contrastive estimation loss to cluster prototypes as latent variables. Additionally, drawing inspiration from the concept of memory banks, we introduce a discrimination bank to distinguish and learn representations for each individual instance that shares the same or a similar prototype, establishing a connection between instances and their associated prototypes. We propose a parallel axial-attention (PAA)-based encoder to facilitate the training process by parallel training over height- and width-axis of attention maps. Experimental results show that, compared to various benchmark self-supervised vision learning models and supervised adversarial attack detection methods, the proposed model achieves state-of-the-art performance on the adversarial attack detection task across a wide range of images.

Chat is not available.

Poster in Workshop: 5th Workshop on Self-Supervised Learning: Theory and Practice

Robust Self-Supervised Learning for Adversarial Attack Detection

Yi Li · Plamen P Angelov · Neeraj Suri

Poster
in
Workshop: 5th Workshop on Self-Supervised Learning: Theory and Practice