NeurIPS Poisoning Generative Models to Promote Catastrophic Forgetting

Poster
in
Workshop: Trustworthy and Socially Responsible Machine Learning

Poisoning Generative Models to Promote Catastrophic Forgetting

Siteng Kang · Xinhua Zhang

[ Abstract ] [ Project Page ]

[ Poster] [ OpenReview]

Abstract:

Generative models have grown into the workhorse of many state-of-the-art machine learning methods. However, their vulnerability under poisoning attacks has been largely understudied. In this work, we investigate this issue in the context of continual learning, where generative replayers are utilized to tackle catastrophic forgetting. By developing a novel customization of dirty-label input-aware backdoor to the online setting, our attacker manages to stealthily promote forgetting while retaining high accuracy at the current task and sustaining strong defenders. Our approach taps into an intriguing property of generative models, namely that they cannot well capture input-dependent triggers. Experiments on four standard datasets corroborate the poisoner’s effectiveness.

Chat is not available.

Poster in Workshop: Trustworthy and Socially Responsible Machine Learning

Poisoning Generative Models to Promote Catastrophic Forgetting

Siteng Kang · Xinhua Zhang

Poster
in
Workshop: Trustworthy and Socially Responsible Machine Learning