NeurIPS Provable Robustness of (Graph) Neural Networks Against Data Poisoning and Backdoor Attacks

Poster
in
Workshop: 3rd Workshop on New Frontiers in Adversarial Machine Learning (AdvML-Frontiers)

Provable Robustness of (Graph) Neural Networks Against Data Poisoning and Backdoor Attacks

Lukas Gosch · Mahalakshmi Sabanayagam · Debarghya Ghoshdastidar · Stephan Günnemann

Keywords: [ graph neural networks ] [ Neural Tangent Kernel ] [ certificates ] [ provable robustness ] [ mixed-integer linear programming ] [ poisoning ] [ backdoor attacks ] [ Neural Networks ] [ Support vector machines ] [ robustness ]

[ Abstract ] [ Project Page ]

[ OpenReview]

Abstract:

Generalization of machine learning models can be severely compromised by data poisoning, where adversarial changes are applied to the training data. This vulnerability has led to interest in certifying (i.e., proving) that such changes up to a certain magnitude do not affect test predictions. We, for the first time, certify Graph Neural Networks (GNNs) against poisoning attacks, including backdoors, targeting the node features of a given graph. Our certificates are white-box and based upon (i) the neural tangent kernel, which characterizes the training dynamics of sufficiently wide networks; and (ii) a novel reformulation of the bilevel optimization describing poisoning as a mixed-integer linear program. We note that our framework is more general and constitutes the first approach to derive white-box poisoning certificates for NNs, which can be of independent interest beyond graph-related tasks.

Chat is not available.

Poster in Workshop: 3rd Workshop on New Frontiers in Adversarial Machine Learning (AdvML-Frontiers)

Provable Robustness of (Graph) Neural Networks Against Data Poisoning and Backdoor Attacks

Lukas Gosch · Mahalakshmi Sabanayagam · Debarghya Ghoshdastidar · Stephan Günnemann

Poster
in
Workshop: 3rd Workshop on New Frontiers in Adversarial Machine Learning (AdvML-Frontiers)