NeurIPS Poster Data-faithful Feature Attribution: Mitigating Unobservable Confounders via Instrumental Variables

Poster

Data-faithful Feature Attribution: Mitigating Unobservable Confounders via Instrumental Variables

Qiheng Sun · Haocheng Xia · Jinfei Liu

East Exhibit Hall A-C #3203

[ Abstract ]

[ Paper] [ OpenReview]

Fri 13 Dec 11 a.m. PST — 2 p.m. PST

Abstract:

The state-of-the-art feature attribution methods often neglect the influence of unobservable confounders, posing a risk of misinterpretation, especially when it is crucial for the interpretation to remain faithful to the data. To counteract this, we propose a new approach, data-faithful feature attribution, which trains a confounder-free model using instrumental variables. The cluttered effects of unobservable confounders in a model trained as such are decoupled from input features, thereby aligning the output of the model with the contribution of input features to the target feature in the data generation. Furthermore, feature attribution results produced by our method are more robust when focusing on attributions from the perspective of data generation. Our experiments on both synthetic and real-world datasets demonstrate the effectiveness of our approaches.

Chat is not available.