Skip to yearly menu bar Skip to main content


Oral Poster

Bayesian-Guided Label Mapping for Visual Reprogramming

Chengyi Cai · Zesheng Ye · Lei Feng · Jianzhong Qi · Feng Liu

[ ]
Fri 13 Dec 4:30 p.m. PST — 7:30 p.m. PST
 
Oral presentation: Oral Session 6B
Fri 13 Dec 3:30 p.m. PST — 4:30 p.m. PST

Abstract: $\textit{Visual reprogramming}$ (VR) leverages the intrinsic capabilities of pretrained vision models by adapting their input or output interfaces to solve downstream tasks whose labels (i.e., downstream labels) might be totally different from the labels associated with the pretrained models (i.e., pretrained labels). When adapting the output interface of the pretrained model, label mapping methods are proposed to map the pretrained labels to downstream labels via a gradient-free one-to-one label mapping function. However, in this paper, we reveal that one-to-one mappings may overlook the complex relationship between pretrained and downstream labels. Motivated by this observation, we propose a $\textit{\textbf{B}ayesian-guided \textbf{L}abel \textbf{M}apping}$ (BLM) method. BLM constructs iteratively-updated probabilistic label mapping matrices, with each element quantifying pairwise relationship between pretrained and downstream labels. The assignment of values to the constructed matrices is guided by Bayesian conditional probability, considering the joint distribution of the downstream labels and the labels predicted by the pretrained model on downstream samples. Experiments conducted on both pretrained vision models (e.g., ResNeXt) and vision-language models (e.g., CLIP) demonstrate the superior performance of BLM over existing label mapping methods. The success of BLM also offers a probabilistic lens through which to understand and analyze the effectiveness of VR.

Live content is unavailable. Log in and register to view live content