Poster
Customized Multiple Clustering via Multi-Modal Subspace Proxy Learning
Jiawei Yao · Qi Qian · Juhua Hu
Multiple clustering aims to discover multiple latent structures of data from different aspects. Deep multiple clustering methods have achieved remarkable results by exploiting complex patterns and relationships in data. However, existing works struggle to flexibly adapt to diverse user-specific needs in data grouping, which may require manual understanding of each clustering. To overcome these limitations, we introduce Multi-Sub, a novel end-to-end multiple clustering approach that incorporates a multi-modal subspace proxy learning framework. Utilizing the synergistic capabilities of CLIP and GPT-4, Multi-Sub aligns textual prompts expressing user preferences with corresponding visual representations. This is achieved by automatically generating proxy words from large language models that act as subspace bases, thus allowing for the customized representation of data in terms specific to the user’s interests. Our method consistently outperforms existing baselines across a broad set of datasets in visual multiple clustering tasks.
Live content is unavailable. Log in and register to view live content