Skip to yearly menu bar Skip to main content


Poster Session
in
Workshop: Scientific Methods for Understanding Neural Networks

Are Capsule Networks Texture or Shape Biased?

Riccardo Renzulli · Dominik Vranay · Marco Grangetto

[ ] [ Project Page ]
Sun 15 Dec 4:30 p.m. PST — 5:30 p.m. PST

Abstract:

Capsule networks (CapsNets) have been proposed as an alternative to traditional convolutional neural networks (CNNs), with the promise of better capturing part-whole relationships and spatial hierarchies. While CNNs are known to exhibit a strong bias towards texture in visual recognition tasks, human perception is more shape-biased. In this paper, we aim to investigate whether CapsNets, by design, demonstrate a stronger bias toward shape than texture, compared to CNNs. We conducted a series of experiments across multiple capsule architectures on images with a texture-shape cue conflict. Contrary to theoretical expectations, our results show that CapsNets do not consistently exhibit a stronger shape bias than CNNs. Although certain capsule models demonstrate promising shape recognition, they still rely significantly on texture, and their overall performance remains closer to that of CNNs than to human perception. These findings highlight the need for further research and architectural improvements to fully realize the potential of CapsNets in shape-based recognition.

Chat is not available.