Poster
Unsupervised Hierarchy-Agnostic Segmentation: Parsing Semantic Image Structure
Simone Rossetti · fiora pirri
Unsupervised semantic segmentation aims to discover groupings within images, capturing objects' view-invariance without external supervision. This task is inherently ambiguous due to the variable levels of granularity in natural groupings. Existing methods often bypass this ambiguity using dataset-specific priors. In our research, we address this ambiguity head-on and provide a universal tool for pixel-level semantic parsing of images guided by the latent representations encoded in self-supervised models. We introduce a novel algebraic methodology for unsupervised image segmentation. The innovative approach identifies scene-conditioned primitives within a dataset and creates a hierarchy-agnostic semantic region tree of the image pixels. The method leverages deep feature extractors and graph partitioning techniques to recursively identify latent semantic regions, dynamically estimating the number of components and ensuring smoothness in the partitioning process. In this way, the model captures fine and coarse semantic details, producing a more nuanced and unbiased segmentation. We present a new metric for estimating the quality of the semantic segmentation of elements discovered on the different levels of the hierarchy. The metric is beneficial because it validates the intrinsic nature of the compositional relations among parts, objects, and scenes in a hierarchy-agnostic domain.Our results unequivocally demonstrate the power of this methodology. It uncovers detailed and hierarchical semantic regions without prior definitions and scales effectively across various datasets. This robust framework for unsupervised image segmentation provides richer and more accurate semantic hierarchical relationships between scene elements than traditional algorithms. The experiments underscore its potential for broad applicability in image analysis tasks, showcasing its ability to deliver a detailed and unbiased segmentation that surpasses existing unsupervised methods.
Live content is unavailable. Log in and register to view live content