NeurIPS Multimodal Relational VAE

Poster
in
Workshop: Bayesian Deep Learning

Multimodal Relational VAE

Thomas Sutter · Julia Vogt

[ Abstract ] [ Project Page ]

[ OpenReview]

Abstract:

In this work, we propose a new formulation for multimodal VAEs to model and learn the relationship between data types. Despite their recent progress, current multimodal generative methods are based on simplistic assumptions regarding the relation between data types, which leads to a trade-off between coherence and quality of generated samples - even for simple toy datasets. The proposed method learns the relationship between data types instead of relying on pre-defined and limiting assumptions. Based on the principles of variational inference, we change the posterior approximation to explicitly include information about the relation between data types. We show empirically that the simplified assumption of a single shared latent space leads to inferior performance for a dataset with additional pairwise shared information.

Chat is not available.

Poster in Workshop: Bayesian Deep Learning

Multimodal Relational VAE

Thomas Sutter · Julia Vogt

Poster
in
Workshop: Bayesian Deep Learning