Poster
in
Workshop: Interpretable Inductive Biases and Physically Structured Learning
16 - An Image is Worth 16 × 16 Tokens: Visual Priors for Efficient Image Synthesis with Transformers
Robin Rombach
Abstract: