Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Interpretable Inductive Biases and Physically Structured Learning

16 - An Image is Worth 16 × 16 Tokens: Visual Priors for Efficient Image Synthesis with Transformers

Robin Rombach


Abstract: