NeurIPS Poster ActAnywhere: Subject-Aware Video Background Generation

Poster

ActAnywhere: Subject-Aware Video Background Generation

Boxiao Pan · Zhan Xu · Chun-Hao Huang · Krishna Kumar Singh · Yang Zhou · Leonidas Guibas · Jimei Yang

East Exhibit Hall A-C #1801

[ Abstract ] [ Project Page ]

[ Paper] [ OpenReview]

Wed 11 Dec 11 a.m. PST — 2 p.m. PST

Abstract:

We study a novel problem to automatically generate video background that tailors to foreground subject motion. It is an important problem for the movie industry and visual effects community, which traditionally requires tedious manual efforts to solve. To this end, we propose ActAnywhere, a video diffusion model that takes as input a sequence of foreground subject segmentation and an image of a novel background and generates a video of the subject interacting in this background. We train our model on a large-scale dataset of 2.4M videos of human-scene interactions. Through extensive evaluation, we show that our model produces videos with realistic foreground-background interaction while strictly following the guidance of the condition image. Our model generalizes to diverse scenarios including non-human subjects, gaming and animation clips, as well as videos with multiple moving subjects. Both quantitative and qualitative comparisons demonstrate that our model significantly outperforms existing methods, which fail to accomplish the studied task. Please visit our project webpage at https://actanywhere.github.io.

Chat is not available.