Skip to yearly menu bar Skip to main content


Short Presentation
in
Session: Creative AI Session 1

Secure & Personalized Music-to-Video Generation via CHARCHA

Mehul Agarwal · Gauri Agarwal · Santiago Benoit · Andrew Lippman · Jean Oh

East Ballroom C
[ ]
Tue 10 Dec 9 a.m. PST — noon PST

Abstract:

Music is a deeply personal experience and our aim is to enhance this with a fully-automated pipeline for personalized music video generation. Our work allows listeners to not just be consumers but co-creators in the music video generation process by creating personalized, consistent and context-driven visuals based on lyrics, rhythm and emotion in the music. The pipeline combines multimodal translation and generation techniques and utilizes low-rank adaptation on listeners' images to create immersive music videos that reflect both the music and the individual. To ensure the ethical use of users' identity, we also introduce CHARCHA, a facial identity verification protocol that protects people against unauthorized use of their face while at the same time collecting authorized images from users for personalizing their videos. This paper thus provides a secure and innovative framework for creating deeply personalized music videos.

Chat is not available.