Given two input images, our approach can generate a smooth and natural transition video between them. This is achieved purely by leveraging the prior knowledge of a pre-trained diffusion model, i.e., Stable Diffusion.
Abstract
Diffusion models have achieved remarkable image generation quality surpassing previous generative models. However, a notable limitation of diffusion models, in comparison to GANs, is their difficulty in smoothly interpolating between two image samples, due to their highly unstructured latent space. Such a smooth interpolation is intriguing as it naturally serves as a solution for the image morphing task with many applications. In this work, we present DiffMorpher, the first approach enabling smooth and natural image interpolation using diffusion models. Our key idea is to capture the semantics of the two images by fitting two LoRAs to them respectively, and interpolate between both the LoRA parameters and the latent noises to ensure a smooth semantic transition, where correspondence automatically emerges without the need for annotation. In addition, we propose an attention interpolation and injection technique, an adaptive normalization adjustment method, and a new sampling schedule to further enhance the smoothness between consecutive images. Extensive experiments demonstrate that DiffMorpher achieves starkly better image morphing effects than previous methods across a variety of object categories, bridging a critical functional gap that distinguished diffusion models from GANs.
Paper: arxiv.org/abs/2312.07409
Code: arxiv.org/abs/2312.07409 (coming soon)
Project Page:kevin-thu.github.io/DiffMorpher_page/
tagginator@utter.online [bot] 1 year ago
New Lemmy Post: DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing (https://lemmy.dbzer0.com/post/10304469)
Tagging: #StableDiffusion
(Replying in the OP of this thread (NOT THIS BOT!) will appear as a comment in the lemmy discussion.)
I am a FOSS bot. Check my README: https://github.com/db0/lemmy-tagginator/blob/main/README.md