DreamLoop: Controllable Cinemagraph Generation from a Single Photograph
Cinemagraph Creation from Single Photograph with Motion Control
DreamLoop enables users to generates cinemagraphs from a single photograph with intuitive and easy-to-use, but precise motion control.
Comparison of Our method with Baselines for Controllable General Domain Cinemagraphs
We compare our method (both with internal model and Wan2.2-5B) with baselines which include: I) Image-to-Video Models - CogVideoX and Wan2.2-5B (pretrained), II) Motion Controllable Image-to-Video Models - DragAnything, III) Our method with only first-last-frame conditioning. For I) and II), we use post-processing to make the videos looping.
Importance of Timing Control over Object Motion
DreamLoop provides additional control over the timing of object motion in cinemagraphs, precisely letting users define the duration of the motion for each action.
Full vs Partial Control over Object Motion
DreamLoop also works in cases where user does to provide the full motion path for all objects in the scene, but just the intial motion hint. The rest is inferred by the model.
Limitations of DreamLoop
Despite the capabilities of DreamLoop and its ability to generate cinemagraphs from a single photograph with intuitive and easy-to-use, but precise motion control, it still has some limitations.
Performance on VBench
DreamLoop outperforms baselines on custom domain cinemagraphs on controllable generation for general domain. across all VBench metrics.
| Method | Aesthetic Quality ↑ | Imaging Quality ↑ | Temporal Flickering ↑ | Motion Smoothness ↑ | Background Consistency ↑ | Subject Consistency ↑ |
|---|---|---|---|---|---|---|
| CogVideoX* | 0.5353 | 0.6419 | 0.9888 | 0.9904 | 0.9741 | 0.9787 |
| Wan2.2-5B* (pretrained) | 0.5442 | 0.6018 | 0.9881 | 0.9922 | 0.9598 | 0.9605 |
| DragAnything* | 0.4986 | 0.5535 | 0.9674 | 0.9789 | 0.9613 | 0.9474 |
| Ours (Wan2.2-5B) [only first-last] | 0.5897 | 0.6756 | 0.9950 | 0.9959 | 0.9746 | 0.9941 |
| Ours (internal) [only first-last] | 0.5872 | 0.6615 | 0.9899 | 0.9929 | 0.9685 | 0.9566 |
| Ours (Wan2.2-5B) | 0.5998 | 0.6762 | 0.9955 | 0.9964 | 0.9800 | 0.9868 |
| Ours (internal) | 0.5997 | 0.6771 | 0.9965 | 0.9959 | 0.9849 | 0.9858 |
Human Preference Study
User Study with Amazon Mechanical Turk involving 50 participants confirms DreamLoop achieves higher user preference for controllable cinemagraph generation compared with contemporary open-source methods. Each comparison is done by 3 participants.
Comparison of Our method with Baselines for Controllable Natural Domain Cinemagraphs
We compare our method with baselines for controllable natural domain cinemagraphs with 1 and 5 hint points.
Different Directions
We can control the direction of the motion by providing different directions as hint points.
Different number of track points
We find that our method is robust to the number of Track Points, and can generate consistent results with different number of track points.