Intro
The project stands as a hybrid of AI-driven visuals and human craftsmanship. Instead of relying on AI as an automatic generator, it was used as a starting point, with each image manipulated, edited and reshaped to serve the story.
The result is a film that retains a handcrafted, intentional aesthetic, proving that technology is a tool, not a replacement for creative vision. Many images and movements were created using hybrid techniques, allowing us to maintain full control over the narrative.
Composition work
Pose Extraction
The editing of the film was more like a visual collage, layering different elements to create a unique aesthetic. While AI tools like MidJourney played a role in generating imagery, these images were heavily altered and reworked. For example provided MidJourney originally polished, Disney-like characters with big expressive eyes. These were edited out, replaced with small black dots—a deliberate symbol of the “soulless” nature of this world. Instead of depicting a camp with 20 tents, we show thousands of identical tents stretching endlessly. This reinforces the idea of a world where individuality is erased, and experiences are generic and pre-defined.
Runway Gen2 was limited to producing a maximum of four seconds of coherent body movement.
Video-to-video generation with WAN 2.2 VACE.
WAN 2.2 VACE (Video-Aware Consistency Engine) is designed for frame-stable generation: it aligns temporal consistency across frames so the style remains locked while motion evolves smoothly. In my workflow, the first frame of the sequence was used as a style reference, ensuring that the yellow Cinema4D hand’s appearance stayed identical throughout. The DW-Pose sequence then guided motion, so what you see is my real hand movement faithfully translated into the stylized model.
Miro World
Outro
Everything is generated end-to-end in ComfyUI. Thanks to WAN 2.2 VACE’s frame stability, the output avoids jitter and drifting style, while the real-world motion input keeps the animation expressive. The result is a natural hand animation built entirely from a lightweight AI pipeline—no traditional rigging required.
⬑ research / AI Hand
Shortfilm
Helpful Resources
- Gen-3 Alpha Prompting Guide
- WAN 2.2 VACE
— Video generation model with Video-Aware Consistency Engine for stable, coherent frame-to-frame output. - ControlNet: DW Pose
— Extracts skeletal keypoints from video for precise motion guidance. - Youtube: Purz and Machine Delusions explain Wan2.2 Vace