Josh – Joshua Glunz

Vueling x Le Tour De France

Posted on 5. April 20266. April 2026 by Josh

A commercial motion project built entirely through generative AI. The process began with storyboard and character design — defining the cyclist, the bicycle, and the overall narrative arc. The critical step before any image generation was writing a master prompt: a single, precise description locking in cameramodel, lens behavior, motion bur, lighting condition and visual tone. This master prompt acted as a style contract across the entire project, ensuring consistency from the first styleframe to the last generated shot. Video production was then executed across Grok, Kling and Seedance.

How did we teach plastic to remember music?

Posted on 5. April 20266. April 2026 by Josh

This project explores what happens when scientific archive material becomes the foundation for a generative visual world. The central question was not how to create something entirely new, but how to look at something already existing from an angle conventional technology cannot reach. The starting point was a simple fascination: vinyl is a physical medium that stores information. The grooves pressed into the plastic surface carry remarkably complex acoustic information. That physical relationship between material and memory became the conceptual anchor for everything that followed.

Initial visual assets were generated using Google Nano Banana 2, with additional work in Qwen Image Edit and Flux2.Dev in ComfyUI. This combination allowed for gradual control over composition, color palette, and material quality — specifically simulating liquid chrome, polished gold, and high-gloss silver, referencing the material language of physical retail environments without ever entering one. Animation was handled primarily through Grok (image-to-video, first-frame input), chosen for its strength in dynamic movement and prompt responsiveness. The final shot was produced in Kling 2.6 using a first/last-frame method, which anchors the motion at both ends and lets the AI interpolate the transformation between them — keeping geometry stable and reducing visual drift.

TOUS 40 Years

Posted on 4. April 202613. April 2026 by Josh

Initial visual assets were generated using Google Nano Banana 1. The process focused on simulating specific physical properties—liquid chrome, polished gold, and high-gloss silver—mimicking the material language found in physical retail environments. To animate the static outputs, the images were processed through Runway and Kling. Image-to-Video: The generated images served as the structural reference to maintain material integrity across time. Frame Constraints: A first/last frame method was used to anchor the motion. By defining both start and end points, the AI calculates the transformation between them, which minimizes "melting" artifacts and keeps the subject’s geometry stable.

Windy Days

Posted on 4. April 20266. April 2026 by Josh

By combining street photography in Barcelona with advanced AI motion control and custom sound design, the goal was to introduce a subtle, surrealist layer to static historic architecture without losing the structural integrity of the subject. The approach trys not to us AI for cheap unintentional effects but rather gain control and intention. The project began with a series of architectural studies of the Santa Maria del Mar church and surrounding El Born & Gothic quarters in Barcelona. To ground the digital manipulation in reality, I recorded ambient city sounds, processed them in Ableton Live and added a harmonic layer to it.

Unlike standard text-to-video generation which often results in unpredictable "hallucinations," this workflow utilizes Trajectory Control within ComfyUI. It gives Control over: Path Animation: Using a spline-based editor, I manually drew the "wind" paths across the architectural facades. This allows for precise control over which direction pixels move and at what velocity [01:26]. Pinning & Static Constraints: To maintain the "subtlety" mentioned in the concept, specific architectural anchors (like the church towers) were "pinned" in the workflow [01:18]. This ensures the camera and core structure remain stationary while only the atmospheric elements—like light reflections and shadows—are influenced by the trajectory paths. Temporal Interpolation: Using start/stop percentages and easing functions (Ease-In/Out) [02:11], the motion was synchronized to the BPM of the Ableton soundscape, creating a cohesive link between the visual "wind" and the audio environment.

AI Oldtimer Flux.dev 1

Posted on 25. Dezember 20256. April 2026 by Josh

My younger brother once told me his favorite photo I had ever taken was of an abandoned Mercedes in a forest, surrounded by tall grass. Maybe he sensed something simple and true in it. Lately, I’ve been deep into experimentation with ComfyUI. Exploring workflows, making mistakes and slowly learning a new tool. This small series was built using FLUX1.DEV inside ComfyUI, paired with a few custom LoRAs to push toward a gritty, analog 35mm look. ComfyUI is a open source software for the visual AI space and can run fully locally on your computer. It is different from the big plug-and-play AI platforms. It’s node-based and open, more like a modular synthesizer than an app. You’re not clicking presets. You’re building a visual process. I’d definitely recommend trying it out yourself.

"d1g1cam, amateur photo, low-lit, overexposure, Low-resolution photo, shot on a mobile phone, noticeable noise in dark areas. (Trigger words for LoRa) A poison green Lamborghini sits abandoned in the middle of a dense, overgrown forest. The car is nearly swallowed by wild vegetation — very tall grasses rise halfway up the doors, and creeping vines curl over the headlights and rear spoiler. The vivid green paint pops unnaturally against the earthy tones of the forest, but is streaked with dirt and leaf debris. The windshield is fogged, and small plants grow from cracks in the soil beneath the tires. Trees surround the scene tightly, with filtered light casting uneven shadows. The photo is messy and raw, with heavy digital grain, clipped highlights, and soft focus — like a forgotten snapshot taken on a cheap mobile phone, lost in the depths of the camera roll. "

Tales Beyond Order

Posted on 2. Oktober 20255. April 2026 by Josh

The editing of the film was more like a visual collage, layering different elements to create a unique aesthetic. While AI tools like MidJourney played a role in generating imagery, these images were heavily altered and reworked. For example provided MidJourney originally polished, Disney-like characters with big expressive eyes. These were edited out, replaced with small black dots—a deliberate symbol of the “soulless” nature of this world. Instead of depicting a camp with 20 tents, we show thousands of identical tents stretching endlessly. This reinforces the idea of a world where individuality is erased, and experiences are generic and pre-defined.

AI Hand

Posted on 29. April 20246. April 2026 by Josh

⬑ Process / AI Hand
[09. September 2025]

What is ComfyUI?

ComfyUI is an open-source, node-based interface for Stable Diffusion that lets you build custom image and video generation workflows by connecting modular nodes for full control and flexibility.

What is WAN 2.2?

WAN 2.2 is an open-source text-to-video and video-to-video diffusion model. It introduces the Video-Aware Consistency Engine (VACE), which is designed to keep style stable and motion coherent across frames, making it especially suited for workflows that combine visual consistency with external motion guidance.

Helpful Resources

ComfyUI Documentation
— Node-based workflow interface for building and experimenting with AI pipelines.
WAN 2.2 VACE
— Video generation model with Video-Aware Consistency Engine for stable, coherent frame-to-frame output.
ControlNet: DW Pose
— Extracts skeletal keypoints from video for precise motion guidance.
Youtube: Purz and Machine Delusions explain Wan2.2 Vace

Context

Final Video

Context

Research

Research

Tools

Final Video / Desktop

Sampling as Method

Context

Asset Generation

Implementation

Final Video / Mobile

Context

Video

Sound on

Workflow

Intro

Prompt Example

Detail Daemon

Intro

Composition work

Production

Runway Gen2 was limited to producing a maximum of four seconds of coherent body movement.

Miro World

Miro World

Outro

Intro

Motion Capture

Pose Extraction

Comfy

Video-to-video generation with WAN 2.2 VACE.

Result

Outro