If you've ever watched a product commercial where a motorcycle floats dramatically through space, orbiting slowly while meteors streak past, you probably assumed it was made by a team of 3D artists using Blender, Cinema 4D, or expensive motion graphics software.
What if you could create the same cinematic quality using just AI? No 3D modeling. No keyframe animation. No render farms.
This guide walks you through the exact process of creating a professional-grade animated product video using Masonry — from placing a real motorcycle into a cinematic space environment to generating smooth 360-degree orbit shots that look like they came straight out of a Hollywood VFX studio.
We'll cover:
- Multi-image referencing (using one image as your environment, another as your subject)
- Subject replacement techniques that maintain product integrity
- Prompt engineering for preventing AI deformation
- Camera movement prompting (gimbal stabilization, orbits, zooms)
- Frame-to-frame video generation
Let's build something cinematic.
The Tools You'll Need
For this project, we're using Masonry, which gives you access to multiple AI image and video models in one workspace. The specific models we'll be using:
For Image Generation:
- Nano Banana Pro (Gemini 3 Pro) — Best for subject integrity and multi-image compositing
- SeedDream 4 — Great for photorealistic outputs with dramatic lighting
- SeedDream 4.5 — Enhanced version with better detail preservation
- GPT Image 1.5 — Strong for creative compositing and perspective changes
For Video Generation:
- Seedance 1.5 Pro — The powerhouse for cinematic AI video with frame-to-frame control
Phase 1: Gathering Your Source Materials
Every great composite starts with great source images. For this project, you need:
- Your Subject Image — A clean, high-resolution product photo (we're using a Benelli TRK-502 motorcycle)
- Your Environment Image — The background/scene you want to place your subject into (Earth from space with atmospheric glow)


Pro tip: The cleaner your source images, the better your AI composites will be. Remove busy backgrounds from your subject if possible.
Phase 2: Initial Compositing — Bike in Space
This is where the magic begins. We're going to take our motorcycle and seamlessly place it into a cinematic space environment.
The Core Prompt Template
Here's the prompt structure that consistently produces clean composites:
Use Image 1 as the environmental reference and Image 2 as the subject reference. Place the motorcycle from Image 2 into the same Earth-from-space environment shown in Image 1. The bike should appear naturally floating above Earth's curved horizon, matching the scale, perspective, lighting, and camera angle of Image 1. Scene requirements: • Background must exactly resemble the Earth atmosphere, clouds, and curvature from Image 1 • Space should be deep black with subtle stars • Soft blue atmospheric glow along Earth's edge • Realistic cinematic lighting on the bike consistent with sunlight from space • Accurate shadows and reflections on the bike • Bike must remain completely intact — no deformation, no transformation, no design changes • Preserve original colors, logo, body structure, wheels, and proportions Composition: • Vertical 9:16 aspect ratio • Bike positioned slightly right-of-center • Camera angle should feel cinematic and realistic, as if shot from orbit • Bike appears suspended or floating naturally in space (no motion blur) Constraints: • Do NOT include any play button, UI elements, overlays, icons, or controls • Do NOT add text, watermarks, or graphics • Do NOT stylize or cartoonize • Ultra-realistic, photoreal, high-detail output
Model Comparison: Same Prompt, Different Results
Here's what each model produces with the exact same prompt:



Why This Prompt Structure Works
Let's break down the key elements:
- Clear image role assignment — "Image 1 as environmental reference and Image 2 as subject reference" tells the AI exactly how to use each input
- Explicit integrity constraints — "Bike must remain completely intact — no deformation, no transformation" prevents the AI from "creatively" modifying your product
- Specific visual requirements — Instead of saying "make it look good," we define exactly what "good" means: lighting direction, reflection behavior, atmospheric effects
- Negative constraints — Telling the AI what NOT to do is often as important as telling it what to do
Phase 3: Creating Scene Variations
Once you have your base composite, you can create dramatic scene variations by adding environmental elements.
Dynamic Action Scene with Meteors and Jets
Use Image 1 as the motorcycle reference and Image 2 as the scene and composition reference. Replace the car in Image 2 with the motorcycle from Image 1. The motorcycle must take the exact role, scale, and position of the car in the space scene. Scene description: A cinematic deep-space environment above Earth. The motorcycle is drifting dynamically in space, slightly tilted, as if caught in zero gravity. Earth's curved horizon is visible below with glowing blue atmosphere and cloud details. Fiery meteors and debris streak through space around the bike. Modern fighter jets fly nearby, adding scale, speed, and intensity. Visual realism: • Ultra-realistic lighting with strong highlights and shadows • Reflections of Earth glow and meteors on the motorcycle body • Motion blur only on meteors and jets, NOT on the motorcycle • Motorcycle remains sharp, intact, and fully detailed • Preserve exact bike design, colors, logos, wheels, and proportions • No deformation, no transformation, no redesign Composition: • Dynamic cinematic angle • Bike placed centrally as the hero subject • Strong depth of field • Epic sci-fi movie poster aesthetic • High contrast, dramatic lighting


Phase 4: Close-Up and Detail Shots
Professional product videos aren't just wide shots — they include dramatic close-ups that show off the product's details.
Front Close-Up Shot
Replace the car in the second image with the motorcycle from the first image. Create a dramatic close-up cinematic shot of the motorcycle in outer space. Match the camera angle, framing, and intensity of the second image. The shot should focus on the front and side details of the motorcycle. Scene details: Deep space environment with Earth's curved horizon partially visible. Subtle meteors and space debris in the background. Soft blue atmospheric glow reflecting on the motorcycle. High-contrast cinematic lighting. Camera and composition: • Extreme close-up perspective • Dynamic cinematic angle • Shallow depth of field • Bike fills most of the frame • Slight background motion, bike remains sharp Bike constraints (VERY IMPORTANT): • Keep the motorcycle completely intact • No shape change, no deformation or morphing • Preserve exact geometry, proportions, wheels, frame, exhaust, and logos • Maintain original colors and materials

Scene with Satellite Background
Replace the car in the second image with the motorcycle from the first image. Keep the entire background exactly the same as the second image: • Outer space environment • Earth's curved horizon • Satellite in the background • Lighting, shadows, debris, and perspective unchanged Place the motorcycle in the exact position, scale, and orientation of the original car. The motorcycle should appear naturally floating in space, matching the scene's lighting and camera angle. Bike integrity (CRITICAL): • Keep the motorcycle completely intact • Preserve all physical and minute details • No deformation, no morphing, no transformation • No stretching or warping • Preserve exact geometry, proportions, wheels, frame, exhaust, mirrors, cables, bolts, textures, and logos • Maintain original colors and materials

Rear View Shots
Use the first image as the base scene. Use the second image as the exact reference for the bike. Replace the car in the first image with the bike from the second image, using a rear/back view of the bike. The bike must be oriented to match the same angle, perspective, and framing as the original car's backshot. The bike's structure, proportions, geometry, colors, textures, logos, decals, and branding must remain exactly identical to the reference image. The rear details of the bike must be clearly visible and intact. Keep the entire space environment completely unchanged, including the stars, asteroids, lighting, reflections, depth, and cinematic atmosphere. Do not stylize, redesign, or transform the bike in any way.


Distance Shot
Sometimes simpler prompts work when you're iterating on an existing composition:
make the bike look really far from the camera angle and make it look small

Phase 5: Component Showcase Shots
For premium product videos, you'll want isolated shots of key components.
Tyre Close-Up in Space
Use the first image as the base background scene. Use the second image as the exact reference for the bike's tyre. Place a close-up, zoomed-in view of the bike's tyre into the space environment. The visible area should focus on the tyre tread, rim, brake disc, and surrounding mechanical details, filling a large portion of the frame. The tyre must match the reference exactly in structure, proportions, tread pattern, rim design, brake disc shape, textures, materials, and colors. Do not alter, stylize, or redesign any part of the tyre. Integrate the tyre naturally into the space scene, matching the lighting direction, reflections, and contrast so it appears as if the tyre is floating in space. Keep the entire background completely unchanged, including Earth's curvature, clouds, stars, and overall atmosphere.


Engine Close-Up in Space
Use the first image as the base background scene. Use the second image as the exact reference for the bike's engine section. Place only the bike engine and surrounding mechanical components into the space environment. The visible area should include the engine casing, frame, exhaust components, and mounting details. No other parts of the bike should be visible. Keep the engine's structure, geometry, proportions, textures, materials, logos, text, bolts, and surface details exactly identical to the reference image. Do not alter, stylize, or redesign the engine in any way. Integrate the engine naturally into the space scene, adjusting scale and perspective so it appears to be floating above Earth. Match lighting direction, highlights, reflections, and shadows to the space environment for a realistic composite.


Phase 6: Generating Cinematic Videos
This is where Seedance 1.5 Pro transforms your static images into cinematic motion. Here are the key video types you can create:
Video Type 1: Free Fall Deceleration
This creates a dramatic shot where the bike falls rapidly through space, then smoothly decelerates to a stop.
Frame 1: The motorcycle is in rapid free fall in outer space above Earth. High-speed downward motion. Strong sense of velocity. Background shows Earth's curved horizon with blue atmospheric glow. Meteors and space debris streak past with motion blur. The motorcycle remains perfectly intact and sharp. Transition (Frame 1 → Frame 2): The motorcycle smoothly decelerates from high speed to zero. No sudden braking. Motion eases naturally. Camera follows the bike on a gimbal-stabilized rig, reducing motion progressively. Background motion slows gradually. Frame 2: The motorcycle is completely stationary, suspended in space. The bike is perfectly still. Camera is fully stabilized. Earth rotates extremely slowly in the background. Camera behavior: Cinematic gimbal stabilization. Smooth easing in and easing out. No shake. No jitter. No snap movements. Bike integrity constraints (CRITICAL): • Motorcycle geometry must remain exactly the same across frames • No deformation, no morphing, no transformation • No stretching or bending • Preserve exact proportions, wheels, frame, exhaust, and logos • Maintain original colors and materials
Video Type 2: 360-Degree Orbit Shot
The classic product showcase — a smooth camera orbit around the stationary subject.
Use the first image as the start frame and the second image as the end frame. Scene description: The motorcycle is floating calmly in outer space above Earth. Earth's curved horizon with soft blue atmospheric glow is visible in the background. Small meteors and space debris drift slowly, adding depth and scale. The environment feels vast, silent, and cinematic. Motion and transition: The motorcycle remains suspended in zero gravity throughout the shot. No movement from the motorcycle itself. The transition from the first frame to the second frame happens smoothly and naturally. Camera behavior: The camera is mounted on a stabilized cinematic gimbal. The camera performs a slow, smooth 360-degree orbit around the motorcycle. The orbit is steady, controlled, and elegant. No shake, no jitter, no sudden acceleration. Camera motion eases in and eases out naturally. Bike integrity constraints (VERY IMPORTANT): • The motorcycle must remain completely intact • No deformation, no morphing, no transformation • Preserve exact geometry, proportions, wheels, frame, exhaust, and logos • Maintain original colors, materials, and textures • No parts added or removed
Video Type 3: Subtle Floating Animation
For when you want minimal movement — just enough to show it's a video, not a still image.
Animate this exact frame without changing the scene or objects. The motorcycle is hovering calmly in outer space. Very slow, subtle floating motion as if suspended in zero gravity. Minimal vertical drift and extremely slight micro-rotation. No lateral movement across the frame. Background motion: Small meteors and debris drift slowly and naturally. Earth rotates extremely slowly and subtly. Camera behavior: Cinematic camera on a stabilized gimbal. Camera remains steady with a slight cinematic drift. No shake, no jitter, no zoom, no fast motion. Focus stays locked on the motorcycle. Bike integrity (CRITICAL): • Motorcycle remains 100% intact • No deformation, no morphing, no transformation • No stretching, bending, or warping • Preserve exact geometry, proportions, wheels, frame, exhaust, mirrors, cables, and logos • Colors, textures, and materials remain unchanged
Video Type 4: Component Animation — Tyre Rotation
Use the attached image as the starting frame. From this exact start frame, animate the bike tyre so that it begins to rotate slowly and smoothly around its central axle. The rotation is calm, controlled, and cinematic. The tyre remains a single rigid object at all times. There is no wobble, no deformation, no bending, no scaling. No change to the tyre's shape, tread pattern, rim geometry, brake disc, or mechanical structure. All logos, markings, text, and surface details remain sharp, readable, and completely unchanged throughout the video. The camera is mounted on a stabilized gimbal and performs a gentle cinematic pan, subtly drifting sideways and slightly downward while keeping the tyre centered and in sharp focus. The space background remains stable and realistic, with Earth visible below for depth and scale. Lighting stays consistent, enhancing rubber texture and metallic reflections naturally.
Advanced Techniques
The Subject Integrity Problem
One of the biggest challenges in AI image generation is maintaining subject integrity. AI models love to "interpret" your subject, which often means adding extra wheels, morphing proportions, or changing colors.
The solution: Be extremely explicit in your constraints. Always include:
Bike integrity constraints (CRITICAL): • Keep the motorcycle completely intact • No deformation, no morphing, no transformation • No stretching or warping • Preserve exact geometry, proportions, wheels, frame, exhaust, mirrors, cables, bolts, textures, and logos • Maintain original colors and materials
Multi-Model Comparison Workflow
For each key shot, we recommend generating with multiple models simultaneously:
- Nano Banana Pro — Often best for subject integrity
- SeedDream 4.5 — Great for photorealistic lighting
- SeedDream 4 — Sometimes produces more dramatic results
- GPT Image 1.5 — Good for creative compositions
Then pick the best result from each model for your final video edit.
Frame-to-Frame Video Control
When using Seedance 1.5 Pro, you can control the video by providing:
- Single starting frame — The AI generates motion from this frame
- Start and end frames — The AI interpolates between them
The second method gives you much more control over the final result.
Model Comparison: Which AI to Use When
- Initial compositing → Nano Banana Pro (Best subject preservation)
- Dramatic lighting → SeedDream 4.5 (Superior photorealistic rendering)
- Creative perspectives → GPT Image 1.5 (Handles unusual angles well)
- Component isolation → SeedDream 4 (Clean edge detection)
- Video generation → Seedance 1.5 Pro (Cinematic motion with frame control)
Final Thoughts
What we just built would have required:
- 3D modeling skills (Blender, Maya, or Cinema 4D)
- Texturing and material setup
- Lighting and rendering knowledge
- Motion graphics and animation expertise
- Hours (or days) of render time
With AI, the same cinematic quality takes a fraction of the effort. The key is understanding how to communicate with these models through detailed, structured prompts.
Key Takeaways
- Be explicit about subject integrity — AI will modify your product unless you tell it not to
- Use multi-image referencing — Separate your subject from your environment
- Describe camera behavior — "Gimbal-stabilized," "smooth easing," "no jitter"
- Use frame-to-frame control — For precise video generation
- Generate multiple options — Different models have different strengths
Ready to Create Your Own?
Head to Masonry and start building your cinematic AI videos today. The platform gives you access to all the models we used in this guide, plus a workspace to organize your generations.


