Image-to-video is a mode where the model takes your existing photo as the first frame, then builds out the next 5 seconds of motion. It's way more predictable than text-to-video: you know what's in the shot beforehand.

Gemini Veo 3.1 is one of the best models for this task: it keeps faces intact, details don't get blurry, and it understands object physics. Here's how to use it.

1Pick Your Source Photo

Works well for: landscapes (water, clouds, foliage), portraits with clear emotion, product shots, cityscapes. Not so good for: small crowds, text screenshots, very dark frames. Ideal size: 1024×1024 or larger, 16:9 or 9:16 aspect ratio.

2Upload and Describe the Motion

In the bot: «🎬 Video» → «Veo 3.1» → «Image-to-video». Attach your photo and describe in the caption what you want to animate. The more specific, the better.

Slow wind moves the woman's hair gently to the right, her eyes blink naturally, subtle smile forms, soft shift of light from clouds passing overhead, camera stays still

3Control the Camera

Veo understands three camera movement types: static camera (still shot), slow zoom in/out (zoom in/out), pan left/right (horizontal shift). For faces, a static camera is usually best — otherwise, the model distorts features.

Static camera. The cat in the photo slowly turns its head from left toward the lens, ears twitching, eyes blinking once, subtle tail movement

4Suggest the Physics

Sometimes Veo isn't sure what should move and what shouldn't. Tell it: only the leaves move, branches and trunk stay still, the woman is the only moving subject, background remains static. This prevents a blurry background.

5Didn't Get It Right? Try This.

Veo 3.1 supports seed — a number that locks in a specific motion "variant." If your first version is almost there but one thing's off, save the seed and regenerate with a tweaked prompt. That saves you credits.

What's Next

Image-to-video is the most predictable and cheapest way to get a working clip. It's especially great for social media: one beautiful photo + Veo = a lively Reel.

Ready to try it? 20 free credits are enough for 3–5 generations.

Open @quantium_aibot →

Related Tutorials