Stable Video Diffusion

Video Model

Stable Video Diffusion

Stable Video Diffusion: open-source video model by Stability AI, 576p, 4s clips

VideoStability AI·Updated 2026-05-12
Create with Stable Video Diffusion

Stable Video Diffusion is an open-source video generation model by Stability AI that produces clips up to 4 seconds at 576x1024 resolution. It supports text-to-video, image-to-video, and custom training workflows. Stable Video Diffusion is free to use locally or via API. On VivifyAll, it is positioned at economy-tier pricing with 5 base credits per generation.

What can Stable Video Diffusion do?

text-to-videoimage-to-videocustom-training

What is Stable Video Diffusion best for?

  • Development
  • Custom workflows
  • Self-hosting
  • Research

What is Stable Video Diffusion not ideal for?

  • High-resolution output
  • Beginners
  • Commercial-grade production

What are Stable Video Diffusion's limitations?

  • Low default resolution (576x1024)
  • Maximum 4-second clips
  • Requires technical skill to self-host
  • Slower without GPU optimization

See What's Possible

Real outputs from real prompts. Click to try them yourself.

What features does Stable Video Diffusion offer?

576x1024 output

up to 4s clips

open-source

self-hostable

How much does Stable Video Diffusion cost?

Free and open-source, can be run locally or via API

Related Models

Frequently Asked Questions

Yes, Stable Video Diffusion is open-source and free to use.
Yes, you can run the model locally with adequate GPU resources.
Default resolution is 576x1024, can be scaled with custom training.
Yes, the open-source license allows commercial use with proper attribution.

Try Stable Video Diffusion Now

Stable Video Diffusion: open-source video model by Stability AI, 576p, 4s clips

Start Creating