Video Model
Grok Imagine
Aurora-powered video generation with native audio
Grok Imagine is xAI's video generation model powered by the Aurora engine. It supports text-to-video and image-to-video with native audio output, delivering cinematic clips at an unbeatable price point.
Capabilities
Best For
- ✓Videos with audio
- ✓Social media content
- ✓Creative clips
- ✓Budget-conscious creators
Not Best For
- ✗4K output
- ✗Long-form content
- ✗Ultra-detailed VFX
Known Limitations
- ⚠720p max resolution
- ⚠10-second maximum clips
- ⚠Economy-tier visual fidelity
See What's Possible
Real outputs from real prompts. Click to try them yourself.
“A tuxedo cat sitting at a grand piano in a dimly lit jazz club, paws moving across keys, warm amber spotlight, cigarette smoke catching light beams, audio of gentle jazz piano, 720p cinematic film noir”
Try this prompt“A lone dancer performing contemporary moves on a rain-soaked rooftop at dusk, city skyline behind, each step splashing in puddles, slow motion with audio of rain and thunder, moody blue-gray palette, cinematic”
Try this promptFeatures
720p output
up to 10s clips
native audio
Aurora engine
Pricing
Economy tier pricing