v5.5/image-to-video
The model generates high-quality video clips from text combined with an image, delivering smooth motion and sharp visual detail.
Setup your API Key
If you don’t have an API key for the AI/ML API yet, feel free to use our Quickstart guide.
How to Make a Call
API Schemas
Create a video generation task and send it to the server
You can generate a video using this API. In the basic setup, you only need a reference image and a prompt. This endpoint creates and sends a video generation task to the server — and returns a generation ID.
The text description of the scene, subject, or action to generate in the video.
URL of the image to be used as the first frame of the video.
An enumeration where the short side of the video frame determines the resolution.
720pPossible values: The output video length in seconds. The 1080p quality option does not support 8-second videos.
5Possible values: The description of elements to avoid in the generated video.
The style of the generated video.
Varying the seed integer is a way to get different results for the same other request parameters. Using the same value for an identical request will produce similar results. If unspecified, a random number is chosen.
Enable audio generation.
- true: Audio on.
- false: Audio off.
falseEnable multi-clip generation with dynamic camera changes.
- true: Multi-clip.
- false: Single-clip.
falsePrompt reasoning enhancement mode.
- "enabled": Turn on prompt optimization.
- "disabled": Turn off prompt optimization.
- "auto" or omitted: Let the model decide automatically.
enabledPossible values: Retrieve the generated video from the server
After sending a request for video generation, this task is added to the queue. This endpoint lets you check the status of a video generation task using its id, obtained from the endpoint described above.
If the video generation task status is complete, the response will include the final result — with the generated video URL and additional metadata.
Successfully generated video
Successfully generated video
Full Example: Generating and Retrieving the Video From the Server
The code below creates a video generation task, then automatically polls the server every 15 seconds until it finally receives the video URL.
Processing time: ~50 s.
Original: 864x1280
Low-res GIF preview:

"Mona Lisa puts on glasses with her hands."Last updated
Was this helpful?