Wan 2.6 (Text-to-Video)
This model enables text-to-video generation with consistent characters, synchronized audio, and cinematic multi-shot storytelling in a single workflow. Compared to earlier versions, Wan 2.6 delivers stronger instruction following, higher visual fidelity, and dramatically improved sound generation.
Setup your API Key
If you don’t have an API key for the AI/ML API yet, feel free to use our Quickstart guide.
How to Make a Call
API Schemas
Video Generation
This endpoint creates and sends a video generation task to the server — and returns a generation ID.
The text description of the scene, subject, or action to generate in the video.
The URL of the audio file. The model will use this audio to generate the video.
The aspect ratio of the generated video.
16:9Possible values: An enumeration where the short side of the video frame determines the resolution.
1080pPossible values: The length of the output video in seconds.
10Possible values: The description of elements to avoid in the generated video.
Specifies the shot type of the generated video, that is, whether the video consists of a single continuous shot or multiple switched shots. This parameter takes effect only when "prompt_extend" is set to 'true':
- single: (default) Outputs a single-shot video.
- multi: Outputs a multi-shot video.
singlePossible values: Specifies whether to automatically add audio to the generated video. This parameter takes effect only when 'audio_url' is not provided.
trueVarying the seed integer is a way to get different results for the same other request parameters. Using the same value for an identical request will produce similar results. If unspecified, a random number is chosen.
Whether to enable prompt expansion.
trueFetch the video
After sending a request for video generation, this task is added to the queue. This endpoint lets you check the status of a video generation task using its id, obtained from the endpoint described above.
If the video generation task status is completed, the response will include the final result — with the generated video URL and additional metadata.
Bearer key
<REPLACE_WITH_YOUR_GENERATION_ID>Full Example: Generating and Retrieving the Video From the Server
The code below creates a video generation task, then automatically polls the server every 10 seconds until it finally receives the video URL.
Processing time: ~ 3 min 25 sec.
Generated video (1920x1080, with sound):
Last updated
Was this helpful?