Wan 2.1 (Text-to-Video)
This documentation is valid for the following list of our models:
wan/v2.1/1.3b/text-to-video
Overview
A state-of-the-art video foundation model designed for advanced generative video tasks. Supporting Text-to-Video (T2V), it incorporates groundbreaking innovations to deliver high-quality outputs with exceptional computational efficiency.
Key Features:
Visual text generation: Generates text in both Chinese and English within videos.
Output Quality: Produces videos at resolutions up to 720P with a frame rate of approximately 16 .
Setup your API Key
If you don’t have an API key for the AI/ML API yet, feel free to use our Quickstart guide.
How to Make a Call
Generating a video using this model involves sequentially calling two endpoints:
The first one is for creating and sending a video generation task to the server (returns a generation ID).
The second one is for requesting the generated video from the server using the generation ID received from the first endpoint.
Below, you can find two corresponding API schemas and examples for both endpoint calls.
API Schemas
Video Generation
This endpoint creates and sends a video generation task to the server — and returns a generation ID.
Fetch the video
This endpoint lets you request the generated video from the server using the generation ID received from the first endpoint.
Examples
Video generation
Fetch the video
Last updated
Was this helpful?