v1-pro/image-to-video

This documentation is valid for the following list of our models:

  • kling-video/v1/pro/image-to-video

Model Overview

This model transforms static images into dynamic video clips. Offers more advanced camera controls than v1 Standard model, including options for tilt, pan, zoom, and roll movements. Results in richer details and more stable camera movements, enhancing the overall visual quality of the generated videos. Produces significantly sharper and more detailed videos. Enhanced animations make elements like water flow and character movements appear more natural and engaging.

Setup your API Key

If you don’t have an API key for the AI/ML API yet, feel free to use our Quickstart guide.

How to Make a Call

Step-by-Step Instructions

Generating a video using this model involves sequentially calling two endpoints:

  • The first one is for creating and sending a video generation task to the server (returns a generation ID).

  • The second one is for requesting the generated video from the server using the generation ID received from the first endpoint.

Below, you can find both corresponding API schemas.

API Schemas

Create a video generation task and send it to the server

post
Authorizations
Body
modelundefined · enumRequiredPossible values:
tail_image_urlstring · uriOptional

URL of the image to be used as the last frame of the video.

static_mask_urlstring · uriOptional

URL of the image for Static Brush Application Area (Mask image created by users using the motion brush).

image_urlstring · uriRequired

A direct link to an online image or a Base64-encoded local image that will serve as the visual base or the first frame for the video.

ratiostring · enumOptionalDeprecatedPossible values:
aspect_ratiostring · enumOptionalDeprecatedPossible values:
promptstringRequired

The text description of the scene, subject, or action to generate in the video.

durationinteger · enumOptional

The length of the output video in seconds.

Possible values:
negative_promptstringOptional

The description of elements to avoid in the generated video.

cfg_scalenumber · max: 1Optional

The CFG (Classifier Free Guidance) scale is a measure of how close you want the model to stick to your prompt.

Responses
201Success
post
POST /v2/generate/video/kling/generation HTTP/1.1
Host: api.aimlapi.com
Authorization: Bearer <YOUR_AIMLAPI_KEY>
Content-Type: application/json
Accept: */*
Content-Length: 310

{
  "model": "kling-video/v1/pro/image-to-video",
  "tail_image_url": "https://example.com",
  "static_mask_url": "https://example.com",
  "dynamic_masks": [
    {
      "mask_url": "https://example.com",
      "trajectories": [
        {
          "x": 1,
          "y": 1
        }
      ]
    }
  ],
  "image_url": "https://example.com",
  "prompt": "text",
  "duration": 5,
  "negative_prompt": "text",
  "cfg_scale": 1
}
201Success

No content

Retrieve the generated video from the server

After sending a request for video generation, this task is added to the queue. Based on the service's load, the generation can be completed in seconds or take a bit more.

get
Authorizations
Query parameters
generation_idstringRequired
Responses
200Success
get
GET /v2/generate/video/kling/generation HTTP/1.1
Host: api.aimlapi.com
Authorization: Bearer <YOUR_AIMLAPI_KEY>
Accept: */*
200Success

No content

Last updated

Was this helpful?