music-cover

This documentation is valid for the following list of our models:

  • minimax/music-cover

An audio-to-audio music generation model that creates cover versions from reference audio. Supports style transformation while preserving the original structure and melody.

How to Make a Call

Step-by-Step Instructions

Generating an audio using this model involves sequentially calling two endpoints:

  • The first one is for creating and sending a video generation task to the server (returns a generation ID).

  • The second one is for requesting the generated video from the server using the generation ID received from the first endpoint.

Below, you can find two corresponding API schemas and examples for both endpoint calls.


If you want to learn how to call AI models via API from the very basics, feel free to use our Quickstart guide.

API Schemas

Generate music sample

This endpoint generates a music piece based on the prompt (which includes style instructions) and the provided lyrics. It returns a generation task ID, its status, and related metadata.

post
Body
modelstring · enumRequiredPossible values:
promptstring · min: 10 · max: 2000Required

A description of the music, specifying style, mood, and scenario. Length: 10–2000 characters.

reference_audio_urlstring · uriRequired

A URL or a Base64-encoded of the reference audio. Reference audio constraints:

  • Duration: 6 seconds to 6 minutes
  • Size: max 50 MB
  • Format: common audio formats (mp3, wav, flac, etc.)
lyricsstring · min: 10 · max: 3000Optional
Responses
200Success
application/json
idstringRequired

The ID of the generated audio.

Example: 60ac7c34-3224-4b14-8e7d-0aa0db708325
statusstring · enumRequired

The current status of the generation task.

Example: completedPossible values:
post
/v2/generate/audio
200Success

Retrieve the generated music sample from the server

After sending a request for music generation, this task is added to the queue. This endpoint lets you check the status of a audio generation task using its id, obtained from the endpoint described above. If the video generation task status is complete, the response will include the final result — with the generated audio URL and additional metadata.

get
Authorizations
AuthorizationstringRequired

Bearer key

Query parameters
generation_idstringRequiredExample: <REPLACE_WITH_YOUR_GENERATION_ID>
Responses
200Success
application/json
idstringRequired

The ID of the generated audio.

Example: 60ac7c34-3224-4b14-8e7d-0aa0db708325
statusstring · enumRequired

The current status of the generation task.

Example: completedPossible values:
get
/v2/generate/audio
200Success

Quick Code Examples

This model allows you to modify both the arrangement and the lyrics of a track. While both can be updated in a single request, we demonstrate them as separate examples to keep the process more controlled.

Arrangement Replacement

Generate a new musical arrangement while preserving the original melody and lyrics of the reference audio file.

Response

Listen to the track we generated.

Replacing the Lyrics

Generate a version with new lyrics while preserving the original melody, structure, and arrangement of the reference audio file.

Response

Listen to the track we generated.

Last updated

Was this helpful?