music-01
Model Overview
An advanced AI model that generates diverse high-quality audio compositions by analyzing and reproducing musical patterns, rhythms, and vocal styles from the reference track. Refine the process using a text prompt.
Setup your API Key
If you don’t have an API key for the AI/ML API yet, feel free to use our Quickstart guide.
API Schemas
Generate a music sample
This endpoint creates and sends a music generation task to the server — and returns a generation ID and the task status.
Audio file local path, supports WAV and MP3 formats. The audio duration must be longer than 10s and no more than 10 minutes.
- If purpose is song:
- You need to upload a music file containing both acapella (vocals) and accompaniment.
- The acapella must be in singing form; normal speech is not supported.
- Outputs: voice_id and instrumental_id.
- If purpose is voice:
- You need to upload a file containing only acapella in singing form (normal speech audio is not supported).
- Output: voice_id.
- If purpose is instrumental:
- You need to upload a file containing only accompaniment.
- Output: instrumental_id.
{
"voice_id": "vocal-2025011003141025-d5ZEMxmp",
"instrumental_id": "instrumental-2025011003141125-Akz9eWnD",
"base_resp": {
"status_code": 1,
"status_msg": "text"
}
}
Retrieve the generated music sample from the server
After sending a request for music generation, this task is added to the queue. Based on the service's load, the generation can be completed in 50-60 seconds or take a bit more.
{
"data": {
"status": 1,
"audio": "text"
},
"extra_info": {
"audio_length": 1,
"audio_size": 1,
"audio_bitrate": 1,
"audio_sample_rate": 1
},
"trace_id": "text",
"base_resp": {
"status_code": 1,
"status_msg": "text"
}
}
Last updated
Was this helpful?