music-01
An advanced AI model that generates diverse high-quality audio compositions by analyzing and reproducing musical patterns, rhythms, and vocal styles from the reference track. Refine the process using a text prompt.
Setup your API Key
If you don’t have an API key for the AI/ML API yet, feel free to use our Quickstart guide.
API Schemas
Upload a reference sample
This endpoint uploads a reference music piece to the server, analyzes it, and returns identifiers for the voice and/or instrumental patterns to use later.
Bearer key
Audio file local path, supports WAV and MP3 formats. The audio duration must be longer than 10s and no more than 10 minutes.
- If purpose is song:
- You need to upload a music file containing both acapella (vocals) and accompaniment.
- The acapella must be in singing form; normal speech is not supported.
- Outputs: voice_id and instrumental_id.
- If purpose is voice:
- You need to upload a file containing only acapella in singing form (normal speech audio is not supported).
- Output: voice_id.
- If purpose is instrumental:
- You need to upload a file containing only accompaniment.
- Output: instrumental_id.
Generate music sample
This endpoint generates a new music piece based on the voice and/or instrumental pattern identifiers obtained from the first endpoint above. The generation can be completed in 50-60 seconds or take a bit more.
Bearer key
Quick Code Example
Here is an example of generation an audio file based on a sample and a prompt using the music model music-01.
Listen to the track we generated:
Last updated
Was this helpful?