music-01

This documentation is valid for the following list of our models:

  • music-01

An advanced AI model that generates diverse high-quality audio compositions by analyzing and reproducing musical patterns, rhythms, and vocal styles from the reference track. Refine the process using a text prompt.

Setup your API Key

If you don’t have an API key for the AI/ML API yet, feel free to use our Quickstart guide.

API Schemas

Upload a reference sample

This endpoint uploads a reference music piece to the server, analyzes it, and returns identifiers for the voice and/or instrumental patterns to use later.

post
Authorizations
AuthorizationstringRequired

Bearer key

Body
filestring · binaryRequired

Audio file local path, supports WAV and MP3 formats. The audio duration must be longer than 10s and no more than 10 minutes.

purposestring · enumRequired
  1. If purpose is song:
  • You need to upload a music file containing both acapella (vocals) and accompaniment.
  • The acapella must be in singing form; normal speech is not supported.
  • Outputs: voice_id and instrumental_id.
  1. If purpose is voice:
  • You need to upload a file containing only acapella in singing form (normal speech audio is not supported).
  • Output: voice_id.
  1. If purpose is instrumental:
  • You need to upload a file containing only accompaniment.
  • Output: instrumental_id.
Possible values:
Responses
default

Generate music sample

This endpoint generates a new music piece based on the voice and/or instrumental pattern identifiers obtained from the first endpoint above. The generation can be completed in 50-60 seconds or take a bit more.

post
Authorizations
AuthorizationstringRequired

Bearer key

Body
Responses
default

Quick Code Example

Here is an example of generation an audio file based on a sample and a prompt using the music model music-01.

Response

Listen to the track we generated:

Last updated

Was this helpful?