This documentation is valid for the following list of our models:
minimax-music
Model Overview
An advanced AI model that generates diverse high-quality audio compositions by analyzing and reproducing musical patterns, rhythms, and vocal styles from the reference track. Refine the process using a text prompt.
Setup your API Key
If you donโt have an API key for the AI/ML API yet, feel free to use our Quickstart guide.
Submit a request
API Schema
Quick Code Example
Here is an example of generation an audio file based on a sample and a prompt using the music model minimax-music.
Full example explanation
As an example, we will generate a song using the popular minimax-music model from the Chinese company MiniMax. As you can verify in its API Reference, this model accepts an audio sample as inputโextracting information about its vocals and instruments for use in the generation processโalong with a text prompt where we can provide lyrics for our song.
We used a publicly available sample from royalty-free sample database and generated some lyrics in Chat GPT:
Side by side, through thick and thin,With a laugh, we always win.Storms may come, but we stay true,Friends foreverโme and you!
To turn this into a model-friendly prompt (as a single string), we added hash symbols and line breaks.
'''\ ##Side by side, through thick and thin, \n\nWith a laugh, we always win. \n\n Storms may come, but we stay true, \n\nFriends foreverโme and you!##\ '''
A notable feature of minimax-music model is that sample uploading/voice analysis + music generation and retrieving the final audio file from the server are done through separate API calls. (AIML API tokens are only consumed during the first stepโi.e., the actual music generation.)
You can insert the contents of each of the two code blocks into a separate Python file in your preferred development environment (or, for example, place each part in a separate cell in Jupyter Notebook). Replace <YOUR_API_KEY> in both fragments with the AIML API Key obtained from your account.
Next, run the first code block. If everything is set up correctly, you will see the following line in the program output (the specific numbers, of course, will vary):
This indicates that the file upload and our generation has been queued on the server (which took 4.5 seconds in our case).
Now, copy this id value (without quotation marks) and insert it into the second code block, replacing <GENERATION_ID>. Now, we can execute the second code block to get our song from the server.
Processing the request on the server may take some time (usually less than a minute). If the requested file is not yet ready, the output will display the corresponding status. Try waiting a bit and rerun the second code block. (If you're comfortable with coding, you can modify the script to perform this request inside a loop.)
In our case, after three reruns of the second code block (waiting a total of about 20 seconds), we saw the following output: