octave-2

circle-info

This documentation is valid for the following list of our models:

  • hume/octave-2

An advanced text-to-speech model with improved emotional understanding, support for 11 languages, and sub-200 ms audio generation. It provides more reliable pronunciation of complex and uncommon inputs.

Setup your API Key

If you don’t have an API key for the AI/ML API yet, feel free to use our Quickstart guidearrow-up-right.

API Schema

post
Authorizations
AuthorizationstringRequired

Bearer key

Body
modelundefined · enumRequiredPossible values:
textstring · min: 1 · max: 500000Required

The text content to be converted to speech.

voicestring · enumOptional

Name of the voice to be used.

Default: Vince DouglasPossible values:
formatstring · enumOptional

Audio output format. MP3 provides good compression and compatibility, PCM offers uncompressed high quality, and FLAC provides lossless compression.

Possible values:
Responses
post
/v1/tts
201Success

No content

Code Example

chevron-rightResponsehashtag

Listen to the audio sample we generated (~ 1.8 s):

Last updated

Was this helpful?