aura 2
Model Overview
Aura 2 produces natural, human-like speech with accurate domain-specific pronunciation — covering drug names, legal terms, alphanumeric strings, and structured inputs such as dates, times, and currency. It also maintains sub-200 ms TTFB latency and offers cost-efficient scalability.
Setup your API Key
If you don’t have an API key for the AI/ML API yet, feel free to use our Quickstart guide.
API Schema
post
Authorizations
AuthorizationstringRequired
Bearer key
Body
modelundefined · enumRequiredPossible values:
textstringRequired
The text content to be converted to speech.
containerstringOptional
The file format wrapper for the output audio. The available options depend on the encoding type.
encodingstring · enumOptionalDefault:
Specifies the expected encoding of your audio output
linear16Possible values: sample_ratestringOptional
Audio sample rate in Hz.
Responses
201Success
application/json
post
/v1/tts201Success
Code Example
The generated audio:
Last updated
Was this helpful?