stt [legacy]

This is the old version of our STT API, which has been known to experience failures due to timeouts. To avoid such issues, we recommend using the new endpoints described on other pages in the Speech-to-Text category.

The migration won’t take much time since the input parameters of the main endpoint remain the same, and each model includes detailed code examples.

This service uses per-second billing. The cost of audio transcription is based on the number of seconds in the input audio file, not the processing time.

post

Authorizations

Body

any ofOptional

Responses

201Success

application/json

post

POST /v1/stt HTTP/1.1
Host: api.aimlapi.com
Authorization: Bearer <YOUR_AIMLAPI_KEY>
Content-Type: application/json
Accept: */*
Content-Length: 593

{
  "model": "#g1_nova-2-general",
  "custom_intent": "text",
  "custom_topic": "text",
  "custom_intent_mode": "strict",
  "custom_topic_mode": "strict",
  "detect_language": true,
  "detect_entities": true,
  "detect_topics": true,
  "diarize": true,
  "dictation": true,
  "diarize_version": "text",
  "extra": "text",
  "filler_words": true,
  "intents": true,
  "keywords": "text",
  "language": "text",
  "measurements": true,
  "multi_channel": true,
  "numerals": true,
  "paragraphs": true,
  "profanity_filter": true,
  "punctuate": true,
  "search": "text",
  "sentiment": true,
  "smart_format": true,
  "summarize": "text",
  "tag": [
    "text"
  ],
  "topics": true,
  "utterances": true,
  "utt_split": 1
}

201Success

{
  "metadata": {
    "transaction_key": "text",
    "request_id": "text",
    "sha256": "text",
    "created": "2025-07-30T08:53:57.118Z",
    "duration": 1,
    "channels": 1,
    "models": [
      "text"
    ],
    "model_info": {
      "ANY_ADDITIONAL_PROPERTY": {
        "name": "text",
        "version": "text",
        "arch": "text"
      }
    }
  }
}

PreviousSpeech-to-Text NextDeepgram

Last updated 1 month ago

Was this helpful?