# Voice Chat

## Overview

Voice chat models are designed to enable voice-based interactions with AI systems. Unlike traditional text-only assistants, these models can generate natural-sounding speech as responses, creating a more immersive and human-like conversational experience. Some models accept text input and respond with voice, while others can process both speech and text, allowing users to talk directly to the model or type messages depending on the use case.

Depending on the model, you may have access to settings for bitrate, output audio formats (often including lossless options), stream vs. non-stream modes, as well as a variety of voices and ways to customize or modify them.

## All Available Voice Chat Models

<table data-header-hidden="false" data-header-sticky data-full-width="true"><thead><tr><th width="284.4000244140625">Model ID</th><th width="132.79998779296875">Developer</th><th width="105.5999755859375">Context</th><th>Model Card</th></tr></thead><tbody><tr><td><a href="/pages/63lJ8tWmUHRo7JcPkoh0">elevenlabs/v3_alpha</a></td><td>ElevenLabs</td><td></td><td><a href="https://aimlapi.com/models/eleven-v3-alpha">Eleven v3 Alpha</a></td></tr><tr><td><a href="/pages/EfMplhNLN9DRrYQ2Urek">minimax/speech-2.5-turbo-preview</a></td><td>MiniMax</td><td></td><td><a href="https://aimlapi.com/models/minimax-speech-2-5-turbo">MiniMax Speech 2.5 Turbo</a></td></tr><tr><td><a href="/pages/F9biUyWK6QtqQomKPVGj">minimax/speech-2.5-hd-preview</a></td><td>MiniMax</td><td></td><td><a href="https://aimlapi.com/models/minimax-speech-2-5-hd">MiniMax Speech 2.5 HD</a></td></tr><tr><td><a href="/pages/hupRCZodzREiq39PZmtO">minimax/speech-2.6-turbo</a></td><td>MiniMax</td><td></td><td><a href="https://aimlapi.com/models/minimax-speech-2-6-turbo">MiniMax Speech 2.6 Turbo</a></td></tr><tr><td><a href="/pages/Urq1DcUAH37bw6HgTvWv">minimax/speech-2.6-hd</a></td><td>MiniMax</td><td></td><td><a href="https://aimlapi.com/models/minimax-speech-2-6-hd">MiniMax Speech 2.6 HD</a></td></tr><tr><td><a href="/pages/CSSHZtSpVfE39fQA1v4y">minimax/speech-2.8-turbo</a></td><td>MiniMax</td><td></td><td><a href="https://aimlapi.com/models/minimax-speech-2-8-turbo">Speech 2.8 Turbo</a></td></tr><tr><td><a href="/pages/5CyqOXqqP1iutKTWhzGL">minimax/speech-2.8-hd</a></td><td>MiniMax</td><td></td><td><a href="https://aimlapi.com/models/minimax-speech-2-8-hd">Speech 2.8 HD</a></td></tr></tbody></table>

***

Several models that were originally listed in our Text Models (LLM) section should also be included in this category:

* [gpt-4o-audio-preview](/api-references/text-models-llm/openai/gpt-4o-audio-preview.md)
* [gpt-4o-mini-audio-preview](/api-references/text-models-llm/openai/gpt-4o-mini-audio-preview.md)


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.aimlapi.com/api-references/speech-models/voice-chat.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
