# Text Models (LLM)

<details>

<summary>Overview</summary>

The AI/ML API provides access to text-based models, also known as **Large Language Models** (**LLM**s), and allows you to interact with them through natural language (that's why a third common name for such models is **chat models**). These models can be applied to various tasks, enabling the creation of diverse applications using our API. For example, text models can be used to:

* Create a system that searches your photos using text prompts.
* Act as a psychological supporter.
* Play games with you through natural language.
* Assist you with coding.
* Perform a security assessment (pentests) on servers for vulnerabilities.
* Write documentation for your services.
* Serve as a grammar corrector for multiple languages with deep context understanding.
* And much more.

</details>

<details>

<summary>Specific Capabilities</summary>

There are several capabilities of text models that are worth mentioning separately.

**Completion** allows the model to analyze a given text fragment and predict how it might continue based on the probabilities of the next possible tokens or characters. **Chat Completion** extends this functionality, enabling a simulated dialogue between the user and the model based on predefined roles (e.g., "strict language teacher" and "student"). A detailed description and examples can be found in our [Completion and Chat Completion](https://docs.aimlapi.com/capabilities/completion-or-chat-models) article.

***

An evolution of chat completion includes **Assistants** (preconfigured conversational agents with specific roles) and **Threads** (a mechanism for maintaining conversation history for context). Examples of this functionality can be found in the [Managing Assistants & Threads](https://docs.aimlapi.com/solutions/openai/assistants) article.

***

**Function Calling** allows a chat model to invoke external programmatic tools (e.g., a function you have written) while generating a response. A detailed description and examples are available in the [Function Calling](https://docs.aimlapi.com/capabilities/function-calling) article.

</details>

<details>

<summary>Endpoint</summary>

All text and chat models use the same endpoint:

<img src="broken-reference" alt="" data-size="line"> `https://api.aimlapi.com/v1/chat/completions`

The parameters may vary (especially for models from different developers), so it’s best to check the API schema on each model’s page for details. Example: [**o4-mini**](https://docs.aimlapi.com/api-references/openai/o4-mini#api-schema).

</details>

<details>

<summary><span data-gb-custom-inline data-tag="emoji" data-code="2705">✅</span> Quick Code Example</summary>

We will call the [**gpt-4o**](https://docs.aimlapi.com/api-references/text-models-llm/openai/gpt-4o) model using the Python programming language and the OpenAI SDK.

{% hint style="info" %}
If you need a more detailed explanation of how to call a model's API in code, check out our [<mark style="color:blue;">QUICKSTART</mark>](https://github.com/aimlapi/api-docs/blob/main/docs/api-references/text-models-llm/broken-reference/README.md) section.
{% endhint %}

{% code overflow="wrap" %}

```python
%pip install openai
import os
from openai import OpenAI

client = OpenAI(
    base_url="https://api.aimlapi.com/v1",

    # Insert your AIML API Key in the quotation marks instead of <YOUR_AIMLAPI_KEY>:
    api_key="<YOUR_AIMLAPI_KEY>",  
)

response = client.chat.completions.create(
    model="gpt-4o",
    messages=[
        {
            "role": "system",
            "content": "You are an AI assistant who knows everything.",
        },
        {
            "role": "user",
            "content": "Tell me, why is the sky blue?"
        },
    ],
)

message = response.choices[0].message.content

print(f"Assistant: {message}")
```

{% endcode %}

By running this code example, we received the following response from the chat model:

{% code overflow="wrap" %}

```http
Assistant: The sky appears blue due to a phenomenon called Rayleigh scattering. When sunlight enters Earth's atmosphere, it collides with gas molecules and small particles. Sunlight is made up of different colors, each with different wavelengths. Blue light has a shorter wavelength and is scattered in all directions by the gas molecules in the atmosphere more than other colors with longer wavelengths, such as red or yellow.
As a result, when you look up at the sky during the day, you see this scattered blue light being dispersed in all directions, making the sky appear blue to our eyes. During sunrise and sunset, the sun's light passes through a greater thickness of Earth's atmosphere, scattering the shorter blue wavelengths out of your line of sight and leaving the longer wavelengths, like red and orange, more dominant, which is why the sky often turns those colors at those times.
```

{% endcode %}

</details>

<details>

<summary>Complete Text Model List</summary>

<table data-full-width="true"><thead><tr><th width="297.4000244140625">Model ID + API Reference link</th><th width="134.20001220703125">Developer</th><th width="105.79998779296875">Context</th><th>Model Card</th></tr></thead><tbody><tr><td><a href="text-models-llm/openai/gpt-3.5-turbo">gpt-3.5-turbo</a></td><td>Open AI</td><td>16,000</td><td><a href="https://aimlapi.com/models/chat-gpt-3-5">Chat GPT 3.5 Turbo</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-3.5-turbo">gpt-3.5-turbo-0125</a></td><td>Open AI</td><td>16,000</td><td><a href="https://aimlapi.com/models/chat-gpt-3-5-turbo-0125">Chat GPT-3.5 Turbo 0125</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-3.5-turbo">gpt-3.5-turbo-1106</a></td><td>Open AI</td><td>16,000</td><td><a href="https://aimlapi.com/models/chat-gpt-3-5-turbo-1106">Chat GPT-3.5 Turbo 1106</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-4o">gpt-4o</a></td><td>Open AI</td><td>128,000</td><td><a href="https://aimlapi.com/models/chat-gpt-4-omni">Chat GPT-4o</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-4o">gpt-4o-2024-08-06</a></td><td>Open AI</td><td>128,000</td><td><a href="https://aimlapi.com/models/gpt-4o-2024-08-06-api">GPT-4o-2024-08-06</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-4o">gpt-4o-2024-05-13</a></td><td>Open AI</td><td>128,000</td><td><a href="https://aimlapi.com/models/gpt-4o-2024-05-13-api">GPT-4o-2024-05-13</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-4o-mini">gpt-4o-mini</a></td><td>Open AI</td><td>128,000</td><td><a href="https://aimlapi.com/models/chat-gpt-4o-mini">Chat GPT 4o mini</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-4o-mini">gpt-4o-mini-2024-07-18</a></td><td>Open AI</td><td>128,000</td><td><a href="https://aimlapi.com/models/chat-gpt-4o-mini">GPT 4o mini</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-4o-audio-preview">gpt-4o-audio-preview</a></td><td>Open AI</td><td>128,000</td><td><a href="https://aimlapi.com/models/gpt-4o-audio-preview-api">GPT-4o Audio Preview</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-4o-mini-audio-preview">gpt-4o-mini-audio-preview</a></td><td>Open AI</td><td>128,000</td><td><a href="https://aimlapi.com/models/gpt-4o-mini-audio-api">GPT-4o mini Audio</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-4o-search-preview">gpt-4o-search-preview</a></td><td>Open AI</td><td>128,000</td><td><a href="https://aimlapi.com/models/gpt-4o-search-preview-api">GPT-4o Search Preview</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-4o-mini-search-preview">gpt-4o-mini-search-preview</a></td><td>Open AI</td><td>128,000</td><td><a href="https://aimlapi.com/models/gpt-4o-mini-search-preview-api">GPT-4o Mini Search Preview</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-4-turbo">gpt-4-turbo</a></td><td>Open AI</td><td>128,000</td><td><a href="https://aimlapi.com/models/chat-gpt-4-turbo">Chat GPT 4 Turbo</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-4-turbo">gpt-4-turbo-2024-04-09</a></td><td>Open AI</td><td>128,000</td><td>-</td></tr><tr><td><a href="text-models-llm/openai/gpt-4">gpt-4</a></td><td>Open AI</td><td>8,000</td><td><a href="https://aimlapi.com/models/chat-gpt-4">Chat GPT 4</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-4-preview">gpt-4-0125-preview</a></td><td>Open AI</td><td>8,000</td><td>-</td></tr><tr><td><a href="text-models-llm/openai/gpt-4-preview">gpt-4-1106-preview</a></td><td>Open AI</td><td>8,000</td><td>-</td></tr><tr><td><a href="text-models-llm/openai/o1">o1</a></td><td>Open AI</td><td>200,000</td><td><a href="https://aimlapi.com/models/openai-o1-api">OpenAI o1</a></td></tr><tr><td><a href="text-models-llm/openai/o3">openai/o3-2025-04-16</a></td><td>Open AI</td><td>200,000</td><td><a href="https://aimlapi.com/models/o3">o3</a></td></tr><tr><td><a href="text-models-llm/openai/o3-mini">o3-mini</a></td><td>Open AI</td><td>200,000</td><td><a href="https://aimlapi.com/models/openai-o3-mini-api">OpenAI o3 mini</a></td></tr><tr><td><a href="text-models-llm/openai/o3-pro">openai/o3-pro</a></td><td>Open AI</td><td>200,000</td><td><a href="https://aimlapi.com/models/o3-pro">o3-pro</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-4.1">openai/gpt-4.1-2025-04-14</a></td><td>Open AI</td><td>1,000,000</td><td><a href="https://aimlapi.com/models/gpt-4-1">GPT-4.1</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-4.1-mini">openai/gpt-4.1-mini-2025-04-14</a></td><td>Open AI</td><td>1,000,000</td><td><a href="https://aimlapi.com/models/gpt-4-1-mini-api">GPT-4.1 Mini</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-4.1-nano">openai/gpt-4.1-nano-2025-04-14</a></td><td>Open AI</td><td>1,000,000</td><td><a href="https://aimlapi.com/models/gpt-4-1-nano-api">GPT-4.1 Nano</a></td></tr><tr><td><a href="text-models-llm/openai/o4-mini">openai/o4-mini-2025-04-16</a></td><td>Open AI</td><td>200,000</td><td><a href="https://aimlapi.com/models/gpt-o4-mini-2025-04-16">GPT-o4-mini-2025-04-16</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-oss-20b">openai/gpt-oss-20b</a></td><td>Open AI</td><td>128,000</td><td><a href="https://aimlapi.com/models/gpt-oss-20b">GPT OSS 20B</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-oss-120b">openai/gpt-oss-120b</a></td><td>Open AI</td><td>128,000</td><td><a href="https://aimlapi.com/models/gpt-oss-120b">GPT OSS 120B</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-5">openai/gpt-5-2025-08-07</a></td><td>Open AI</td><td>400,000</td><td><a href="https://aimlapi.com/models/gpt-5">GPT-5</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-5-mini">openai/gpt-5-mini-2025-08-07</a></td><td>Open AI</td><td>400,000</td><td><a href="https://aimlapi.com/models/gpt-5-mini">GPT-5 Mini</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-5-nano">openai/gpt-5-nano-2025-08-07</a></td><td>Open AI</td><td>400,000</td><td><a href="https://aimlapi.com/models/gpt-5-nano">GPT-5 Nano</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-5-chat">openai/gpt-5-chat-latest</a></td><td>Open AI</td><td>400,000</td><td><a href="https://aimlapi.com/models/gpt-5-chat">GPT-5 Chat</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-5-1">openai/gpt-5-1</a></td><td>Open AI</td><td>128,000</td><td><a href="https://aimlapi.com/models/gpt-5-1">GPT-5.1</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-5-1-chat-latest">openai/gpt-5-1-chat-latest</a></td><td>Open AI</td><td>128,000</td><td><a href="https://aimlapi.com/models/gpt-5-1-chat-latest">GPT-5.1 Chat Latest</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-5-1-codex">openai/gpt-5-1-codex</a></td><td>Open AI</td><td>400,000</td><td><a href="https://aimlapi.com/models/gpt-5-1-codex">GPT-5.1 Codex</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-5-1-codex-mini">openai/gpt-5-1-codex-mini</a></td><td>Open AI</td><td>400,000</td><td><a href="https://aimlapi.com/models/gpt-5-1-codex-mini">GPT-5.1 Codex Mini</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-5.2">openai/gpt-5-2</a></td><td>Open AI</td><td>400,000</td><td><a href="https://aimlapi.com/models/gpt-5-2">GPT-5.2</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-5.2-chat-latest">openai/gpt-5-2-chat-latest</a></td><td>Open AI</td><td>400,000</td><td><a href="https://aimlapi.com/models/gpt-5-2-chat-latest">GPT-5.2 Chat Latest</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-5.2-pro">openai/gpt-5-2-pro</a></td><td>Open AI</td><td>400,000</td><td><a href="https://aimlapi.com/models/gpt-5-2-pro">GPT-5.2 Pro</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-5.2-codex">openai/gpt-5-2-codex</a></td><td>Open AI</td><td>400,000</td><td><a href="https://aimlapi.com/models/gpt-5-2-codex">GPT-5.2 Codex</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-5.3-codex">openai/gpt-5-3-codex</a></td><td>Open AI</td><td>400,000</td><td><a href="https://aimlapi.com/models/gpt-5-3-codex">GPT-5.3 Codex</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-5-4">openai/gpt-5-4</a></td><td>Open AI</td><td>1,000,000</td><td><a href="https://aimlapi.com/models/gpt-5-4">GPT-5.4</a></td></tr><tr><td><a href="text-models-llm/openai/gpt-5-4-pro">openai/gpt-5-4-pro</a></td><td>Open AI</td><td>1,000,000</td><td><a href="https://aimlapi.com/models/gpt-5-4-pro">GPT-5.4 Pro</a></td></tr><tr><td><a href="text-models-llm/anthropic/claude-3-haiku">claude-3-haiku-20240307</a></td><td>Anthropic</td><td>200,000</td><td>-</td></tr><tr><td><a href="text-models-llm/anthropic/claude-4-opus">anthropic/claude-opus-4</a></td><td>Anthropic</td><td>200,000</td><td><a href="https://aimlapi.com/models/claude-4-opus">Claude 4 Opus</a></td></tr><tr><td><a href="text-models-llm/anthropic/claude-opus-4.1">anthropic/claude-opus-4.1<br>claude-opus-4-1<br>claude-opus-4-1-20250805</a></td><td>Anthropic</td><td>200,000</td><td><a href="https://aimlapi.com/models/claude-opus-4-1">Claude Opus 4.1</a></td></tr><tr><td><a href="text-models-llm/anthropic/claude-4-sonnet">anthropic/claude-sonnet-4</a></td><td>Anthropic</td><td>200,000</td><td><a href="https://aimlapi.com/models/claude-4-sonnet">Claude 4 Sonnet</a></td></tr><tr><td><p><a href="text-models-llm/anthropic/claude-4-5-sonnet">claude-sonnet-4-5-20250929</a></p><p><a href="text-models-llm/anthropic/claude-4-5-sonnet">anthropic/claude-sonnet-4.5</a></p><p><a href="text-models-llm/anthropic/claude-4-5-sonnet">claude-sonnet-4-5</a></p></td><td>Anthropic</td><td>200,000</td><td><a href="https://aimlapi.com/models/claude-4-5-sonnet">Claude 4.5 Sonnet</a></td></tr><tr><td><p><a href="text-models-llm/anthropic/claude-4.5-haiku">anthropic/claude-haiku-4.5</a><br><a href="text-models-llm/anthropic/claude-4.5-haiku">claude-haiku-4-5</a></p><p><a href="text-models-llm/anthropic/claude-4.5-haiku">claude-haiku-4-5-20251001</a></p></td><td>Anthropic</td><td>200,000</td><td><a href="https://aimlapi.com/models/claude-4-5-haiku">Claude 4.5 Haiku</a></td></tr><tr><td><a href="text-models-llm/anthropic/claude-4.5-opus">anthropic/claude-opus-4-5<br>claude-opus-4-5<br>claude-opus-4-5-20251101</a></td><td>Anthropic</td><td>200,000</td><td><a href="text-models-llm/anthropic/claude-4.5-opus">Claude 4.5 Opus</a></td></tr><tr><td><a href="text-models-llm/anthropic/claude-4.6-opus">anthropic/claude-opus-4-6</a></td><td>Anthropic</td><td>200,000</td><td><a href="https://aimlapi.com/models/claude-opus-4-6">Claude 4.6 Opus</a></td></tr><tr><td><a href="text-models-llm/anthropic/claude-4.6-sonnet">anthropic/claude-sonnet-4.6<br>anthropic/claude-sonnet-4-6-20260218</a></td><td>Anthropic</td><td>200,000</td><td><a href="https://aimlapi.com/models/claude-sonnet-4-6">Claude Sonnet 4.6</a></td></tr><tr><td><a href="text-models-llm/alibaba-cloud/qwen2.5-7b-instruct-turbo">Qwen/Qwen2.5-7B-Instruct-Turbo</a></td><td>Alibaba Cloud</td><td>32,000</td><td><a href="https://aimlapi.com/models/qwen-2-5-7b-instruct-api">Qwen 2.5 7B Instruct Turbo</a></td></tr><tr><td><a href="text-models-llm/alibaba-cloud/qwen-max">qwen-max</a></td><td>Alibaba Cloud</td><td>32,000</td><td><a href="https://aimlapi.com/models/qwen-max-api">Qwen Max</a></td></tr><tr><td><a href="text-models-llm/alibaba-cloud/qwen-max">qwen-max-2025-01-25</a></td><td>Alibaba Cloud</td><td>32,000</td><td><a href="https://aimlapi.com/models/qwen-max-2025-01-25-api">Qwen Max 2025-01-25</a></td></tr><tr><td><a href="text-models-llm/alibaba-cloud/qwen-plus">qwen-plus</a></td><td>Alibaba Cloud</td><td>131,000</td><td><a href="https://aimlapi.com/models/qwen-plus-api">Qwen Plus</a></td></tr><tr><td><a href="text-models-llm/alibaba-cloud/qwen-turbo">qwen-turbo</a></td><td>Alibaba Cloud</td><td>1,000,000</td><td><a href="https://aimlapi.com/models/qwen-turbo-api">Qwen Turbo</a></td></tr><tr><td><a href="text-models-llm/alibaba-cloud/qwen3-32b">alibaba/qwen3-32b</a></td><td>Alibaba Cloud</td><td>131,000</td><td><a href="https://aimlapi.com/models/qwen3-32b">Qwen3-32B</a></td></tr><tr><td><a href="text-models-llm/alibaba-cloud/qwen3-coder-480b-a35b-instruct">alibaba/qwen3-coder-480b-a35b-instruct</a></td><td>Alibaba Cloud</td><td>262,000</td><td><a href="https://aimlapi.com/models/qwen3-coder-480b-a35b-instruct">Qwen3 Coder</a></td></tr><tr><td><a href="text-models-llm/alibaba-cloud/qwen3-235b-a22b-thinking-2507">alibaba/qwen3-235b-a22b-thinking-2507</a></td><td>Alibaba Cloud</td><td>262,000</td><td><a href="https://aimlapi.com/models/qwen3-235b-a22b">Qwen3 235B A22B Thinking</a></td></tr><tr><td><a href="text-models-llm/alibaba-cloud/qwen3-next-80b-a3b-instruct">alibaba/qwen3-next-80b-a3b-instruct</a></td><td>Alibaba Cloud</td><td>262,000</td><td><a href="https://aimlapi.com/models/qwen3-next-80b-a3b-instruct">Qwen3-Next-80B-A3B Instruct</a></td></tr><tr><td><a href="text-models-llm/alibaba-cloud/qwen3-next-80b-a3b-thinking">alibaba/qwen3-next-80b-a3b-thinking</a></td><td>Alibaba Cloud</td><td>262,000</td><td><a href="https://aimlapi.com/models/qwen3-next-80b-a3b-thinking">Qwen3-Next-80B-A3B Thinking</a></td></tr><tr><td><a href="text-models-llm/alibaba-cloud/qwen3-max-preview">alibaba/qwen3-max-preview</a></td><td>Alibaba Cloud</td><td>258,000</td><td><a href="text-models-llm/alibaba-cloud/qwen3-max-preview">Qwen3-Max Preview</a></td></tr><tr><td><a href="text-models-llm/alibaba-cloud/qwen3-max-instruct">alibaba/qwen3-max-instruct</a></td><td>Alibaba Cloud</td><td>262,000</td><td><a href="text-models-llm/alibaba-cloud/qwen3-max-instruct">Qwen3-Max Instruct</a></td></tr><tr><td><a href="text-models-llm/alibaba-cloud/qwen3-omni-30b-a3b-captioner">qwen3-omni-30b-a3b-captioner</a></td><td>Alibaba Cloud</td><td>65,000</td><td><a href="text-models-llm/alibaba-cloud/qwen3-omni-30b-a3b-captioner">qwen3-omni-30b-a3b-captioner</a></td></tr><tr><td><a href="text-models-llm/alibaba-cloud/qwen3-vl-32b-instruct">alibaba/qwen3-vl-32b-instruct</a></td><td>Alibaba Cloud</td><td>126,000</td><td><a href="https://aimlapi.com/models/qwen3-vl-32b-instruct">Qwen3 VL 32B Instruct</a></td></tr><tr><td><a href="text-models-llm/alibaba-cloud/qwen3-vl-32b-thinking">alibaba/qwen3-vl-32b-thinking</a></td><td>Alibaba Cloud</td><td>126,000</td><td><a href="https://aimlapi.com/models/qwen3-vl-32b-thinking">Qwen3 VL 32B Thinking</a></td></tr><tr><td><a href="text-models-llm/alibaba-cloud/qwen3.5-plus">alibaba/qwen3.5-plus-20260218</a></td><td>Alibaba Cloud</td><td>1,000,000</td><td><a href="https://aimlapi.com/models/qwen3-5-plus">Qwen3.5 Plus</a></td></tr><tr><td><a href="text-models-llm/anthracite/magnum-v4">anthracite-org/magnum-v4-72b</a></td><td>Anthracite</td><td>32,000</td><td><a href="https://aimlapi.com/models/magnum-v4-72b-api">Magnum v4 72B</a></td></tr><tr><td><a href="text-models-llm/baidu/ernie-4.5-8k-preview">baidu/ernie-4-5-8k-preview</a></td><td>Baidu</td><td>8,000</td><td><a href="https://aimlapi.com/models/ernie-4-5">ERNIE 4.5</a></td></tr><tr><td><a href="text-models-llm/baidu/ernie-4.5-0.3b">baidu/ernie-4.5-0.3b</a></td><td>Baidu</td><td>120,000</td><td><a href="https://aimlapi.com/models/ernie-4-5">ERNIE 4.5</a></td></tr><tr><td><a href="text-models-llm/baidu/ernie-4.5-21b-a3b">baidu/ernie-4.5-21b-a3b</a></td><td>Baidu</td><td>120,000</td><td><a href="https://aimlapi.com/models/ernie-4-5">ERNIE 4.5</a></td></tr><tr><td><a href="text-models-llm/baidu/ernie-4.5-21b-a3b-thinking">baidu/ernie-4.5-21b-a3b-thinking</a></td><td>Baidu</td><td>131,000</td><td><a href="https://aimlapi.com/models/ernie-4-5">ERNIE 4.5</a></td></tr><tr><td><a href="text-models-llm/baidu/ernie-4.5-vl-28b-a3b">baidu/ernie-4.5-vl-28b-a3b</a></td><td>Baidu</td><td>30,000</td><td><a href="https://aimlapi.com/models/ernie-4-5-vl">ERNIE 4.5 VL</a></td></tr><tr><td><a href="text-models-llm/baidu/ernie-4.5-vl-424b-a47b">baidu/ernie-4.5-vl-424b-a47b</a></td><td>Baidu</td><td>123,000</td><td><a href="https://aimlapi.com/models/ernie-4-5-vl">ERNIE 4.5 VL</a></td></tr><tr><td><a href="text-models-llm/baidu/ernie-4.5-300b-a47b">baidu/ernie-4.5-300b-a47b</a></td><td>Baidu</td><td>123,000</td><td><a href="https://aimlapi.com/models/ernie-4-5">ERNIE 4.5</a></td></tr><tr><td><a href="text-models-llm/baidu/ernie-4.5-300b-a47b-paddle">baidu/ernie-4.5-300b-a47b-paddle</a></td><td>Baidu</td><td>123,000</td><td><a href="https://aimlapi.com/models/ernie-4-5">ERNIE 4.5</a></td></tr><tr><td><a href="text-models-llm/baidu/ernie-4.5-turbo-128k">baidu/ernie-4-5-turbo-128k</a></td><td>Baidu</td><td>128,000</td><td><a href="https://aimlapi.com/models/ernie-4-5">ERNIE 4.5</a></td></tr><tr><td><a href="text-models-llm/baidu/ernie-4.5-turbo-vl-32k">baidu/ernie-4-5-turbo-vl-32k</a></td><td>Baidu</td><td>32,000</td><td><a href="https://aimlapi.com/models/ernie-4-5-vl">ERNIE 4.5 VL</a></td></tr><tr><td><a href="text-models-llm/baidu/ernie-5.0-thinking-preview">baidu/ernie-5-0-thinking-preview</a></td><td>Baidu</td><td>128,000</td><td><a href="https://aimlapi.com/models/ernie-5-0">ERNIE 5.0</a></td></tr><tr><td><a href="text-models-llm/baidu/ernie-5.0-thinking-latest">baidu/ernie-5-0-thinking-latest</a></td><td>Baidu</td><td>128,000</td><td><a href="https://aimlapi.com/models/ernie-5-0">ERNIE 5.0</a></td></tr><tr><td><a href="text-models-llm/baidu/ernie-x1-turbo-32k">baidu/ernie-x1-turbo-32k</a></td><td>Baidu</td><td>32,000</td><td><em>Coming Soon</em></td></tr><tr><td><a href="text-models-llm/baidu/ernie-x1.1-preview">baidu/ernie-x1-1-preview</a></td><td>Baidu</td><td>64,000</td><td><em>Coming Soon</em></td></tr><tr><td><a href="text-models-llm/bytedance/seed-1.8">bytedance/seed-1-8</a></td><td>ByteDance</td><td>256,000</td><td><a href="https://aimlapi.com/models/seed-1-8">Seed 1.8</a></td></tr><tr><td><a href="text-models-llm/cohere/command-a">cohere/command-a</a></td><td>Cohere</td><td>256,000</td><td><a href="https://aimlapi.com/models/command-a">Command A</a></td></tr><tr><td><a href="text-models-llm/deepseek/deepseek-chat">deepseek-chat or<br>deepseek/deepseek-chat or<br>deepseek/deepseek-chat-v3-0324</a></td><td>DeepSeek</td><td>128,000</td><td><a href="https://aimlapi.com/models/deepseek-v3">DeepSeek V3</a></td></tr><tr><td><a href="text-models-llm/deepseek/deepseek-r1">deepseek/deepseek-r1 or<br>deepseek-reasoner</a></td><td>DeepSeek</td><td>128,000</td><td><a href="https://aimlapi.com/models/deepseek-r1-api">DeepSeek R1</a></td></tr><tr><td><a href="text-models-llm/deepseek/deepseek-chat-v3.1">deepseek/deepseek-chat-v3.1</a></td><td>DeepSeek</td><td>128,000</td><td><a href="https://aimlapi.com/models/deepseek-v3-1-chat">DeepSeek V3.1 Chat</a></td></tr><tr><td><a href="text-models-llm/deepseek/deepseek-reasoner-v3.1">deepseek/deepseek-reasoner-v3.1</a></td><td>DeepSeek</td><td>128,000</td><td><a href="https://aimlapi.com/models/deepseek-v3-1-reasoner">DeepSeek V3.1 Reasoner</a></td></tr><tr><td><a href="text-models-llm/deepseek/deepseek-reasoner-v3.2-exp-thinking">deepseek/deepseek-thinking-v3.2-exp</a></td><td>DeepSeek</td><td>128,000</td><td><a href="https://aimlapi.com/models/deepseek-v3-2-exp-thinking">DeepSeek V3.2-Exp Thinking</a></td></tr><tr><td><a href="text-models-llm/deepseek/deepseek-reasoner-v3.2-exp-non-thinking">deepseek/deepseek-non-thinking-v3.2-exp</a></td><td>DeepSeek</td><td>128,000</td><td><a href="https://aimlapi.com/models/deepseek-v3-2-exp-non-thinking">DeepSeek V3.2-Exp Non-Thinking</a></td></tr><tr><td><a href="text-models-llm/deepseek/deepseek-reasoner-v3.1-terminus">deepseek/deepseek-reasoner-v3.1-terminus</a></td><td>DeepSeek</td><td>128,000</td><td><a href="https://aimlapi.com/models/deepseek-v3-1-terminus-reasoning">DeepSeek V3.1 Terminus Reasoning</a></td></tr><tr><td><a href="text-models-llm/deepseek/deepseek-non-reasoner-v3.1-terminus">deepseek/deepseek-non-reasoner-v3.1-terminus</a></td><td>DeepSeek</td><td>128,000</td><td><a href="https://aimlapi.com/models/deepseek-v3-1-terminus-non-reasoning">DeepSeek V3.1 Terminus Non-Reasoning</a></td></tr><tr><td><a href="text-models-llm/deepseek/deepseek-v3.2-speciale">deepseek/deepseek-v3.2-speciale</a></td><td>DeepSeek</td><td>128,000</td><td><a href="https://aimlapi.com/models/deepseek-v3-2-speciale">DeepSeek V3.2 Speciale</a></td></tr><tr><td><a href="text-models-llm/google/gemini-2.0-flash">gemini-2.0-flash</a></td><td>Google</td><td>1,000,000</td><td><a href="https://aimlapi.com/models/gemini-2-0-flash-api">Gemini 2.0 Flash</a></td></tr><tr><td><a href="text-models-llm/google/gemini-2.5-flash-lite-preview">google/gemini-2.5-flash-lite-preview</a></td><td>Google</td><td>1,000,000</td><td>–</td></tr><tr><td><a href="text-models-llm/google/gemini-2.5-flash">google/gemini-2.5-flash</a></td><td>Google</td><td>1,000,000</td><td><a href="https://aimlapi.com/models/gemini-2-5-flash-api">Gemini 2.5 Flash</a></td></tr><tr><td><a href="text-models-llm/google/gemini-3-flash-preview">google/gemini-3-flash-preview</a></td><td>Google</td><td>1,000,000</td><td><a href="https://aimlapi.com/models/gemini-3-flash">Gemini 3 Flash</a></td></tr><tr><td><a href="text-models-llm/google/gemini-2.5-pro">google/gemini-2.5-pro</a></td><td>Google</td><td>1,000,000</td><td><a href="https://aimlapi.com/models/gemini-pro-2-5-api">Gemini 2.5 Pro</a></td></tr><tr><td><a href="text-models-llm/google/gemini-3-1-pro-preview">google/gemini-3-1-pro-preview</a></td><td>Google</td><td>1,000,000</td><td><a href="https://aimlapi.com/models/gemini-3-1-pro">Gemini 3.1 Pro</a></td></tr><tr><td><a href="text-models-llm/google/gemma-3">google/gemma-3-4b-it</a></td><td>Google</td><td>128,000</td><td><a href="https://aimlapi.com/models/gemma-3-4b-api">Gemma 3 (4B)</a></td></tr><tr><td><a href="text-models-llm/google/gemma-3">google/gemma-3-12b-it</a></td><td>Google</td><td>128,000</td><td><a href="https://aimlapi.com/models/gemma-3-12b-api">Gemma 3 (12B)</a></td></tr><tr><td><a href="text-models-llm/google/gemma-3">google/gemma-3-27b-it</a></td><td>Google</td><td>128,000</td><td><a href="https://aimlapi.com/models/gemma-3-27b-api">Gemma 3 (27B)</a></td></tr><tr><td><a href="text-models-llm/google/gemma-3n-4b">google/gemma-3n-e4b-it</a></td><td>Google</td><td>8,192</td><td><a href="https://aimlapi.com/models/gemma-3n-4b">Gemma 3n 4B</a></td></tr><tr><td><a href="text-models-llm/google/gemma-4-31b-it">google/gemma-4-31b-it</a></td><td>Google</td><td>262,000</td><td><a href="https://aimlapi.com/models/gemma-4-31b">Gemma 4 31B</a></td></tr><tr><td><a href="text-models-llm/gryphe/mythomax-l2-13b">gryphe/mythomax-l2-13b</a></td><td>Gryphe</td><td>4,000</td><td><a href="https://aimlapi.com/models/mythomax-l2-13b">MythoMax-L2 (13B)</a></td></tr><tr><td><a href="text-models-llm/meta/llama-3.3-70b-instruct-turbo">meta-llama/Llama-3.3-70B-Instruct-Turbo</a></td><td>Meta</td><td>128,000</td><td><a href="https://aimlapi.com/models/meta-llama-3-3-70b-instruct-turbo-api">Meta Llama 3.3 70B Instruct Turbo</a></td></tr><tr><td><a href="text-models-llm/meta/meta-llama-3-8b-instruct-lite">meta-llama/Meta-Llama-3-8B-Instruct-Lite</a></td><td>Meta</td><td>9,000</td><td><a href="https://aimlapi.com/models/llama-3-8b-instruct-lite-api">Llama 3 8B Instruct Lite</a></td></tr><tr><td><a href="text-models-llm/meta/llama-3.3-70b-versatile">meta-llama/llama-3.3-70b-versatile</a></td><td>Meta</td><td>131,000</td><td><a href="text-models-llm/meta/llama-3.3-70b-versatile">Llama 3.3 70B Versatile</a></td></tr><tr><td><a href="text-models-llm/minimax/text-01">MiniMax-Text-01</a></td><td>MiniMax</td><td>1,000,000</td><td><a href="https://aimlapi.com/models/minimax-text-01-api">MiniMax-Text-01</a></td></tr><tr><td><a href="text-models-llm/minimax/m1">minimax/m1</a></td><td>MiniMax</td><td>1,000,000</td><td><a href="https://aimlapi.com/models/minimax-m1">MiniMax M1</a></td></tr><tr><td><a href="text-models-llm/minimax/m2">minimax/m2</a></td><td>MiniMax</td><td>200,000</td><td><a href="https://aimlapi.com/models/minimax-m2">MiniMax M2</a></td></tr><tr><td><a href="text-models-llm/minimax/m2-1">minimax/m2-1</a></td><td>MiniMax</td><td>204,800</td><td><a href="https://aimlapi.com/models/minimax-m2-1">MiniMax M2.1</a></td></tr><tr><td><a href="text-models-llm/minimax/m2-5">minimax/m2-5-20260218</a></td><td>MiniMax</td><td>204,800</td><td><a href="https://aimlapi.com/models/minimax-m2-5">MiniMax M2.5</a></td></tr><tr><td><a href="text-models-llm/minimax/m2-5-highspeed">minimax/m2-5-highspeed-20260218</a></td><td>MiniMax</td><td>204,800</td><td><a href="https://aimlapi.com/models/minimax-m2-5">MiniMax M2.5</a></td></tr><tr><td><a href="text-models-llm/minimax/m2-7">minimax/m2-7-20260402</a></td><td>MiniMax</td><td>204,800</td><td><a href="https://aimlapi.com/models/minimax-m2-7">MiniMax M2.7</a></td></tr><tr><td><a href="text-models-llm/mistral-ai/mixtral-8x7b-instruct-v0.1">mistralai/Mixtral-8x7B-Instruct-v0.1</a></td><td>Mistral AI</td><td>64,000</td><td><a href="https://aimlapi.com/models/mixtral-8x7b-instruct-v01">Mixtral-8x7B Instruct v0.1</a></td></tr><tr><td><a href="text-models-llm/mistral-ai/mistral-nemo">mistralai/mistral-nemo</a></td><td>Mistral AI</td><td>128,000</td><td><a href="https://aimlapi.com/models/mistral-nemo-api">Mistral Nemo</a></td></tr><tr><td><a href="text-models-llm/moonshot/kimi-k2-preview">moonshot/kimi-k2-preview</a></td><td>Moonshot</td><td>131,000</td><td><a href="https://aimlapi.com/models/kimi-k2">Kimi-K2</a></td></tr><tr><td><a href="text-models-llm/moonshot/kimi-k2-preview">moonshot/kimi-k2-0905-preview</a></td><td>Moonshot</td><td>256,000</td><td><a href="https://aimlapi.com/models/kimi-k2">Kimi-K2</a></td></tr><tr><td><a href="text-models-llm/moonshot/kimi-k2-turbo-preview">moonshot/kimi-k2-turbo-preview</a></td><td>Moonshot</td><td>256,000</td><td><a href="https://aimlapi.com/models/kimi-k2-turbo-preview">Kimi K2 Turbo Preview</a></td></tr><tr><td><a href="text-models-llm/moonshot/kimi-k2-5">moonshot/kimi-k2-5</a></td><td>Moonshot</td><td>262,000</td><td><a href="https://aimlapi.com/models/kimi-k2-5">Kimi K2.5</a></td></tr><tr><td><a href="text-models-llm/nousresearch/hermes-4-405b">nousresearch/hermes-4-405b</a></td><td>NousResearch</td><td>131,000</td><td><em>-</em></td></tr><tr><td><a href="text-models-llm/nvidia/llama-3.1-nemotron-70b">nvidia/llama-3.1-nemotron-70b-instruct</a></td><td>NVIDIA</td><td>128,000</td><td><a href="https://aimlapi.com/models/llama-3-1-nemotron-70b-instruct-api">Llama 3.1 Nemotron 70B Instruct</a></td></tr><tr><td><a href="text-models-llm/nvidia/nemotron-nano-9b-v2">nvidia/nemotron-nano-9b-v2</a></td><td>NVIDIA</td><td>128,000</td><td><a href="https://aimlapi.com/models/nemotron-nano-9b-v2">Nemotron Nano 9B V2</a></td></tr><tr><td><a href="text-models-llm/nvidia/nemotron-nano-12b-v2-vl">nvidia/nemotron-nano-12b-v2-vl</a></td><td>NVIDIA</td><td>128,000</td><td><a href="https://aimlapi.com/models/nemotron-nano-12b-v2-vl">Nemotron Nano 12B V2 VL</a></td></tr><tr><td><a href="text-models-llm/perplexity/sonar">perplexity/sonar</a></td><td>Perplexity</td><td>128,000</td><td><a href="https://aimlapi.com/models/perplexity-sonar">Sonar</a></td></tr><tr><td><a href="text-models-llm/perplexity/sonar-pro">perplexity/sonar-pro</a></td><td>Perplexity</td><td>200,000</td><td><a href="https://aimlapi.com/models/perplexity-sonar-pro">Sonar Pro</a></td></tr><tr><td><a href="text-models-llm/xai/grok-3-beta">x-ai/grok-3-beta</a></td><td>xAI</td><td>131,000</td><td><a href="https://aimlapi.com/models/grok-3-beta-api">Grok 3 Beta</a></td></tr><tr><td><a href="text-models-llm/xai/grok-3-mini-beta">x-ai/grok-3-mini-beta</a></td><td>xAI</td><td>131,000</td><td><a href="https://aimlapi.com/models/grok-3-beta-mini-api">Grok 3 Beta Mini</a></td></tr><tr><td><a href="text-models-llm/xai/grok-4">x-ai/grok-4-07-09</a></td><td>xAI</td><td>256,000</td><td><a href="https://aimlapi.com/models/grok-4">Grok 4</a></td></tr><tr><td><a href="text-models-llm/xai/grok-code-fast-1">x-ai/grok-code-fast-1</a></td><td>xAI</td><td>256,000</td><td><a href="https://aimlapi.com/models/grok-code-fast-1">Grok Code Fast 1</a></td></tr><tr><td><a href="text-models-llm/xai/grok-4-fast-non-reasoning">x-ai/grok-4-fast-non-reasoning</a></td><td>xAI</td><td>2,000,000</td><td><a href="https://aimlapi.com/models/grok-4-fast">Grok 4 Fast</a></td></tr><tr><td><a href="text-models-llm/xai/grok-4-fast-reasoning">x-ai/grok-4-fast-reasoning</a></td><td>xAI</td><td>2,000,000</td><td><a href="https://aimlapi.com/models/grok-4-fast-reasoning">Grok 4 Fast Reasoning</a></td></tr><tr><td><a href="text-models-llm/xai/grok-4-1-fast-non-reasoning">x-ai/grok-4-1-fast-non-reasoning</a></td><td>xAI</td><td>2,000,000</td><td><a href="https://aimlapi.com/models/grok-4-1-fast-non-reasoning">Grok 4.1 Fast Non-Reasoning</a></td></tr><tr><td><a href="text-models-llm/xai/grok-4-1-fast-reasoning">x-ai/grok-4-1-fast-reasoning</a></td><td>xAI</td><td>2,000,000</td><td><a href="https://aimlapi.com/models/grok-4-1-fast-reasoning">Grok 4.1 Fast Reasoning</a></td></tr><tr><td><a href="text-models-llm/zhipu/glm-4.5-air">zhipu/glm-4.5-air</a></td><td>Zhipu</td><td>128,000</td><td><a href="https://aimlapi.com/models/glm-4-5-air">GLM-4.5 Air</a></td></tr><tr><td><a href="text-models-llm/zhipu/glm-4.5">zhipu/glm-4.5</a></td><td>Zhipu</td><td>128,000</td><td><a href="https://aimlapi.com/models/glm-4-5">GLM-4.5</a></td></tr><tr><td><a href="text-models-llm/zhipu/glm-4.6">zhipu/glm-4.6</a></td><td>Zhipu</td><td>200,000</td><td><a href="text-models-llm/zhipu/glm-4.6">GLM-4.6</a></td></tr><tr><td><a href="text-models-llm/zhipu/glm-4.7">zhipu/glm-4.7</a></td><td>Zhipu</td><td>200,000</td><td><a href="https://aimlapi.com/models/glm-4-7">GLM-4.7</a></td></tr><tr><td><a href="text-models-llm/zhipu/glm-5">zhipu/glm-5</a></td><td>Zhipu</td><td>200,000</td><td><a href="https://aimlapi.com/models/glm-5">GLM-5</a></td></tr><tr><td><a href="text-models-llm/zhipu/glm-5.1">zhipu/glm-5-1</a></td><td>Zhipu</td><td>200,000</td><td><em>Coming Soon</em></td></tr></tbody></table>

</details>
