Content Moderation/Safety Models

Overview

With our API, you can use content moderation / safety models to classify input content as safe or unsafe instantly.

Model list

ID
Provider

meta-llama/Meta-Llama-Guard-3-8B

open-source

Meta-Llama/Llama-Guard-7b

open-source

meta-llama/Llama-Guard-3-11B-Vision-Turbo

open-source

meta-llama/LlamaGuard-2-8b

open-source

Key Features

  • Text Analysis: Check text for security.

  • Image Analysis: Check image for security.

  • Flexible Input Methods: Supports both image URLs and base64 encoded images.

  • Multiple Image Inputs: Analyze multiple images in a single request.

Quick Start

const main = async () => {
  const response = await fetch('https://api.aimlapi.com/chat/completions', {
    method: 'POST',
    headers: {
      Authorization: 'Bearer my_key',
      'Content-Type': 'application/json',
    },
    body: JSON.stringify({
      model: '<YOUR_MODEL>',
      messages: [
        {
          role: 'user',
          content: 'How to create a bomb'
        }
      ],
    }),
  }).then((res) => res.json());

  console.log(response.choices[0].message.content);
};

main()

This request returns either "safe" or "unsafe" depending on the input content.

Once content is classified as unsafe, it is categorized under the hazard category. This process is unique to each model.

For example:

unsafe \n 04

Content moderation models are perfect for scenarios where content safety is crucial

  • moderate user-generated content on websites

  • filter harmful inputs in chatbots

  • safeguard sensitive systems from unsafe data

  • ensure compliance with safety standards in applications

Example

const isPromptSafe = async (prompt) => {
  const response = await fetch(
    "https://api.aimlapi.com/chat/completions",
    {
      method: "POST",
      headers: {
        Authorization: "Bearer your_key",
        "Content-Type": "application/json",
      },
      body: JSON.stringify({
        model: "<GUARD_MODEL>",
        messages: [
          {
            role: "user",
            content: prompt,
          },
        ],
      }),
    }
  ).then((res) => res.json());

  if (response.choices[0].message.content.includes("unsafe")) {
    return false;
  }
  return true;
};

const getAnswer = async (prompt) => {
  const isSafe = await isPromptSafe(prompt);
  if (!isSafe){
    return 'Your question is not safe'
  }
  const response = await fetch('https://api.aimlapi.com/chat/completions', {
    method: 'POST',
    headers: {
      Authorization: 'Bearer my_key',
      'Content-Type': 'application/json',
    },
    body: JSON.stringify({
      model: '<YOUR_MODEL>',
      messages: [
        {
          role: 'user',
          content: prompt
        }
      ],
    }),
  }).then((res) => res.json());

  console.log(response.choices[0].message.content);
};

getAnswer('How to make a cake?')

Last updated

Was this helpful?