Stable Diffusion v3 Medium

This documentation is valid for the following list of our models:

  • stable-diffusion-v3-medium

Model Overview

An advanced text-to-image generation model that utilizes a Multimodal Diffusion Transformer (MMDiT) architecture to produce high-quality images from textual descriptions.

Setup your API Key

If you don’t have an API key for the AI/ML API yet, feel free to use our Quickstart guide.

API Schema

post
Body
modelstring · enumRequiredPossible values:
promptstring · max: 4000Required

The text prompt describing the content, style, or composition of the image to be generated.

num_imagesnumber · min: 1 · max: 4Optional

The number of images to generate.

Default: 1
seedinteger · min: 1Optional

The same seed and the same prompt given to the same version of the model will output the same image every time.

image_sizeany ofOptionalDefault: square_hd
or
string · enumOptional

The size of the generated image.

Possible values:
negative_promptstringOptional

The description of elements to avoid in the generated image.

prompt_expansionbooleanOptional

If set to True, prompt will be upsampled with more details.

guidance_scalenumber · min: 1 · max: 20Optional

The CFG (Classifier Free Guidance) scale is a measure of how close you want the model to stick to your prompt when looking for a related image to show you.

num_inference_stepsinteger · min: 1 · max: 50Optional

The number of inference steps to perform.

enable_safety_checkerbooleanOptional

If set to True, the safety checker will be enabled.

Default: true
Responses
200Success
application/json
post
/v1/images/generations
200Success

Quick Example

Let's generate an image using a simple prompt.

Response

We obtained the following 1024x576 image by running this code example:

"A T-Rex relaxing on a beach, lying on a sun lounger and wearing sunglasses."

Last updated

Was this helpful?