Stable Diffusion v3 Medium
Model Overview
An advanced text-to-image generation model that utilizes a Multimodal Diffusion Transformer (MMDiT) architecture to produce high-quality images from textual descriptions.
Setup your API Key
If you don’t have an API key for the AI/ML API yet, feel free to use our Quickstart guide.
API Schema
The text prompt describing the content, style, or composition of the image to be generated.
The number of images to generate.
1The same seed and the same prompt given to the same version of the model will output the same image every time.
square_hdThe size of the generated image.
The description of elements to avoid in the generated image.
If set to True, prompt will be upsampled with more details.
The CFG (Classifier Free Guidance) scale is a measure of how close you want the model to stick to your prompt when looking for a related image to show you.
The number of inference steps to perform.
If set to True, the safety checker will be enabled.
trueQuick Example
Let's generate an image using a simple prompt.
We obtained the following 1024x576 image by running this code example:

"A T-Rex relaxing on a beach, lying on a sun lounger and wearing sunglasses."Last updated
Was this helpful?