Stable Diffusion 3 Medium
Stable Diffusion 3 Medium is a Multimodal Diffusion Transformer (MMDiT) text-to-image model that features greatly improved performance in image quality, typography, complex prompt understanding, and resource-efficiency.
Demo Stable Diffusion 3 Medium
https://huggingface.co/stabilityai/stable-diffusion-3-medium
https://huggingface.co/stabilityai/stable-diffusion-3-medium-diffusers
Training Dataset
We used synthetic data and filtered publicly available data to train our models. The model was pre-trained on 1 billion images. The fine-tuning data includes 30M high-quality aesthetic images focused on specific visual content and style, as well as 3M preference data images.