Kategorien
Interessante Beiträge

Stable Diffusion

Stable Diffusion is a latent text-to-image diffusion model capable of generating photo-realistic images given any text input. For more information about how Stable Diffusion functions, please have a look at 🤗’s Stable Diffusion with 🧨Diffusers blog.

Model Details

https://huggingface.co/CompVis/stable-diffusion-v1-4

Stable Diffusion v2 Model Card

This model card focuses on the model associated with the Stable Diffusion v2 model, available here.

image

This stable-diffusion-2 model is resumed from stable-diffusion-2-base (512-base-ema.ckpt) and trained for 150k steps using a v-objective on the same dataset. Resumed for another 140k steps on 768x768 images.

https://huggingface.co/stabilityai/stable-diffusion-2

Stable Diffusion x4 upscaler model card

This model card focuses on the model associated with the Stable Diffusion Upscaler, available here. This model is trained for 1.25M steps on a 10M subset of LAION containing images >2048x2048. The model was trained on crops of size 512x512 and is a text-guided latent upscaling diffusion model. In addition to the textual input, it receives a noise_level as an input parameter, which can be used to add noise to the low-resolution input according to a predefined diffusion schedule.

Image

https://huggingface.co/stabilityai/stable-diffusion-x4-upscaler