SD1.5 & SDXL

implementation Analysis of Stable Diffusion Model 1.5 and SDXL

1 Stable Diffusion v1-5

从hugging face的model card里面看到Stable Diffusion主要有:

  1. unet
  2. text_encoder
  3. vae
  4. scheduler
  5. tokenize
    几个部分

Very detailed description can be found here->SD1.5
picture/Pasted image 20240405121018.png

Model Architecture

Latent diffusion model combine a autoencoder

2 SDXL

两个tokenizer 两个那啥