Stable Diffusion 系列

Stability AI 的 open-weight 圖像生成 diffusion 模型家族（SD 1.5、SDXL、SD3、SD 3.5），是 open-source AI 藝術生態的基礎。

Stable Diffusion 是 Stability AI 的 open-weight 圖像生成模型家族（早期版本有 Runway 跟 CompVis 等重要外部合作）。產品線：Stable Diffusion 1.4/1.5（2022，讓 AI 圖像生成普及的突破）、SDXL（2023，品質大幅提升）、SD3（2024，multimodal diffusion transformer 架構）、SD 3.5。它重要的原因是：Stable Diffusion 對圖像生成的意義就像 Llama 對 LLM——整個生態建立在這個 open foundation 上。Civitai 跟 Hugging Face 上幾百個 fine-tune 變體、整個 ControlNet 生態、AnimateDiff 做影片、大多數非商業 AI 藝術工具都追到 SD checkpoint。ComfyUI 跟 Automatic1111 web UI 之所以存在是因為 SD 的 open 權重。 SD 1.5 特別被廣泛客製——動漫模型、寫實模型、角色 LoRA、風格 LoRA。模型夠小（~4 GB）能在消費者 GPU 上跑，幾秒生成圖像。新的 closed-source 前緣圖像生成競爭者——DALL-E 3、Midjourney、Imagen、Flux Pro——通常產生更高的絕對品質。但在客製、fine-tune、可控生成、離線使用上，SD 家族仍是基礎。延伸閱讀：diffusion model、image generation、ControlNet、LoRA、Flux。