--- id: DIFFUSION-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [ai, generative-model, diffusion-model, image-generation, deep-learning] last_reinforced: 2026-04-26 --- # Diffusion Models (ํ™•์‚ฐ ๋ชจ๋ธ) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "ํ˜ผ๋ˆ(Noise) ์†์—์„œ ์งˆ์„œ๋ฅผ ์ฐพ์•„๋‚ด์–ด ๋ฌด(็„ก)์—์„œ ์œ (ๆœ‰)๋ฅผ ์ฐฝ์กฐํ•˜๋ผ" โ€” ๋ฐ์ดํ„ฐ์— ๋…ธ์ด์ฆˆ๋ฅผ ์ ์ง„์ ์œผ๋กœ ์ถ”๊ฐ€ํ–ˆ๋‹ค๊ฐ€ ์ด๋ฅผ ๋‹ค์‹œ ์ œ๊ฑฐํ•˜๋Š” ์—ญ๊ณผ์ •(Denoising)์„ ํ•™์Šตํ•˜์—ฌ, ๋‹จ์ˆœํ•œ ๋…ธ์ด์ฆˆ๋กœ๋ถ€ํ„ฐ ๊ณ ํ’ˆ์งˆ์˜ ์ด๋ฏธ์ง€๋‚˜ ๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•˜๋Š” ์ตœ์‹  ์ƒ์„ฑ ๋ชจ๋ธ. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** ์ •๊ทœ ๋ถ„ํฌ๋ฅผ ๋”ฐ๋ฅด๋Š” ๋ฌด์ž‘์œ„ ๋…ธ์ด์ฆˆ์—์„œ ์‹œ์ž‘ํ•˜์—ฌ, ๋ชจ๋ธ์ด ํ•™์Šตํ•œ ๋ฐ์ดํ„ฐ์˜ ๋ถ„ํฌ๋ฅผ ๋”ฐ๋ผ ๋ฏธ์„ธํ•œ ํŒจํ„ด์„ ๋ณต์›ํ•ด๋‚˜๊ฐ€๋Š” ๋ฐ˜๋ณต์  ์ •์ œ(Iterative Refinement) ํŒจํ„ด. - **์ž‘๋™ ์›๋ฆฌ:** - **Forward Process:** ๋ฐ์ดํ„ฐ์— ๊ฐ€์šฐ์‹œ์•ˆ ๋…ธ์ด์ฆˆ๋ฅผ ๋‹จ๊ณ„์ ์œผ๋กœ ์ถ”๊ฐ€ํ•˜์—ฌ ์™„์ „ํ•œ ๋…ธ์ด์ฆˆ ์ƒํƒœ๋กœ ๋งŒ๋“ฆ. - **Reverse Process (Denoising):** ๊ฐ ๋‹จ๊ณ„์—์„œ ์ถ”๊ฐ€๋œ ๋…ธ์ด์ฆˆ๋ฅผ ์˜ˆ์ธกํ•˜๊ณ  ์ œ๊ฑฐํ•˜์—ฌ ์›๋ž˜ ๋ฐ์ดํ„ฐ๋ฅผ ๋ณต๊ตฌํ•˜๋„๋ก ๋ชจ๋ธ์„ ํ•™์Šต. - **Sampling:** ํ•™์Šต๋œ ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•ด ์ˆœ์ˆ˜ ๋…ธ์ด์ฆˆ๋กœ๋ถ€ํ„ฐ ํ•œ ๋‹จ๊ณ„์”ฉ ๋…ธ์ด์ฆˆ๋ฅผ ๊ฑท์–ด๋‚ด๋ฉฐ ์ƒˆ๋กœ์šด ๋ฐ์ดํ„ฐ ์ƒ์„ฑ. - **์˜์˜:** GAN์˜ ํ•™์Šต ๋ถˆ์•ˆ์ •์„ฑ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ณ , ์••๋„์ ์ธ ๋ฐ์ดํ„ฐ ์ƒ์„ฑ ํ’ˆ์งˆ๊ณผ ๋‹ค์–‘์„ฑ์„ ํ™•๋ณดํ•˜์—ฌ Midjourney, Stable Diffusion ๋“ฑ์˜ ๊ธฐ๋ฐ˜ ๊ธฐ์ˆ ์ด ๋จ. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** GAN์ด ์ƒ์„ฑ ๋ชจ๋ธ์˜ ์ •๋‹ต์œผ๋กœ ์—ฌ๊ฒจ์ง€๋˜ ์‹œ๋Œ€๋ฅผ ์ง€๋‚˜, ๋” ์•ˆ์ •์ ์ด๊ณ  ๊ณ ์„ฑ๋Šฅ์ธ ํ™•์‚ฐ ๋ชจ๋ธ์ด ์ด๋ฏธ์ง€/๋น„๋””์˜ค ์ƒ์„ฑ์˜ ์ƒˆ๋กœ์šด ํ‘œ์ค€์œผ๋กœ ์ž๋ฆฌ ์žก์Œ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ์œ„ํ‚ค ๋ฌธ์„œ์˜ ์‹œ๊ฐํ™” ๋ณด์กฐ ์ž๋ฃŒ๋‚˜ ๋ชฉ์—… ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•  ๋•Œ ์ตœ์‹  ํ™•์‚ฐ ๋ชจ๋ธ ๊ธฐ๋ฐ˜์˜ API๋ฅผ ํ™œ์šฉํ•˜์—ฌ ๊ณ ํ’ˆ์งˆ ๊ฒฐ๊ณผ๋ฌผ์„ ์ƒ์„ฑํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - Generative-Adversarial-Networks-GAN, [[Variational-Autoencoders-VAE]], [[CLIP]], Computer-Vision-Mastery - **Raw Source:** 10_Wiki/Topics/AI/Diffusion-Models.md