# ํ™•์‚ฐ ๋ชจ๋ธ (Diffusion Models) ## ๐Ÿ“Œ Brief Summary ํ™•์‚ฐ ๋ชจ๋ธ(Diffusion Models)์€ ํ…์ŠคํŠธ ํ”„๋กฌํ”„ํŠธ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ๋ฌด์ž‘์œ„ ๋…ธ์ด์ฆˆ์—์„œ ์‹œ์ž‘ํ•ด ์ ์ง„์ ์œผ๋กœ ๋…ธ์ด์ฆˆ๋ฅผ ์ œ๊ฑฐ(Denoising)ํ•ด ๋‚˜๊ฐ€๋ฉฐ ์ตœ์ข… ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•˜๋Š” ์ƒ์„ฑํ˜• AI ์•„ํ‚คํ…์ฒ˜์ž…๋‹ˆ๋‹ค [1, 2]. ํ›ˆ๋ จ ๊ณผ์ •์—์„œ ์›๋ณธ ๋ฐ์ดํ„ฐ์— ๊ฐ€์šฐ์‹œ์•ˆ ๋…ธ์ด์ฆˆ๋ฅผ ์ถ”๊ฐ€ํ•˜๋Š” '์ˆœ๋ฐฉํ–ฅ ํ™•์‚ฐ'๊ณผ ์ด๋ฅผ ๋‹ค์‹œ ๋ณต์›ํ•˜๋Š” '์—ญ๋ฐฉํ–ฅ ํ™•์‚ฐ' ๊ณผ์ •์„ ๊ฑฐ์ณ ๋ฐ์ดํ„ฐ ์ƒ์„ฑ ๋ฐฉ๋ฒ•์„ ํ•™์Šตํ•ฉ๋‹ˆ๋‹ค [2, 3]. Midjourney, DALL-E, Stable Diffusion ๋“ฑ ํ˜„๋Œ€์˜ ์ฃผ์š” AI ์ด๋ฏธ์ง€ ์ƒ์„ฑ ๋„๊ตฌ๋“ค์˜ ํ•ต์‹ฌ ๊ธฐ๋ฐ˜ ๊ธฐ์ˆ ์ž…๋‹ˆ๋‹ค [4, 5]. ## ๐Ÿ“– Core Content * **ํ•ต์‹ฌ ์ž‘๋™ ๋ฉ”์ปค๋‹ˆ์ฆ˜** - **์ˆœ๋ฐฉํ–ฅ ํ™•์‚ฐ (Forward Diffusion)**: ์›๋ณธ ๋ฐ์ดํ„ฐ์— ๊ฐ€์šฐ์‹œ์•ˆ ๋…ธ์ด์ฆˆ(Gaussian Noise)๋ฅผ ์—ฌ๋Ÿฌ ๋‹จ๊ณ„์— ๊ฑธ์ณ ์ ์ง„์ ์œผ๋กœ ์ถ”๊ฐ€ํ•˜์—ฌ ๋ฐ์ดํ„ฐ๊ฐ€ ์ˆœ์ˆ˜ ๋…ธ์ด์ฆˆ ์ƒํƒœ๋กœ ์ €ํ•˜๋˜๋Š” ๊ณผ์ •์„ ๋ชจ๋ธ์ด ํ•™์Šตํ•ฉ๋‹ˆ๋‹ค [1, 2]. - **์—ญ๋ฐฉํ–ฅ ํ™•์‚ฐ (Reverse Diffusion)**: ๋…ธ์ด์ฆˆ๊ฐ€ ์ถ”๊ฐ€๋œ ๊ณผ์ •์„ ์—ญ์œผ๋กœ ๊ฑฐ์Šฌ๋Ÿฌ ์˜ฌ๋ผ๊ฐ€๋ฉฐ, ๋…ธ์ด์ฆˆ๋ฅผ ์ฒด๊ณ„์ ์œผ๋กœ ์ œ๊ฑฐํ•˜์—ฌ ์›๋ž˜์˜ ์ž…๋ ฅ์„ ์žฌ๊ตฌ์„ฑํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ํ•™์Šตํ•ฉ๋‹ˆ๋‹ค [2, 3]. - **์ด๋ฏธ์ง€ ์ƒ์„ฑ (Generation)**: ์‹ค์ œ ์ƒ์„ฑ ์‹œ์—๋Š” ๋ฌด์ž‘์œ„ ๋…ธ์ด์ฆˆ์—์„œ ์ถœ๋ฐœํ•˜์—ฌ ํ•™์Šต๋œ ๋””๋…ธ์ด์ง• ๋‹จ๊ณ„๋ฅผ ๋ฐ˜๋ณต์ ์œผ๋กœ ์ ์šฉ, ํ…์ŠคํŠธ ํ”„๋กฌํ”„ํŠธ์˜ ์ง€์‹œ์— ๋ถ€ํ•ฉํ•˜๋Š” ์ผ๊ด€๋œ ์‹œ๊ฐ์  ๊ฒฐ๊ณผ๋ฌผ๋กœ ๋ณ€ํ™˜ํ•ฉ๋‹ˆ๋‹ค [2, 4]. * **ํ”„๋กฌํ”„ํŠธ์™€์˜ ์ƒํ˜ธ์ž‘์šฉ (์กฐ๊ฑด๋ถ€ ์ƒ์„ฑ)** ํ…์ŠคํŠธ ํ”„๋กฌํ”„ํŠธ๋Š” ๋…ธ์ด์ฆˆ๊ฐ€ ์ตœ์ข… ์ด๋ฏธ์ง€๋กœ ํ˜•ํƒœ๋ฅผ ๊ฐ–์ถฐ๊ฐ€๋Š” ๊ณผ์ • ์ „๋ฐ˜์— ์ง€์นจ(Guidance)์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค [1]. ์ตœ์‹  ๋ชจ๋ธ๋“ค์€ ํ…์ŠคํŠธ ์ธ์ฝ”๋”์™€ ์ž ์žฌ ๊ณต๊ฐ„(Latent Space)์„ ๊ธด๋ฐ€ํ•˜๊ฒŒ ์ •๋ ฌํ•˜์—ฌ ํ”„๋กฌํ”„ํŠธ์˜ ๋ฏธ์„ธํ•œ ๋‰˜์•™์Šค๊นŒ์ง€ ํ”ฝ์…€ ๋‹จ์œ„๋กœ ๊ตฌํ˜„ํ•ฉ๋‹ˆ๋‹ค [4, 6]. ๋ชจ๋ธ์€ ๊ธ์ •์ /๋ถ€์ •์  ์กฐ๊ฑด์„ ํ•จ๊ป˜ ์ธ์ฝ”๋”ฉํ•˜๋ฉฐ, ์ƒ˜ํ”Œ๋Ÿฌ(Sampler)๊ฐ€ ์ƒ์„ฑ ์ค‘์— ์ด ๋‘˜ ์‚ฌ์ด์˜ ๊ท ํ˜•์„ ๋งž์ถ”๊ณ  CFG ์Šค์ผ€์ผ์„ ํ†ตํ•ด ์ง€์นจ์˜ ๊ฐ•๋„๋ฅผ ์กฐ์ ˆํ•ฉ๋‹ˆ๋‹ค [6, 7]. ## โš–๏ธ Trade-offs & Caveats * **์žฅ์ **: GAN(์ƒ์„ฑ์  ์ ๋Œ€ ์‹ ๊ฒฝ๋ง)์— ๋น„ํ•ด ํ•™์Šต์ด ์•ˆ์ •์ ์ด๋ฉฐ, ๊ณ ํ’ˆ์งˆ์˜ ์„ธ๋ฐ€ํ•˜๊ณ  ๋‹ค์–‘ํ•œ ๊ฒฐ๊ณผ๋ฌผ์„ ์ถœ๋ ฅํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ๋˜ํ•œ ์ ์ง„์  ์ƒ์„ฑ ๊ณผ์ •์„ ๊ฑฐ์น˜๋ฏ€๋กœ ๋‹ค์–‘ํ•œ ๋‹จ๊ณ„์—์„œ ์„ธ๋ฐ€ํ•œ ์ œ์–ด(Fine-Grained Control)๊ฐ€ ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค [2]. * **๋‹จ์ **: ๋ฐ˜๋ณต์ ์ธ ๋…ธ์ด์ฆˆ ์ œ๊ฑฐ ๊ณผ์ •์œผ๋กœ ์ธํ•ด ์—ฐ์‚ฐ ์ž์› ์†Œ๋ชจ(Computational Intensity)๊ฐ€ ์‹ฌํ•˜๋ฉฐ, GAN ๋ชจ๋ธ์— ๋น„ํ•ด ์ƒ์„ฑ ์†๋„๊ฐ€ ์ƒ๋Œ€์ ์œผ๋กœ ๋А๋ฆฝ๋‹ˆ๋‹ค [5, 9]. ๋˜ํ•œ ๋กœ์ปฌ ํ™˜๊ฒฝ ์„ค์ • ์‹œ ์ƒ๋‹นํ•œ ์ „๋ฌธ ์ง€์‹์ด ์š”๊ตฌ๋˜๋Š” ๊ตฌ์กฐ์  ๋ณต์žก์„ฑ์ด ์กด์žฌํ•ฉ๋‹ˆ๋‹ค [5, 9]. ## ๐Ÿ”— Knowledge Connections - **Related Topics**: [[แ„‘แ…ณแ„…แ…ฉแ†ทแ„‘แ…ณแ„แ…ณ แ„‹แ…ฆแ†ซแ„Œแ…ตแ„‚แ…ตแ„‹แ…ฅแ„…แ…ตแ†ผ|ํ”„๋กฌํ”„ํŠธ ์—”์ง€๋‹ˆ์–ด๋ง]], ์ž ์žฌ ๊ณต๊ฐ„(Latent Space), CFG Scale, ๋…ธ์ด์ฆˆ ์ œ๊ฑฐ(Denoising, [[แ„‡แ…ฎแ„Œแ…ฅแ†ผ แ„‘แ…ณแ„…แ…ฉแ†ทแ„‘แ…ณแ„แ…ณ (Negative Prompt)|๋ถ€์ • ํ”„๋กฌํ”„ํŠธ(Negative Prompt]] - **Projects/Contexts**: [[Midjourney|Midjourney]], Stable Diffusion, DALL-E --- *Last updated: 2026-04-30*