--- category: Unified tags: [auto-consolidated, technical-documentation] title: [[Variational Autoencoders (VAE)|Variational Autoencoders (VAE)]] last_updated: 2026-05-02 --- # [[Variational Autoencoders (VAE)|Variational Autoencoders (VAE)]] ## ๐Ÿ“Œ Brief Summary > "๋ฐ์ดํ„ฐ๋ฅผ ๊ตฌ๋ฆ„ ์†์— ๊ฐ€๋‘๊ณ  ๋‹ค์‹œ ๋นš๊ธฐ: ํ˜„์‹ค์˜ ๋ฐ์ดํ„ฐ๋ฅผ ์••์ถ•๋œ '์ž ์žฌ ๊ณต๊ฐ„(Latent Space)'์ด๋ผ๋Š” ํ™•๋ฅ  ๋ถ„ํฌ๋กœ ๋ณ€ํ™˜ํ•œ ๋’ค, ๊ทธ ๊ตฌ๋ฆ„์—์„œ ์ƒˆ๋กœ์šด ํ‘œ๋ณธ์„ ์ƒ˜ํ”Œ๋งํ•˜์—ฌ ํ˜„์‹ค์— ์กด์žฌํ•œ ์  ์—†๋Š” ์ƒˆ๋กœ์šด ๋ฐ์ดํ„ฐ๋ฅผ ์ฐฝ์กฐํ•ด๋‚ด๋Š” ์ƒ์„ฑ์˜ ์ •์„." --- > "๋ฐ์ดํ„ฐ๋ฅผ ํ™•๋ฅ  ๋ถ„ํฌ๋กœ ์••์ถ•ํ•˜์—ฌ ๋ฌดํ•œํ•œ ๋ณ€์ด๋ฅผ ์ƒ์„ฑํ•˜๋ผ" โ€” ์ž…๋ ฅ ๋ฐ์ดํ„ฐ๋ฅผ ํŠน์ • ์ˆ˜์น˜๊ฐ€ ์•„๋‹Œ 'ํ‰๊ท '๊ณผ '๋ถ„์‚ฐ'์„ ๊ฐ€์ง„ ํ™•๋ฅ  ๋ถ„ํฌ๋กœ ์ธ์ฝ”๋”ฉํ•จ์œผ๋กœ์จ, ์ž ์žฌ ๊ณต๊ฐ„(Latent Space)์—์„œ ์ƒˆ๋กœ์šด ๋ฐ์ดํ„ฐ๋ฅผ ์ƒ˜ํ”Œ๋งํ•˜์—ฌ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ๊ฒŒ ํ•˜๋Š” ๋ชจ๋ธ. ## ๐Ÿ“– Core Content ๋ณ€์ดํ˜• ์˜คํ† ์ธ์ฝ”๋”(Variational Autoencoder, VAE)๋Š” ๋ฐ์ดํ„ฐ์˜ ์ž ์žฌ์ ์ธ ๊ตฌ์กฐ๋ฅผ ํ•™์Šตํ•˜์—ฌ ์ƒˆ๋กœ์šด ์œ ์‚ฌ ๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•ด๋‚ผ ์ˆ˜ ์žˆ๋Š” ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜์˜ ์ƒ์„ฑ ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค. 1. **๊ตฌ์กฐ์™€ ๋งค์ปค๋‹ˆ์ฆ˜**: * **Encoder**: ์ž…๋ ฅ ๋ฐ์ดํ„ฐ(์ด๋ฏธ์ง€ ๋“ฑ)๋ฅผ ์ €์ฐจ์›์˜ '์ž ์žฌ ๋ณ€์ˆ˜(Latent Variable)' ๋ถ„ํฌ(ํ‰๊ท ๊ณผ ๋ถ„์‚ฐ)๋กœ ์••์ถ•. * **Latent Space**: ๋ฐ์ดํ„ฐ๋ฅผ ํ•˜๋‚˜์˜ ์ ์ด ์•„๋‹Œ 'ํ™•๋ฅ  ๋ถ„ํฌ'์˜ ์˜์—ญ์œผ๋กœ ํ‘œํ˜„ํ•˜์—ฌ, ๊ทธ ์˜์—ญ ๋‚ด์˜ ์–ด๋–ค ์ ์—์„œ๋„ ๊ทธ๋Ÿด์‹ธํ•œ ๋ฐ์ดํ„ฐ๊ฐ€ ๋‚˜์˜ค๊ฒŒ ํ•จ (์—ฐ์†์„ฑ ํ™•๋ณด). * **Decoder**: ์ž ์žฌ ๊ณต๊ฐ„์—์„œ ์ƒ˜ํ”Œ๋งํ•œ ๋ฒกํ„ฐ๋ฅผ ๋‹ค์‹œ ์›๋ž˜์˜ ๊ณ ์ฐจ์› ๋ฐ์ดํ„ฐ ํ˜•์‹์œผ๋กœ ๋ณต์› ๋ฐ ์ƒ์„ฑ. 2. **ํ•ต์‹ฌ ๊ธฐ๋ฒ• - Re[[Parameter|Parameter]]ization Trick**: * ์ƒ˜ํ”Œ๋ง ๊ณผ์ •์€ ๋ฏธ๋ถ„์ด ๋ถˆ๊ฐ€๋Šฅํ•˜์—ฌ ์˜ค์ฐจ ์—ญ์ „ํŒŒ๊ฐ€ ์•ˆ ๋˜๋Š”๋ฐ, ์ด๋ฅผ ์ˆ˜ํ•™์  ํŠธ๋ฆญ์œผ๋กœ ์šฐํšŒํ•˜์—ฌ ์‹ ๊ฒฝ๋ง ์ „์ฒด๊ฐ€ ํ•™์Šต ๊ฐ€๋Šฅํ•˜๊ฒŒ ๋งŒ๋“ฆ. 3. **์šฉ๋„**: * ๋ฐ์ดํ„ฐ ์ฆ๊ฐ•, ๋…ธ์ด์ฆˆ ์ œ๊ฑฐ(Denosing), ์ด๋ฏธ์ง€ ์ƒ์„ฑ, ๋ถ„์ž ๊ตฌ์กฐ ์„ค๊ณ„ ๋“ฑ. --- - **์ถ”์ถœ๋œ ํŒจํ„ด:** ์›์‹œ ๋ฐ์ดํ„ฐ๋ฅผ ์˜๋ฏธ ์žˆ๋Š” ์ €์ฐจ์› ํ™•๋ฅ  ๋ถ„ํฌ๋กœ ์š”์•ฝ(Encoder)ํ•˜๊ณ , ์ด ๋ถ„ํฌ๋กœ๋ถ€ํ„ฐ ์ƒ˜ํ”Œ๋ง๋œ ๊ฐ’์„ ๋‹ค์‹œ ์›์‹œ ๋ฐ์ดํ„ฐ ํ˜•ํƒœ๋กœ ๋ณต์›(Decoder)ํ•˜๋Š” ์ƒ์„ฑ์  ์ถ”๋ก  ํŒจํ„ด. - **์„ธ๋ถ€ ๋‚ด์šฉ:** - **Latent Space:** ๋ฐ์ดํ„ฐ์˜ ํ•ต์‹ฌ ํŠน์ง•๋“ค์ด ์••์ถ•๋œ ๋‹ค์ฐจ์› ๊ณต๊ฐ„. VAE๋Š” ์ด ๊ณต๊ฐ„์ด ์ •๊ทœ ๋ถ„ํฌ๋ฅผ ๋”ฐ๋ฅด๋„๋ก ๊ฐ•์ œํ•จ. - **Re[[Parameter|Parameter]]ization Trick:** ์ƒ˜ํ”Œ๋ง ๊ณผ์ •์—์„œ ๋ฏธ๋ถ„ ๊ฐ€๋Šฅ์„ฑ์„ ์œ ์ง€ํ•˜์—ฌ ์—ญ์ „ํŒŒ([[Backpropagation|Backpropagation]])๊ฐ€ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•˜๋Š” ํ•ต์‹ฌ ์ˆ˜ํ•™์  ๊ธฐ๋ฒ•. - **Kullback-Leibler (KL) Divergence:** ํ•™์Šต๋œ ์ž ์žฌ ๋ถ„ํฌ๊ฐ€ ํ‘œ์ค€ ์ •๊ทœ ๋ถ„ํฌ์™€ ๋„ˆ๋ฌด ๋ฉ€์–ด์ง€์ง€ ์•Š๋„๋ก ๊ทœ์ œํ•˜๋Š” ์†์‹ค ํ•จ์ˆ˜ ํ•ญ. - **Applications:** ์ด๋ฏธ์ง€ ์ƒ์„ฑ, ๋ฐ์ดํ„ฐ ์••์ถ•, ์ด์ƒ์น˜ ํƒ์ง€(Anomaly Detection) ๋“ฑ. ## โš–๏ธ Trade-offs & Caveats - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ**: ์ดˆ๊ธฐ ์ƒ์„ฑ ๋ชจ๋ธ ์ •์ฑ…์€ ๋‹จ์ˆœํ•œ ๋ณต์›(Autoencoder)์— ๊ทธ์น˜๊ฑฐ๋‚˜ GAN์˜ ๋ถˆ์•ˆ์ •ํ•œ ํ•™์Šต์— ๊ณ ์ „ํ–ˆ์œผ๋‚˜, VAE ์ •์ฑ…์€ ์ˆ˜ํ•™์ ์œผ๋กœ ์•ˆ์ •์ ์ธ ํ•™์Šต ๊ธฐ๋ฐ˜์„ ์ œ๊ณตํ•˜๋ฉฐ ์ƒ์„ฑ AI ์ •์ฑ…์˜ ๊ธฐํ‹€์„ ๋‹ฆ์Œ(RL Update). - **์ •์ฑ… ๋ณ€ํ™”(RL Update)**: ํ˜„๋Œ€์˜ ๊ณ ํ’ˆ์งˆ ์ด๋ฏธ์ง€ ์ƒ์„ฑ ์ •์ฑ…(Stable Diffusion ๋“ฑ)์—์„œ, VAE๋Š” ์ด๋ฏธ์ง€๋ฅผ ํšจ์œจ์ ์ธ ์ž ์žฌ ๊ณต๊ฐ„์œผ๋กœ ์˜ฎ๊ฒจ ์—ฐ์‚ฐ ๋ถ€ํ•˜๋ฅผ ์ค„์ด๋Š” 'Latent Diffusion' ์ •์ฑ…์˜ ํ•ต์‹ฌ ๋ถ€ํ’ˆ(Encoder/Decoder)์œผ๋กœ ์žฌ๋ฐฐ์น˜๋˜์–ด ์ œ2์˜ ์ „์„ฑ๊ธฐ๋ฅผ ๋ˆ„๋ฆผ. --- - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ๋‹จ์ˆœํžˆ ๋ฐ์ดํ„ฐ๋ฅผ ๋ณต์›๋งŒ ํ•˜๋˜ ์ผ๋ฐ˜ ์˜คํ† ์ธ์ฝ”๋”(AE)์™€ ๋‹ฌ๋ฆฌ, ์ž ์žฌ ๊ณต๊ฐ„์˜ ์—ฐ์†์„ฑ์„ ํ™•๋ณดํ•จ์œผ๋กœ์จ '์ƒˆ๋กœ์šด' ๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ๋Š” ๋Šฅ๋ ฅ์„ ๊ฐ–์ถค. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ์œ„ํ‚ค ๋ฌธ์„œ์˜ ์˜๋ฏธ์  ์œ ์‚ฌ์„ฑ ๋ถ„์„ ๋ฐ ๋ฌธ์„œ ๊ฐ„ '๋ˆ„๋ฝ๋œ ์—ฐ๊ฒฐ ๊ณ ๋ฆฌ'๋ฅผ ์ƒ์„ฑ์  ์ถ”๋ก ์œผ๋กœ ์ฐพ๊ธฐ ์œ„ํ•ด VAE ๊ธฐ๋ฐ˜์˜ ์ž ์žฌ ๊ณต๊ฐ„ ๋ถ„์„ ๊ธฐ๋ฒ•์„ ํ™œ์šฉํ•จ. ## ๐Ÿ”— Knowledge Connections - [[Self-Supervised Learning (SSL)|Self-Supervised Learning (SSL)]], Foundational Models, [[Straightening|Straightening]], [[Probability Theory|Probability Theory]], [[Style-Transfer|Style-Transfer]] - **Modern Tech/Tools**: Stable Diffusion VAE, Beta-VAE, PyTorch VAE, Keras Generative. --- --- - Autoencoder, [[Generative-Adversarial-Networks|Generative-Adversarial-Networks]]-GAN, [[Representation-Learning|Representation-Learning]], [[Uncertainty-Quantification|Uncertainty-Quantification]] - **Raw Source:** 10_Wiki/Topics/AI/Variational-Autoencoders-VAE.md