--- id: MATH-OPT-SA-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [math, optimization, simulated-annealing, heuristics, global-optimum, algorithm, stochastic-process] last_reinforced: 2026-04-26 --- # Simulated Annealing (์‹œ๋ฎฌ๋ ˆ์ดํ‹ฐ๋“œ ์–ด๋‹๋ง) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "์ดˆ๊ธฐ์—๋Š” ๋œจ๊ฑฐ์šด ์—ด๊ธฐ(Randomness)๋กœ ์ง€์—ญ์  ์ตœ์ ํ•ด์˜ ํ•จ์ •์„ ๋›ฐ์–ด๋„˜๊ณ , ์„œ์„œํžˆ ์‹์–ด๊ฐ€๋Š” ์ง€ํ˜œ(Cooling Schedule)๋ฅผ ํ†ตํ•ด ์ „์—ญ ์ตœ์ ํ•ด๋ผ๋Š” ์™„๋ฒฝํ•œ ๊ฒฐ์ •์ฒด๋ฅผ ํ˜•์„ฑํ•˜๋ผ" โ€” ๊ธˆ์† ๊ณตํ•™์˜ ๋‹ด๊ธˆ์งˆ ์›๋ฆฌ๋ฅผ ๋ชจ๋ฐฉํ•˜์—ฌ, ๋ณต์žกํ•œ ํƒ์ƒ‰ ๊ณต๊ฐ„์—์„œ ์ง€์—ญ ์ตœ์ ํ•ด(Local Optima)๋ฅผ ํƒˆ์ถœํ•˜๊ณ  ์ „์—ญ ์ตœ์ ํ•ด(Global Optimum)๋ฅผ ์ฐพ๊ธฐ ์œ„ํ•œ ํ™•๋ฅ ์  ์ตœ์ ํ™” ๊ธฐ๋ฒ•. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** "Stochastic Exploration and Gradual Convergence" โ€” ํ˜„์žฌ๋ณด๋‹ค ์ข‹์ง€ ์•Š์€ ํ•ด(Solution)๋ผ๋„ ์˜จ๋„($T$)์— ๋”ฐ๋ฅธ ํŠน์ • ํ™•๋ฅ ๋กœ ์ˆ˜์šฉํ•จ์œผ๋กœ์จ ํƒ์ƒ‰์˜ ๋ฒ”์œ„๋ฅผ ๋„“ํžˆ๊ณ , ์‹œ๊ฐ„์ด ํ๋ฆ„์— ๋”ฐ๋ผ ์˜จ๋„๋ฅผ ๋‚ฎ์ถฐ ์ ์  ์ •๊ตํ•˜๊ฒŒ ์ •๋‹ต์— ์•ˆ์ฐฉํ•˜๋Š” ํŒจํ„ด. - **ํ•ต์‹ฌ ๋ฉ”์ปค๋‹ˆ์ฆ˜:** - **Temperature ($T$):** ํƒ์ƒ‰์˜ ๋ฌด์ž‘์œ„์„ฑ์„ ๊ฒฐ์ •ํ•˜๋Š” ๋ณ€์ˆ˜. ์ดˆ๊ธฐ์— ๋†’๊ณ  ์„œ์„œํžˆ ๋‚ฎ์•„์ง. - **Metropolis Criterion:** ๋‚˜์œ ํ•ด๋ฅผ ์ˆ˜์šฉํ•  ํ™•๋ฅ  $P = \exp(-\Delta E / T)$. ์˜จ๋„๊ฐ€ ๋†’์„์ˆ˜๋ก, ์˜ค์ฐจ๊ฐ€ ์ž‘์„์ˆ˜๋ก ๋‚˜์œ ํ•ด๋ฅผ ๋” ์ž˜ ๋ฐ›์•„๋“ค์ž„. - **Cooling Schedule:** ์˜จ๋„๋ฅผ ์–ผ๋งˆ๋‚˜ ๋นจ๋ฆฌ ์‹ํž์ง€ ๊ฒฐ์ •ํ•˜๋Š” ํ•จ์ˆ˜. ํ•™์Šต์˜ ์„ฑํŒจ๋ฅผ ์ขŒ์šฐํ•จ. - **์˜์˜:** ์ˆ˜ํ•™์ ์œผ๋กœ ํ•ด๋ฅผ ๊ตฌํ•˜๊ธฐ ์–ด๋ ค์šด ์กฐํ•ฉ ์ตœ์ ํ™” ๋ฌธ์ œ(์˜ˆ: TSP)๋‚˜ ๋งค์šฐ ๋ณต์žกํ•œ ์†์‹ค ํ•จ์ˆ˜๋ฅผ ๊ฐ€์ง„ ๋ชจ๋ธ ํ•™์Šต์—์„œ ์ „์—ญ์ ์ธ ์‹œ์•ผ๋ฅผ ์œ ์ง€ํ•˜๊ฒŒ ํ•ด์ฃผ๋Š” ๊ฐ•๋ ฅํ•œ ๋ฉ”ํƒ€ํœด๋ฆฌ์Šคํ‹ฑ ๋„๊ตฌ. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ์—ฐ์‚ฐ ์†๋„๊ฐ€ ๋А๋ฆฌ๋‹ค๋Š” ๋‹จ์  ๋•Œ๋ฌธ์— ๊ฒฝ์‚ฌ ํ•˜๊ฐ•๋ฒ•(Gradient Descent)์— ๋ฐ€๋ฆฌ๋Š” ๋“ฏํ–ˆ์œผ๋‚˜, ๊ฒฝ์‚ฌ ์ •๋ณด๋ฅผ ์•Œ ์ˆ˜ ์—†๋Š” ๋น„์—ฐ์†์  ๊ณต๊ฐ„์ด๋‚˜ ๊ฐ•ํ™”ํ•™์Šต์˜ ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ ์ตœ์ ํ™” ๋“ฑ์—์„œ ์—ฌ์ „ํžˆ ๋Œ€์ฒด ๋ถˆ๊ฐ€๋Šฅํ•œ ๊ฐ€์น˜๋ฅผ ๋ฐœํœ˜ํ•จ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ์—์ด์ „ํŠธ์˜ ์ž‘์—… ์Šค์ผ€์ค„๋ง ์ตœ์ ํ™”๋‚˜ ๋ณต์žกํ•œ ์ง€์‹ ๊ทธ๋ž˜ํ”„์˜ ํด๋Ÿฌ์Šคํ„ฐ๋ง ์ดˆ๊ธฐ๊ฐ’ ์„ค์ • ์‹œ, ์ง€์—ญ ์ตœ์ ํ•ด ํ•จ์ •์„ ํ”ผํ•˜๊ธฐ ์œ„ํ•ด ์‹œ๋ฎฌ๋ ˆ์ดํ‹ฐ๋“œ ์–ด๋‹๋ง์˜ ํ™•๋ฅ ์  ํƒ์ƒ‰ ๋กœ์ง์„ ์ ์šฉํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Optimization-Algorithms]], [[Randomized-Algorithms]], [[Reinforcement-Learning]], Algorithm-Complexity-Analysis - **Raw Source:** 10_Wiki/Topics/AI/Simulated-Annealing.md