--- id: [[P-Reinforce|P-Reinforce]]-AUTO-LOSE-001 category: Unified confidence_score: 0.93 tags: [auto-reinforced, local-[[Search|Search]], [[Optimization|Optimization]], hill-climbing, algorithms, [[Search-Strategy|Search-Strategy]]] last_reinforced: 2026-04-20 --- # [[Local-Search|Local-Search]] ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "ํ˜„์žฌ๋ณด๋‹ค ๋‚˜์€ ๋‚ด์ผ์„ ์œ„ํ•ด: ์ „์ฒด ์ง€๋„๋ฅผ ๋‹ค ๋ณด์ง€ ์•Š์•„๋„, ์ง€๊ธˆ ์„œ ์žˆ๋Š” ๊ณณ์—์„œ ์ฃผ๋ณ€์„ ํ›‘์–ด๋ณด๊ณ  ์กฐ๊ธˆ์ด๋ผ๋„ ๋” ๋†’์€ ๊ณณ(ํ˜น์€ ๋‚ฎ์€ ๊ณณ)์œผ๋กœ ํ•œ ๋ฐœ์ž๊ตญ์”ฉ ์˜ฎ๊ธฐ๋ฉฐ ์ตœ์ ์˜ ๋ชฉํ‘œ๋ฅผ ์ฐพ์•„๊ฐ€๋Š” ํ˜„์‹ค์ ์ด๊ณ  ๋ฏผ์ฒฉํ•œ ํƒ์ƒ‰ ๊ธฐ๋ฒ•." ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) ๊ตญ์†Œ ํƒ์ƒ‰(Local-Search)์€ ํ˜„์žฌ์˜ ํ•ด๋ฅผ ์กฐ๊ธˆ์”ฉ ์ˆ˜์ •ํ•˜์—ฌ ๋” ๋‚˜์€ ํ•ด๋ฅผ ์ฐพ์•„๊ฐ€๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜์ž…๋‹ˆ๋‹ค. 1. **์ฃผ์š” ์•Œ๊ณ ๋ฆฌ์ฆ˜**: * **Hill Climbing**: ์ง€๊ธˆ๋ณด๋‹ค ๋” ๋†’์€ ๊ณณ์ด ๋ณด์ด๋ฉด ๋ฌด์กฐ๊ฑด ์ด๋™. (ํ•˜์ง€๋งŒ ์ •์ƒ์ธ์ง€ ์–ธ๋•์ธ์ง€ ๋ชจ๋ฆ„) * **Simulated Annealing**: ๊ฐ€๋”์€ ๋‚ฎ์€ ๊ณณ์œผ๋กœ๋„ ๊ฐ€๋ณด๋ฉฐ ๋” ํฐ ์ •์ƒ์„ ์ฐพ์Œ (๊ธˆ์† ๋‹ด๊ธˆ์งˆ ์›๋ฆฌ). ([[Search-Optimization|Search-Optimization]]์™€ ์—ฐ๊ฒฐ) * **Tabu Search**: ํ•œ ๋ฒˆ ๊ฐ€๋ณธ ๊ณณ์€ ๋ชฉ๋ก์— ์ ์–ด๋‘๊ณ  ๋‹ค์‹œ ๊ฐ€์ง€ ์•Š์Œ. 2. **์™œ ์ค‘์š”ํ•œ๊ฐ€?**: * ๋ฌธ์ œ์˜ ๊ทœ๋ชจ๊ฐ€ ๋„ˆ๋ฌด ์ปค์„œ ์ „์ฒด๋ฅผ ๋‹ค ํƒ์ƒ‰(Global Search)ํ•  ์ˆ˜ ์—†์„ ๋•Œ, '์ถฉ๋ถ„ํžˆ ์ข‹์€ ๋‹ต'์„ ๋น ๋ฅด๊ฒŒ ๋‚ด๋†“๋Š” ์ตœ์„ ์˜ ๋ฐฉ๋ฒ•์ž„. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ**: ๊ณผ๊ฑฐ์—๋Š” ๋™๋„ค ๋’ท์‚ฐ(Local Optima)์— ๊ฐ‡ํ˜€๋ฒ„๋ฆฐ๋‹ค๋Š” ๊ฒƒ์ด ์น˜๋ช…์  ์•ฝ์  ์ •์ฑ…์ด์—ˆ์œผ๋‚˜, ํ˜„๋Œ€ ์ •์ฑ…์€ ๋ฌด์ž‘์œ„์„ฑ(Randomness) ์ •์ฑ…์„ ์ ์ ˆํžˆ ์„ž์–ด ์ด๋ฅผ ํƒˆ์ถœํ•˜๋Š” ์ •๊ตํ•œ ๊ธฐ๋ฒ•๋“ค์ด ํ‘œ์ค€ ์ •์ฑ…์ด ๋จ(RL Update). - **์ •์ฑ… ๋ณ€ํ™”(RL Update)**: ๋”ฅ๋Ÿฌ๋‹ ๊ฐ€์ค‘์น˜๋ฅผ ์ฐพ๋Š” ๊ฒฝ์‚ฌ ํ•˜๊ฐ•๋ฒ•([[Gradient-Descent|Gradient-Descent]]) ์ž์ฒด๊ฐ€ ์ผ์ข…์˜ ์—ฐ์† ๊ณต๊ฐ„์—์„œ์˜ ๊ตญ์†Œ ํƒ์ƒ‰ ์ •์ฑ…์ด๋ฉฐ, ์ด๋ฅผ ๊ฐ€์†ํ™”ํ•˜๊ณ  ํƒˆ์ถœํ•˜๊ธฐ ์œ„ํ•œ ๋‹ค์–‘ํ•œ ๋ชจ๋ฉ˜ํ…€(Momentum) ์ •์ฑ…์ด ํ˜„๋Œ€ AI์˜ ํ•ต์‹ฌ ์—”์ง„์ด ๋จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Search-Optimization|Search-Optimization]], [[Gradient-Descent|Gradient-Descent]], [[Combinatorial-Optimization|Combinatorial-Optimization]], [[Heuristics|Heuristics]], [[Optimization|Optimization]] - **Modern Tech/Tools**: Genetic algorithms, GRASP, Local solver, Meta-[[Heuristics|Heuristics]]. ---