--- id: P-REINFORCE-AUTO-SEOP-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 0.95 tags: [auto-reinforced, search-optimization, algorithms, pathfinding, heuristic-search, efficiency] last_reinforced: 2026-04-20 --- # [[Search-Optimization]] ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "์ตœ๋‹จ ๊ฒฝ๋กœ๋ฅผ ํ–ฅํ•œ ๋์—†๋Š” ํƒ์ƒ‰: ์ˆ˜์—†์ด ๋งŽ์€ ์„ ํƒ์ง€์˜ ์ˆฒ(State Space)์—์„œ, ๋ชฉํ‘œ ์ง€์ ๊นŒ์ง€์˜ ๋น„์šฉ์„ ์ตœ์†Œํ™”ํ•˜๊ธฐ ์œ„ํ•ด ํœด๋ฆฌ์Šคํ‹ฑ(Heuristic)์ด๋ผ๋Š” ๋‚˜์นจ๋ฐ˜์„ ๋“ค๊ณ  ๊ฐ€์žฅ ์œ ๋งํ•œ ๋ฐฉํ–ฅ์œผ๋กœ ๋ฐœ์„ ๋“ค์ด๋Š” ์˜๋ฆฌํ•œ ๊ธธ ์ฐพ๊ธฐ." ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) ํƒ์ƒ‰ ์ตœ์ ํ™”(Search-Optimization)๋Š” ๋ฌธ์ œ์˜ ํ•ด๋‹ต์„ ์ฐพ๊ธฐ ์œ„ํ•ด ๊ฐ€๋Šฅํ•œ ๋ชจ๋“  ์ƒํƒœ๋ฅผ ํšจ์œจ์ ์œผ๋กœ ์กฐ์‚ฌํ•˜๋Š” ๊ธฐ๋ฒ•์ž…๋‹ˆ๋‹ค. (Grail-Search์  ๊ด€์  ํฌํ•จ) 1. **์ฃผ์š” ์•Œ๊ณ ๋ฆฌ์ฆ˜**: * **Uninformed Search**: ์ •๋ณด ์—†์ด ๋‹ค ๋’ค์ง€๋Š” ๋ฐฉ์‹ (BFS, DFS). (Brute-force์™€ ์—ฐ๊ฒฐ) * **Informed Search (Heuristic)**: ๋ชฉํ‘œ๊นŒ์ง€ ๋‚จ์€ ๊ฑฐ๋ฆฌ๋ฅผ '์ถ”์ •'ํ•ด์„œ ํƒ์ƒ‰ (A* Algorithm). * **Local Search**: ํ˜„์žฌ๋ณด๋‹ค ๋‚˜์€ ์ฃผ๋ณ€์œผ๋กœ๋งŒ ์ด๋™ (Hill Climbing, Simulated Annealing). 2. **์™œ ์ค‘์š”ํ•œ๊ฐ€?**: * ๊ฒŒ์ž„ AI์˜ ๊ฒฝ๋กœ ์ฐพ๊ธฐ, ๋ฌผ๋ฅ˜ ๋ฐฐ์†ก ์ตœ์ ํ™”, ํผ์ฆ ํ’€์ด, ๊ทธ๋ฆฌ๊ณ  ์‹ ๊ฒฝ๋ง์˜ ๊ฐ€์ค‘์น˜๋ฅผ ์ฐพ๋Š” ๊ณผ์ •(Gradient-Descent) ์ž์ฒด๊ฐ€ ๊ฑฐ๋Œ€ํ•œ ํƒ์ƒ‰ ์ตœ์ ํ™” ๋ฌธ์ œ์ž„. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ**: ๊ณผ๊ฑฐ์—๋Š” '์™„์ „ ํƒ์ƒ‰ ์ •์ฑ…'์œผ๋กœ ์ •๋‹ต์„ ๋ณด์žฅํ•˜๋ ค ํ–ˆ์œผ๋‚˜, ํ˜„๋Œ€ ์ •์ฑ…์€ ์ •๋‹ต๋ณด๋‹ค '์ถฉ๋ถ„ํžˆ ์ข‹์€ ํ•ด ์ •์ฑ…(Satisficing)'์„ ์ œํ•œ๋œ ์‹œ๊ฐ„ ๋‚ด์— ์ฐพ๋Š” ํšจ์œจ์„ฑ ์ •์ฑ…์„ ์šฐ์„ ์‹œํ•จ(RL Update). (Bounded-Rationality์™€ ์—ฐ๊ฒฐ) - **์ •์ฑ… ๋ณ€ํ™”(RL Update)**: ๊ฑฐ๋Œ€ ๋ชจ๋ธ์˜ ์ถ”๋ก  ์ •์ฑ…์—์„œ, ์ˆ˜๋งŽ์€ ๋‹ต๋ณ€ ํ›„๋ณด ์ค‘ ๊ฐ€์žฅ ๋…ผ๋ฆฌ์ ์ธ ๊ฒฝ๋กœ๋ฅผ ํƒ์ƒ‰ํ•˜๋Š” 'MCTS(Monte Carlo Tree Search)' ๊ธฐ๋ฐ˜์˜ ์‚ฌ๊ณ  ํ๋ฆ„ ํƒ์ƒ‰ ์ •์ฑ…์ด ์ƒˆ๋กœ์šด ์„ฑ๋Šฅ ํ–ฅ์ƒ์˜ ๋ŒํŒŒ๊ตฌ๊ฐ€ ๋จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Brute-force]], [[Optimization]], [[Heuristics]], [[Combinatorial-Optimization]], [[Gradient-Descent]] - **Modern Tech/Tools**: A* Search, MCTS, Beam Search (in NLP), AlphaGo's search engine. ---