--- id: P-REINFORCE-AI-COMBO-OPT category: "[[10_Wiki/๐Ÿ’ก Topics/AI]]" confidence_score: 0.99 tags: [Optimization, Combinatorial, NP-Hard, Algorithm] last_reinforced: 2026-04-20 --- # [[Combinatorial-Optimization]] (์กฐํ•ฉ ์ตœ์ ํ™”) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > ๋ฌดํ•œ์— ๊ฐ€๊นŒ์šด ์„ ํƒ์ง€ ์†์—์„œ '๊ฐ€์žฅ ์‹ธ๊ฑฐ๋‚˜', '๊ฐ€์žฅ ๋น ๋ฅด๊ฑฐ๋‚˜', '๊ฐ€์žฅ ํšจ์œจ์ ์ธ' ๋‹จ ํ•˜๋‚˜์˜ ์กฐํ•ฉ์„ ์ฐพ์•„๋‚ด๋Š” ๊ณตํ•™์˜ ๊ทนํ•œ์ด๋‹ค. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **NP-Hard Problems**: - **์™ธํŒ์› ๋ฌธ์ œ (TSP)**: ๋ชจ๋“  ๋„์‹œ๋ฅผ ํ•œ ๋ฒˆ์”ฉ ๋ฐฉ๋ฌธํ•˜๊ณ  ๋Œ์•„์˜ค๋Š” ์ตœ๋‹จ ๊ฒฝ๋กœ ์ฐพ๊ธฐ. - **๋ฐฐ๋‚ญ ๋ฌธ์ œ (Knapsack)**: ๋ฌด๊ฒŒ ์ œํ•œ ๋‚ด์— ๊ฐ€์น˜๊ฐ€ ์ตœ๋Œ€๊ฐ€ ๋˜๋„๋ก ์ง ์‹ธ๊ธฐ. - **Heuristics & Meta-heuristics**: - ์ตœ์ ํ•ด๋ฅผ ์ฐพ๋Š” ๊ฒƒ์ด ๋ถˆ๊ฐ€๋Šฅ์— ๊ฐ€๊นŒ์šธ ๋•Œ, '์ ๋‹นํžˆ ์ข‹์€ ํ•ด'๋ฅผ ๋น ๋ฅด๊ฒŒ ์ฐพ๋Š” ๊ธฐ๋ฒ•. (์˜ˆ: ์œ ์ „ ์•Œ๊ณ ๋ฆฌ์ฆ˜, ๋‹ด๊ธˆ์งˆ ๊ธฐ๋ฒ•(Simulated Annealing)). - **Integer Programming**: - ๋ณ€์ˆ˜๊ฐ€ ์ •์ˆ˜์—ฌ์•ผ ํ•˜๋Š” ์ œ์•ฝ ์กฐ๊ฑด ํ•˜์—์„œ ์ตœ์ ์˜ ํ•ด๋ฅผ ๊ตฌํ•˜๋Š” ์ˆ˜ํ•™์  ๊ธฐ๋ฒ•. ๋ฌผ๋ฅ˜ ์ตœ์ ํ™”, ์Šค์ผ€์ค„๋ง ๋“ฑ์— ํ•„์ˆ˜์ ์ด๋‹ค. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (RL Update) - ์ตœ๊ทผ์—๋Š” ๊ฐ•ํ™”ํ•™์Šต ์—์ด์ „ํŠธ๊ฐ€ ์กฐํ•ฉ ์ตœ์ ํ™” ๋ฌธ์ œ๋ฅผ ์Šค์Šค๋กœ ํ•™์Šตํ•˜์—ฌ ํ‘ธ๋Š” ์—ฐ๊ตฌ๊ฐ€ ํ™œ๋ฐœํ•˜๋‹ค. ํŠนํžˆ ์นฉ ์„ค๊ณ„(Chip Layout)๋‚˜ ๋ฐ์ดํ„ฐ ์„ผํ„ฐ ์—๋„ˆ์ง€ ์ตœ์ ํ™” ๋“ฑ์—์„œ AI๊ฐ€ ์ธ๊ฐ„ ์„ค๊ณ„์ž๋ฅผ ๋›ฐ์–ด๋„˜๋Š” ์„ฑ๊ณผ๋ฅผ ๋‚ด๊ณ  ์žˆ๋‹ค. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - Related: [[Distributed-Systems-Engineering]] , [[Reinforcement Learning]] - Foundation: [[Computational Thinking]]