--- id: [[P-Reinforce|P-Reinforce]]-AUTO-BORA-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 0.98 tags: [auto-reinforced, bounded-rationality, decision-theory, [[Heuristics|Heuristics]], cognitive-limitations, her[[BERT|BERT]]-simon] last_reinforced: 2026-04-20 --- # [[Bounded-Rationality|Bounded-Rationality]] ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "ํ˜„์‹ค์ ์ธ ๋˜‘๋˜‘ํ•จ: ์ธ๊ฐ„์˜ ์ธ์ง€ ๋Šฅ๋ ฅ, ์‹œ๊ฐ„, ์ •๋ณด๋Š” ๋ชจ๋‘ ์œ ํ•œํ•˜๊ธฐ ๋•Œ๋ฌธ์—, ๋ชจ๋“  ๋Œ€์•ˆ์„ ์™„๋ฒฝํžˆ ๊ณ„์‚ฐํ•ด ์ตœ์ (Optimizing)์„ ์ฐพ๋Š” ๋Œ€์‹  ํ˜„์žฌ ์ƒํ™ฉ์—์„œ '์ ๋‹นํžˆ ๋งŒ์กฑ์Šค๋Ÿฌ์šด(Satisficing)' ํ•ด๊ฒฐ์ฑ…์„ ์„ ํƒํ•˜๋Š” ์‹ค์งˆ์ ์ธ ํ•ฉ๋ฆฌ์„ฑ." ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) ์ œํ•œ๋œ ํ•ฉ๋ฆฌ์„ฑ(Bounded-Rationality)์€ ํ—ˆ๋ฒ„ํŠธ ์‚ฌ์ด๋จผ์ด ์ œ์•ˆํ•œ ๊ฐœ๋…์œผ๋กœ, ์ธ๊ฐ„์ด ์˜์‚ฌ๊ฒฐ์ •์„ ๋‚ด๋ฆด ๋•Œ ์ง๋ฉดํ•˜๋Š” ํ˜„์‹ค์ ์ธ ์ œ์•ฝ๋“ค์„ ์ธ์ •ํ•˜๋Š” ์ด๋ก ์ž…๋‹ˆ๋‹ค. 1. **3๋Œ€ ์ œ์•ฝ ์กฐ๊ฑด**: * **Limited Information**: ๋ชจ๋“  ์ •๋ณด๋ฅผ ๋‹ค ์•Œ ์ˆ˜ ์—†์Œ. * **Cognitive Limitations**: ๋‘๋‡Œ์˜ ์ •๋ณด ์ฒ˜๋ฆฌ ์šฉ๋Ÿ‰์— ํ•œ๊ณ„๊ฐ€ ์žˆ์Œ. * **Time Constraints**: ๊ฒฐ์ •์— ๋ฌดํ•œํ•œ ์‹œ๊ฐ„์„ ์“ธ ์ˆ˜ ์—†์Œ. 2. **ํ•ด๊ฒฐ ์ „๋žต - ํœด๋ฆฌ์Šคํ‹ฑ (Heuristics)**: * ๋ณต์žกํ•œ ์—ฐ์‚ฐ ๋Œ€์‹  '๊ฒฝํ—˜์˜ ๋ฒ•์น™'์ด๋‚˜ ์ง๊ด€์„ ์‚ฌ์šฉํ•˜์—ฌ ๋น ๋ฅด๊ณ  ์ถฉ๋ถ„ํžˆ ๊ดœ์ฐฎ์€ ๊ฒฐ๋ก ์— ๋„๋‹ฌํ•จ. (Satisficing) ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ**: ๊ณผ๊ฑฐ ๊ฒฝ์ œํ•™ ์ •์ฑ…์€ ์ธ๊ฐ„์„ ๋ชจ๋“  ๊ฒƒ์„ ๊ณ„์‚ฐํ•˜๋Š” 'ํ˜ธ๋ชจ ์—์ฝ”๋…ธ๋ฏธ์ฟ ์Šค(ํ•ฉ๋ฆฌ์  ์ธ๊ฐ„)' ์ •์ฑ…์œผ๋กœ ์ •์˜ํ–ˆ์œผ๋‚˜, ํ˜„๋Œ€ ์ •์ฑ…์€ ์ธ๊ฐ„์˜ ์ธ์ง€์  ํ•œ๊ณ„๋ฅผ ์ธ์ •ํ•œ ์ œํ•œ๋œ ํ•ฉ๋ฆฌ์„ฑ ์ •์ฑ…์„ ๋ฐ”ํƒ•์œผ๋กœ ํ•œ ํ–‰๋™ ๊ฒฝ์ œํ•™ ์ •์ฑ…์„ ์ฃผ๋ฅ˜๋กœ ์ˆ˜์šฉํ•จ(RL Update). - **์ •์ฑ… ๋ณ€ํ™”(RL Update)**: AI ์„ค๊ณ„ ์ •์ฑ…์—์„œ, ๋ฌดํ•œ์ • ๋งŽ์€ ์ปดํ“จํŒ… ์ž์›์„ ์จ์„œ ์ •๋‹ต์„ ์ฐพ๋Š” '[[Brute-force|Brute-force]]' ๋ฐฉ์‹๋ณด๋‹ค ์ œํ•œ๋œ ์ž์› ํ•˜์—์„œ ํšจ์œจ์ ์œผ๋กœ ์ถ”๋ก ํ•˜๋Š” '๊ฒฝ๋Ÿ‰ํ™” ๋ฐ ์กฐ๊ฑด๋ถ€ ์ถ”๋ก  ์ •์ฑ…'์ด ์—์ง€ ๋””๋ฐ”์ด์Šค์šฉ ์ง€๋Šฅ์˜ ํ•ต์‹ฌ ์•„ํ‚คํ…์ฒ˜๊ฐ€ ๋จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - Rationality, [[Decision Theory|Decision Theory]], [[Bayesian-Updating|Bayesian-Updating]], [[Heuristics|Heuristics]], [[Optimization|Optimization]] - **Modern Tech/Tools**: Heuristic-based algorithms, Multi-armed bandit (MAB) [[Optimization|Optimization]]. ---