--- id: P-REINFORCE-AUTO-PRKN-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 0.95 tags: [auto-reinforced, procedural-knowledge, knowing-how, skill-acquisition, implicit-knowledge, expertise] last_reinforced: 2026-04-20 --- # [[Procedural-Knowledge|Procedural-Knowledge]] ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๋ชธ์ด ๊ธฐ์–ตํ•˜๋Š” ์ง€๋Šฅ: '๋ฌด์—‡(What)'์„ ์•„๋Š” ์„ ์–ธ์  ์ง€์‹์„ ๋„˜์–ด, ์‹ค์ œ๋กœ ์ผ์„ ์ฒ˜๋ฆฌํ•˜๋Š” ์ˆœ์„œ์™€ ๋ฐฉ๋ฒ•์ธ '์–ด๋–ป๊ฒŒ(How)'๊ฐ€ ์ฒด๋“๋œ ์ƒํƒœ์ด์ž, ์ˆ˜๋งŽ์€ ๋ฐ˜๋ณต์„ ํ†ตํ•ด ๋ฌด์˜์‹์  ์ž๋™ํ™” ๋‹จ๊ณ„์— ์ด๋ฅธ ์ง„์ •ํ•œ ์‹ค๋ ฅ." ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) ์ ˆ์ฐจ์  ์ง€์‹(Procedural-Knowledge)์€ ์–ด๋–ค ๊ณผ์ œ๋ฅผ ์ˆ˜ํ–‰ํ•˜๋Š” ๋ฐฉ๋ฒ•์— ๋Œ€ํ•œ ์ง€์‹์ž…๋‹ˆ๋‹ค. 1. **ํŠน์ง•**: * **Knowing-How**: ์ž์ „๊ฑฐ ํƒ€๊ธฐ, ํƒ€์ดํ•‘, ์ฝ”๋“œ ๋””๋ฒ„๊น…์ฒ˜๋Ÿผ 'ํ–‰๋™'์œผ๋กœ ๋ฐœํ˜„๋จ. * **Implicit (์•”๋ฌต์ )**: ๋ง์ด๋‚˜ ๊ธ€๋กœ ์™„๋ฒฝํžˆ ์„ค๋ช…ํ•˜๊ธฐ ์–ด๋ ต๊ณ , ์ง์ ‘ ํ•ด๋ณด๋ฉฐ ์ตํ˜€์•ผ ํ•จ. * **Automation**: ์ˆ™๋ จ๋˜๋ฉด ์ธ์ง€ ์—๋„ˆ์ง€๋ฅผ ๊ฑฐ์˜ ์“ฐ์ง€ ์•Š๊ณ  ์ฒ˜๋ฆฌ ๊ฐ€๋Šฅ. (Efficiency์™€ ์—ฐ๊ฒฐ) 2. **์™œ ์ค‘์š”ํ•œ๊ฐ€?**: * AI๊ฐ€ ๋ฐฉ๋Œ€ํ•œ ํ…์ŠคํŠธ(์„ ์–ธ์  ์ง€์‹)๋ฅผ ์•Œ๋”๋ผ๋„, ์‹ค์ œ ํ™˜๊ฒฝ์—์„œ ์ƒํ™ฉ์— ๋งž๊ฒŒ ํˆด์„ ์“ฐ๊ณ  ๋ฌธ์ œ๋ฅผ ํ‘ธ๋Š” '์ ˆ์ฐจ์  ๋Šฅ๋ ฅ'์ด ๊ฒฐํ•ฉ๋˜์–ด์•ผ๋งŒ ๋น„๋กœ์†Œ ๊ฐ€์น˜ ์žˆ๋Š” ์—์ด์ „ํŠธ๊ฐ€ ๋˜๊ธฐ ๋•Œ๋ฌธ์ž„. (Mastery์™€ ์—ฐ๊ฒฐ) ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ**: ๊ณผ๊ฑฐ์—๋Š” ์ธ๊ฐ„ ์ „๋ฌธ๊ฐ€์˜ ์ ˆ์ฐจ๋ฅผ ์ฝ”๋“œ๋กœ ์ง์ ‘ ์งœ์ฃผ๋Š” ์ •์ฑ…(Hard-coding)์ด์—ˆ์œผ๋‚˜, ํ˜„๋Œ€ ์ •์ฑ…์€ AI๊ฐ€ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์ด๋‚˜ ๊ฐ•ํ™”ํ•™์Šต์„ ํ†ตํ•ด ์Šค์Šค๋กœ ์ตœ์ ์˜ ์ ˆ์ฐจ๋ฅผ ํ„ฐ๋“ํ•˜๋Š” '์ž์œจ์  ์ ˆ์ฐจ ํš๋“ ์ •์ฑ…'์œผ๋กœ ๋ณ€ํ™”ํ•จ(RL Update). - **์ •์ฑ… ๋ณ€ํ™”(RL Update)**: ๋ณธ ์‹œ์Šคํ…œ์˜ 'SOP(Standard Operating Procedure)' ๋˜ํ•œ ์ •์ ์ธ ๋ฌธ์„œ ์ •์ฑ…์„ ๋„˜์–ด, ์ƒํ™ฉ์— ๋”ฐ๋ผ AI๊ฐ€ ๋™์ ์œผ๋กœ ์ ˆ์ฐจ๋ฅผ ์ƒ์„ฑํ•˜๊ณ  ์ตœ์ ํ™”ํ•˜๋Š” '์ง€๋Šฅํ˜• ์ ˆ์ฐจ ์ •์ฑ…'์œผ๋กœ ์ง„ํ™” ์ค‘์ž„. (Reinforcement Learning (RL)์™€ ์—ฐ๊ฒฐ) ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Mastery|Mastery]], [[Efficiency|Efficiency]], [[Reinforcement Learning (RL)|Reinforcement Learning (RL)]], [[Mental-Models|Mental-Models]], [[Master-of-Information-Management|Master-of-Information-Management]] - **Modern Tech/Tools**: Workflow automation, Robotics (Action policy), Skill-based learning. ---