--- id: [[P-Reinforce|P-Reinforce]]-PSYCH-004 category: Dev confidence_score: 0.94 tags: [[Psychology|[Psychology]], [[Behavior|Behavior]], conditioning, skinner] last_reinforced: 2026-04-20 github_commit: "batch-reinforce-03" --- # [[Opera|Opera]]nt Conditioning ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > ν–‰λ™μ˜ κ²°κ³Όκ°€ 미래의 행동 λΉˆλ„λ₯Ό κ²°μ •ν•œλ‹€λŠ” 원리λ₯Ό 톡해 생λͺ…μ²΄μ˜ 적응적 행동 λ³€ν™”λ₯Ό μ„€λͺ…ν•˜λŠ” 고전적 λ©”μΉ΄λ‹ˆμ¦˜. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **μΆ”μΆœλœ νŒ¨ν„΄:** 정적/뢀적 κ°•ν™”(Reinforcement)와 처벌(Punishment)의 쑰합을 톡해 행동을 μ‘°ν˜•(Shaping)ν•˜λŠ” ν™˜κ²½ ν†΅μ œ νŒ¨ν„΄. - **μ„ΈλΆ€ λ‚΄μš©:** - μŠ€ν‚€λ„ˆ λ°•μŠ€ μ‹€ν—˜μ„ ν†΅ν•œ 행동 λΆ„μ„μ˜ 기초 확립. - 간헐적 κ°•ν™” μŠ€μΌ€μ€„μ΄ ν–‰λ™μ˜ μœ μ§€μ™€ μ†Œκ±°μ— λ―ΈμΉ˜λŠ” 영ν–₯. - ν˜„λŒ€ μ§€λŠ₯ν˜• μ—μ΄μ „νŠΈμ˜ κ°•ν™”ν•™μŠ΅(RL) μ•Œκ³ λ¦¬μ¦˜μ˜ 심리학적 기원. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌:** ν–‰λ™μ˜ 외적 κ²°κ³Όμ—λ§Œ μ§‘μ€‘ν•˜λ˜ ν–‰λ™μ£Όμ˜μ—μ„œ, 내적 인지 과정을 ν¬ν•¨ν•œ 인지 행동 λͺ¨λΈλ‘œ ν™•μž₯. - **μ •μ±… λ³€ν™”:** μ‚¬μš©μž κ²½ν—˜(UX) 섀계(w3) μ‹œ '보상 μŠ€μΌ€μ€„'의 윀리적 적용 κ°€μ΄λ˜μŠ€ κ°•ν™”. ## πŸ”— 지식 μ—°κ²° (Graph) - **Parent:** 10_Wiki/πŸ’‘ Topics/Psychology - **Related:** [[ABA|ABA]], Behavioral-Economics, [[Reinforcement-Learning|Reinforcement-Learning]] - **Raw Source:** 00_Raw/2026-04-20/[[Operant Conditioning|Operant Conditioning]].md