--- id: P-REINFORCE-AUTO-L2RE-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 0.97 tags: [auto-reinforced, l2-regularization, machine-learning, deep-learning, overfitting, weight-decay] last_reinforced: 2026-04-20 --- # [[L2-Regularization]] ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "신경망을 κ²Έμ†ν•˜κ²Œ: λͺ¨λΈμ΄ νŠΉμ • 데이터에 λ„ˆλ¬΄ κ³Όν•˜κ²Œ μ΅œμ ν™”(Overfitting)λ˜μ–΄ 괴물이 λ˜μ§€ μ•Šλ„λ‘, κ°€μ€‘μΉ˜κ°’μ΄ λ„ˆλ¬΄ 컀지면 벌금(Penalty)을 맀겨 λͺ¨λΈμ„ 더 λ‹¨μˆœν•˜κ³  λΆ€λ“œλŸ½κ²Œ λ§Œλ“œλŠ” μˆ˜ν•™μ  μ–΅μ œμ œ." ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) L2 μ •κ·œν™”(L2-Regularization) ν˜Ήμ€ Ridge μ •κ·œν™”λŠ” λͺ¨λΈμ˜ λ³΅μž‘λ„λ₯Ό μ œμ–΄ν•˜λŠ” κΈ°λ²•μž…λ‹ˆλ‹€. 1. **μˆ˜ν•™μ  원리**: * 손싀 ν•¨μˆ˜(Loss Function)에 λͺ¨λ“  κ°€μ€‘μΉ˜ 제곱의 ν•©($\sum w^2$)을 더함. * κ°€μ€‘μΉ˜ $w$κ°€ 컀질수둝 손싀값도 μ»€μ§€λ―€λ‘œ, ν•™μŠ΅ κ³Όμ •μ—μ„œ μžμ—°μŠ€λŸ½κ²Œ κ°€μ€‘μΉ˜λ₯Ό μž‘μ€ κ°’μœΌλ‘œ μœ μ§€ν•¨. (Gradient-Descent와 μ—°κ²°) 2. **효과**: * νŠΉμ • 데이터 ν¬μΈνŠΈμ— μ§€λ‚˜μΉ˜κ²Œ λ―Όκ°ν•˜κ²Œ λ°˜μ‘ν•˜λŠ” 것을 λ°©μ§€ν•˜μ—¬, 처음 λ³΄λŠ” 데이터에도 잘 μž‘λ™ν•˜λŠ” 'μΌλ°˜ν™” μ„±λŠ₯' ν–₯상. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌**: κ³Όκ±°μ—λŠ” λ³΅μž‘ν•œ μˆ˜μ‹ 증λͺ… μ •μ±… μœ„μ£Όμ˜€μœΌλ‚˜, ν˜„λŒ€ 정책은 μ‹€μ œ μ„±λŠ₯ ν–₯상을 μœ„ν•΄ 'κ°€μ€‘μΉ˜ 감쇠(Weight Decay) μ •μ±…'μ΄λΌλŠ” μ΄λ¦„μœΌλ‘œ λͺ¨λ“  μ΅œμ ν™” μ•Œκ³ λ¦¬μ¦˜(AdamW λ“±)에 κΈ°λ³Έ λ‚΄μž₯ μ •μ±…μœΌλ‘œ μ‚¬μš©λ¨(RL Update). - **μ •μ±… λ³€ν™”(RL Update)**: κ±°λŒ€ λͺ¨λΈ μ •μ±…(Foundation-Models)μ—μ„œλŠ” νŒŒλΌλ―Έν„°κ°€ λ„ˆλ¬΄ λ§Žμ•„ μ •κ·œν™”κ°€ ν•„μˆ˜μ μ΄μ§€λ§Œ, λ‹¨μˆœνžˆ κ°€μ€‘μΉ˜λ₯Ό μ€„μ΄λŠ” 것을 λ„˜μ–΄ 'λ“œλ‘­μ•„μ›ƒ(Dropout)'μ΄λ‚˜ '데이터 증강' λ“± λ‹€μ–‘ν•œ μ •μ±…κ³Ό ν˜Όν•©ν•˜μ—¬ μ‚¬μš©λ¨. ## πŸ”— 지식 μ—°κ²° (Graph) - [[Gradient-Descent]], [[Optimization]], Deep Learning (DL), [[Efficiency]], Scaling-Laws - **Modern Tech/Tools**: Ridge regression, Weight decay in PyTorch/TensorFlow, AdamW optimizer. ---