--- id: wiki-2026-0508-l2-regularization title: L2 Regularization category: 10_Wiki/Topics status: needs_review canonical_id: self aliases: [P-Reinforce-AUTO-L2RE-001] duplicate_of: none source_trust_level: A confidence_score: 0.97 tags: [auto-reinforced, l2-Regularization, machine-learning, Deep-Learning, Overfitting, weight-decay] raw_sources: [] last_reinforced: 2026-04-20 github_commit: pending inferred_by: Claude Opus 4.7 (auto-normalize 2026-05-08) --- # [[L2-Regularization|L2-Regularization]] ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "신경망을 κ²Έμ†ν•˜κ²Œ: λͺ¨λΈμ΄ νŠΉμ • 데이터에 λ„ˆλ¬΄ κ³Όν•˜κ²Œ μ΅œμ ν™”(Overfitting)λ˜μ–΄ 괴물이 λ˜μ§€ μ•Šλ„λ‘, κ°€μ€‘μΉ˜κ°’μ΄ λ„ˆλ¬΄ 컀지면 벌금(Penalty)을 맀겨 λͺ¨λΈμ„ 더 λ‹¨μˆœν•˜κ³  λΆ€λ“œλŸ½κ²Œ λ§Œλ“œλŠ” μˆ˜ν•™μ  μ–΅μ œμ œ." ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) L2 μ •κ·œν™”(L2-Regularization) ν˜Ήμ€ Ridge μ •κ·œν™”λŠ” λͺ¨λΈμ˜ λ³΅μž‘λ„λ₯Ό μ œμ–΄ν•˜λŠ” κΈ°λ²•μž…λ‹ˆλ‹€. 1. **μˆ˜ν•™μ  원리**: * 손싀 ν•¨μˆ˜(Loss Function)에 λͺ¨λ“  κ°€μ€‘μΉ˜ 제곱의 ν•©($\sum w^2$)을 더함. * κ°€μ€‘μΉ˜ $w$κ°€ 컀질수둝 손싀값도 μ»€μ§€λ―€λ‘œ, ν•™μŠ΅ κ³Όμ •μ—μ„œ μžμ—°μŠ€λŸ½κ²Œ κ°€μ€‘μΉ˜λ₯Ό μž‘μ€ κ°’μœΌλ‘œ μœ μ§€ν•¨. ([[Gradient-Descent|Gradient-Descent]]와 μ—°κ²°) 2. **효과**: * νŠΉμ • 데이터 ν¬μΈνŠΈμ— μ§€λ‚˜μΉ˜κ²Œ λ―Όκ°ν•˜κ²Œ λ°˜μ‘ν•˜λŠ” 것을 λ°©μ§€ν•˜μ—¬, 처음 λ³΄λŠ” 데이터에도 잘 μž‘λ™ν•˜λŠ” 'μΌλ°˜ν™” μ„±λŠ₯' ν–₯상. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & Updates) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌**: κ³Όκ±°μ—λŠ” λ³΅μž‘ν•œ μˆ˜μ‹ 증λͺ… μ •μ±… μœ„μ£Όμ˜€μœΌλ‚˜, ν˜„λŒ€ 정책은 μ‹€μ œ μ„±λŠ₯ ν–₯상을 μœ„ν•΄ 'κ°€μ€‘μΉ˜ 감쇠(Weight Decay) μ •μ±…'μ΄λΌλŠ” μ΄λ¦„μœΌλ‘œ λͺ¨λ“  μ΅œμ ν™” μ•Œκ³ λ¦¬μ¦˜(AdamW λ“±)에 κΈ°λ³Έ λ‚΄μž₯ μ •μ±…μœΌλ‘œ μ‚¬μš©λ¨(RL Update). - **μ •μ±… λ³€ν™”(RL Update)**: κ±°λŒ€ λͺ¨λΈ μ •μ±…([[Foundation-Models|Foundation-Models]])μ—μ„œλŠ” νŒŒλΌλ―Έν„°κ°€ λ„ˆλ¬΄ λ§Žμ•„ μ •κ·œν™”κ°€ ν•„μˆ˜μ μ΄μ§€λ§Œ, λ‹¨μˆœνžˆ κ°€μ€‘μΉ˜λ₯Ό μ€„μ΄λŠ” 것을 λ„˜μ–΄ 'λ“œλ‘­μ•„μ›ƒ(Dropout)'μ΄λ‚˜ '데이터 증강' λ“± λ‹€μ–‘ν•œ μ •μ±…κ³Ό ν˜Όν•©ν•˜μ—¬ μ‚¬μš©λ¨. ## πŸ”— 지식 μ—°κ²° (Graph) - [[Gradient-Descent|Gradient-Descent]], [[Optimization|Optimization]], Deep Learning (DL), [[Efficiency|Efficiency]], Scaling-Laws - **Modern Tech/Tools**: Ridge regression, Weight decay in PyTorch/TensorFlow, AdamW optimizer. --- ## πŸ€– LLM ν™œμš© 힌트 (How to Use This Knowledge) **μ–Έμ œ 이 지식을 μ“°λŠ”κ°€:** - *(TODO)* **μ–Έμ œ μ“°λ©΄ μ•ˆ λ˜λŠ”κ°€:** - *(TODO)* ## πŸ§ͺ 검증 μƒνƒœ (Validation) - **정보 μƒνƒœ:** needs_review - **좜처 신뒰도:** A - **κ²€ν†  이유:** *(P-Reinforce Phase 1 μžλ™ μ •κ·œν™”. λ³Έλ¬Έ 검증 ν•„μš”.)* ## 🧬 쀑볡 검사 (Duplicate Check) - **κΈ°μ‘΄ μœ μ‚¬ λ¬Έμ„œ:** *(TODO: μΈλ±μ„œ ν΄λŸ¬μŠ€ν„° 리포트 μ°Έμ‘°)* - **처리 방식:** UPDATE (μžλ™ μ •κ·œν™”) - **처리 이유:** Phase 1 μ •κ·œν™” β€” μ˜› ν…œν”Œλ¦Ώ/λˆ„λ½ ν•„λ“œ 보강. ## πŸ•“ λ³€κ²½ 이λ ₯ (Changelog) | λ‚ μ§œ | λ³€κ²½ λ‚΄μš© | 처리 방식 | 신뒰도 | |------|-----------|-----------|--------| | 2026-05-08 | P-Reinforce Phase 1 μ •κ·œν™” (frontmatter + 헀더 ν‘œμ€€ν™”) | UPDATE | A |