--- id: ML-REG-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 1.0 tags: [machine-learning, regularization, l1-norm, l2-norm, overfitting, optimization] last_reinforced: 2026-04-26 --- # L1 and L2 Regularization (L1 및 L2 μ •κ·œν™”) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "λͺ¨λΈμ˜ μš•μ‹¬(Weight)에 λ²Œμ μ„ λΆ€μ—¬ν•˜μ—¬, λ‹¨μˆœν•¨μ˜ λ―Έν•™μœΌλ‘œ 과적합(Overfitting)의 λŠͺ을 νƒˆμΆœν•˜λΌ" β€” 손싀 ν•¨μˆ˜μ— κ°€μ€‘μΉ˜μ˜ 크기λ₯Ό νŽ˜λ„ν‹°λ‘œ μΆ”κ°€ν•˜μ—¬, λͺ¨λΈμ΄ νŠΉμ • λ°μ΄ν„°μ—λ§Œ κ³Όλ„ν•˜κ²Œ λ§žμΆ°μ§€λŠ” 것을 λ°©μ§€ν•˜κ³  μΌλ°˜ν™” μ„±λŠ₯을 λ†’μ΄λŠ” 기법. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **μΆ”μΆœλœ νŒ¨ν„΄:** "Weight Decay" β€” κ°€μ€‘μΉ˜κ°€ 컀질수둝 전체 손싀(Loss)을 μ¦κ°€μ‹œμΌœ, λͺ¨λΈμ΄ κ°€λŠ₯ν•œ μž‘μ€ κ°€μ€‘μΉ˜ 값을 갖도둝 μœ λ„ν•¨μœΌλ‘œμ¨ λ³΅μž‘λ„λ₯Ό μ œμ–΄ν•˜λŠ” 수치적 μ–΅μ œ νŒ¨ν„΄. - **μ£Όμš” μœ ν˜•:** - **L1 Regularization (Lasso):** κ°€μ€‘μΉ˜μ˜ μ ˆλŒ€κ°’ 합을 νŽ˜λ„ν‹°λ‘œ λΆ€μ—¬. μ€‘μš”ν•˜μ§€ μ•Šμ€ κ°€μ€‘μΉ˜λ₯Ό 0으둜 λ§Œλ“€μ–΄ νŠΉμ§• 선택(Feature Selection) 효과 λ°œμƒ. - **L2 Regularization (Ridge):** κ°€μ€‘μΉ˜μ˜ 제곱 합을 νŽ˜λ„ν‹°λ‘œ λΆ€μ—¬. κ°€μ€‘μΉ˜λ₯Ό μ „λ°˜μ μœΌλ‘œ μž‘κ³  κ³ λ₯΄κ²Œ λ§Œλ“€μ–΄ κΈ‰κ²©ν•œ λ³€ν™”λ₯Ό μ–΅μ œ. - **의의:** 고차원 λ°μ΄ν„°μ—μ„œ λͺ¨λΈμ΄ λ…Έμ΄μ¦ˆκΉŒμ§€ ν•™μŠ΅ν•˜λŠ” 것을 λ°©μ§€ν•˜μ—¬, μ‹€μ „μ—μ„œ μ•ˆμ •μ μΈ 예츑 μ„±λŠ₯을 보μž₯함. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌:** λ‹¨μˆœνžˆ κ°€μ€‘μΉ˜λ₯Ό μ€„μ΄λŠ” κ²ƒλ§Œμ΄ λŠ₯사가 μ•„λ‹ˆλ©°, λ°μ΄ν„°μ˜ νŠΉμ„±μ— 따라 L1κ³Ό L2λ₯Ό κ²°ν•©ν•œ Elastic Netμ΄λ‚˜ λ“œλ‘­μ•„μ›ƒ(Dropout) λ“±κ³Ό λ³‘ν–‰ν•˜μ—¬ 졜적의 κ· ν˜•μ μ„ μ°ΎλŠ” 것이 ν˜„λŒ€ λ”₯λŸ¬λ‹μ˜ ν‘œμ€€. - **μ •μ±… λ³€ν™”:** Antigravity ν”„λ‘œμ νŠΈμ˜ 핡심 μΆ”λ‘  λͺ¨λΈλ“€μ€ ν•™μŠ΅ μ‹œ κ³Όλ„ν•œ κ°€μ€‘μΉ˜ 쏠림을 λ°©μ§€ν•˜κΈ° μœ„ν•΄ L2 μ •κ·œν™”λ₯Ό κΈ°λ³Έ μ μš©ν•˜λ©°, ν¬μ†Œν•œ 지식 νŠΉμ§•μ„ μΆ”μΆœν•΄μ•Ό ν•˜λŠ” λͺ¨λ“ˆμ—λŠ” L1 μ •κ·œν™”λ₯Ό μ „λž΅μ μœΌλ‘œ μ‚¬μš©ν•¨. ## πŸ”— 지식 μ—°κ²° (Graph) - [[Supervised-Learning-Foundations|Supervised-Learning-Foundations]], [[Generalization-in-AI|Generalization-in-AI]], [[Hyperparameter-Optimization|Hyperparameter-Optimization]], [[Loss-Functions-Foundations|Loss-Functions-Foundations]] - **Raw Source:** 10_Wiki/Topics/AI/L1-and-L2-Regularization.md