--- id: REG-TECH-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 1.0 tags: [machine-learning, optimization, overfitting, regularization] last_reinforced: 2026-04-26 --- # Regularization Techniques (κ·œμ œν™” 기법) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "λͺ¨λΈμ΄ ν•™μŠ΅ λ°μ΄ν„°λ§Œ 달달 μ™Έμš°μ§€ λͺ»ν•˜κ²Œ λ°©ν•΄ν•˜λΌ" β€” 과적합(Overfitting)을 λ°©μ§€ν•˜κ³  λͺ¨λΈμ˜ μΌλ°˜ν™” μ„±λŠ₯을 높이기 μœ„ν•΄ λ³΅μž‘μ„±μ— νŽ˜λ„ν‹°λ₯Ό μ£Όκ±°λ‚˜ ν•™μŠ΅ 과정에 μ˜λ„μ μΈ λ…Έμ΄μ¦ˆλ₯Ό μΆ”κ°€ν•˜λŠ” 기법듀. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **μΆ”μΆœλœ νŒ¨ν„΄:** λͺ¨λΈμ΄ νŠΉμ • κ°€μ€‘μΉ˜μ— κ³Όν•˜κ²Œ μ˜μ‘΄ν•˜κ±°λ‚˜ λ°μ΄ν„°μ˜ 지엽적인 νŠΉμ§•μ— λ§€λͺ°λ˜μ§€ μ•Šλ„λ‘ μΈμœ„μ μΈ μ œμ•½ 쑰건을 κ°€ν•˜λŠ” νŒ¨ν„΄. - **μ£Όμš” 기법:** - **L1/L2 Regularization:** κ°€μ€‘μΉ˜μ˜ 크기λ₯Ό 손싀 ν•¨μˆ˜μ— ν¬ν•¨μ‹œμΌœ κ°€μ€‘μΉ˜κ°€ λ„ˆλ¬΄ 컀지지 μ•Šλ„λ‘ μ œν•œ. L1은 ν¬μ†Œ(Sparse) λͺ¨λΈμ„ λ§Œλ“¦. - **Dropout:** ν•™μŠ΅ 쀑 일뢀 λ‰΄λŸ°μ„ λ¬΄μž‘μœ„λ‘œ λΉ„ν™œμ„±ν™”ν•˜μ—¬ νŠΉμ • κ²½λ‘œμ—λ§Œ μ˜μ‘΄ν•˜λŠ” ν˜„μƒ λ°©μ§€. - **Early Stopping:** 검증 μ„±λŠ₯이 더 이상 μ’‹μ•„μ§€μ§€ μ•Šμ„ λ•Œ ν•™μŠ΅μ„ 쑰기에 μ’…λ£Œ. - **Data Augmentation:** ν•™μŠ΅ 데이터λ₯Ό λ³€ν˜•(νšŒμ „, λ…Έμ΄μ¦ˆ μΆ”κ°€ λ“±)ν•˜μ—¬ λͺ¨λΈμ΄ 더 λ‹€μ–‘ν•œ μΌ€μ΄μŠ€μ— λŒ€μ‘ν•˜κ²Œ 함. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌:** μ΄ˆκΈ°μ—λŠ” μˆ˜ν•™μ  μ œμ•½(L1/L2) μœ„μ£Όμ˜€μœΌλ‚˜, ν˜„λŒ€μ—λŠ” λͺ¨λΈ μ•„ν‚€ν…μ²˜ μžμ²΄μ— λ…Ήμ•„λ“  기법(Dropout, Layer Norm λ“±)κ³Ό 데이터 μ°¨μ›μ˜ κ·œμ œν™”κ°€ 더 널리 μ‚¬μš©λ¨. - **μ •μ±… λ³€ν™”:** Antigravity μ—μ΄μ „νŠΈμ˜ λ‚΄λΆ€ μš”μ•½ λͺ¨λΈ ν•™μŠ΅ μ‹œ, 과적합 λ°©μ§€λ₯Ό μœ„ν•΄ 0.1 λΉ„μœ¨μ˜ Dropoutκ³Ό μ—„κ²©ν•œ Early Stopping 정책을 μ μš©ν•¨. ## πŸ”— 지식 μ—°κ²° (Graph) - [[Overfitting|Overfitting]], [[Optimization|Optimization]], [[Machine-Learning-Lifecycle|Machine-Learning-Lifecycle]], [[Layer-Normalization|Layer-Normalization]] - **Raw Source:** 10_Wiki/Topics/AI/Regularization-Techniques.md