--- id: wiki-2026-0508-regularization title: Regularization category: 10_Wiki/Topics status: needs_review canonical_id: self aliases: [P-Reinforce-AUTO-REGU-001] duplicate_of: none source_trust_level: A confidence_score: 0.97 tags: [auto-reinforced, regularization, Overfitting, precision, machine-learning, L2-Regularization] raw_sources: [] last_reinforced: 2026-04-20 github_commit: pending inferred_by: Claude Opus 4.7 (auto-normalize 2026-05-08) --- # [[Regularization|Regularization]] ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๋ณต์žกํ•จ์— ๋Œ€ํ•œ ๋ฒŒ๊ธˆ: ๋ชจ๋ธ์ด ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ์˜ ์‚ฌ์†Œํ•œ ์žก์Œ๊นŒ์ง€ ์™ธ์šฐ๋ ค ํ•  ๋•Œ๋งˆ๋‹ค ๊ณผ๊ฐํ•˜๊ฒŒ ์ œ๋™์„ ๊ฑธ์–ด, ๋„ˆ๋ฌด ๋˜‘๋˜‘ํ•ด ๋ณด์ด๊ธฐ๋ณด๋‹ค '์ ๋‹นํžˆ ๋‹จ์ˆœํ•˜๊ณ  ์ผ๋ฐ˜์ ์ธ' ํ†ต์ฐฐ์„ ๊ฐ–๊ฒŒ ๋งŒ๋“ค์–ด ์‹ค์ „(Test data) ๊ฐ•์ž๋กœ ํ‚ค์›Œ๋‚ด๋Š” ์–ต์ œ ๊ธฐ์ˆ ." ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) ์ •๊ทœํ™”(Regularization) ํ˜น์€ ๊ทœ์ œ๋Š” ๋ชจ๋ธ์˜ ๋ณต์žก๋„๋ฅผ ์ œํ•œํ•˜์—ฌ ๊ณผ์ ํ•ฉ(Overfitting)์„ ๋ฐฉ์ง€ํ•˜๋Š” ๋ชจ๋“  ๊ธฐ๋ฒ•์„ ๋งํ•ฉ๋‹ˆ๋‹ค. 1. **๋Œ€ํ‘œ์  ๊ธฐ๋ฒ•**: * **L1 (Lasso)**: ๋ถˆํ•„์š”ํ•œ ๊ฐ€์ค‘์น˜๋ฅผ 0์œผ๋กœ ๋งŒ๋“ค์–ด ์ค‘์š”ํ•œ ํŠน์ง•๋งŒ ๋‚จ๊น€. * **L2 (Ridge)**: ๊ฐ€์ค‘์น˜๋“ค์˜ ํฌ๊ธฐ๋ฅผ ๊ณจ๊ณ ๋ฃจ ์ž‘๊ฒŒ ๋งŒ๋“ค์–ด ํŠน์ • ๋ณ€์ˆ˜ ์˜์กด๋„ ๋‚ฎ์ถค. (L2-Regularization์™€ ์—ฐ๊ฒฐ) * **Dropout**: ๋ฌด์ž‘์œ„๋กœ ์‹ ๊ฒฝ๋ง์˜ ์—ฐ๊ฒฐ์„ ๋Š์Œ. * **Early Stopping**: ์„ฑ๋Šฅ์ด ์•ˆ ์ข‹์•„์ง€๊ธฐ ์ „์— ํ•™์Šต ์ค‘๋‹จ. 2. **์™œ ์ค‘์š”ํ•œ๊ฐ€?**: * ํ˜„์‹ค ์„ธ๊ณ„์˜ ๋ฐ์ดํ„ฐ๋Š” ํ•ญ์ƒ ๋…ธ์ด์ฆˆ([[Noise|Noise]])๊ฐ€ ์„ž์—ฌ ์žˆ์œผ๋ฉฐ, ์ด๋ฅผ ๊ฑธ๋Ÿฌ๋‚ด์ง€ ๋ชปํ•˜๋Š” ๋ชจ๋ธ์€ ์“ธ๋ชจ์—†๋Š” '์•”๊ธฐ๊ธฐ๊ณ„'์— ๋ถˆ๊ณผํ•˜๊ธฐ ๋•Œ๋ฌธ์ž„. ([[Optimization|Optimization]]์˜ ํ•„์ˆ˜ ์š”์†Œ) ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & Updates) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ**: ๊ณผ๊ฑฐ์—๋Š” ํŒŒ๋ผ๋ฏธํ„ฐ ์ˆ˜๋ฅผ ์ค„์ด๋Š” ์ •์ฑ…์—๋งŒ ์ง‘์ค‘ํ–ˆ์œผ๋‚˜, ํ˜„๋Œ€ ์ •์ฑ…์€ ํŒŒ๋ผ๋ฏธํ„ฐ๋Š” ์ˆ˜์กฐ ๊ฐœ๋กœ ๋Š˜๋ฆฌ๋˜ ๋ฐ์ดํ„ฐ ์ฆ๊ฐ•(Augmentation)์ด๋‚˜ ์ •๊ตํ•œ ๊ฐ€์ค‘์น˜ ๊ฐ์‡ (Weight Decay) ์ •์ฑ…์„ ํ†ตํ•ด '๊ฑฐ๋Œ€ํ•œ ์ผ๋ฐ˜ ์ง€๋Šฅ ์ •์ฑ…'์„ ๊ตฌ์ถ•ํ•˜๋Š” ๋ฐฉํ–ฅ์œผ๋กœ ์ „ํ™˜๋จ(RL Update). - **์ •์ฑ… ๋ณ€ํ™”(RL Update)**: "๋‹จ์ˆœํ•œ ๊ฒƒ์ด ์ตœ๊ณ ๋‹ค(Occam's Razor)"๋ผ๋Š” ๊ณ ์ „ ์ •์ฑ…์„ ์ˆ˜ํ•™์  ์ˆ˜์‹ ์ •์ฑ…์œผ๋กœ ๊ตฌํ˜„ํ•ด๋‚ธ ๊ฒƒ์ด ๋ฐ”๋กœ ํ˜„๋Œ€ ๋จธ์‹ ๋Ÿฌ๋‹์˜ ์ •๊ทœํ™” ์ •์ฑ…์ž„. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Overfitting|Overfitting]], [[L2-Regularization|L2-Regularization]], [[Noise|Noise]], [[Optimization|Optimization]], [[Machine Learning (ML)|Machine Learning (ML)]] - **Modern Tech/Tools**: Weight decay, Batch [[Normalization|Normalization]], Dropout layers. --- ## ๐Ÿค– LLM ํ™œ์šฉ ํžŒํŠธ (How to Use This Knowledge) **์–ธ์ œ ์ด ์ง€์‹์„ ์“ฐ๋Š”๊ฐ€:** - *(TODO)* **์–ธ์ œ ์“ฐ๋ฉด ์•ˆ ๋˜๋Š”๊ฐ€:** - *(TODO)* ## ๐Ÿงช ๊ฒ€์ฆ ์ƒํƒœ (Validation) - **์ •๋ณด ์ƒํƒœ:** needs_review - **์ถœ์ฒ˜ ์‹ ๋ขฐ๋„:** A - **๊ฒ€ํ†  ์ด์œ :** *(P-Reinforce Phase 1 ์ž๋™ ์ •๊ทœํ™”. ๋ณธ๋ฌธ ๊ฒ€์ฆ ํ•„์š”.)* ## ๐Ÿงฌ ์ค‘๋ณต ๊ฒ€์‚ฌ (Duplicate Check) - **๊ธฐ์กด ์œ ์‚ฌ ๋ฌธ์„œ:** *(TODO: ์ธ๋ฑ์„œ ํด๋Ÿฌ์Šคํ„ฐ ๋ฆฌํฌํŠธ ์ฐธ์กฐ)* - **์ฒ˜๋ฆฌ ๋ฐฉ์‹:** UPDATE (์ž๋™ ์ •๊ทœํ™”) - **์ฒ˜๋ฆฌ ์ด์œ :** Phase 1 ์ •๊ทœํ™” โ€” ์˜› ํ…œํ”Œ๋ฆฟ/๋ˆ„๋ฝ ํ•„๋“œ ๋ณด๊ฐ•. ## ๐Ÿ•“ ๋ณ€๊ฒฝ ์ด๋ ฅ (Changelog) | ๋‚ ์งœ | ๋ณ€๊ฒฝ ๋‚ด์šฉ | ์ฒ˜๋ฆฌ ๋ฐฉ์‹ | ์‹ ๋ขฐ๋„ | |------|-----------|-----------|--------| | 2026-05-08 | P-Reinforce Phase 1 ์ •๊ทœํ™” (frontmatter + ํ—ค๋” ํ‘œ์ค€ํ™”) | UPDATE | A |