--- id: wiki-2026-0508-overfitting title: Overfitting category: 10_Wiki/Topics status: needs_review canonical_id: self aliases: [P-Reinforce-AUTO-OVER-001] duplicate_of: none source_trust_level: A confidence_score: 0.98 tags: [auto-reinforced, overfitting, machine-learning, model-evaluation, generalization, Deep-Learning] raw_sources: [] last_reinforced: 2026-04-20 github_commit: pending inferred_by: Claude Opus 4.7 (auto-normalize 2026-05-08) --- # [[Overfitting|Overfitting]] ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๋ฐ์ดํ„ฐ๋ฅผ ์™ธ์šฐ๋Š” ๋ชจ๋ธ์˜ ๋น„๊ทน: ํ›ˆ๋ จ์šฉ ๋ฐ์ดํ„ฐ์—๋งŒ ๋„ˆ๋ฌด ์™„๋ฒฝํ•˜๊ฒŒ ๋งž์ถฐ์ง„ ๋‚˜๋จธ์ง€, ์ •์ž‘ ์‹ค์ „(Test data)์—์„œ๋Š” ์ž‘์€ ๋ณ€๋™์กฐ์ฐจ ๊ฒฌ๋””์ง€ ๋ชปํ•˜๊ณ  ์„ฑ๋Šฅ์ด ๊ณค๋‘๋ฐ•์งˆ์น˜๋Š” '์‘์šฉ๋ ฅ ์ œ๋กœ'์˜ ์ƒํƒœ." ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) ๊ณผ์ ํ•ฉ(Overfitting)์€ ๋ชจ๋ธ์ด ํ•™์Šต ๋ฐ์ดํ„ฐ์˜ ๋…ธ์ด์ฆˆ๋‚˜ ์„ธ๋ถ€ ์‚ฌํ•ญ์— ์ง€๋‚˜์น˜๊ฒŒ ์ ์‘ํ•˜์—ฌ, ์ƒˆ๋กœ์šด ๋ฐ์ดํ„ฐ์— ๋Œ€ํ•œ ์ผ๋ฐ˜ํ™” ๋Šฅ๋ ฅ์„ ์žƒ๋Š” ํ˜„์ƒ์ž…๋‹ˆ๋‹ค. 1. **์›์ธ**: * **High Complexity**: ๋ฐ์ดํ„ฐ์— ๋น„ํ•ด ๋ชจ๋ธ ํŒŒ๋ผ๋ฏธํ„ฐ๊ฐ€ ๋„ˆ๋ฌด ๋งŽ์Œ. ([[L2-Regularization|L2-Regularization]]๊ณผ ์—ฐ๊ฒฐ) * **Lack of Data**: ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ๊ฐ€ ๋„ˆ๋ฌด ์ ์–ด ํŠน์ˆ˜ํ•œ ์ผ€์ด์Šค๋ฅผ ์ผ๋ฐ˜์  ๋ฒ•์น™์œผ๋กœ ์˜คํ•ดํ•จ. * **[[Noise|Noise]] learning**: ๋ฐ์ดํ„ฐ ์†์˜ ๋ฌด์˜๋ฏธํ•œ ์žก์Œ๊นŒ์ง€ ๋ฒ•์น™์œผ๋กœ ํ•™์Šตํ•จ. (Noise์™€ ์—ฐ๊ฒฐ) 2. **ํ•ด๊ฒฐ์ฑ… (๋ฐฉ์—ญ ๊ธฐ๋ฒ•)**: * **[[Regularization|Regularization]]**: ๊ฐ€์ค‘์น˜์— ๋ฒŒ๊ธˆ์„ ๋งค๊ฒจ ๋ชจ๋ธ์„ ๋‹จ์ˆœํ™”. (L2-Regularization์™€ ์—ฐ๊ฒฐ) * **Cross Validation**: ๋ฐ์ดํ„ฐ๋ฅผ ์—ฌ๋Ÿฌ ๋ญ‰์น˜๋กœ ๋‚˜๋ˆ  ๊ต์ฐจ ๊ฒ€์ฆ. * **Early Stopping**: ์‹ค์ „ ์„ฑ๋Šฅ์ด ๋–จ์–ด์ง€๊ธฐ ์ง์ „์— ํ•™์Šต์„ ๋ฉˆ์ถค. * **Dropout**: ํ•™์Šต ์‹œ ์‹ ๊ฒฝ๋ง์˜ ์ผ๋ถ€ ๋…ธ๋“œ๋ฅผ ๋ฌด์ž‘์œ„๋กœ ๊บผ์„œ ์˜์กด์„ฑ ๋ถ„์‚ฐ. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & Updates) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ**: ๊ณผ๊ฑฐ์—๋Š” ๊ณผ์ ํ•ฉ์„ ๋ฌด์กฐ๊ฑด ํ”ผํ•ด์•ผ ํ•  ์ •์ฑ…์œผ๋กœ ๋ณด์•˜์œผ๋‚˜, ํ˜„๋Œ€ ์ •์ฑ…์€ ์ถฉ๋ถ„ํžˆ ๊ณผ์ ํ•ฉ๋œ ๋ชจ๋ธ์—์„œ '๊ทธ๋กœํ‚น(Grokking)'์ด๋ผ๋Š” ๊ฐ‘์ž‘์Šค๋Ÿฌ์šด ์ผ๋ฐ˜ํ™” ์ •์ฑ…์ด ๋‚˜ํƒ€๋‚œ๋‹ค๋Š” ์ ์„ ๋ฐœ๊ฒฌํ•˜์—ฌ ์ด๋ฅผ ์—ฐ๊ตฌํ•จ(RL Update). - **์ •์ฑ… ๋ณ€ํ™”(RL Update)**: ๊ฑฐ๋Œ€ ๋ชจ๋ธ ์ •์ฑ…(LLM ๋“ฑ)์—์„œ๋Š” ํŒŒ๋ผ๋ฏธํ„ฐ๊ฐ€ ์••๋„์ ์œผ๋กœ ๋งŽ์Œ์—๋„ ๋ถˆ๊ตฌํ•˜๊ณ  ๋ฐ์ดํ„ฐ๊ฐ€ ์›Œ๋‚™ ๋ฐฉ๋Œ€ํ•˜์—ฌ ๊ณผ์ ํ•ฉ๋ณด๋‹ค๋Š” ์˜คํžˆ๋ ค ์ง€์‹์ด ๋ถ€์กฑํ•œ '๊ณผ์†Œ์ ํ•ฉ(Underfitting)'์ด๋‚˜ ๋ฐ์ดํ„ฐ ๋ฐ”๋‹ฅ๋‚จ ์ •์ฑ…์„ ๊ฑฑ์ •ํ•˜๋Š” ์‹œ๋Œ€๋กœ ๋ณ€ํ™”ํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[L2-Regularization|L2-Regularization]], [[Noise|Noise]], [[Machine Learning (ML)|Machine Learning (ML)]], Deep Learning (DL), [[Optimization|Optimization]] - **Modern Tech/Tools**: Dropout, Weight decay, Augmentation, Cross-validation. --- ## ๐Ÿค– LLM ํ™œ์šฉ ํžŒํŠธ (How to Use This Knowledge) **์–ธ์ œ ์ด ์ง€์‹์„ ์“ฐ๋Š”๊ฐ€:** - *(TODO)* **์–ธ์ œ ์“ฐ๋ฉด ์•ˆ ๋˜๋Š”๊ฐ€:** - *(TODO)* ## ๐Ÿงช ๊ฒ€์ฆ ์ƒํƒœ (Validation) - **์ •๋ณด ์ƒํƒœ:** needs_review - **์ถœ์ฒ˜ ์‹ ๋ขฐ๋„:** A - **๊ฒ€ํ†  ์ด์œ :** *(P-Reinforce Phase 1 ์ž๋™ ์ •๊ทœํ™”. ๋ณธ๋ฌธ ๊ฒ€์ฆ ํ•„์š”.)* ## ๐Ÿงฌ ์ค‘๋ณต ๊ฒ€์‚ฌ (Duplicate Check) - **๊ธฐ์กด ์œ ์‚ฌ ๋ฌธ์„œ:** *(TODO: ์ธ๋ฑ์„œ ํด๋Ÿฌ์Šคํ„ฐ ๋ฆฌํฌํŠธ ์ฐธ์กฐ)* - **์ฒ˜๋ฆฌ ๋ฐฉ์‹:** UPDATE (์ž๋™ ์ •๊ทœํ™”) - **์ฒ˜๋ฆฌ ์ด์œ :** Phase 1 ์ •๊ทœํ™” โ€” ์˜› ํ…œํ”Œ๋ฆฟ/๋ˆ„๋ฝ ํ•„๋“œ ๋ณด๊ฐ•. ## ๐Ÿ•“ ๋ณ€๊ฒฝ ์ด๋ ฅ (Changelog) | ๋‚ ์งœ | ๋ณ€๊ฒฝ ๋‚ด์šฉ | ์ฒ˜๋ฆฌ ๋ฐฉ์‹ | ์‹ ๋ขฐ๋„ | |------|-----------|-----------|--------| | 2026-05-08 | P-Reinforce Phase 1 ์ •๊ทœํ™” (frontmatter + ํ—ค๋” ํ‘œ์ค€ํ™”) | UPDATE | A |