--- id: AI-OPT-REG-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [ai, deep-learning, regularization, overfitting, l1-lasso, l2-ridge, dropout, early-stopping] last_reinforced: 2026-04-26 --- # Regularization Strategies (๊ทœ์ œ ์ „๋žต) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๋ชจ๋ธ์˜ ์ง€๋Šฅ์ด ํŠน์ • ๋ฐ์ดํ„ฐ์—๋งŒ ๋งค๋ชฐ๋˜์ง€ ์•Š๋„๋ก ๊ฐ€์ค‘์น˜์— '๋ฒŒ๊ธˆ'์„ ๋งค๊ธฐ๊ฑฐ๋‚˜ ๊ตฌ์กฐ์  '๊ฒฐํ•'์„ ๋ถ€์—ฌํ•˜์—ฌ, ์–ด๋–ค ์ƒํ™ฉ์—์„œ๋„ ์œ ์—ฐํ•˜๊ฒŒ ๋Œ€์‘ํ•˜๋Š” ์ผ๋ฐ˜ํ™” ๋Šฅ๋ ฅ์„ ํ™•๋ณดํ•˜๋ผ" โ€” ํ•™์Šต ์˜ค์ฐจ๋ฅผ ์ค„์ด๋Š” ๊ฒƒ๊ณผ ๋ชจ๋ธ์˜ ๋ณต์žก๋„๋ฅผ ๋‚ฎ์ถ”๋Š” ๊ฒƒ ์‚ฌ์ด์˜ ๊ท ํ˜•์„ ๋งž์ถ”์–ด ๊ณผ์ ํ•ฉ(Overfitting)์„ ๋ฐฉ์ง€ํ•˜๋Š” ๊ธฐ์ˆ ์  ์ˆ˜๋‹จ๋“ค. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** "Complexity Penalty and Stochastic Variation" โ€” ์†์‹ค ํ•จ์ˆ˜์— ๋ชจ๋ธ ํฌ๊ธฐ์— ๋น„๋ก€ํ•˜๋Š” ํ•ญ์„ ์ถ”๊ฐ€ํ•˜๊ฑฐ๋‚˜(L1/L2), ํ•™์Šต ์‹œ ๋ฌด์ž‘์œ„์„ฑ์„ ์ฃผ์ž…ํ•˜์—ฌ(Dropout) ํŠน์ • ๊ฒฝ๋กœ์—๋งŒ ์˜์กดํ•˜์ง€ ์•Š๊ฒŒ ํ•จ์œผ๋กœ์จ ๋ชจ๋ธ์˜ ๊ฐ•๊ฑด์„ฑ(Robustness)์„ ๋†’์ด๋Š” ํŒจํ„ด. - **์ฃผ์š” ์ „๋žต:** - **L1 (Lasso):** ์ค‘์š”ํ•˜์ง€ ์•Š์€ ๊ฐ€์ค‘์น˜๋ฅผ 0์œผ๋กœ ๋งŒ๋“ค์–ด ๋ณ€์ˆ˜ ์„ ํƒ ํšจ๊ณผ ์ œ๊ณต. - **L2 (Ridge):** ๊ฐ€์ค‘์น˜๋“ค์„ ์ „๋ฐ˜์ ์œผ๋กœ ์ž‘๊ฒŒ ์œ ์ง€ํ•˜์—ฌ ๊ธ‰๊ฒฉํ•œ ๋ณ€ํ™” ์–ต์ œ. - **Dropout:** ํ•™์Šต ์‹œ ๋‰ด๋Ÿฐ์„ ๋ฌด์ž‘์œ„๋กœ ์ƒ๋žตํ•˜์—ฌ ํŠน์ • ๋‰ด๋Ÿฐ์—์˜ ์˜์กด๋„ ๊ฐ์†Œ. - **Early Stopping:** ๊ฒ€์ฆ ์˜ค์ฐจ๊ฐ€ ์˜ค๋ฅด๊ธฐ ์‹œ์ž‘ํ•˜๋Š” ์‹œ์ ์— ํ•™์Šต์„ ์ค‘๋‹จ. - **์˜์˜:** AI ๋ชจ๋ธ์ด ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ์˜ ๋…ธ์ด์ฆˆ๊นŒ์ง€ ์™ธ์›Œ๋ฒ„๋ฆฌ๋Š” ๋ถ€์ž‘์šฉ์„ ๋ง‰๊ณ , ๋ณธ์งˆ์ ์ธ ํŒจํ„ด๋งŒ์„ ํ•™์Šตํ•˜๊ฒŒ ์œ ๋„ํ•˜๋Š” '์ตœ์ ํ™”์˜ ์œค๋ฆฌ'์™€ ๊ฐ™์€ ์—ญํ• . ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ๊ทœ์ œ๊ฐ€ ๊ฐ•ํ• ์ˆ˜๋ก ์ •ํ™•๋„๊ฐ€ ๋–จ์–ด์ง„๋‹ค๋Š” ์šฐ๋ ค๋Š” ์ด์ œ '๊ฒ€์ฆ ๋ฐ์ดํ„ฐ'์— ๋Œ€ํ•œ ์ผ๋ฐ˜ํ™” ์„ฑ๋Šฅ ํ–ฅ์ƒ์œผ๋กœ ์ƒ์‡„๋˜๋ฉฐ, ํ˜„๋Œ€ ๋”ฅ๋Ÿฌ๋‹์—์„œ๋Š” ๋“œ๋กญ์•„์›ƒ๊ณผ ๋ฐฐ์น˜ ์ •๊ทœํ™”(Batch Norm)๋ฅผ ํ•จ๊ป˜ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ์ด ์‚ฌ์‹ค์ƒ์˜ ํ‘œ์ค€ ์•„ํ‚คํ…์ฒ˜๊ฐ€ ๋จ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ์—์ด์ „ํŠธ์˜ ๋ฏธ์„ธ ์กฐ์ • ์‹œ, ์†Œ๋Ÿ‰์˜ ๋ฐ์ดํ„ฐ๋กœ๋„ ๋ฒ”์šฉ์ ์ธ ์„ฑ๋Šฅ์„ ์œ ์ง€ํ•˜๊ธฐ ์œ„ํ•ด ๊ฐ€์ค‘์น˜ ๊ฐ์‡ (Weight Decay)์™€ ์กฐ๊ธฐ ์ข…๋ฃŒ ํ”„๋กœํ† ์ฝœ์„ ํ•„์ˆ˜์ ์œผ๋กœ ๊ฐ€๋™ํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Overfitting-and-Underfitting|Overfitting-and-Underfitting]], [[Optimization-in-AI|Optimization-in-AI]], [[Normalization-Strategies|Normalization-Strategies]], [[Performance-Metrics-in-AI|Performance-Metrics-in-AI]] - **Raw Source:** 10_Wiki/Topics/AI/Regularization-Strategies.md