--- id: P-REINFORCE-AUTO-BIVA-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.00 tags: [auto-reinforced, bias-variance, machine-learning-foundations, overfitting, underfitting, model-performance] last_reinforced: 2026-04-20 --- # [[Bias vs Variance|Bias vs Variance]] ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "ํ•™์Šต์˜ ์˜์›ํ•œ ์ค„๋‹ค๋ฆฌ๊ธฐ: ๋„ˆ๋ฌด ๋‹จ์ˆœํ•ด์„œ ์ง„์‹ค์„ ๋†“์น˜๋Š” 'ํŽธํ–ฅ(Bias)'์˜ ํ•จ์ •๊ณผ, ๋„ˆ๋ฌด ์˜ˆ๋ฏผํ•ด์„œ ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ์˜ ์‚ฌ์†Œํ•œ ์†Œ์Œ๊นŒ์ง€ ๋‹ค ๋ฏฟ์–ด๋ฒ„๋ฆฌ๋Š” '๋ถ„์‚ฐ(Variance)'์˜ ์—ญ์„ค ์‚ฌ์ด์—์„œ ํ™ฉ๊ธˆ ๊ท ํ˜•(Sweet Spot)์„ ์ฐพ๋Š” ๊ณผ์ •." ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) ํŽธํ–ฅ๊ณผ ๋ถ„์‚ฐ์˜ ํŠธ๋ ˆ์ด๋“œ์˜คํ”„๋Š” ๋จธ์‹ ๋Ÿฌ๋‹ ๋ชจ๋ธ์˜ ์ผ๋ฐ˜ํ™”(Generalization) ์„ฑ๋Šฅ์„ ๊ฒฐ์ •์ง“๋Š” ํ•ต์‹ฌ ๊ฐœ๋…์ž…๋‹ˆ๋‹ค. 1. **High Bias (Underfitting)**: * ๋ชจ๋ธ์ด ๋„ˆ๋ฌด ๋‹จ์ˆœํ•ด์„œ ๋ฐ์ดํ„ฐ์˜ ๋‚ด์žฌ์  ํŒจํ„ด์„ ์ถฉ๋ถ„ํžˆ ์žก์•„๋‚ด์ง€ ๋ชปํ•จ. ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ์™€ ํ…Œ์ŠคํŠธ ๋ฐ์ดํ„ฐ ๋ชจ๋‘์—์„œ ์ ์ˆ˜๊ฐ€ ๋‚ฎ์Œ. 2. **High Variance (Overfitting)**: * ๋ชจ๋ธ์ด ๋„ˆ๋ฌด ๋ณต์žกํ•ด์„œ ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ์—๋งŒ ์™„๋ฒฝํžˆ ์ ์‘ํ•จ. ํ›ˆ๋ จ ์ ์ˆ˜๋Š” ๋†’์œผ๋‚˜ ์ƒˆ๋กœ์šด ๋ฐ์ดํ„ฐ(Test set)์— ๋Œ€ํ•œ ์˜ˆ์ธก๋ ฅ์ด ๊ธ‰๊ฒฉํžˆ ๋–จ์–ด์ง. 3. **Total Error**: * ๋ชจ๋ธ์˜ ์ „์ฒด ์˜ค์ฐจ = $Bias^2 + Variance + Irreducible Error(๋…ธ์ด์ฆˆ)$. * ๋ชฉํ‘œ๋Š” ์ „์ฒด ์˜ค์ฐจ๋ฅผ ์ตœ์†Œํ™”ํ•˜๋Š” ๋ณต์žก๋„์˜ ์ตœ์  ์ง€์ ์„ ์ฐพ๋Š” ๊ฒƒ์ž„. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ**: ๊ณผ๊ฑฐ์—๋Š” ๋ชจ๋ธ ๋งค๊ฐœ๋ณ€์ˆ˜๊ฐ€ ๋งŽ์•„์ง€๋ฉด ๋ฌด์กฐ๊ฑด Variance๊ฐ€ ์ปค์ง„๋‹ค๊ณ  ๋ฏฟ์—ˆ์œผ๋‚˜(U-shape curve), ํ˜„๋Œ€ ๊ฑฐ๋Œ€ ๋ชจ๋ธ ์ •์ฑ…์€ ๋งค๊ฐœ๋ณ€์ˆ˜๊ฐ€ ์ž„๊ณ„์น˜ ์ด์ƒ์œผ๋กœ ๋งŽ์•„์ง€๋ฉด ์˜ค์ฐจ๊ฐ€ ์˜คํžˆ๋ ค ๋‹ค์‹œ ์ค„์–ด๋“œ๋Š” 'Double Descent(์ด์ค‘ ํ•˜๊ฐ•) ์ •์ฑ…'์„ ๋ฐœ๊ฒฌํ•˜์—ฌ ๊ณ ์ „์  ํ†ต๊ณ„ํ•™ ์ •์ฑ…์˜ ํ•œ๊ณ„๋ฅผ ํ™•์žฅํ•จ(RL Update). - **์ •์ฑ… ๋ณ€ํ™”(RL Update)**: ๋ณด์ƒ ํ•จ์ˆ˜ ์„ค๊ณ„ ์ •์ฑ…์—์„œ, ๋ชจ๋ธ์˜ ๋ถ„์‚ฐ์„ ์ค„์ด๊ธฐ ์œ„ํ•ด ๋ฐ์ดํ„ฐ ์ฆ๊ฐ•(Augmentation)์ด๋‚˜ ๊ทœ์ œํ™”(Regularization)๋ฅผ ๊ฐ•์ œํ•˜๋Š” '์•ˆ์ •์„ฑ ์ง€ํ–ฅ์  ํ•™์Šต ์ •์ฑ…'์ด ํ•„์ˆ˜์ ์œผ๋กœ ์ ์šฉ๋จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Standardization vs Innovation|Standardization vs Innovation]], [[stochastic gradient descent|stochastic gradient descent]], Foundational Models, Pattern Recognition, [[Stability vs Flexibility|Stability vs Flexibility]] - **Modern Tech/Tools**: Cross-validation, Early stopping, Dropout, L1/L2 Regularization. ---