--- id: P-REINFORCE-AI-BIAS-VAR category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [Bias Variance Tradeoff, Overfitting, Underfitting, Machine Learning] last_reinforced: 2026-04-20 --- # [[Bias-Variance-Tradeoff]] (ํŽธํ–ฅ-๋ถ„์‚ฐ ํŠธ๋ ˆ์ด๋“œ์˜คํ”„) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๋„ˆ๋ฌด ๋‹จ์ˆœํ•ด๋„, ๋„ˆ๋ฌด ๋ณต์žกํ•ด๋„ ๋งํ•œ๋‹ค." ๋ชจ๋ธ์ด ๋ฐ์ดํ„ฐ๋ฅผ ๋„ˆ๋ฌด ๋Œ€์ถฉ ๋ฐฐ์›Œ ์ƒ๊ธฐ๋Š” ์˜ค์ฐจ(Bias)์™€, ๋„ˆ๋ฌด ๊น๊นํ•˜๊ฒŒ ๋ฐฐ์›Œ ์ƒ๊ธฐ๋Š” ์˜ค์ฐจ(Variance) ์‚ฌ์ด์˜ ํ™ฉ๊ธˆ ๋ฐธ๋Ÿฐ์Šค๋ฅผ ์ฐพ๋Š” ๋จธ์‹ ๋Ÿฌ๋‹์˜ ์ˆ™๋ช…์  ๊ณผ์ œ๋‹ค. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **High Bias (Underfitting)**: - ๋ชจ๋ธ์ด ๋„ˆ๋ฌด ๋‹จ์ˆœํ•˜์—ฌ ๋ฐ์ดํ„ฐ์˜ ๋ณธ์งˆ์  ํŒจํ„ด์„ ์žก์ง€ ๋ชปํ•จ. (์˜ˆ: ๊ณก์„ ์„ ์ง์„ ์œผ๋กœ ์„ค๋ช…ํ•˜๋ ค ํ•  ๋•Œ) - **High Variance (Overfitting)**: - ๋ชจ๋ธ์ด ๋„ˆ๋ฌด ๋ณต์žกํ•˜์—ฌ ๋ฐ์ดํ„ฐ์˜ ๋…ธ์ด์ฆˆ๊นŒ์ง€ ๋‹ค ์™ธ์›Œ๋ฒ„๋ฆผ. ์ƒˆ๋กœ์šด ๋ฐ์ดํ„ฐ๋ฅผ ๋„ฃ์œผ๋ฉด ์—‰๋šฑํ•œ ๊ฒฐ๊ณผ๊ฐ€ ๋‚˜์˜ด. - **Total Error Reduction**: - ํŽธํ–ฅ๊ณผ ๋ถ„์‚ฐ์˜ ํ•ฉ์ด ์ตœ์†Œ๊ฐ€ ๋˜๋Š” ์ง€์ ์ด ๋ฐ”๋กœ ๋ชจ๋ธ์˜ ์ผ๋ฐ˜ํ™” ์„ฑ๋Šฅ(Generalization)์ด ๊ฐ€์žฅ ๋†’์€ ๊ตฌ๊ฐ„์ด๋‹ค. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (RL Update) - ์ตœ๊ทผ์˜ ์ดˆ๊ฑฐ๋Œ€ ๋ชจ๋ธ(LLM)๋“ค์€ 'Double Descent' ํ˜„์ƒ์— ์˜ํ•ด, ๋ชจ๋ธ์„ ๊ทนํ•œ์œผ๋กœ ํ‚ค์šฐ๋ฉด ์˜คํžˆ๋ ค ๋ถ„์‚ฐ์ด ๋‹ค์‹œ ์ค„์–ด๋“ค๋ฉฐ ์„ฑ๋Šฅ์ด ์ข‹์•„์ง€๋Š” ๊ธฐ์ดํ•œ ํ˜„์ƒ์ด ๋ฐœ๊ฒฌ๋˜๊ณ  ์žˆ๋‹ค. ์ด๋Š” ์ „ํ†ต์ ์ธ ํŠธ๋ ˆ์ด๋“œ์˜คํ”„ ์ด๋ก ์„ ์žฌ์ •๋ฆฝํ•˜๊ฒŒ ๋งŒ๋“ค๊ณ  ์žˆ๋‹ค. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - Related: [[Regularization-Techniques]] , Model-Optimization-Strategies - Foundation: [[Computational Theory & Math/Information Theory]]