--- id: LOSS-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [math, deep-learning, loss-function, information-theory, cross-entropy] last_reinforced: 2026-04-26 --- # Cross-Entropy Loss (๊ต์ฐจ ์—”ํŠธ๋กœํ”ผ ์†์‹ค) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๋ชจ๋ธ์˜ ์˜ˆ์ธก ๋ถ„ํฌ์™€ ์‹ค์ œ ์ •๋‹ต ๋ถ„ํฌ ์‚ฌ์ด์˜ ๊ฑฐ๋ฆฌ(๋†€๋žŒ์˜ ์ •๋„)๋ฅผ ์ธก์ •ํ•˜์—ฌ ์ขํ˜€๋ผ" โ€” ์ •๋ณด ์ด๋ก ์˜ ์—”ํŠธ๋กœํ”ผ ๊ฐœ๋…์„ ๋นŒ๋ ค์™€, ๋ชจ๋ธ์ด ์ถœ๋ ฅํ•œ ํ™•๋ฅ  ๋ถ„ํฌ๊ฐ€ ์ •๋‹ต ๋ถ„ํฌ์™€ ์–ผ๋งˆ๋‚˜ ๋‹ค๋ฅธ์ง€๋ฅผ ์ˆ˜์น˜ํ™”ํ•œ ์†์‹ค ํ•จ์ˆ˜๋กœ ๋ถ„๋ฅ˜ ๋ฌธ์ œ์˜ ํ‘œ์ค€. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** ์ •๋‹ต ํด๋ž˜์Šค์— ๋Œ€ํ•ด ๋ชจ๋ธ์ด ๋‚ฎ์€ ํ™•๋ฅ ์„ ์ค„์ˆ˜๋ก ๋” ํฐ ํŽ˜๋„ํ‹ฐ๋ฅผ ๋ถ€์—ฌํ•˜์—ฌ, ๋ชจ๋ธ์ด ์ •๋‹ต์— ๋Œ€ํ•ด ํ™•์‹ ์„ ๊ฐ–๋„๋ก ์œ ๋„ํ•˜๋Š” ํ™•๋ฅ  ์ •๋ ฌ ํŒจํ„ด. - **์ˆ˜ํ•™์  ์˜๋ฏธ:** - **Entropy:** ์‹œ์Šคํ…œ์˜ ๋ถˆํ™•์‹ค์„ฑ/์ •๋ณด๋Ÿ‰ ์ธก์ •. - **KL Divergence:** ๋‘ ํ™•๋ฅ  ๋ถ„ํฌ ์‚ฌ์ด์˜ ์ฐจ์ด ์ธก์ •. - **Cross-Entropy:** $H(p, q) = -\sum p(x) \log q(x)$. ์ •๋‹ต ๋ถ„ํฌ($p$)์™€ ์˜ˆ์ธก ๋ถ„ํฌ($q$) ์‚ฌ์ด์˜ ์œ ์‚ฌ๋„ ์ธก์ •. - **ํŠน์ง•:** ๋กœ๊ทธ ํ•จ์ˆ˜๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์˜ˆ์ธก์ด ํ‹€๋ฆด์ˆ˜๋ก ์†์‹ค๊ฐ’์ด ๊ธฐํ•˜๊ธ‰์ˆ˜์ ์œผ๋กœ ์ปค์ง. ์ด๋ฅผ ํ†ตํ•ด ๊ฒฝ์‚ฌ ํ•˜๊ฐ•๋ฒ• ์‹œ ๊ฐ€์ค‘์น˜๋ฅผ ๊ฐ•๋ ฅํ•˜๊ฒŒ ์—…๋ฐ์ดํŠธํ•จ. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ๋‹จ์ˆœํ•œ ์˜ค์ฐจ ์ œ๊ณฑํ•ฉ(MSE)์„ ๋ถ„๋ฅ˜ ๋ฌธ์ œ์— ์“ฐ๋˜ ๋ฐฉ์‹๋ณด๋‹ค ํ›จ์”ฌ ๋น ๋ฅธ ์ˆ˜๋ ด ์†๋„์™€ ๋†’์€ ์„ฑ๋Šฅ์„ ๋ณด์ž„์ด ์ž…์ฆ๋˜๋ฉฐ ์‚ฌ์‹ค์ƒ์˜ ํ‘œ์ค€์ด ๋จ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ์—์ด์ „ํŠธ์˜ ๋ฌธ์„œ ๋ถ„๋ฅ˜ ๋ฐ ์˜๋„ ์ธ์‹ ๋ชจ๋ธ ํ•™์Šต ์‹œ ๊ต์ฐจ ์—”ํŠธ๋กœํ”ผ ์†์‹ค ํ•จ์ˆ˜๋ฅผ ๊ธฐ๋ณธ์œผ๋กœ ์‚ฌ์šฉํ•˜๋ฉฐ, ํด๋ž˜์Šค ๋ถˆ๊ท ํ˜• ํ•ด๊ฒฐ์„ ์œ„ํ•ด Focal Loss ๋“ฑ ๋ณ€ํ˜• ๊ธฐ๋ฒ•์„ ํ•จ๊ป˜ ๊ฒ€ํ† ํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Objective-Functions]], [[Gradient-Descent]], Machine-Learning, [[Deep-Learning]] - **Raw Source:** 10_Wiki/Topics/AI/Cross-Entropy Loss.md