--- id: P-REINFORCE-AI-BACKPROP category: "[[10_Wiki/๐Ÿ’ก Topics/AI]]" confidence_score: 1.0 tags: [Backpropagation, Deep Learning, Gradient Descent, Optimization] last_reinforced: 2026-04-20 --- # [[Backpropagation]] (์—ญ์ „ํŒŒ) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "์‹ค์ˆ˜๋ฅผ ๋’ค์—์„œ๋ถ€ํ„ฐ ๊ณ ์ณ ๋‚˜๊ฐ€๋Š” ์ง€ํ˜œ." ์ถœ๋ ฅ์ธต์—์„œ ๋ฐœ์ƒํ•œ ์˜ค์ฐจ(Loss)๊ฐ€ ๊ฐ ์‹ ๊ฒฝ๋ง ์ธต์˜ ๊ฐ€์ค‘์น˜(Weight)์— ์–ผ๋งˆ๋‚˜ ๊ธฐ์—ฌํ–ˆ๋Š”์ง€ ๊ฑฐ๊พธ๋กœ ๊ณ„์‚ฐํ•˜๋ฉฐ ํšจ์œจ์ ์œผ๋กœ ํ•™์Šต์‹œํ‚ค๋Š” ๋”ฅ๋Ÿฌ๋‹์˜ ํ•ต์‹ฌ ํ•™์Šต ๋ฉ”์ปค๋‹ˆ์ฆ˜์ด๋‹ค. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **Chain Rule (๋ฏธ๋ถ„์˜ ์—ฐ์‡„ ๋ฒ•์น™)**: - ์ „์ฒด ์˜ค์ฐจ๋ฅผ ๊ฐ ํŒŒ๋ผ๋ฏธํ„ฐ๋กœ ๋ฏธ๋ถ„ํ•˜๊ธฐ ์œ„ํ•ด, ๊ฐ ๋‹จ๊ณ„์˜ ๋ถ€๋ถ„ ๋ฏธ๋ถ„๊ฐ’์„ ๊ณฑํ•ด ๋‚˜๊ฐ€๋Š” ๋ฏธ์ ๋ถ„ํ•™์  ๊ณผ์ •. - **Gradient Computation**: - ๋ชจ๋“  ํŒŒ๋ผ๋ฏธํ„ฐ์— ๋Œ€ํ•œ ๊ฒฝ์‚ฌ๋„(Gradient)๋ฅผ ํ•œ ๋ฒˆ์— ๊ณ„์‚ฐํ•˜์—ฌ, ๊ฒฝ์‚ฌ ํ•˜๊ฐ•๋ฒ•(Gradient Descent)์„ ํ†ตํ•ด ์‹ ๊ฒฝ๋ง์„ ์ •๋‹ต์— ๊ฐ€๊น๊ฒŒ ์—…๋ฐ์ดํŠธํ•œ๋‹ค. - **Efficiency**: - ๋ชจ๋“  ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ๊ฐœ๋ณ„์ ์œผ๋กœ ๋ฏธ๋ถ„ํ•˜๋Š” ๊ฒƒ๋ณด๋‹ค ์ˆ˜๋ฐฑ๋งŒ ๋ฐฐ ๋น ๋ฅด๋ฉฐ, ์ด๋กœ ์ธํ•ด ์ˆ˜์กฐ ๊ฐœ์˜ ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ๊ฐ€์ง„ ๊ฑฐ๋Œ€ ๋ชจ๋ธ์˜ ํ•™์Šต์ด ๊ฐ€๋Šฅํ•ด์กŒ๋‹ค. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (RL Update) - ์ธต์ด ๋„ˆ๋ฌด ๊นŠ์–ด์ง€๋ฉด ๋ฏธ๋ถ„๊ฐ’์ด 0์œผ๋กœ ์‚ฌ๋ผ์ง€๋Š” 'Vanishing Gradient' ๋ฌธ์ œ๊ฐ€ ๋ฐœ์ƒํ•œ๋‹ค. ์ด๋ฅผ ์œ„ํ•ด ReLU ํ™œ์„ฑํ™” ํ•จ์ˆ˜๋‚˜ ResNet ๊ฐ™์€ ์ž”์ฐจ ์—ฐ๊ฒฐ(Residual Connection) ๊ธฐ์ˆ ์ด ๋ณด์™„์ ์œผ๋กœ ์‚ฌ์šฉ๋œ๋‹ค. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - Related: [[Deep-Learning-Architecture-Patterns]] , [[Gradient-Descent]] - Foundation: [[Computational Theory & Math/Information Theory]]