--- id: P-REINFORCE-AUTO-GRDE-001 category: "[[10_Wiki/๐Ÿ’ก Topics/AI]]" confidence_score: 0.99 tags: [auto-reinforced, gradient-descent, optimization, deep-learning, machine-learning, backpropagation] last_reinforced: 2026-04-20 --- # [[Gradient-Descent]] ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "์•ˆ๊ฐœ ๋‚€ ์‚ฐ ๋‚ด๋ ค์˜ค๊ธฐ: ๋ณต์žกํ•œ ์ˆ˜๋งŒ ๊ฐœ์˜ ๋ณ€์ˆ˜๋“ค๋กœ ์ด๋ฃจ์–ด์ง„ ์˜ค์ฐจ์˜ ์‚ฐ(Error Surface)์—์„œ, ํ˜„์žฌ ์œ„์น˜์˜ ๊ฒฝ์‚ฌ(Gradient)๋ฅผ ๋”ฐ๋ผ ๊ฐ€์žฅ ๊ฐ€ํŒŒ๋ฅด๊ฒŒ ๋‚ฎ์•„์ง€๋Š” ๋ฐฉํ–ฅ์œผ๋กœ ํ•œ ๊ฑธ์Œ์”ฉ ์ด๋™ํ•˜๋ฉฐ ์‹œ์Šคํ…œ์˜ ์˜ค์ฐจ๋ฅผ ์ตœ์†Œํ™”ํ•ด ๋‚˜๊ฐ€๋Š” ํ•™์Šต์˜ ํ–‰๋™ ๊ฐ•๋ น." ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) ๊ฒฝ์‚ฌ ํ•˜๊ฐ•๋ฒ•(Gradient-Descent)์€ ๋ฏธ๋ถ„ ๊ฐ€๋Šฅํ•œ ํ•จ์ˆ˜์˜ ์ตœ์†Ÿ๊ฐ’์„ ์ฐพ๋Š” ์ตœ์ ํ™” ์•Œ๊ณ ๋ฆฌ์ฆ˜์ž…๋‹ˆ๋‹ค. ํ˜„๋Œ€ AI ํ•™์Šต์˜ ์‹ฌ์žฅ๋ถ€์ž…๋‹ˆ๋‹ค. 1. **์ž‘๋™ ์›๋ฆฌ**: * **Gradient**: ํ•จ์ˆ˜์˜ ๊ธฐ์šธ๊ธฐ. ์ด ๋ฐฉํ–ฅ์˜ ๋ฐ˜๋Œ€์ชฝ์œผ๋กœ ๊ฐ€์•ผ ์˜ค์ฐจ๊ฐ€ ์ค„์–ด๋“ฆ. * **Learning Rate ($\eta$)**: ํ•œ ๋ฐœ์ž๊ตญ์˜ ํฌ๊ธฐ. ๋„ˆ๋ฌด ํฌ๋ฉด ์‚ฐ์„ ๋›ฐ์–ด๋„˜๊ณ (๋ฐœ์‚ฐ), ๋„ˆ๋ฌด ์ž‘์œผ๋ฉด ๋‚ด๋ ค๊ฐ€๋Š” ๋ฐ ์˜๊ฒ์˜ ์‹œ๊ฐ„์ด ๊ฑธ๋ฆผ. (Optimization๊ณผ ์—ฐ๊ฒฐ) 2. **์ข…๋ฅ˜**: * **Batch GD**: ๋ชจ๋“  ๋ฐ์ดํ„ฐ๋ฅผ ๋‹ค ๋ณด๊ณ  ๋‚ด๋ฆผ (์ •ํ™•ํ•˜์ง€๋งŒ ๋А๋ฆผ). * **Stochastic GD (SGD)**: ๋ฐ์ดํ„ฐ ํ•˜๋‚˜ ๋ณผ ๋•Œ๋งˆ๋‹ค ํ•œ ๊ฑธ์Œ (๋น ๋ฅด์ง€๋งŒ ์š”๋™์นจ). * **Mini-batch GD**: ์ ๋‹นํ•œ ๋ฌถ์Œ์”ฉ ๋ณด๊ณ  ์ด๋™ (ํ˜„์‹ค์  ํƒ€ํ˜‘). ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ**: ๊ณผ๊ฑฐ์—๋Š” ๊ตญ์†Œ ์ตœ์ ํ•ด(Local Minima)์— ๊ฐ‡ํ˜€ ์˜์˜ ๋น ์ ธ๋‚˜์˜ค์ง€ ๋ชปํ•  ๊ฒƒ์ด๋ผ๋Š” ๋น„๊ด€ ์ •์ฑ…์ด ๋งŽ์•˜์œผ๋‚˜, ํ˜„๋Œ€ ์‹ค๋ฌด ์ •์ฑ…์€ ๊ณ ์ฐจ์› ๊ณต๊ฐ„์—์„œ๋Š” ๋Œ€๋ถ€๋ถ„์ด '์•ˆ์žฅ์ (Saddle point)'์ด๋ฉฐ ์ ์ ˆํ•œ ์†Œ์Œ(Adam ๋“ฑ)์ด ์žˆ์œผ๋ฉด ๋ฒ—์–ด๋‚  ์ˆ˜ ์žˆ์Œ์„ ์ž…์ฆํ•จ(RL Update). - **์ •์ฑ… ๋ณ€ํ™”(RL Update)**: ๋‹จ์ˆœํžˆ ์˜ค์ฐจ๋ฅผ ์ค„์ด๋Š” ์ •์ฑ…์„ ๋„˜์–ด, ํ•™์Šต ๋„์ค‘ ๊ฒฝ์‚ฌ๊ฐ€ ์†Œ๋ฉธ(Vanishing)ํ•˜๊ฑฐ๋‚˜ ํญ์ฃผ(Exploding)ํ•˜๋Š” ์ •์ฑ…์„ ๋ง‰๊ธฐ ์œ„ํ•ด ๋ฐฐ์น˜ ์ •๊ทœํ™”(Batch Norm)๋‚˜ ์ž”์ฐจ ์—ฐ๊ฒฐ(Residual Connection) ๊ฐ™์€ ์•„ํ‚คํ…์ฒ˜์  ๋ณด์กฐ ์ •์ฑ…์ด ํ•„์ˆ˜ํ™”๋จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Optimization]], [[Backpropagation]], [[Deep Learning (DL)]], [[Efficiency]], [[Error Prediction Error (RPE์™€ ์œ ์‚ฌ)]] - **Modern Tech/Tools**: Adam, RMSprop, Momentum, PyTorch/TensorFlow (Autograd). ---