--- id: LR-SCHED-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [machine-learning, optimization, learning-rate, training-strategy] last_reinforced: 2026-04-26 --- # Learning Rate Scheduling (ํ•™์Šต๋ฅ  ์Šค์ผ€์ค„๋ง) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "ํ•™์Šต์˜ ์†๋„๋ฅผ ์‹œ๊ฐ„์— ๋”ฐ๋ผ ์˜๋ฆฌํ•˜๊ฒŒ ์กฐ์ ˆํ•˜๋ผ" โ€” ๊ณ ์ •๋œ ํ•™์Šต๋ฅ  ๋Œ€์‹  ํ•™์Šต์˜ ์ง„ํ–‰ ์ •๋„์— ๋”ฐ๋ผ ์ตœ์ ์˜ ๋ณดํญ(Step size)์„ ๋™์ ์œผ๋กœ ๋ณ€๊ฒฝํ•˜์—ฌ, ์ „์—ญ ์ตœ์ ํ•ด(Global Optima)์— ๋” ๋น ๋ฅด๊ณ  ์ •ํ™•ํ•˜๊ฒŒ ๋„๋‹ฌํ•˜๊ฒŒ ๋งŒ๋“œ๋Š” ๊ธฐ๋ฒ•. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** ํ•™์Šต ์ดˆ๊ธฐ์—๋Š” ํฐ ๋ณดํญ์œผ๋กœ ๋น ๋ฅด๊ฒŒ ํƒ์ƒ‰ํ•˜๊ณ , ํ›„๊ธฐ์—๋Š” ์ž‘์€ ๋ณดํญ์œผ๋กœ ์ •๊ตํ•˜๊ฒŒ ์ˆ˜๋ ดํ•ด๊ฐ€๋Š” '์ ์ง„์  ๊ฐ์‡ (Decay)' ํŒจํ„ด. - **์ฃผ์š” ์ „๋žต:** - **Step Decay:** ์ผ์ • ์—ํฌํฌ๋งˆ๋‹ค ํ•™์Šต๋ฅ ์„ ๊ณ ์ • ๋น„์œจ๋กœ ๊ฐ์†Œ. - **Exponential Decay:** ๋งค ๋‹จ๊ณ„๋งˆ๋‹ค ์ง€์ˆ˜ ํ•จ์ˆ˜์ ์œผ๋กœ ๊ฐ์†Œ์‹œ์ผœ ๋ถ€๋“œ๋Ÿฌ์šด ์ˆ˜๋ ด ์œ ๋„. - **Cosine Annealing:** ์ฝ”์‚ฌ์ธ ํ•จ์ˆ˜๋ฅผ ๋”ฐ๋ผ ํ•™์Šต๋ฅ ์„ ์กฐ์ ˆ. ์ตœ๊ทผ ํŠธ๋žœ์Šคํฌ๋จธ ํ•™์Šต์˜ ๋Œ€์„ธ. - **Warm-up:** ํ•™์Šต ๊ทน์ดˆ๊ธฐ์— ์•„์ฃผ ๋‚ฎ์€ ํ•™์Šต๋ฅ ์—์„œ ์‹œ์ž‘ํ•˜์—ฌ ์ ์ง„์ ์œผ๋กœ ๋†’์—ฌ ๋ชจ๋ธ์ด ์ดˆ๊ธฐ์— ๋ฐœ์‚ฐํ•˜๋Š” ๊ฒƒ์„ ๋ฐฉ์ง€. - **ReduceLROnPlateau:** ์„ฑ๋Šฅ ํ–ฅ์ƒ์ด ๋ฉˆ์ท„์„ ๋•Œ๋งŒ ํ•™์Šต๋ฅ ์„ ๋‚ฎ์ถ”๋Š” ์ ์‘ํ˜• ์ „๋žต. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ๋‹จ์ˆœํžˆ ์ž‘๊ฒŒ ์‹œ์ž‘ํ•˜๋˜ ๋ฐฉ์‹์—์„œ, ์ตœ๊ทผ์—๋Š” ๋Œ€๊ทœ๋ชจ ๋ชจ๋ธ์˜ ์•ˆ์ •์„ฑ์„ ์œ„ํ•ด '์›œ์—…'๊ณผ '์ฝ”์‚ฌ์ธ ์Šค์ผ€์ค„๋ง'์˜ ์กฐํ•ฉ์ด ํ•„์ˆ˜ ๊ณต์‹์œผ๋กœ ๊ตณ์–ด์ง. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ์—์ด์ „ํŠธ์˜ ํŒŒ์ธํŠœ๋‹ ํ”„๋กœ์„ธ์Šค ์„ค๊ณ„ ์‹œ, ํ•™์Šต ํšจ์œจ ๊ทน๋Œ€ํ™”๋ฅผ ์œ„ํ•ด AdamW ์˜ตํ‹ฐ๋งˆ์ด์ €์™€ ์ฝ”์‚ฌ์ธ ์›œ์—… ์Šค์ผ€์ค„๋Ÿฌ๋ฅผ ๊ธฐ๋ณธ ์‚ฌ์–‘์œผ๋กœ ์„ค์ •ํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Optimization|Optimization]], AdamW-Optimizer, [[Deep-Learning|Deep-Learning]], [[Gradient-Descent|Gradient-Descent]] - **Raw Source:** 10_Wiki/Topics/AI/Learning-Rate-Scheduling.md