--- id: AI-OPT-CORE-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 1.0 tags: [ai, deep-learning, optimization, loss-function, training, convergence] last_reinforced: 2026-04-26 --- # Optimization in AI (AIμ—μ„œμ˜ μ΅œμ ν™”) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "λ°μ΄ν„°μ˜ λ°”λ‹€μ—μ„œ λͺ¨λΈμ˜ 'μ˜€λ‹΅'을 μ΅œμ†Œν™”ν•˜λŠ” 졜적의 κ°€μ€‘μΉ˜λ₯Ό λ°œκ΅΄ν•˜μ—¬, κΈ°κ³„μ˜ 계산을 μ§€λŠ₯의 ν†΅μ°°λ‘œ μŠΉν™”μ‹œμΌœλΌ" β€” 신경망 λͺ¨λΈμ˜ μ˜ˆμΈ‘κ°’κ³Ό μ‹€μ œκ°’ μ‚¬μ΄μ˜ 였차(Loss)λ₯Ό 쀄이기 μœ„ν•΄ λͺ¨λΈμ˜ νŒŒλΌλ―Έν„°λ₯Ό 반볡적으둜 μ‘°μ •ν•˜μ—¬ 졜적의 μ„±λŠ₯을 λŒμ–΄λ‚΄λŠ” κ³Όμ •. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **μΆ”μΆœλœ νŒ¨ν„΄:** "Empirical Risk Minimization and Gradient Flow" β€” μ£Όμ–΄μ§„ ν•™μŠ΅ 데이터에 λŒ€ν•΄ 손싀 ν•¨μˆ˜μ˜ 기울기λ₯Ό 따라가며 μœ„ν—˜μ„ μ΅œμ†Œν™”ν•˜λŠ” λ™μ‹œμ—, 보지 λͺ»ν•œ 데이터에도 잘 μž‘λ™ν•˜λ„λ‘ μΌλ°˜ν™”(Generalization) μ„±λŠ₯을 ν™•λ³΄ν•˜λŠ” κ· ν˜• 작힌 μ΅œμ ν™” νŒ¨ν„΄. - **AI μ΅œμ ν™”μ˜ 3λŒ€ μš”μ†Œ:** - **Objective Function (Loss):** 쀄여야 ν•  λͺ©ν‘œ (예: MSE, Cross Entropy). - **Optimizer:** μ–΄λ–»κ²Œ 쀄일 것인가 (예: SGD, Adam, RMSProp). - **Regularization:** λ„ˆλ¬΄ μ§€λ‚˜μΉ˜κ²Œ ν•™μŠ΅ν•˜μ§€ μ•Šλ„λ‘ μ œμ–΄ (예: Dropout, Weight Decay). - **의의:** AI λͺ¨λΈμ΄ λ‹¨μˆœν•œ μˆ˜μ‹μ˜ λ‚˜μ—΄μ—μ„œ ν•™μŠ΅μ„ 톡해 'λŠ₯λ ₯'을 νšλ“ν•˜κ²Œ λ§Œλ“œλŠ” μ‹€μ§ˆμ μΈ μ§€λŠ₯ κ΅¬ν˜„μ˜ 심μž₯. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌:** λ‹¨μˆœνžˆ ν•™μŠ΅ 였차λ₯Ό 0으둜 λ§Œλ“œλŠ” 것이 λͺ©ν‘œμ˜€λ˜ μ‹œμ ˆμ„ μ§€λ‚˜, μ΄μ œλŠ” 'ν‰ν‰ν•œ 졜적점(Flat Minima)'을 μ°Ύμ•„μ•Ό λͺ¨λΈμ˜ μΌλ°˜ν™” μ„±λŠ₯이 μ’‹μ•„μ§„λ‹€λŠ” 관점이 μ •λ¦½λ˜μ–΄ 이λ₯Ό μœ λ„ν•˜λŠ” μ΅œμ ν™” 기법(SAM λ“±)이 μ£Όλͺ©λ°›κ³  있음. - **μ •μ±… λ³€ν™”:** Antigravity ν”„λ‘œμ νŠΈλŠ” λŒ€κ·œλͺ¨ μ–Έμ–΄ λͺ¨λΈ ν•™μŠ΅ μ‹œ, 수렴 속도와 μ΅œμ’… μ„±λŠ₯의 κ· ν˜•μ„ μœ„ν•΄ ν•™μŠ΅λ₯  μŠ€μΌ€μ€„λ§(Learning Rate Scheduling)κ³Ό AdamW μ΅œμ ν™” 도ꡬλ₯Ό κ²°ν•©ν•œ ν‘œμ€€ νŒŒμ΄ν”„λΌμΈμ„ 가동함. ## πŸ”— 지식 μ—°κ²° (Graph) - [[Optimization-Algorithms|Optimization-Algorithms]], Gradient-Descent-Foundations, [[Loss-Functions-Foundations|Loss-Functions-Foundations]], [[Hyperparameter-Optimization|Hyperparameter-Optimization]] - **Raw Source:** 10_Wiki/Topics/AI/Optimization-in-AI.md