--- id: P-REINFORCE-AI-HYPERPARAM category: "[[10_Wiki/πŸ’‘ Topics/AI]]" confidence_score: 0.96 tags: [AI, MachineLearning, Hyperparameter, Optimization] last_reinforced: 2026-04-20 --- # [[Hyperparameter-Optimization]] (ν•˜μ΄νΌνŒŒλΌλ―Έν„° μ΅œμ ν™”) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "ν•™μŠ΅μ„ λ°°μš°λŠ” 기술, λͺ¨λΈμ˜ 성격을 νŠœλ‹ν•˜λŠ” κ³Όμ •." λͺ¨λΈμ΄ 슀슀둜 ν•™μŠ΅ν•˜λŠ” κ°€μ€‘μΉ˜(Weight)κ°€ μ•„λ‹ˆλΌ, ν•™μŠ΅ 방식 자체λ₯Ό κ²°μ •ν•˜λŠ” μ„€μ •κ°’(배치 크기, ν•™μŠ΅λ₯  λ“±)의 졜적 쑰합을 μ°ΎλŠ” 과정이닀. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **Key Parameters**: - **Learning Rate**: κ°€μ€‘μΉ˜ μ—…λ°μ΄νŠΈμ˜ 크기. - **Batch Size**: ν•œ λ²ˆμ— ν•™μŠ΅ν•  데이터 λ­‰μΉ˜μ˜ 크기. - **Number of Layers/Neurons**: μ‹ κ²½λ§μ˜ ꡬ쑰적 크기. - **Methods**: - **Grid Search**: λͺ¨λ“  쑰합을 격자무늬둜 λ‹€ μ‹œλ„ν•¨ (느림). - **Random Search**: λ¬΄μž‘μœ„λ‘œ 쑰합을 골라 μ‹œλ„ν•¨ (μ€κ·Όνžˆ 효과적). - **Bayesian Optimization**: 이전 μ‹œλ„ κ²°κ³Όλ₯Ό λ°”νƒ•μœΌλ‘œ μœ λ§ν•œ 쑰합을 μ˜ˆμΈ‘ν•˜λ©° 탐색 (Gausean Process λ“± ν™œμš©). - **Goal**: 검증 데이터(Validation set)에 λŒ€ν•΄ 졜고의 μ„±λŠ₯을 λ‚΄λŠ” 섀정을 ν™•λ³΄ν•˜λŠ” 것. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (RL Update) - ν•˜μ΄νΌνŒŒλΌλ―Έν„° νŠœλ‹ μžμ²΄μ— λ„ˆλ¬΄ λ§Žμ€ μ»΄ν“¨νŒ… μžμ›μ„ μ“°λŠ” 것은 '주객전도'κ°€ 될 수 μžˆλ‹€. μ΅œκ·Όμ—λŠ” **AutoML**μ΄λ‚˜ **Population Based Training (PBT)** 등을 톡해 ν•™μŠ΅ 쀑간에 μ‹€μ‹œκ°„μœΌλ‘œ ν•˜μ΄νΌνŒŒλΌλ―Έν„°λ₯Ό μ§„ν™”μ‹œν‚€λŠ” 방식이 λŒ€ν˜• λͺ¨λΈ ν•™μŠ΅μ—μ„œ ν‘œμ€€μœΌλ‘œ 쓰이고 μžˆλ‹€. ## πŸ”— 지식 μ—°κ²° (Graph) - Related: [[Gradient-Descent]] , [[AutoML]] - Concept: [[Overfitting-vs-Underfitting]]