--- id: DL-ACT-RELU-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 1.0 tags: [ai, deep-learning, activation-function, relu, vanishing-gradient, neural-networks, optimization] last_reinforced: 2026-04-26 --- # ReLU Activation Functions (ReLU ν™œμ„±ν™” ν•¨μˆ˜) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "0보닀 μž‘μœΌλ©΄ 과감히 버리고, 0보닀 크면 κ·ΈλŒ€λ‘œ ν†΅κ³Όμ‹œμΌœ μ‹ κ²½λ§μ˜ '기울기 μ†Œμ‹€'μ΄λΌλŠ” 동λ§₯κ²½ν™”λ₯Ό μΉ˜λ£Œν•˜λΌ" β€” λ”₯λŸ¬λ‹μ—μ„œ κ°€μž₯ 널리 μ“°μ΄λŠ” λΉ„μ„ ν˜• ν™œμ„±ν™” ν•¨μˆ˜λ‘œ, μ—°μ‚°μ˜ λ‹¨μˆœν•¨κ³Ό ν•™μŠ΅μ˜ νš¨μœ¨μ„±μ„ λ™μ‹œμ— μž‘μ•„ ν˜„λŒ€ μ‹ κ²½λ§μ˜ 깊이λ₯Ό κ°€λŠ₯μΌ€ ν•œ 핡심 도ꡬ. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **μΆ”μΆœλœ νŒ¨ν„΄:** "Linear Rectification and Sparsity Inducement" β€” $f(x) = \max(0, x)$ λΌλŠ” λ‹¨μˆœν•œ μˆ˜μ‹μ„ 톡해 μ–‘μˆ˜ μ˜μ—­μ—μ„œλŠ” 기울기λ₯Ό μΌμ •ν•˜κ²Œ μœ μ§€ν•˜μ—¬ κ·Έλž˜λ””μ–ΈνŠΈ μ „νŒŒλ₯Ό 돕고, 음수 μ˜μ—­μ—μ„œλŠ” λ‰΄λŸ°μ„ λΉ„ν™œμ„±ν™”(Sparsity)ν•˜μ—¬ μ—°μ‚° νš¨μœ¨μ„ λ†’μ΄λŠ” νŒ¨ν„΄. - **핡심 μž₯점:** - **Vanishing Gradient Solution:** μ‹œκ·Έλͺ¨μ΄λ“œ(Sigmoid)와 달리 큰 μ–‘μˆ˜ κ°’μ—μ„œλ„ κΈ°μšΈκΈ°κ°€ 1둜 μœ μ§€λ˜μ–΄ κΉŠμ€ 망 ν•™μŠ΅μ΄ κ°€λŠ₯. - **Computational Efficiency:** λ‹¨μˆœν•œ 비ꡐ μ—°μ‚°λ§ŒμœΌλ‘œ κ΅¬ν˜„ κ°€λŠ₯ν•˜μ—¬ ν•™μŠ΅ 속도가 맀우 빠름. - **Biological Plausibility:** λ‡Œμ„Έν¬μ˜ νŠΉμ • μž„κ³„μΉ˜ μ΄μƒμ—μ„œλ§Œ λ°˜μ‘ν•˜λŠ” νŠΉμ„±μ„ 일뢀 λͺ¨λ°©. - **의의:** λ”₯λŸ¬λ‹μ΄ 'ν•™μŠ΅ κ°€λŠ₯ν•œ μˆ˜μ€€'으둜 λ‚΄λ €μ˜€κ²Œ λ§Œλ“  결정적인 곡신 쀑 ν•˜λ‚˜μ΄λ©°, AlexNet 이후 μ‚¬μ‹€μƒμ˜ ν‘œμ€€(De facto standard)으둜 자리 작음. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌:** 음수 μ˜μ—­μ—μ„œ κΈ°μšΈκΈ°κ°€ 0이 λ˜μ–΄ λ‰΄λŸ°μ΄ μ˜μ›νžˆ μ£½μ–΄λ²„λ¦¬λŠ” 'Dying ReLU' λ¬Έμ œμ— μ§λ©΄ν–ˆμœΌλ‚˜, 이λ₯Ό ν•΄κ²°ν•˜κΈ° μœ„ν•΄ Leaky ReLU, ELU, GeLU(BERTμ—μ„œ μ‚¬μš©) λ“± λ‹€μ–‘ν•œ λ³€ν˜• λͺ¨λΈμ΄ λ“±μž₯ν•˜λ©° 보완됨. - **μ •μ±… λ³€ν™”:** Antigravity ν”„λ‘œμ νŠΈλŠ” μ—μ΄μ „νŠΈμ˜ λ‚΄λΆ€ μΆ”λ‘  신경망 섀계 μ‹œ, ν•™μŠ΅ 속도와 μ•ˆμ •μ„±μ˜ κ· ν˜•μ„ μœ„ν•΄ 기본적으둜 ReLU ν˜Ήμ€ κ·Έ λ³€ν˜•μΈ GeLUλ₯Ό ν™œμ„±ν™” ν•¨μˆ˜λ‘œ 채택함. ## πŸ”— 지식 μ—°κ²° (Graph) - Deep-Learning-Foundations, Backpropagation-Foundations, [[Optimization-in-AI|Optimization-in-AI]], Neural-Architecture-Design - **Raw Source:** 10_Wiki/Topics/AI/ReLU-Activation-Functions.md