--- id: DL-ACT-NLIN-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [ai, deep-learning, activation-functions, non-linearity, relu, sigmoid] last_reinforced: 2026-04-26 --- # Non-linear Activation Functions (๋น„์„ ํ˜• ํ™œ์„ฑํ™” ํ•จ์ˆ˜) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๋‹จ์กฐ๋กœ์šด ์ง์„ ์˜ ์„ธ๊ณ„์— '๊ตด๊ณก'์„ ๋ถ€์—ฌํ•˜์—ฌ, ์‹ ๊ฒฝ๋ง์ด ์„ธ์ƒ์˜ ๋ชจ๋“  ๋ณต์žกํ•œ ํ•จ์ˆ˜๋ฅผ ๊ทผ์‚ฌํ•  ์ˆ˜ ์žˆ๋Š” ๋ฌดํ•œํ•œ ํ‘œํ˜„๋ ฅ์„ ๊ฐ–๊ฒŒ ํ•˜๋ผ" โ€” ๊ฐ ๋‰ด๋Ÿฐ์˜ ์ถœ๋ ฅ์„ ๋น„์„ ํ˜•์ ์œผ๋กœ ๋ณ€ํ™˜ํ•จ์œผ๋กœ์จ ์‹ฌ์ธต ์‹ ๊ฒฝ๋ง์ด ์„ ํ˜•์ ์ธ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๊ณ  ๊ณ ์ฐจ์›์ ์ธ ํŒจํ„ด์„ ํ•™์Šตํ•˜๊ฒŒ ๋งŒ๋“œ๋Š” ํ•ต์‹ฌ ์žฅ์น˜. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** "Linear Combination and Non-linear Transformation" โ€” ์ž…๋ ฅ์„ ๊ฐ€์ค‘ํ•ฉํ•œ ๊ฒฐ๊ณผ๋ฅผ ๊ทธ๋Œ€๋กœ ๋‚ด๋ณด๋‚ด์ง€ ์•Š๊ณ  ํŠน์ • ์ž„๊ณ„๊ฐ’์—์„œ ๊บพ๊ฑฐ๋‚˜(ReLU), 0๊ณผ 1 ์‚ฌ์ด๋กœ ์••์ถ•(Sigmoid)ํ•˜๋Š” ๋ณ€ํ™˜์„ ํ†ตํ•ด ์ธต์„ ์Œ“์„์ˆ˜๋ก ๋ชจ๋ธ์˜ ์ง€๋Šฅ์  ๊นŠ์ด๊ฐ€ ๊นŠ์–ด์ง€๊ฒŒ ํ•˜๋Š” ํŒจํ„ด. - **์ฃผ์š” ํ•จ์ˆ˜:** - **ReLU (Rectified Linear Unit):** ์Œ์ˆ˜๋ฉด 0, ์–‘์ˆ˜๋ฉด ๊ทธ๋Œ€๋กœ. ์—ฐ์‚ฐ์ด ๋น ๋ฅด๊ณ  ๊ธฐ์šธ๊ธฐ ์†Œ์‹ค(Vanishing Gradient) ๋ฌธ์ œ๋ฅผ ํฌ๊ฒŒ ๊ฐœ์„ . - **Sigmoid:** 0๊ณผ 1 ์‚ฌ์ด์˜ ํ™•๋ฅ ๊ฐ’ ๋ฐ˜ํ™˜. ์ดˆ๊ธฐ ์‹ ๊ฒฝ๋ง์˜ ํ‘œ์ค€์ด์—ˆ์œผ๋‚˜ ํ˜„์žฌ๋Š” ์ถœ๋ ฅ์ธต์—์„œ ์ฃผ๋กœ ์‚ฌ์šฉ. - **Tanh:** -1๊ณผ 1 ์‚ฌ์ด๋กœ ์••์ถ•ํ•˜์—ฌ ๋ฐ์ดํ„ฐ์˜ ์ค‘์‹ฌ์„ 0์œผ๋กœ ๋งž์ถค. - **Leaky ReLU/GELU:** ReLU์˜ ๋‹จ์ (Dying ReLU)์„ ๋ณด์™„ํ•œ ์ตœ์‹  ๋ณ€์ข…๋“ค. - **์˜์˜:** ๋น„์„ ํ˜• ํ™œ์„ฑํ™” ํ•จ์ˆ˜๊ฐ€ ์—†๋‹ค๋ฉด ์•„๋ฌด๋ฆฌ ๊นŠ์€ ์‹ ๊ฒฝ๋ง๋„ ๋‹จ์ผ ๋ ˆ์ด์–ด์˜ ์„ ํ˜• ํšŒ๊ท€์™€ ์ˆ˜ํ•™์ ์œผ๋กœ ๋™์ผํ•ด์ง€๋ฉฐ, ๋”ฅ๋Ÿฌ๋‹์ด๋ผ๋Š” ํ•™๋ฌธ ์ž์ฒด๊ฐ€ ์„ฑ๋ฆฝํ•˜์ง€ ์•Š๊ฒŒ ๋จ. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ์‹œ๊ทธ๋ชจ์ด๋“œ๊ฐ€ ๊ฐ€์žฅ ์ธ๊ฐ„์˜ ๋‡Œ์™€ ๋‹ฎ์•„ ์ตœ์„ ์ด๋ผ๋Š” ๋ฏฟ์Œ์—์„œ ๋ฒ—์–ด๋‚˜, ์ด์ œ๋Š” ํ•™์Šต์˜ ์•ˆ์ •์„ฑ๊ณผ ์†๋„๋ฅผ ์œ„ํ•ด ReLU ๊ณ„์—ด๊ณผ ํŠธ๋žœ์Šคํฌ๋จธ์—์„œ ์“ฐ์ด๋Š” GELU ๋“ฑ์ด ์‹ค์งˆ์ ์ธ ํ‘œ์ค€์œผ๋กœ ์ž๋ฆฌ ์žก์Œ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ ์•„ํ‚คํ…์ฒ˜ ์„ค๊ณ„ ์‹œ, ์ˆ˜ํ•™์  ๋ถ€๋“œ๋Ÿฌ์›€๊ณผ ์„ฑ๋Šฅ ์ตœ์ ํ™”๊ฐ€ ๊ฒ€์ฆ๋œ SwiGLU ๋˜๋Š” GELU ํ™œ์„ฑํ™” ํ•จ์ˆ˜๋ฅผ ๊ธฐ๋ณธ ์‚ฌ์–‘์œผ๋กœ ์ฑ„ํƒํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - Activation-Functions, [[Leaky-ReLU-and-Activations|Leaky-ReLU-and-Activations]], Deep-Learning-Foundations, Backpropagation-Foundations - **Raw Source:** 10_Wiki/Topics/AI/Non-linear-Activation-Functions.md