--- id: AI-OPT-PRUNE-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [ai, deep-learning, pruning, model-compression, optimization, inference-speedup, efficiency] last_reinforced: 2026-04-26 --- # Pruning Techniques (๊ฐ€์ง€์น˜๊ธฐ ๊ธฐ๋ฒ•) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๋ชจ๋ธ์˜ ์ง€๋Šฅ์„ ํ›ผ์†ํ•˜์ง€ ์•Š๋Š” ์„ ์—์„œ ์ž‰์—ฌ๋กœ์šด ์—ฐ๊ฒฐ(Weights)์„ ๊ณผ๊ฐํžˆ ๋„๋ ค๋‚ด์–ด, ๊ฐ€๋ณ๊ณ  ๋‚ ๋ ตํ•œ '์‹ค์ „์šฉ ์ง€๋Šฅ'์œผ๋กœ ์žฌํƒ„์ƒ์‹œ์ผœ๋ผ" โ€” ์‹ ๊ฒฝ๋ง์—์„œ ์ถœ๋ ฅ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์ด ์ ์€ ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ์ œ๊ฑฐํ•จ์œผ๋กœ์จ ์„ฑ๋Šฅ ์†์‹ค์„ ์ตœ์†Œํ™”ํ•˜๋ฉด์„œ ๋ชจ๋ธ์˜ ํฌ๊ธฐ์™€ ์—ฐ์‚ฐ๋Ÿ‰์„ ์ค„์ด๋Š” ๋ชจ๋ธ ์••์ถ• ๊ธฐ์ˆ . ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** "Importance-based Sparsification and Fine-tuning" โ€” ๊ฐ€์ค‘์น˜์˜ ํฌ๊ธฐ(Magnitude)๋‚˜ ๊ธฐ์šธ๊ธฐ(Gradient) ์ •๋ณด๋ฅผ ๊ธฐ์ค€์œผ๋กœ ๊ธฐ์—ฌ๋„๊ฐ€ ๋‚ฎ์€ ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ 0์œผ๋กœ ๋งŒ๋“ค๊ฑฐ๋‚˜(Masking) ์•„์˜ˆ ์ œ๊ฑฐํ•˜๊ณ , ๋‚จ์€ ํŒŒ๋ผ๋ฏธํ„ฐ๋“ค์„ ์žฌํ•™์Šต์‹œ์ผœ ์„ฑ๋Šฅ์„ ๋ณต๊ตฌํ•˜๋Š” ํŒจํ„ด. - **์ฃผ์š” ๋ถ„๋ฅ˜:** - **Unstructured Pruning:** ๊ฐ€์ค‘์น˜ ํ–‰๋ ฌ ๋‚ด ๊ฐœ๋ณ„ ์š”์†Œ๋ฅผ ๋ฌด์ž‘์œ„๋กœ ์ œ๊ฑฐ (๋†’์€ ์••์ถ•๋ฅ , ํ•˜๋“œ์›จ์–ด ์ตœ์ ํ™” ์–ด๋ ค์›€). - **Structured Pruning:** ํ•„ํ„ฐ, ์ฑ„๋„, ํ˜น์€ ๋ ˆ์ด์–ด ์ „์ฒด๋ฅผ ํ†ต์งธ๋กœ ์ œ๊ฑฐ (์—ฐ์‚ฐ ์†๋„ ํ–ฅ์ƒ์— ์ง๊ฒฐ). - **Global vs Local:** ๋ชจ๋ธ ์ „์ฒด์—์„œ ํ•˜์œ„ n%๋ฅผ ๊ณ ๋ฅผ์ง€, ์ธต๋ณ„๋กœ ๊ณ ๋ฅผ์ง€์˜ ์ฐจ์ด. - **์˜์˜:** ๊ณ ๊ฐ€์˜ ์„œ๋ฒ„ ์—†์ด๋„ ๋ชจ๋ฐ”์ผ ๊ธฐ๊ธฐ(On-device AI)๋‚˜ ์ž„๋ฒ ๋””๋“œ ์‹œ์Šคํ…œ์—์„œ ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ์„ ์‹ค์‹œ๊ฐ„์œผ๋กœ ๊ตฌ๋™ํ•  ์ˆ˜ ์žˆ๊ฒŒ ํ•˜๋Š” ํ•ต์‹ฌ ์ตœ์ ํ™” ์ „๋žต. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ํŒŒ๋ผ๋ฏธํ„ฐ๊ฐ€ ๋งŽ์„์ˆ˜๋ก ๋ฌด์กฐ๊ฑด ์ข‹๋‹ค๋Š” '๊ฑฐ๋Œ€ ๋ชจ๋ธ ๋งŒ๋Šฅ๋ก '์—์„œ ๋ฒ—์–ด๋‚˜, ์ ์ ˆํžˆ ๊ฐ€์ง€์น˜๊ธฐ๋œ ๋ชจ๋ธ์ด ๋•Œ๋กœ๋Š” ๋…ธ์ด์ฆˆ๊ฐ€ ์ œ๊ฑฐ๋˜์–ด ๋” ๋†’์€ ์ผ๋ฐ˜ํ™” ์„ฑ๋Šฅ์„ ๋ณด์ด๊ธฐ๋„ ํ•œ๋‹ค๋Š” '๋ณต๊ถŒ ๊ฐ€์„ค(Lottery Ticket Hypothesis)'์ด ์ฃผ๋ชฉ๋ฐ›๊ณ  ์žˆ์Œ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ์—์ด์ „ํŠธ์˜ ๋กœ์ปฌ ์‹คํ–‰ ๋ชจ๋“ˆ ๋ฐฐํฌ ์‹œ, ๋ชจ๋ธ ํฌ๊ธฐ๋ฅผ 1/4 ์ดํ•˜๋กœ ์ค„์ด๋ฉด์„œ๋„ ์ •ํ™•๋„๋ฅผ ์œ ์ง€ํ•˜๋Š” ๊ตฌ์กฐ์  ๊ฐ€์ง€์น˜๊ธฐ ํŒŒ์ดํ”„๋ผ์ธ์„ ์ ์šฉํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Quantization-Foundations|Quantization-Foundations]], Knowledge-Distillation-Foundations, Model-Compression-and-Deployment, [[Optimization-in-AI|Optimization-in-AI]] - **Raw Source:** 10_Wiki/Topics/AI/Pruning-Techniques.md