--- category: Unified tags: [auto-consolidated, technical-documentation] title: PEFT (Parameter-Efficient Fine-Tuning) last_updated: 2026-05-02 --- # PEFT (Parameter-Efficient Fine-Tuning) ## ๐Ÿ“Œ Brief Summary > "์ „๋ด‡๋Œ€๋ฅผ ๋‹ค ๋ฐ”๊พธ๋Š” ๋Œ€์‹  ์ „๊ตฌ๋งŒ ๋ฐ”๊พผ๋‹ค: ๊ฑฐ๋Œ€ ๋ชจ๋ธ์˜ ์ „์ฒด ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ๊ฑด๋“œ๋ฆฌ์ง€ ์•Š๊ณ , ๊ทนํžˆ ์ผ๋ถ€(1% ๋ฏธ๋งŒ)๋งŒ ํ•™์Šต์‹œ์ผœ ํ•˜๋“œ์›จ์–ด ๋ถ€๋‹ด ์—†์ด ์ „๋ฌธ ์ง€์‹์„ ์ฃผ์ž…ํ•˜๋Š” ํšจ์œจ ๊ทน๋Œ€ํ™” ๊ธฐ์ˆ ." --- > "์ „์ฒด ๊ฐ€์ค‘์น˜๋ฅผ ๋‹ค ๋ฐ”๊พธ์ง€ ์•Š๊ณ ๋„ ๋ชจ๋ธ์˜ ์ „๋ฌธ์„ฑ์„ ๊ทน๋Œ€ํ™”ํ•˜๋ผ" โ€” ๊ฑฐ๋Œ€ ๋ชจ๋ธ์˜ ๋Œ€๋ถ€๋ถ„ ๊ฐ€์ค‘์น˜๋Š” ๊ณ ์ •ํ•œ ์ฑ„, ์•„์ฃผ ์ ์€ ์ˆ˜์˜ ์ถ”๊ฐ€ ํŒŒ๋ผ๋ฏธํ„ฐ๋‚˜ ์ผ๋ถ€ ๋ ˆ์ด์–ด๋งŒ ํ•™์Šต์‹œ์ผœ ์„ฑ๋Šฅ ํšจ์œจ๊ณผ ๋น„์šฉ์„ ๋™์‹œ์— ์žก๋Š” ํŠœ๋‹ ์ „๋žต. ## ๐Ÿ“– Core Content ๋งค๊ฐœ๋ณ€์ˆ˜ ํšจ์œจ์  ๋ฏธ์„ธ ์กฐ์ •(PEFT)์€ ๊ฑฐ๋Œ€ ์–ธ์–ด ๋ชจ๋ธ(LLM)์„ ํŠน์ • ์ž‘์—…์— ๋งž์ถฐ ์ตœ์ ํ™”ํ•  ๋•Œ, ์ „์ฒด ๊ฐ€์ค‘์น˜๋ฅผ ์—…๋ฐ์ดํŠธํ•˜๋Š” ๋Œ€์‹  ์†Œ๋Ÿ‰์˜ ์ถ”๊ฐ€ ํŒŒ๋ผ๋ฏธํ„ฐ๋งŒ ํ•™์Šต์‹œํ‚ค๋Š” ๋ฐฉ๋ฒ•๋ก ์ž…๋‹ˆ๋‹ค. 1. **์ฃผ์š” ๊ธฐ๋ฒ•**: * **[[LoRA (Low-Rank Adaptation)|LoRA (Low-Rank Adaptation)]]**: ๊ฐ€์ค‘์น˜ ํ–‰๋ ฌ์˜ ๋ณ€ํ™”๋Ÿ‰์„ ๋‚ฎ์€ ์ฐจ์›์˜ ๋‘ ํ–‰๋ ฌ(A, B)๋กœ ๋ถ„ํ•ดํ•˜์—ฌ ํ•™์Šต. ๊ฐ€์žฅ ๋Œ€์ค‘์ ์ธ ๊ธฐ๋ฒ•์œผ๋กœ ์—ฐ์‚ฐ๋Ÿ‰๊ณผ ๋ฉ”๋ชจ๋ฆฌ๋ฅผ ํš๊ธฐ์ ์œผ๋กœ ์ ˆ๊ฐ. * **Adapters**: ๊ธฐ์กด ๋ชจ๋ธ ๋ ˆ์ด์–ด ์‚ฌ์ด์— ์ž‘์€ ์‹ ๊ฒฝ๋ง(Adapter)์„ ๋ผ์›Œ ๋„ฃ์–ด ํ•ด๋‹น ๋ถ€๋ถ„๋งŒ ํ•™์Šต. * **prompt Tuning / Prefix Tuning**: ๋ชจ๋ธ ์ž…๋ ฅ ์•ž๋‹จ์— ํ•™์Šต ๊ฐ€๋Šฅํ•œ ๊ฐ€์ƒ์˜ '์†Œํ”„ํŠธ ํ”„๋กฌํ”„ํŠธ' ๋ฒกํ„ฐ๋ฅผ ์ถ”๊ฐ€ํ•˜์—ฌ ํŠœ๋‹. 2. **ํ•ต์‹ฌ ์ด์ **: * **GPU ๋ฉ”๋ชจ๋ฆฌ ์ ˆ์•ฝ**: ํ•˜์ด์—”๋“œ ์„œ๋ฒ„ ์—†์ด๋„ ์†Œ๋น„์ž์šฉ GPU์—์„œ ๊ฑฐ๋Œ€ ๋ชจ๋ธ ํŠœ๋‹ ๊ฐ€๋Šฅ. * **ํŒŒ๋ผ๋ฏธํ„ฐ ์‚ฌ์ผ๋กœ ๋ฐฉ์ง€**: ๊ฐ ์ž‘์—…๋งˆ๋‹ค ๊ฑฐ๋Œ€ ๋ชจ๋ธ์„ ํ†ต์งธ๋กœ ์ €์žฅํ•  ํ•„์š” ์—†์ด, ์ž‘์€ PEFT ๋ชจ๋“ˆ(์ฒดํฌํฌ์ธํŠธ)๋งŒ ์ €์žฅํ•˜์—ฌ ๊ต์ฒดํ•˜๋ฉฐ ์‚ฌ์šฉ ๊ฐ€๋Šฅ. * **Catastrophic Forgetting ๋ฐฉ์ง€**: ์›๋ณธ ๊ฐ€์ค‘์น˜๊ฐ€ ๊ณ ์ •๋˜๋ฏ€๋กœ ๋ชจ๋ธ์˜ ๊ธฐ๋ฐ˜ ์ง€์‹์ด ๋ฌด๋„ˆ์ง€์ง€ ์•Š์Œ. --- - **์ถ”์ถœ๋œ ํŒจํ„ด:** ๋ชจ๋ธ์˜ ํ•ต์‹ฌ ์ง€์‹(Pre-trained weights)์€ ๋ณด์กดํ•˜๋ฉด์„œ, ํŠน์ • ํƒœ์Šคํฌ์— ํ•„์š”ํ•œ ๋ฏธ์„ธํ•œ ์กฐ์ •๊ฐ’๋งŒ์„ ํšจ์œจ์ ์œผ๋กœ ํ•™์Šตํ•˜์—ฌ ๋ฐฐํฌํ•˜๋Š” ํŒจํ„ด. - **์ฃผ์š” ๊ธฐ๋ฒ•:** - **[[LoRA (Low-Rank Adaptation)|LoRA (Low-Rank Adaptation)]]:** ๊ฐ€์ค‘์น˜ ํ–‰๋ ฌ์˜ ๋ณ€ํ™”๋Ÿ‰์„ ์ €์ˆœ์œ„ ํ–‰๋ ฌ๊ณฑ์œผ๋กœ ๊ทผ์‚ฌํ•˜์—ฌ ํ•™์Šต. - **Prefix Tuning:** ์ž…๋ ฅ ๋ฐ์ดํ„ฐ ์•ž์— ํ•™์Šต ๊ฐ€๋Šฅํ•œ ๊ฐ€์ƒ ํ† ํฐ(Prefix)์„ ์ถ”๊ฐ€ํ•˜์—ฌ ๋ชจ๋ธ์˜ ๊ฑฐ๋™ ์ œ์–ด. - **Adapter Modules:** ๊ธฐ์กด ๋ ˆ์ด์–ด ์‚ฌ์ด์— ์•„์ฃผ ์ž‘์€ ์‹ ๊ฒฝ๋ง ์ธต์„ ์‚ฝ์ž…ํ•˜์—ฌ ํ•ด๋‹น ๋ถ€๋ถ„๋งŒ ํ•™์Šต. - **prompt Tuning:** ํ”„๋กฌํ”„ํŠธ ์ž์ฒด๋ฅผ ๋ฒกํ„ฐ ํ˜•ํƒœ๋กœ ํ•™์Šตํ•˜์—ฌ ์ตœ์ ์˜ ์ง€์‹œ์–ด๋ฅผ ์ฐพ์Œ. - **์žฅ์ :** ์—ฐ์‚ฐ๋Ÿ‰ ๊ธ‰๊ฐ, ๋ชจ๋ธ ์ €์žฅ ๊ณต๊ฐ„ ์ ˆ์•ฝ(MB ๋‹จ์œ„), ์—ฌ๋Ÿฌ ํƒœ์Šคํฌ์— ๋Œ€ํ•œ ์–ด๋Œ‘ํ„ฐ๋ฅผ ๋…๋ฆฝ์ ์œผ๋กœ ๊ด€๋ฆฌ ๊ฐ€๋Šฅ. ## โš–๏ธ Trade-offs & Caveats - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ**: ์ดˆ๊ธฐ์—๋Š” "์ผ๋ถ€๋งŒ ํ•™์Šตํ•˜๋ฉด ์„ฑ๋Šฅ์ด ๋–จ์–ด์งˆ ๊ฒƒ"์ด๋ผ๋Š” ์šฐ๋ ค๊ฐ€ ์žˆ์—ˆ์œผ๋‚˜, ์—ฐ๊ตฌ ๊ฒฐ๊ณผ ์ „์ฒด ํŠœ๋‹(Full Fine-tuning)๊ณผ ๋Œ€๋“ฑํ•˜๊ฑฐ๋‚˜ ์˜คํžˆ๋ ค ํŠน์ • ์ž‘์—…์—์„œ๋Š” ๊ณผ์ ํ•ฉ์„ ๋ง‰์•„ ๋” ๋‚˜์€ ์„ฑ๋Šฅ์„ ๋ƒ„์ด ์ฆ๋ช…๋จ. - **์ •์ฑ… ๋ณ€ํ™”(RL Update)**: ๊ธฐ์—… ๋ณด์•ˆ ์ •์ฑ… ์ƒ 'ํด๋ผ์šฐ๋“œ API'๋ฅผ ์“ฐ๊ธฐ ํž˜๋“  ํ™˜๊ฒฝ์—์„œ, ์‚ฌ๋‚ด ๋ฐ์ดํ„ฐ๋กœ ๋กœ์ปฌ ๋ชจ๋ธ์„ ์•ˆ์ „ํ•˜๊ณ  ์ €๋น„์šฉ์œผ๋กœ ํŠœ๋‹ํ•˜๋Š” 'On-premise PEFT'๊ฐ€ ๋ฐ์ดํ„ฐ ๊ฑฐ๋ฒ„๋„Œ์Šค์˜ ํ•ต์‹ฌ ์ „๋žต์œผ๋กœ ๋ถ€์ƒํ•จ. --- - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ๋ชจ๋“  ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ๋‹ค์‹œ ํ•™์Šต์‹œํ‚ค๋˜ Full Fine-tuning์—์„œ, ์ž์› ํšจ์œจ์„ฑ์ด ๊ฐ•์กฐ๋˜๋Š” PEFT ์ค‘์‹ฌ์œผ๋กœ ์‚ฐ์—…๊ณ„ ํ‘œ์ค€์ด ์ด๋™. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ์ƒˆ๋กœ์šด ์œ„ํ‚ค ๋„๋ฉ”์ธ ํ•™์Šต ์‹œ PEFT(ํŠนํžˆ LoRA)๋ฅผ ๊ธฐ๋ณธ ์‚ฌ์–‘์œผ๋กœ ์ฑ„ํƒํ•˜์—ฌ ํ•˜๋“œ์›จ์–ด ๋น„์šฉ์„ 90% ์ด์ƒ ์ ˆ๊ฐํ•จ. ## ๐Ÿ”— Knowledge Connections - **Related**: [[SFT (Supervised Fine-Tuning)|SFT (Supervised Fine-Tuning)]], Foundational Models, [[Transfer Learning|Transfer Learning]], [[Large Language Models (LLM)|Large Language Models (LLM)]] - **Modern Tech/Tools**: HuggingFace PEFT library, LoRA, QLoRA. --- --- - [[Low-Rank-Adaptation-LoRA|Low-Rank-Adaptation-LoRA]], [[Fine-tuning|Fine-Tuning]], [[LLM|LLM]], Transfer-Learning - **Raw Source:** 10_Wiki/Topics/AI/Parameter-Efficient Fine-Tuning (PEFT).md