--- id: [[P-Reinforce|P-Reinforce]]-AUTO-PELR-001 category: Unified confidence_score: 1.00 tags: [auto-reinforced, peft, lora, qlora, fine-tuning-optimization, vram-efficiency] last_reinforced: 2026-05-04 --- # [[PEFT & LoRA|PEFT & LoRA]] ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "์ตœ์†Œํ•œ์˜ ๋ณ€๊ฒฝ์œผ๋กœ ์ตœ๋Œ€์˜ ํšจ๊ณผ: ๊ฑฐ๋Œ€ ๋ชจ๋ธ์˜ ์ˆ˜์‹ญ์–ต ๊ฐœ ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ์ „๋ถ€ ๊ฑด๋“œ๋ฆฌ๋Š” ๋Œ€์‹ , ์•„์ฃผ ์ž‘์€ ์–ด๋Œ‘ํ„ฐ(Adapter)๋งŒ ํ•™์Šต์‹œ์ผœ ๊ฐœ์ธ์šฉ PC์—์„œ๋„ ์ตœ์‹  AI๋ฅผ ํŠœ๋‹ํ•  ์ˆ˜ ์žˆ๊ฒŒ ๋งŒ๋“  ํšจ์œจ์„ฑ์˜ ๊ทน์น˜." ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) PEFT(Parameter-Efficient Fine-Tuning)๋Š” ๋ชจ๋ธ์˜ ์ „์ฒด ๊ฐ€์ค‘์น˜๋ฅผ ์—…๋ฐ์ดํŠธํ•˜์ง€ ์•Š๊ณ  ๊ทนํžˆ ์ผ๋ถ€์˜ ํŒŒ๋ผ๋ฏธํ„ฐ๋งŒ ํ•™์Šต์‹œํ‚ค๋Š” ๋ฏธ์„ธ ์กฐ์ • ๊ธฐ์ˆ ์˜ ์ด์นญ์ž…๋‹ˆ๋‹ค. 1. **LoRA (Low-Rank Adaptation)**: * **์›๋ฆฌ**: ๋ชจ๋ธ์˜ ๊ฐ€์ค‘์น˜ ํ–‰๋ ฌ($W$)์„ ๊ทธ๋Œ€๋กœ ๋‘๋Š” ๋Œ€์‹ , ๋‘ ๊ฐœ์˜ ์ž‘์€ ์ €์ฐจ์› ํ–‰๋ ฌ($A, B$)์˜ ๊ณฑ์œผ๋กœ ํ‘œํ˜„๋˜๋Š” ๋ณ€ํ™”๋Ÿ‰($\Delta W$)๋งŒ ํ•™์Šตํ•ฉ๋‹ˆ๋‹ค. * **์žฅ์ **: ํ•™์Šต ํŒŒ๋ผ๋ฏธํ„ฐ ์ˆ˜๋ฅผ 10,000๋ฐฐ ์ด์ƒ ์ค„์ด๋ฉด์„œ๋„ ์ „์ฒด ๊ฐ€์ค‘์น˜ ๋ฏธ์„ธ ์กฐ์ •๊ณผ ๋Œ€๋“ฑํ•œ ์„ฑ๋Šฅ์„ ๋ƒ…๋‹ˆ๋‹ค. ํ•™์Šต ํ›„ ๊ธฐ์กด ๋ชจ๋ธ์— ์‰ฝ๊ฒŒ ๋ณ‘ํ•ฉ(Merge)ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. 2. **QLoRA (Quantized LoRA)**: * **์›๋ฆฌ**: ๊ธฐ๋ณธ ๋ชจ๋ธ์„ 4๋น„ํŠธ๋กœ ์–‘์žํ™”(Quantization)ํ•˜์—ฌ VRAM์— ์˜ฌ๋ฆฌ๊ณ , ๊ทธ ์œ„์— LoRA๋ฅผ ์ ์šฉํ•ฉ๋‹ˆ๋‹ค. * **์˜์˜**: ๋‹จ์ผ 24GB GPU(RTX 3090/4090)์—์„œ๋„ 65B(650์–ต ๊ฐœ ํŒŒ๋ผ๋ฏธํ„ฐ) ์ด์ƒ์˜ ๊ฑฐ๋Œ€ ๋ชจ๋ธ์„ ๋ฏธ์„ธ ์กฐ์ •ํ•  ์ˆ˜ ์žˆ๊ฒŒ ํ•œ ํ˜์‹ ์  ๊ธฐ์ˆ ์ž…๋‹ˆ๋‹ค. 3. **๊ธฐํƒ€ PEFT ๊ธฐ๋ฒ•**: * **Prefix Tuning**: ์ž…๋ ฅ ์•ž์— ํ•™์Šต ๊ฐ€๋Šฅํ•œ ๊ฐ€์ƒ ํ† ํฐ(Prefix)์„ ์ถ”๊ฐ€ํ•ฉ๋‹ˆ๋‹ค. * **Prompt Tuning**: ํ”„๋กฌํ”„ํŠธ์˜ ์ž„๋ฒ ๋”ฉ ๊ณต๊ฐ„ ์ผ๋ถ€๋ฅผ ํ•™์Šต ๊ฐ€๋Šฅํ•˜๊ฒŒ ๋งŒ๋“ญ๋‹ˆ๋‹ค. * **Adapter Tuning**: ๊ธฐ์กด ํŠธ๋žœ์Šคํฌ๋จธ ๋ ˆ์ด์–ด ์‚ฌ์ด์— ์ž‘์€ ๋ณ‘๋ชฉ ๋ ˆ์ด์–ด๋ฅผ ์‚ฝ์ž…ํ•ฉ๋‹ˆ๋‹ค. ## โš–๏ธ Trade-offs & Caveats * **์ถ”๋ก  ์ง€์—ฐ**: ์–ด๋Œ‘ํ„ฐ(Adapter) ๋ฐฉ์‹์˜ ๊ฒฝ์šฐ ์ถ”๋ก  ์‹œ ์ถ”๊ฐ€ ์—ฐ์‚ฐ์ด ํ•„์š”ํ•˜์—ฌ ์†๋„๊ฐ€ ์†Œํญ ๋А๋ ค์งˆ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค (LoRA๋Š” ๋ณ‘ํ•ฉ์„ ํ†ตํ•ด ํ•ด๊ฒฐ ๊ฐ€๋Šฅ). * **๋ณต์žกํ•œ ์ž‘์—…์˜ ํ•œ๊ณ„**: ์•„์ฃผ ๋ฐฉ๋Œ€ํ•˜๊ฑฐ๋‚˜ ๋ณต์žกํ•œ ์ง€์‹์„ ์ƒˆ๋กญ๊ฒŒ ์ฃผ์ž…ํ•ด์•ผ ํ•˜๋Š” ๊ฒฝ์šฐ, ์ „์ฒด ๊ฐ€์ค‘์น˜ ๋ฏธ์„ธ ์กฐ์ •(Full Fine-Tuning)์— ๋น„ํ•ด ์„ฑ๋Šฅ์ด ๋‹ค์†Œ ๋–จ์–ด์งˆ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) * **์ƒ์œ„ ๊ฐœ๋…**: [[Fine-Tuning & Alignment|Fine-Tuning & Alignment]] * **์—ฐ๊ด€ ๊ธฐ์ˆ **: [[Quantization|Quantization]], [[LLM Architecture|LLM Architecture]] * **์ฃผ์š” ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ**: Hugging Face PEFT, Unsloth, Axolotl --- *Last updated: 2026-05-04*