--- id: P-REINFORCE-AI-LORA category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.00 tags: [AI, LLM, LoRA, FineTuning, Efficiency] last_reinforced: 2026-04-20 --- # [[LoRA (Low-Rank Adaptation)|LoRA (Low-Rank Adaptation)]] (์ €์ฐจ์› ์ ์‘) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๊ฑฐ๋Œ€ํ•œ ์‚ฐ์„ ์˜ฎ๊ธฐ์ง€ ๋ง๊ณ , ์‹ ๋ฐœ ๋ฐ‘์ฐฝ์— ์•„์ฃผ ์–‡์€ ๊น”์ฐฝ ํ•˜๋‚˜๋งŒ ๋ง๋Œ€๋Š” ํ˜๋ช…." ์ˆ˜์กฐ ๊ฐœ์˜ ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ๊ฐ€์ง„ ๊ฑฐ๋Œ€ ๋ชจ๋ธ ์ „์ฒด๋ฅผ ๊ฑด๋“œ๋ฆฌ์ง€ ์•Š๊ณ , ์•„์ฃผ ์ž‘์€ ์ถ”๊ฐ€ ํ–‰๋ ฌ(A, B)๋งŒ ํ•™์Šต์‹œ์ผœ ๋ชจ๋ธ์˜ ์ง€์‹์„ ํšจ์œจ์ ์œผ๋กœ ๊ฐฑ์‹ ํ•˜๋Š” ์ตœ์‹  ํŠœ๋‹ ๊ธฐ๋ฒ•์ด๋‹ค. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **The Core Idea**: ๋ชจ๋ธ์ด ํ•™์Šตํ•˜๋ฉฐ ๋ณ€ํ•˜๋Š” ๊ฐ€์ค‘์น˜์˜ ์ฐจ์ด($\Delta W$)๋Š” ์‚ฌ์‹ค '๋‚ฎ์€ ์ฐจ์›(Low intrinsic rank)'์— ๋จธ๋ฌผ๋Ÿฌ ์žˆ๋‹ค๋Š” ์ ์— ์ฐฉ์•ˆํ•จ. - **Mechanism**: - ๊ธฐ์กด ๊ฐ€์ค‘์น˜ $W$๋Š” ์–ผ๋ ค๋‘”(Freeze) ์ฑ„๋กœ, ์˜†์— ๋‘ ๊ฐœ์˜ ์ž‘์€ ํ–‰๋ ฌ($A \times B$)์„ ๋‘ . - $W_{new} = W + (A \times B)$. - **Unbelievable Efficiency**: - ์ „์ฒด ํŒŒ๋ผ๋ฏธํ„ฐ์˜ 0.01%๋งŒ ํ•™์Šตํ•ด๋„ ์ „์ฒด ํŠœ๋‹๊ณผ ์œ ์‚ฌํ•œ ์„ฑ๋Šฅ์„ ๋ƒ„. - ์ˆ˜ ๊ธฐ๊ฐ€๋ฐ”์ดํŠธ์˜ ๋ชจ๋ธ ๋Œ€์‹  ์ˆ˜ ๋ฉ”๊ฐ€๋ฐ”์ดํŠธ์˜ 'LoRA ๊ฐ€์ค‘์น˜ ํŒŒ์ผ'๋งŒ ์ €์žฅํ•˜๊ณ  ๊ณต์œ ํ•˜๋ฉด ๋จ. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (RL Update) - LoRA๋Š” ํšจ์œจ์ ์ด์ง€๋งŒ, ๋Œ€๊ทœ๋ชจ ๋ฉ€ํ‹ฐ ๋ชจ๋‹ฌ ํ•™์Šต์ด๋‚˜ ๊ทผ๋ณธ์ ์ธ ๊ธฐ์ดˆ ์ง€์‹ ์Šต๋“์—๋Š” ์ „์ฒด ํŒŒ์ธํŠœ๋‹(Full Fine-tuning)๋ณด๋‹ค ์„ฑ๋Šฅ์ด ์†Œํญ ๋–จ์–ด์งˆ ์ˆ˜ ์žˆ๋‹ค. ์ด๋ฅผ ๋ณด์™„ํ•˜๊ธฐ ์œ„ํ•ด ์–‘์žํ™” ๊ธฐ์ˆ ์„ ๊ฒฐํ•ฉํ•œ **QLoRA**๊ฐ€ ๋“ฑ์žฅํ•˜์—ฌ, ์ผ๋ฐ˜ ์†Œ๋น„์ž์šฉ ๊ทธ๋ž˜ํ”ฝ์นด๋“œ ํ•œ ์žฅ์œผ๋กœ๋„ ๊ฑฐ๋Œ€ ์–ธ์–ด ๋ชจ๋ธ์„ ํŠœ๋‹ํ•˜๋Š” 'AI ๋ฏผ์ฃผํ™”'๋ฅผ ์ด๋Œ๊ณ  ์žˆ๋‹ค. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - Related: [[Instruction-Tuning|Instruction-Tuning]] , Quantization (์–‘์žํ™”) - Variant: QLoRA (Quantized LoRA)