--- id: AI-LORA-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [ai, llm, lora, peft, fine-tuning, model-optimization] last_reinforced: 2026-04-26 --- # Low-Rank Adaptation (LoRA, ์ €์ˆœ์œ„ ์ ์‘) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๊ฑฐ๋Œ€ํ•œ ๋‡Œ(Base Model)๋Š” ๊ทธ๋Œ€๋กœ ๋‘๊ณ , ์•„์ฃผ ์–‡์€ ์‹ ๊ฒฝ ๋‹ค๋ฐœ(Low-rank Matrices)๋งŒ ๋ง๋ถ™์—ฌ ์ƒˆ๋กœ์šด ๊ธฐ์ˆ ์„ ๊ฐ€๋ฅด์ณ๋ผ" โ€” ๊ฑฐ๋Œ€ ์–ธ์–ด ๋ชจ๋ธ์˜ ๋ณธ๋ž˜ ๊ฐ€์ค‘์น˜๋Š” ๊ณ ์ •ํ•˜๊ณ , ๊ฐ€์ค‘์น˜ ๋ณ€ํ™”๋Ÿ‰($\Delta W$)์„ ๋‘ ๊ฐœ์˜ ์ž‘์€ ํ–‰๋ ฌ์˜ ๊ณฑ์œผ๋กœ ๋ถ„ํ•ดํ•˜์—ฌ ํ•™์Šตํ•จ์œผ๋กœ์จ ํŒŒ๋ผ๋ฏธํ„ฐ ์ˆ˜๋ฅผ 10,000๋ฐฐ ์ด์ƒ ์ค„์ด๋ฉด์„œ๋„ ํšจ๊ณผ์ ์ธ ๋ฏธ์„ธ ์กฐ์ •์„ ๊ฐ€๋Šฅ์ผ€ ํ•˜๋Š” ๊ธฐ๋ฒ•. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** "Efficient Parameter Update" โ€” ๋ชจ๋ธ์˜ ๋ณ€ํ™”๋Ÿ‰์ด ์‹ค์ œ๋กœ๋Š” ๋‚ฎ์€ ์ฐจ์›์˜ ๋‚ด์žฌ์  ๊ตฌ์กฐ(Intrinsic Dimension)๋ฅผ ๊ฐ€์ง„๋‹ค๋Š” ํ†ต์ฐฐ์„ ๋ฐ”ํƒ•์œผ๋กœ, ์ „์ฒด๋ฅผ ๋‹ค์‹œ ํ•™์Šต์‹œํ‚ค๋Š” ๋Œ€์‹  ํ•ต์‹ฌ์ ์ธ ๋ณ€ํ™”๋งŒ์„ ํฌ์ฐฉํ•˜์—ฌ ํšจ์œจ์ ์œผ๋กœ ์ง€์‹์„ ์ด์‹ํ•˜๋Š” PEFT(Parameter-Efficient Fine-Tuning) ํŒจํ„ด. - **์ž‘๋™ ์›๋ฆฌ:** - **Freezing:** ๊ธฐ์กด ๋ชจ๋ธ์˜ ๋ชจ๋“  ๊ฐ€์ค‘์น˜๋Š” ์—…๋ฐ์ดํŠธํ•˜์ง€ ์•Š์Œ. - **Low-Rank Decomposition:** ์—…๋ฐ์ดํŠธํ•  ๊ฐ€์ค‘์น˜ ํ–‰๋ ฌ์„ $A \times B$ (์ˆœ์œ„ $r$์ด ๋งค์šฐ ์ž‘์€ ํ–‰๋ ฌ๋“ค)๋กœ ์ •์˜ํ•˜์—ฌ ํ•™์Šต. - **Merging:** ํ•™์Šต ์™„๋ฃŒ ํ›„, ํ›ˆ๋ จ๋œ ํ–‰๋ ฌ์„ ๊ธฐ์กด ๋ชจ๋ธ๊ณผ ํ•ฉ์ณ์„œ ์ถ”๋ก  ์ง€์—ฐ ์‹œ๊ฐ„(Latency) ์—†์ด ์‚ฌ์šฉ ๊ฐ€๋Šฅ. - **์˜์˜:** ๊ณ ์‚ฌ์–‘ GPU ์—†์ด๋„ ๋Œ€๊ทœ๋ชจ ๋ชจ๋ธ์„ ํŠน์ • ๋„๋ฉ”์ธ์— ์ตœ์ ํ™”ํ•  ์ˆ˜ ์žˆ๊ฒŒ ํ•˜์—ฌ, ๊ฐœ์ธํ™”๋œ AI ๋ฐ ๊ธฐ์—…์šฉ ํŠนํ™” ๋ชจ๋ธ ๊ตฌ์ถ•์˜ ์ง„์ž… ์žฅ๋ฒฝ์„ ํ˜์‹ ์ ์œผ๋กœ ๋‚ฎ์ถค. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ์„ฑ๋Šฅ์„ ์œ„ํ•ด์„œ๋Š” ์ „์ฒด ๋ฏธ์„ธ ์กฐ์ •(Full Fine-tuning)์ด ํ•„์ˆ˜๋ผ๋Š” ๋ฏฟ์Œ์„ ๊นจ๊ณ , LoRA๋งŒ์œผ๋กœ๋„ ์œ ์‚ฌํ•˜๊ฑฐ๋‚˜ ๋” ๋‚˜์€ ์„ฑ๋Šฅ์„ ๋‚ผ ์ˆ˜ ์žˆ์Œ์„ ์ž…์ฆํ•˜๋ฉฐ ํ˜„๋Œ€ LLM ์ƒํƒœ๊ณ„์˜ ํ‘œ์ค€ ํŠœ๋‹ ๊ธฐ์ˆ ๋กœ ์ž๋ฆฌ ์žก์Œ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ์‚ฌ์šฉ์ž์˜ ํŠน์ • ์ฝ”๋”ฉ ์Šคํƒ€์ผ์ด๋‚˜ ๋ฌธ์„œ ์–‘์‹์„ ์—์ด์ „ํŠธ์—๊ฒŒ ํ•™์Šต์‹œํ‚ฌ ๋•Œ, ์›๋ณธ ๋ชจ๋ธ์˜ ์ง€๋Šฅ์„ ํ›ผ์†ํ•˜์ง€ ์•Š๊ณ  ํšจ์œจ์ ์œผ๋กœ ํ•™์Šตํ•˜๊ธฐ ์œ„ํ•ด LoRA ๊ธฐ์ˆ ์„ ๊ธฐ๋ณธ์œผ๋กœ ์‚ฌ์šฉํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[LLM|LLM]], Transfer-Learning-Foundations, [[Inference-Optimization|Inference-Optimization]], [[Local-Brain-Management|Local-Brain-Management]] - **Raw Source:** 10_Wiki/Topics/AI/Low-Rank-Adaptation-LoRA.md