--- id: LLM-PARAM-EFF-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 1.0 tags: [ai, llm, peft, lora, parameter-efficiency, fine-tuning, optimization] last_reinforced: 2026-04-26 --- # Parameter Efficiency in LLMs (LLMμ—μ„œμ˜ νŒŒλΌλ―Έν„° νš¨μœ¨μ„±) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "μˆ˜μ²œμ–΅ 개의 νŒŒλΌλ―Έν„°λ₯Ό μ „λΆ€ 흔듀지 말고, 핡심적인 'μž‘μ€ λ ˆλ²„'λ“€λ§Œ μ‘°μ •ν•˜μ—¬ κ±°λŒ€ μ§€λŠ₯을 λ‚΄ λͺ©μ μ— 맞게 길듀여라" β€” κ±°λŒ€ μ–Έμ–΄ λͺ¨λΈ(LLM)을 전체 λ―Έμ„Έ μ‘°μ •(Full Fine-tuning)ν•˜λŠ” λŒ€μ‹ , 극히 μΌλΆ€μ˜ νŒŒλΌλ―Έν„°λ§Œ ν•™μŠ΅μ‹œμΌœ μ»΄ν“¨νŒ… μžμ›μ„ 획기적으둜 μ ˆμ•½ν•˜λ©΄μ„œλ„ 높은 μ„±λŠ₯을 λ‹¬μ„±ν•˜λŠ” 기술(PEFT). ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **μΆ”μΆœλœ νŒ¨ν„΄:** "Low-rank Adaptation and Additive Modules" β€” λͺ¨λΈμ˜ κΈ°μ‘΄ κ°€μ€‘μΉ˜λŠ” κ³ μ •(Freeze)ν•΄λ‘” 채, μ˜†μ— μ•„μ£Ό μž‘μ€ 크기의 행렬을 λ§λΆ™μ΄κ±°λ‚˜(LoRA) μž…λ ₯ ν”„λ‘¬ν”„νŠΈ μ•žμ— ν•™μŠ΅ κ°€λŠ₯ν•œ 벑터λ₯Ό λΆ™μ—¬(Prompt Tuning) λ³€ν™”μ˜ μ–‘λ§Œμ„ 효율적으둜 ν•™μŠ΅ν•˜λŠ” νŒ¨ν„΄. - **μ£Όμš” 기법:** - **LoRA (Low-Rank Adaptation):** κ°€μ€‘μΉ˜ μ—…λ°μ΄νŠΈλŸ‰μ„ 저차원 ν–‰λ ¬λ‘œ λΆ„ν•΄ν•˜μ—¬ ν•™μŠ΅ νŒŒλΌλ―Έν„°λ₯Ό 10,000λ°° 이상 κ°μ†Œμ‹œν‚΄. - **Adapter Tuning:** λͺ¨λΈμ˜ μΈ΅ 사이에 μž‘μ€ 신경망(Adapter)을 μ‚½μž…ν•˜μ—¬ ν•™μŠ΅. - **Prefix/Prompt Tuning:** μž…λ ₯값에 νŠΉμˆ˜ν•œ μž„λ² λ”©μ„ μΆ”κ°€ν•˜μ—¬ λͺ¨λΈμ˜ 좜λ ₯을 μ œμ–΄. - **의의:** κ³ κ°€μ˜ GPU ν΄λŸ¬μŠ€ν„° 없이도 μ€‘μ†ŒκΈ°μ—…μ΄λ‚˜ 개인이 μžμ‹ μ˜ 데이터에 νŠΉν™”λœ κ³ μ„±λŠ₯ LLM을 ꡬ좕할 수 있게 ν•˜μ—¬, AI의 μ‹€μš©μ  μ»€μŠ€ν„°λ§ˆμ΄μ§• μ‹œλŒ€λ₯Ό μ—΄μ—ˆμŒ. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌:** νŒŒλΌλ―Έν„°λ₯Ό 적게 ν•™μŠ΅μ‹œν‚€λ©΄ μ„±λŠ₯이 λ–¨μ–΄μ§ˆ κ²ƒμ΄λΌλŠ” μš°λ €μ™€ 달리, νŠΉμ • 도메인 μ΅œμ ν™”μ—μ„œλŠ” 였히렀 전체 λ―Έμ„Έ 쑰정보닀 과적합이 적고 μ•ˆμ •μ μΈ μ„±λŠ₯을 λ‚΄λŠ” κ²½μš°κ°€ λ§Žλ‹€λŠ” 것이 μž…μ¦λ¨. - **μ •μ±… λ³€ν™”:** Antigravity ν”„λ‘œμ νŠΈλŠ” μ—μ΄μ „νŠΈμ˜ νŠΉμ • 도메인(의료, 법λ₯ , μ½”λ”© λ“±) μ „λ¬Έμ„± κ°•ν™” μ‹œ, 전체 λͺ¨λΈ μž¬ν•™μŠ΅ λŒ€μ‹  LoRA 기반의 νŒŒλΌλ―Έν„° 효율적 ν•™μŠ΅ 방식을 ν‘œμ€€μœΌλ‘œ μ‚¬μš©ν•¨. ## πŸ”— 지식 μ—°κ²° (Graph) - [[Low-Rank-Adaptation-LoRA|Low-Rank-Adaptation-LoRA]], Transfer-Learning-Foundations, [[Natural-Language-Processing-NLP|Natural-Language-Processing-NLP]], [[Hardware-Acceleration-for-AI|Hardware-Acceleration-for-AI]] - **Raw Source:** 10_Wiki/Topics/AI/Parameter-Efficiency-in-LLMs.md