--- id: PEFT-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 1.0 tags: [ai, llm, fine-tuning, peft, efficiency] last_reinforced: 2026-04-26 --- # Parameter-Efficient Fine-Tuning (PEFT, 효율적 λ―Έμ„Έ μ‘°μ •) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "전체 κ°€μ€‘μΉ˜λ₯Ό λ‹€ λ°”κΎΈμ§€ μ•Šκ³ λ„ λͺ¨λΈμ˜ 전문성을 κ·ΉλŒ€ν™”ν•˜λΌ" β€” κ±°λŒ€ λͺ¨λΈμ˜ λŒ€λΆ€λΆ„ κ°€μ€‘μΉ˜λŠ” κ³ μ •ν•œ 채, μ•„μ£Ό 적은 수의 μΆ”κ°€ νŒŒλΌλ―Έν„°λ‚˜ 일뢀 λ ˆμ΄μ–΄λ§Œ ν•™μŠ΅μ‹œμΌœ μ„±λŠ₯ 효율과 λΉ„μš©μ„ λ™μ‹œμ— μž‘λŠ” νŠœλ‹ μ „λž΅. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **μΆ”μΆœλœ νŒ¨ν„΄:** λͺ¨λΈμ˜ 핡심 지식(Pre-trained weights)은 λ³΄μ‘΄ν•˜λ©΄μ„œ, νŠΉμ • νƒœμŠ€ν¬μ— ν•„μš”ν•œ λ―Έμ„Έν•œ μ‘°μ •κ°’λ§Œμ„ 효율적으둜 ν•™μŠ΅ν•˜μ—¬ λ°°ν¬ν•˜λŠ” νŒ¨ν„΄. - **μ£Όμš” 기법:** - **LoRA (Low-Rank Adaptation):** κ°€μ€‘μΉ˜ ν–‰λ ¬μ˜ λ³€ν™”λŸ‰μ„ μ €μˆœμœ„ ν–‰λ ¬κ³±μœΌλ‘œ κ·Όμ‚¬ν•˜μ—¬ ν•™μŠ΅. - **Prefix Tuning:** μž…λ ₯ 데이터 μ•žμ— ν•™μŠ΅ κ°€λŠ₯ν•œ 가상 토큰(Prefix)을 μΆ”κ°€ν•˜μ—¬ λͺ¨λΈμ˜ 거동 μ œμ–΄. - **Adapter Modules:** κΈ°μ‘΄ λ ˆμ΄μ–΄ 사이에 μ•„μ£Ό μž‘μ€ 신경망 측을 μ‚½μž…ν•˜μ—¬ ν•΄λ‹Ή λΆ€λΆ„λ§Œ ν•™μŠ΅. - **Prompt Tuning:** ν”„λ‘¬ν”„νŠΈ 자체λ₯Ό 벑터 ν˜•νƒœλ‘œ ν•™μŠ΅ν•˜μ—¬ 졜적의 μ§€μ‹œμ–΄λ₯Ό 찾음. - **μž₯점:** μ—°μ‚°λŸ‰ 급감, λͺ¨λΈ μ €μž₯ 곡간 μ ˆμ•½(MB λ‹¨μœ„), μ—¬λŸ¬ νƒœμŠ€ν¬μ— λŒ€ν•œ μ–΄λŒ‘ν„°λ₯Ό λ…λ¦½μ μœΌλ‘œ 관리 κ°€λŠ₯. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌:** λͺ¨λ“  νŒŒλΌλ―Έν„°λ₯Ό λ‹€μ‹œ ν•™μŠ΅μ‹œν‚€λ˜ Full Fine-tuningμ—μ„œ, μžμ› νš¨μœ¨μ„±μ΄ κ°•μ‘°λ˜λŠ” PEFT μ€‘μ‹¬μœΌλ‘œ 산업계 ν‘œμ€€μ΄ 이동. - **μ •μ±… λ³€ν™”:** Antigravity ν”„λ‘œμ νŠΈλŠ” μƒˆλ‘œμš΄ μœ„ν‚€ 도메인 ν•™μŠ΅ μ‹œ PEFT(특히 LoRA)λ₯Ό κΈ°λ³Έ μ‚¬μ–‘μœΌλ‘œ μ±„νƒν•˜μ—¬ ν•˜λ“œμ›¨μ–΄ λΉ„μš©μ„ 90% 이상 μ ˆκ°ν•¨. ## πŸ”— 지식 μ—°κ²° (Graph) - [[Low-Rank-Adaptation-LoRA|Low-Rank-Adaptation-LoRA]], [[Fine-tuning|Fine-Tuning]], [[LLM|LLM]], Transfer-Learning - **Raw Source:** 10_Wiki/Topics/AI/Parameter-Efficient Fine-Tuning (PEFT).md