--- id: P-REINFORCE-AUTO-ZFS-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 0.99 tags: [auto-reinforced, few-shot, zero-shot, in-context-learning, prompting, llm] last_reinforced: 2026-04-20 --- # [[Zero Shot and Few Shot Learning|Zero Shot and Few Shot Learning]] ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "μ˜ˆμ‹œκ°€ μžˆλŠλƒ μ—†λŠλƒμ˜ 차이: μΆ”κ°€ ν•™μŠ΅ 없이 λͺ…λ Ήμ–΄λ§ŒμœΌλ‘œ 문제λ₯Ό ν‘ΈλŠ” '생지λŠ₯(Zero Shot)'κ³Ό, 두세 개의 힌트λ₯Ό μ£Όμ–΄ λͺ¨λΈμ˜ λ°©ν–₯성을 μž‘μ•„μ£ΌλŠ” '힌트 ν•™μŠ΅(Few Shot)'을 톡해 AI의 λ²”μš©μ„±μ„ κ·ΉλŒ€ν™”ν•˜λŠ” 기법." ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) μ œλ‘œμƒ· 및 퓨샷 ν•™μŠ΅(Zero Shot and Few Shot Learning)은 κ±°λŒ€ μ–Έμ–΄ λͺ¨λΈ(LLM)이 사전 ν•™μŠ΅λœ λ°©λŒ€ν•œ 지식을 ν™œμš©ν•˜μ—¬, λͺ…μ‹œμ μΈ νŒŒλΌλ―Έν„° μ—…λ°μ΄νŠΈ 없이 μƒˆλ‘œμš΄ νƒœμŠ€ν¬λ₯Ό μˆ˜ν–‰ν•˜κ²Œ λ§Œλ“œλŠ” 기법(In-context Learning)μž…λ‹ˆλ‹€. 1. **Zero-Shot Learning**: * **μ •μ˜**: μ˜ˆμ‹œλ₯Ό 단 ν•˜λ‚˜λ„ μ£Όμ§€ μ•Šκ³  였직 λͺ…λ Ήμ–΄(Prompt)만으둜 μž‘μ—…μ„ μˆ˜ν–‰ν•˜κ²Œ 함. * **μž‘λ™**: λͺ¨λΈμ΄ 이미 배운 보편적 κ°œλ… κ°„μ˜ 상관관계λ₯Ό μ΄μš©ν•΄ μΆ”λ‘ . * **예**: "λ‹€μŒ λ¬Έμž₯을 ν”„λž‘μŠ€μ–΄λ‘œ λ²ˆμ—­ν•΄: [λ¬Έμž₯]" 2. **Few-Shot Learning**: * **μ •μ˜**: ν”„λ‘¬ν”„νŠΈ μ•ˆμ— ν•΄κ²°ν•˜κ³ μž ν•˜λŠ” λ¬Έμ œμ™€ μœ μ‚¬ν•œ μ˜ˆμ‹œ(Few examples, 보톡 1~5개)λ₯Ό ν¬ν•¨μ‹œμΌœ 전달. * **μž‘λ™**: μ˜ˆμ‹œμ˜ 'νŒ¨ν„΄'κ³Ό 'ν˜•μ‹'을 μ¦‰μ„μ—μ„œ λͺ¨λ°©ν•˜μ—¬ 정확도λ₯Ό λΉ„μ•½μ μœΌλ‘œ λ†’μž„. * **예**: "μž…λ ₯: 사과 -> 좜λ ₯: 과일, μž…λ ₯: μžλ™μ°¨ -> 좜λ ₯: νƒˆκ²ƒ, μž…λ ₯: μ—°ν•„ -> 좜λ ₯: [λͺ¨λΈμ˜ μ •λ‹΅]" 3. **One-Shot Learning**: μ˜ˆμ‹œλ₯Ό λ”± ν•˜λ‚˜λ§Œ μ£ΌλŠ” 쀑간 단계. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌**: κ³Όκ±°μ—λŠ” μƒˆλ‘œμš΄ νƒœμŠ€ν¬λ₯Ό μœ„ν•΄ 수만 개의 데이터λ₯Ό ν†΅ν•œ 전인 ꡐ윑(Fine-tuning)이 ν•„μˆ˜μ˜€μœΌλ‚˜, ν˜„λŒ€μ˜ LLM 정책은 단 λͺ‡ 개의 μ˜ˆμ‹œλ§ŒμœΌλ‘œλ„ μ „λ¬Έ 지식을 흉내 λ‚΄λŠ” 'In-context Learning μ •μ±…'λ§ŒμœΌλ‘œλ„ μΆ©λΆ„ν•œ μ„±λŠ₯을 λ‚Ό 수 μžˆμŒμ„ 증λͺ…함(RL Update). - **μ •μ±… λ³€ν™”(RL Update)**: 데이터 κ°œμΈμ •λ³΄ 보호 정책이 강화됨에 따라, 데이터λ₯Ό λͺ¨λΈμ— μ£Όμž…ν•΄ ν•™μŠ΅μ‹œν‚€μ§€ μ•Šκ³  ν”„λ‘¬ν”„νŠΈ μˆ˜μ€€μ—μ„œλ§Œ ν™œμš©ν•˜μ—¬ νœ˜λ°œμ‹œν‚€λŠ” 'ν”„λΌμ΄λ²„μ‹œ μΉœν™”μ  μ œλ‘œμƒ· μΆ”λ‘  μ •μ±…'이 κΈ°μ—…μš© AI ν™œμš©μ˜ ν‘œμ€€μ΄ 됨. ## πŸ”— 지식 μ—°κ²° (Graph) - Foundational Models, [[Prompt-Engineering|Prompt-Engineering]], [[Transfer Learning|Transfer Learning]], [[SFT (Supervised Fine-Tuning)|SFT (Supervised Fine-Tuning)]], NLP (μžμ—°μ–΄ 처리) - **Modern Tech/Tools**: LangChain prompt templates, Meta Llama-3 few-shot benchmarks. ---