--- id: P-REINFORCE-AUTO-FSLR-001 category: "[[10_Wiki/πŸ’‘ Topics/AI]]" confidence_score: 0.97 tags: [auto-reinforced, few-shot-learning, llm, prompt-engineering, in-context-learning, meta-learning] last_reinforced: 2026-04-20 --- # [[Few-Shot-Learning]] ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "μ˜ˆμ‹œ λͺ‡ 개둜 끝내기: 수천만 개의 λ°μ΄ν„°λ‘œ μˆ˜κ°œμ›”κ°„ ν•™μŠ΅ν•˜λŠ” λŒ€μ‹ , 이미 κ±°λŒ€ν•œ 지식을 κ°€μ§„ λͺ¨λΈμ—κ²Œ 단 λͺ‡ 개의 μž…μΆœλ ₯ μ˜ˆμ‹œ(Short examples)만 λ³΄μ—¬μ€ŒμœΌλ‘œμ¨ μƒˆλ‘œμš΄ μž‘μ—…μ˜ λ§₯락을 μ¦‰μ‹œ νŒŒμ•…ν•˜κ²Œ λ§Œλ“œλŠ” 효율적인 μ§€λŠ₯ 가동법." ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) 퓨샷 λŸ¬λ‹(Few-Shot-Learning)은 μ•„μ£Ό 적은 수의 데이터 μƒ˜ν”Œμ„ 톡해 λŒ€μƒμ— λŒ€ν•œ ν•™μŠ΅μ„ μˆ˜ν–‰ν•˜λŠ” κΈ°λ²•μž…λ‹ˆλ‹€. 1. **μ£Όμš” 방식 (In-Context Learning)**: * **Zero-Shot**: μ˜ˆμ‹œ 없이 λͺ…λ Ήλ§Œ μˆ˜ν–‰. * **One-Shot**: μ˜ˆμ‹œλ₯Ό λ”± ν•˜λ‚˜ λ³΄μ—¬μ€Œ. * **Few-Shot**: 2~5개 μ •λ„μ˜ μ˜ˆμ‹œλ₯Ό ν”„λ‘¬ν”„νŠΈμ— ν¬ν•¨ν•˜μ—¬ νŒ¨ν„΄μ„ μΈμ§€μ‹œν‚΄. 2. **μ™œ μ€‘μš”ν•œκ°€?**: * 데이터 확보가 μ–΄λ €μš΄ 특수 λ„λ©”μΈμ—μ„œ AIλ₯Ό 즉각 ν™œμš© κ°€λŠ₯ν•˜κ²Œ ν•˜λ©°, ν”„λ‘¬ν”„νŠΈ μ—”μ§€λ‹ˆμ–΄λ§μ˜ 핡심 λ„κ΅¬μž„. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌**: κ³Όκ±°μ—λŠ” νŒŒλΌλ―Έν„°λ₯Ό 직접 μ—…λ°μ΄νŠΈν•˜λŠ” 'νŒŒμΈνŠœλ‹(Fine-tuning) μ •μ±…'이 ν•„μˆ˜μ˜€μœΌλ‚˜, ν˜„λŒ€ 정책은 κ±°λŒ€ λͺ¨λΈμ˜ λ¬Έλ§₯ νŒŒμ•… λŠ₯λ ₯ 정책을 ν™œμš©ν•œ 'μΈμ»¨ν…μŠ€νŠΈ λŸ¬λ‹ μ •μ±…'으둜 μΆ©λΆ„ν•œ μ„±λŠ₯을 λ‚Ό 수 μžˆμŒμ„ μž…μ¦ν•¨(RL Update). - **μ •μ±… λ³€ν™”(RL Update)**: λ‹¨μˆœνžˆ μ˜ˆμ‹œλ₯Ό λ³΄μ—¬μ£ΌλŠ” μˆ˜μ€€μ„ λ„˜μ–΄, λͺ¨λΈμ΄ μ˜ˆμ‹œλ“€λ‘œλΆ€ν„° 슀슀둜 νŠΉμ§•μ„ μΆ”μΆœν•˜κ³  λ©”νƒ€μ μœΌλ‘œ ν•™μŠ΅ν•˜λŠ” '검색 증강 퓨샷 μ •μ±…' λ“±μœΌλ‘œ 고도화 μ€‘μž„. ## πŸ”— 지식 μ—°κ²° (Graph) - [[Gen-AI]], [[Prompt-Engineering]], [[Transfer-Learning]], [[Efficiency]], [[Cognitive Biases]] - **Modern Tech/Tools**: OpenAI API (System message examples), Anthropic Claude prompts, LangChain (Few-shot templates). ---