--- id: P-REINFORCE-AUTO-AGAR-001 category: Art confidence_score: 0.98 tags: [auto-reinforced, agent-architecture, ai-agents, cognitive-architecture, modular-design] last_reinforced: 2026-04-20 --- # [[Agent Architecture|Agent Architecture]] ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "자율 μ£Όν–‰ν•˜λŠ” μ§€λŠ₯의 λ‚΄λΆ€ ꡬ쑰: λ‹¨μˆœνžˆ 닡을 λ‚΄λŠ” λͺ¨λΈμ„ λ„˜μ–΄, κΈ°μ–΅(Memory), κ³„νš(Planning), 도ꡬ ν™œμš©(Tool Use) κΈ°λŠ₯을 유기적으둜 κ²°ν•©ν•˜μ—¬ λ…λ¦½μ μœΌλ‘œ λ―Έμ…˜μ„ μˆ˜ν–‰ν•˜λŠ” μ—μ΄μ „νŠΈμ˜ λ‡Œ 섀계." ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) μ—μ΄μ „νŠΈ μ•„ν‚€ν…μ²˜(Agent Architecture)λŠ” 인곡지λŠ₯이 ν™˜κ²½μ„ μΈμ‹ν•˜κ³ , μΆ”λ‘ ν•˜λ©°, λͺ©ν‘œ 달성을 μœ„ν•΄ ν–‰λ™ν•˜λŠ” 일련의 과정을 κ΅¬μ‘°ν™”ν•œ 섀계λ₯Ό μ˜λ―Έν•©λ‹ˆλ‹€. 1. **AI μ—μ΄μ „νŠΈμ˜ 4λŒ€ ꡬ성 μš”μ†Œ**: * **Brain (The LLM)**: 핡심적인 μΆ”λ‘  및 μ˜μ‚¬κ²°ν•© μ—”μ§„. * **Planning**: λͺ©ν‘œλ₯Ό ν•˜μœ„ νƒœμŠ€ν¬λ‘œ λΆ„ν•΄(Task Decomposition) 및 μžκ°€ μ„±μ°°(Self-reflection). * **Memory**: * **Short-term**: ν˜„μž¬ λŒ€ν™”μ˜ λ§₯락 (Context Window). * **Long-term**: μ™ΈλΆ€ λ°μ΄ν„°λ² μ΄μŠ€ μ—°κ²° (RAG, Vector DB). * **Tools (Action)**: μ½”λ“œλ₯Ό μ‹€ν–‰ν•˜κ±°λ‚˜ APIλ₯Ό ν˜ΈμΆœν•˜μ—¬ ν˜„μ‹€ 세계에 영ν–₯을 λ―ΈμΉ˜λŠ” μˆ˜λ‹¨. 2. **μ•„ν‚€ν…μ²˜ νŒ¨ν„΄**: * **ReAct**: Reason + Actλ₯Ό 순차적으둜 λ°˜λ³΅ν•˜μ—¬ 문제 ν•΄κ²°. * **Plan-and-Execute**: 전체 κ³„νšμ„ λ¨Όμ € μ„Έμš°κ³  ν•˜λ‚˜μ”© μ‹€ν–‰. * **Multi-Agent**: μ „λ¬Έν™”λœ μ—¬λŸ¬ μ—μ΄μ „νŠΈκ°€ ν˜‘μ—…ν•˜λŠ” ꡬ쑰. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌**: κ³Όκ±°μ—λŠ” ν•˜λ‚˜μ˜ κ±°λŒ€ λͺ¨λΈμ΄ λͺ¨λ“  κ±Έ λ‹€ ν•˜λŠ” 'Single-model' μ •μ±…μ΄μ—ˆμœΌλ‚˜, ν˜„λŒ€μ˜ κ³ λ‚œλ„ νƒœμŠ€ν¬ μˆ˜ν–‰ 정책은 각 κΈ°λŠ₯을 λͺ¨λ“ˆν™”ν•˜κ³  순차적으둜 μ—°κ²°ν•˜λŠ” '에이전틱 μ›Œν¬ν”Œλ‘œμš°(Agentic Workflow) μ •μ±…'으둜 νŒ¨λŸ¬λ‹€μž„μ„ μ „ν™˜ν•¨(RL Update). - **μ •μ±… λ³€ν™”(RL Update)**: μ—μ΄μ „νŠΈμ˜ 자율 ν†΅μ œ 뢈λŠ₯ 리슀크λ₯Ό λ°©μ–΄ν•˜κΈ° μœ„ν•΄, λ§€ 행동 λ‹¨κ³„λ§ˆλ‹€ 인간이 μŠΉμΈν•˜κ±°λ‚˜ κ·œμΉ™μ„ κ²€μ¦ν•˜λŠ” 'Human-in-the-loop μ—μ΄μ „νŠΈ κ±°λ²„λ„ŒμŠ€' 정책이 μ‚°μ—… ν‘œμ€€μœΌλ‘œ 채택됨. ## πŸ”— 지식 μ—°κ²° (Graph) - [[Ps-Reinforce|Ps-Reinforce]], Foundational Models, [[Workflow-Integrity|Workflow-Integrity]], Self-Correction Mechanisms, [[Tool-Usage-Optimization|Tool-Usage-Optimization]] - **Modern Tech/Tools**: LangChain, AutoGPT, BabyAGI, Microsoft AutoGen, LangGraph. ---