--- id: P-REINFORCE-AI-AGENT category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 1.0 tags: [AI Agent, Autonomy, Planning, Reasoning, Action] last_reinforced: 2026-04-20 --- # AI-μ—μ΄μ „νŠΈ-(AI-Agent) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "λ‹¨μˆœν•œ κ³„μ‚°κΈ°μ—μ„œ 자율적인 일꾼으둜." 슀슀둜 λͺ©ν‘œλ₯Ό μ„€μ •ν•˜κ³ , κ³„νšμ„ μ„Έμš°λ©°, 도ꡬ(Browser, Terminal λ“±)λ₯Ό μ‚¬μš©ν•˜μ—¬ μ£Όμ–΄μ§„ 과업을 λκΉŒμ§€ μ™„μˆ˜ν•˜λŠ” 자율적 μ§€λŠ₯체닀. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **Planning & Reasoning**: - κ±°λŒ€ μ–Έμ–΄ λͺ¨λΈ(LLM)을 λ‘λ‡Œλ‘œ μ‚Όμ•„ λ³΅μž‘ν•œ 문제λ₯Ό μž‘μ€ λ‹¨κ³„λ‘œ λΆ„ν•΄(Chain-of-Thought)ν•˜κ³  μ „λž΅μ„ μˆ˜λ¦½ν•œλ‹€. - **Action & Tool Use**: - API 호좜, μ›Ή 검색, μ½”λ“œ μ‹€ν–‰ λ“± μ™ΈλΆ€ ν™˜κ²½κ³Ό μƒν˜Έμž‘μš©ν•  수 μžˆλŠ” μΈν„°νŽ˜μ΄μŠ€λ₯Ό 톡해 μ‹€μ œ 세계에 λ³€ν™”λ₯Ό μΌμœΌν‚¨λ‹€. - **Memory Management**: - λŒ€ν™”μ˜ λ§₯락(Short-term)κ³Ό κ³Όκ±° 지식(Long-term)을 RAGλ‚˜ 체크포인트 ν˜•νƒœλ‘œ μœ μ§€ν•˜μ—¬ μΌκ΄€λœ μˆ˜ν–‰ λŠ₯λ ₯을 λ³΄μœ ν•œλ‹€. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (RL Update) - ν˜„μž¬μ˜ μ—μ΄μ „νŠΈλŠ” 'λ¬΄ν•œ 루프'λ‚˜ 'ν™˜κ°'에 빠질 μœ„ν—˜μ΄ 크닀. 이λ₯Ό κ·Ήλ³΅ν•˜κΈ° μœ„ν•΄ μ—μ΄μ „νŠΈκ°€ μžμ‹ μ˜ 결과물을 슀슀둜 κ²€ν† ν•˜λŠ” 'Self-Correction' 루프와, 인간이 쀑간에 κ°œμž…ν•˜λŠ” 'Human-in-the-loop' 섀계가 ν•„μˆ˜μ μ΄λ‹€. ## πŸ”— 지식 μ—°κ²° (Graph) - Related: Multi-Agent-System-(닀쀑-μ—μ΄μ „νŠΈ-μ‹œμŠ€ν…œ) , Agent-Communication-Protocol-(μ—μ΄μ „νŠΈ-톡신-κ·œμ•½) - Deployment: [[Deployment_Final_Gate]]