--- id: [[P-Reinforce|P-Reinforce]]-AUTO-RAGM-001 category: Unified confidence_score: 1.00 tags: [auto-reinforced, rag, retrieval-augmented-generation, knowledge-base, llm-context] last_reinforced: 2026-05-04 --- # [[Retrieval-Augmented Generation (RAG)|Retrieval-Augmented Generation (RAG)]] ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "μ˜€ν”ˆ 뢁 ν…ŒμŠ€νŠΈμ˜ 정석: λͺ¨λ“  지식을 λͺ¨λΈμ˜ νŒŒλΌλ―Έν„°μ— μš°κ²¨λ„£λŠ” λŒ€μ‹ , ν•„μš”ν•  λ•Œλ§ˆλ‹€ μ™ΈλΆ€ 지식 μ°½κ³ μ—μ„œ κ΄€λ ¨ 정보λ₯Ό μ°Ύμ•„ λͺ¨λΈμ—κ²Œ μ „λ‹¬ν•¨μœΌλ‘œμ¨ 정확도λ₯Ό 높이고 ν™˜κ°μ„ μ€„μ΄λŠ” μ‹€μš©μ£Όμ˜μ  AI μ•„ν‚€ν…μ²˜." ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) RAG(검색 증강 생성)λŠ” κ±°λŒ€ μ–Έμ–΄ λͺ¨λΈμ΄ ν•™μŠ΅ 데이터에 μ—†λŠ” μ΅œμ‹  μ •λ³΄λ‚˜ νŠΉμ • λ„λ©”μΈμ˜ 지식을 ν™œμš©ν•  수 μžˆλ„λ‘, μ™ΈλΆ€ λ°μ΄ν„°λ² μ΄μŠ€μ—μ„œ κ΄€λ ¨ λ¬Έμ„œλ₯Ό κ²€μƒ‰ν•˜μ—¬ ν”„λ‘¬ν”„νŠΈμ— ν¬ν•¨μ‹œν‚€λŠ” κΈ°μˆ μž…λ‹ˆλ‹€. 1. **μž‘λ™ ν”„λ‘œμ„ΈμŠ€**: * **Indexing (인덱싱)**: λ°©λŒ€ν•œ λ¬Έμ„œλ₯Ό μž‘μ€ 쑰각(Chunk)으둜 λ‚˜λˆ„κ³  벑터(Vector) ν˜•νƒœλ‘œ λ³€ν™˜ν•˜μ—¬ μ €μž₯ν•©λ‹ˆλ‹€. * **Retrieval (검색)**: μ‚¬μš©μžμ˜ 질문과 μœ μ‚¬ν•œ 의미λ₯Ό κ°€μ§„ λ¬Έμ„œ 쑰각듀을 λ°μ΄ν„°λ² μ΄μŠ€μ—μ„œ μ°Ύμ•„λƒ…λ‹ˆλ‹€. * **Generation (생성)**: κ²€μƒ‰λœ 쑰각듀을 질문과 ν•¨κ»˜ λͺ¨λΈμ—κ²Œ μ „λ‹¬ν•˜μ—¬, ν•΄λ‹Ή κ·Όκ±°λ₯Ό λ°”νƒ•μœΌλ‘œ 닡변을 μƒμ„±ν•˜κ²Œ ν•©λ‹ˆλ‹€. 2. **핡심 이점**: * **ν™˜κ°(Hallucination) κ°μ†Œ**: λͺ¨λΈμ΄ κ·Όκ±° λ¬Έμ„œλ₯Ό 보고 λ‹΅λ³€ν•˜λ―€λ‘œ μ—†λŠ” 사싀을 μ§€μ–΄λ‚Ό ν™•λ₯ μ΄ μ€„μ–΄λ“­λ‹ˆλ‹€. * **μ΅œμ‹ μ„± μœ μ§€**: λͺ¨λΈμ„ μž¬ν•™μŠ΅μ‹œν‚€μ§€ μ•Šκ³ λ„ μ™ΈλΆ€ λ°μ΄ν„°λ² μ΄μŠ€λ§Œ μ—…λ°μ΄νŠΈν•˜λ©΄ μ΅œμ‹  지식을 λ°˜μ˜ν•  수 μžˆμŠ΅λ‹ˆλ‹€. * **μ„€λͺ… κ°€λŠ₯μ„±**: λ‹΅λ³€μ˜ 좜처(Source/Citation)λ₯Ό λͺ…ν™•νžˆ μ œμ‹œν•  수 μžˆμ–΄ 신뒰도가 λ†’μŠ΅λ‹ˆλ‹€. 3. **λ°œμ „ 단계**: * **Naive RAG**: λ‹¨μˆœ 벑터 검색 기반. * **Advanced RAG**: ν•˜μ΄λΈŒλ¦¬λ“œ 검색, μž¬μˆœμœ„ν™”(Re-ranking), 쿼리 λ³€ν™˜ 등을 포함. * **[[Agentic RAG|Agentic RAG]]**: μ—μ΄μ „νŠΈκ°€ 슀슀둜 검색 μ „λž΅μ„ μˆ˜λ¦½ν•˜κ³  결과의 μ μ ˆμ„±μ„ ν‰κ°€ν•˜λ©° 루프λ₯Ό μˆ˜ν–‰. ## βš–οΈ Trade-offs & Caveats * **검색 μ˜μ‘΄μ„±**: 검색 κ²°κ³Όκ°€ λΆ€μ‹€ν•˜λ©΄ λ‹΅λ³€ ν’ˆμ§ˆλ„ κΈ‰κ²©νžˆ λ–¨μ–΄μ§‘λ‹ˆλ‹€. (Garbage In, Garbage Out) * **μ§€μ—° μ‹œκ°„**: μ™ΈλΆ€ 검색 단계가 μΆ”κ°€λ˜λ―€λ‘œ 순수 생성보닀 응닡 속도가 느렀질 수 μžˆμŠ΅λ‹ˆλ‹€. * **Lost in the middle**: λ„ˆλ¬΄ λ§Žμ€ 정보λ₯Ό κ²€μƒ‰ν•˜μ—¬ 전달할 경우, λͺ¨λΈμ΄ μ»¨ν…μŠ€νŠΈ 쀑간에 μžˆλŠ” μ€‘μš”ν•œ 정보λ₯Ό λ†“μΉ˜λŠ” ν˜„μƒμ΄ λ°œμƒν•  수 μžˆμŠ΅λ‹ˆλ‹€. ## πŸ”— 지식 μ—°κ²° (Graph) * **μƒμœ„ κ°œλ…**: [[LLM Application Architecture|LLM Application Architecture]] * **μ„ΈλΆ€ 기술**: [[Agentic RAG|Agentic RAG]], [[GraphRAG|GraphRAG]], [[Hybrid Search|Hybrid Search]], [[Re-ranking|Re-ranking]] * **μ΅œμ ν™” 도ꡬ**: [[LlamaIndex|LlamaIndex]], [[LangChain|LangChain]], [[ChromaDB|ChromaDB]], [[Pinecone|Pinecone]] --- *Last updated: 2026-05-04*