--- id: AI-RAG-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [ai, llm, rag, retrieval-augmented-generation, vector-database, semantic-search, embeddings] last_reinforced: 2026-04-26 --- # RAG and Document Retrieval (RAG์™€ ๋ฌธ์„œ ๊ฒ€์ƒ‰) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๋ชจ๋ธ์˜ ๊ธฐ์–ต๋ ฅ์—๋งŒ ์˜์กดํ•˜์ง€ ๋ง๊ณ , ๋ฐฉ๋Œ€ํ•œ ์ง€์‹์˜ ๋„์„œ๊ด€(External Knowledge)์—์„œ ๊ทผ๊ฑฐ๋ฅผ ์ง์ ‘ ์ฐพ์•„๋ณด๊ณ  ๋งํ•˜๊ฒŒ ํ•˜์—ฌ ์ง€๋Šฅ์˜ ์‹ ๋ขฐ๋„๋ฅผ ์™„์„ฑํ•˜๋ผ" โ€” ๊ฑฐ๋Œ€ ์–ธ์–ด ๋ชจ๋ธ์ด ํ•™์Šตํ•˜์ง€ ์•Š์€ ์ตœ์‹  ๋ฐ์ดํ„ฐ๋‚˜ ๋น„๊ณต๊ฐœ ๋ฌธ์„œ๋ฅผ ์‹ค์‹œ๊ฐ„์œผ๋กœ ๊ฒ€์ƒ‰ํ•˜์—ฌ ๋‹ต๋ณ€์˜ ์ •ํ™•์„ฑ์„ ๋†’์ด๊ณ  ํ™˜๊ฐ ํ˜„์ƒ์„ ์ค„์ด๋Š” ๊ธฐ์ˆ . ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** "Retrieve-Read-Refine" โ€” ์‚ฌ์šฉ์ž์˜ ์งˆ๋ฌธ์„ ๋ฒกํ„ฐ๋กœ ๋ณ€ํ™˜ํ•˜์—ฌ ์ง€์‹ ์ €์žฅ์†Œ์—์„œ ๊ฐ€์žฅ ์œ ์‚ฌํ•œ ๋ฌธ๋งฅ์„ ์ฐพ์•„๋‚ด๊ณ (Retrieve), ์ด๋ฅผ ์งˆ๋ฌธ๊ณผ ํ•จ๊ป˜ ๋ชจ๋ธ์— ์ „๋‹ฌํ•˜์—ฌ(Read), ๋ชจ๋ธ์ด ๊ทผ๊ฑฐ ์ค‘์‹ฌ์˜ ์ •ํ™•ํ•œ ๋‹ต๋ณ€์„ ์ƒ์„ฑํ•˜๊ฒŒ ํ•˜๋Š”(Refine) ํŒจํ„ด. - **ํ•ต์‹ฌ ๊ตฌ์„ฑ ์š”์†Œ:** - **Embeddings:** ํ…์ŠคํŠธ์˜ ์˜๋ฏธ๋ฅผ ์ˆซ์ž์˜ ๋‚˜์—ด(Vector)๋กœ ๋ณ€ํ™˜. - **Vector Database:** ์ˆ˜๋ฐฑ๋งŒ ๊ฐœ์˜ ๋ฒกํ„ฐ ์‚ฌ์ด์—์„œ ๊ฐ€์žฅ ๋‹ฎ์€ ๊ฒƒ์„ ์ˆœ์‹๊ฐ„์— ์ฐพ๋Š” ์ €์žฅ์†Œ (Pinecone, Milvus, Chroma ๋“ฑ). - **Semantic Search:** ๋‹จ์ˆœ ํ‚ค์›Œ๋“œ ๋งค์นญ์ด ์•„๋‹Œ '์˜๋ฏธ์  ์œ ์‚ฌ์„ฑ'์„ ๊ธฐ๋ฐ˜์œผ๋กœ ๊ฒ€์ƒ‰. - **์˜์˜:** ๋งค๋ฒˆ ๋ชจ๋ธ์„ ์ƒˆ๋กœ ํ•™์Šต(Fine-tuning)์‹œํ‚ค์ง€ ์•Š๊ณ ๋„ ์ตœ์‹  ์ง€์‹์„ ์ฆ‰๊ฐ์ ์œผ๋กœ ์ฃผ์ž…ํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ, ๋‹ต๋ณ€์˜ ์ถœ์ฒ˜๋ฅผ ๋ช…ํ™•ํžˆ ์ œ์‹œํ•˜์—ฌ ์‚ฌ์šฉ์ž์˜ ์‹ ๋ขฐ๋ฅผ ํ™•๋ณดํ•จ. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ๋‹จ์ˆœํžˆ ๋งŽ์€ ๋ฌธ์„œ๋ฅผ ์ฐพ๋Š” ๊ฒƒ์ด ์ข‹๋‹ค๋Š” ์‹œ๊ฐ์—์„œ ๋ฒ—์–ด๋‚˜, ์ด์ œ๋Š” ๋ชจ๋ธ์—๊ฒŒ ๊ผญ ํ•„์š”ํ•œ 'ํ•ต์‹ฌ ๋ฌธ๋งฅ'๋งŒ์„ ๊ณจ๋ผ๋‚ด๋Š” ์ •๋ฐ€ํ•œ ๋žญํ‚น(Reranking) ๊ธฐ์ˆ ๊ณผ ๊ธด ๋ฌธ๋งฅ์„ ์†Œํ™”ํ•˜๋Š” ๋Šฅ๋ ฅ์ด RAG ์„ฑ๋Šฅ์˜ ํ•ต์‹ฌ ๊ฒฝ์Ÿ๋ ฅ์ด ๋จ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” 1,174๊ฐœ ์ง€์‹ ๋ฌธ์„œ์˜ ์œ ๊ธฐ์  ์—ฐ๊ฒฐ์„ ์œ„ํ•ด ๊ณ ๋„ํ™”๋œ RAG ์—”์ง„์„ ๋‚ด์žฅํ•˜๋ฉฐ, ์—์ด์ „ํŠธ๊ฐ€ ๋‹ต๋ณ€ ์‹œ ๋ฐ˜๋“œ์‹œ ์œ„ํ‚ค ๋‚ด์˜ ๊ด€๋ จ ๋ฌธ์„œ๋ฅผ ์ฐธ์กฐํ•˜์—ฌ ๋‹ต๋ณ€ํ•˜๋„๋ก ๊ฐ•์ œํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Natural-Language-Processing-NLP]], [[Prompt-Engineering-Foundations]], Vector-Database-Foundations, Knowledge-Gardening-Workflow - **Raw Source:** 10_Wiki/Topics/AI/RAG-and-Document-Retrieval.md