--- id: VEC-DB-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [ai, infrastructure, vector-database, rag, search-engine] last_reinforced: 2026-04-26 --- # Vector Database Selection (๋ฒกํ„ฐ DB ์„ ์ •) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๋ฐ์ดํ„ฐ์˜ ์–‘, ์†๋„, ์˜ˆ์‚ฐ์— ๋งž๋Š” ์ตœ์ ์˜ '์ง€์‹ ์ €์žฅ์†Œ'๋ฅผ ์„ ํƒํ•˜๋ผ" โ€” RAG ์•„ํ‚คํ…์ฒ˜์˜ ํ•ต์‹ฌ์ธ ๋ฒกํ„ฐ ์ž„๋ฒ ๋”ฉ ๋ฐ์ดํ„ฐ๋ฅผ ์ €์žฅํ•˜๊ณ  ์œ ์‚ฌ๋„ ๊ฒ€์ƒ‰(ANN)์„ ์ˆ˜ํ–‰ํ•˜๊ธฐ ์œ„ํ•œ DB ์†”๋ฃจ์…˜ ๋น„๊ต ๋ฐ ์„ ์ • ๊ธฐ์ค€. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** ํ”„๋กœ์ ํŠธ์˜ ํ™•์žฅ์„ฑ, ์ง€์—ฐ ์‹œ๊ฐ„(Latency) ์š”๊ตฌ์‚ฌํ•ญ, ๊ธฐ์กด ๊ธฐ์ˆ  ์Šคํƒ๊ณผ์˜ ์ •ํ•ฉ์„ฑ์„ ๊ณ ๋ คํ•˜์—ฌ ์ตœ์ ์˜ ๋ฒกํ„ฐ ๊ฒ€์ƒ‰ ์—”์ง„์„ ๋งค์นญํ•˜๋Š” ์ธํ”„๋ผ ๊ฒฐ์ • ํŒจํ„ด. - **์ฃผ์š” ๋น„๊ต๊ตฐ:** - **Dedicated Vector DBs:** Milvus, Pinecone, Weaviate, Qdrant. ๊ณ ์„ฑ๋Šฅ ์ „๋ฌธ ๊ธฐ๋Šฅ ์ œ๊ณต. - **Integrated Solutions:** pgvector (PostgreSQL), Elasticsearch/OpenSearch. ๊ธฐ์กด DB์— ๋ฒกํ„ฐ ๊ฒ€์ƒ‰ ๊ธฐ๋Šฅ ์ถ”๊ฐ€. ๊ด€๋ฆฌ๊ฐ€ ์šฉ์ดํ•จ. - **Lightweight/Local:** Chroma, FAISS. ํ”„๋กœํ† ํƒ€์ดํ•‘์ด๋‚˜ ์—ฃ์ง€ ํ™˜๊ฒฝ์— ์ ํ•ฉ. - **์„ ์ • ๊ธฐ์ค€:** - **Performance:** ์ดˆ๋‹น ์ฟผ๋ฆฌ ์ฒ˜๋ฆฌ๋Ÿ‰(QPS) ๋ฐ ๊ฒ€์ƒ‰ ์ •ํ™•๋„(Recall). - **Scalability:** ์ˆ˜์–ต ๊ฑด ์ด์ƒ์˜ ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ ์‹œ ๋ถ„์‚ฐ ํด๋Ÿฌ์Šคํ„ฐ๋ง ์ง€์› ์—ฌ๋ถ€. - **Filtering:** ์†์„ฑ ๋ฐ์ดํ„ฐ(Metadata)์™€ ๋ฒกํ„ฐ ๊ฒ€์ƒ‰์„ ๋™์‹œ์— ์ง€์›ํ•˜๋Š”์ง€(Hybrid Search). - **Cloud vs On-premise:** ๊ด€๋ฆฌํ˜• ์„œ๋น„์Šค ์„ ํ˜ธ ์—ฌ๋ถ€. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ์ดˆ๊ธฐ์—๋Š” FAISS์™€ ๊ฐ™์€ ๋‹จ์ˆœ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ ์œ„์ฃผ์˜€์œผ๋‚˜, ํ˜„๋Œ€ RAG ์‹œ์Šคํ…œ์—์„œ๋Š” ๋ฐ์ดํ„ฐ ๋ฌด๊ฒฐ์„ฑ๊ณผ ๋ฉ”ํƒ€๋ฐ์ดํ„ฐ ํ•„ํ„ฐ๋ง์ด ๊ฐ•์กฐ๋˜๋ฉฐ ์ „๋ฌธ ๋ฒกํ„ฐ DB ์„œ๋น„์Šค๊ฐ€ ์ฃผ๋ฅ˜๋กœ ๋ถ€์ƒ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ์ดˆ๊ธฐ ๋กœ์ปฌ ๊ฐœ๋ฐœ ์‹œ Chroma๋ฅผ ์‚ฌ์šฉํ•˜๋ฉฐ, ๋Œ€๊ทœ๋ชจ ์ง€์‹ ํ™•์žฅ์„ ์œ„ํ•ด pgvector ๋˜๋Š” Pinecone์œผ๋กœ์˜ ์ „ํ™˜ ์‹œ๋‚˜๋ฆฌ์˜ค๋ฅผ ์„ค๊ณ„ํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Retrieval-Augmented-Generation-RAG|Retrieval-Augmented-Generation-RAG]], [[Semantic-Search|Semantic-Search]], [[LlamaIndex|LlamaIndex]], System-Design-for-AI-Scale - **Raw Source:** 10_Wiki/Topics/AI/Vector-Database Selection.md