--- id: [[P-Reinforce|P-Reinforce]]-AUTO-RRHS-001 category: Unified confidence_score: 1.00 tags: [auto-reinforced, reranking, hybrid-search, semantic-search, lexical-search, bm25] last_reinforced: 2026-05-04 --- # [[Reranking & Hybrid Search|Reranking & Hybrid Search]] ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๊ฒ€์ƒ‰์˜ ํ•„ํ„ฐ๋ง๊ณผ ์žฌ์กฐํ•ฉ: ๋‹จ์ˆœํ•œ ์˜๋ฏธ์  ์œ ์‚ฌ์„ฑ(Dense)๊ณผ ์ •ํ™•ํ•œ ํ‚ค์›Œ๋“œ ๋งค์นญ(Sparse)์„ ๊ฒฐํ•ฉํ•˜๊ณ , ํ›„๋ณด๊ตฐ์„ ๋‹ค์‹œ ํ•œ๋ฒˆ ์ •๋ฐ€ ๊ฒ€์‚ฌํ•˜์—ฌ ๋ชจ๋ธ์—๊ฒŒ ๊ฐ€์žฅ ์™„๋ฒฝํ•œ ๊ทผ๊ฑฐ๋ฅผ ์ œ๊ณตํ•˜๋Š” 2๋‹จ๊ณ„ ๊ฒ€์ฆ ์‹œ์Šคํ…œ." ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) RAG ์‹œ์Šคํ…œ์˜ ๊ฒ€์ƒ‰ ์ •ํ™•๋„๋ฅผ ๊ทน๋Œ€ํ™”ํ•˜๊ธฐ ์œ„ํ•ด ๋‘ ๊ฐ€์ง€ ์ด์ƒ์˜ ๊ฒ€์ƒ‰ ๋ฐฉ์‹์„ ๊ฒฐํ•ฉํ•˜๊ณ  ๊ฒฐ๊ณผ๋ฅผ ์žฌ์ •๋ ฌํ•˜๋Š” ๊ธฐ๋ฒ•์ž…๋‹ˆ๋‹ค. 1. **Hybrid Search (ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ๊ฒ€์ƒ‰)**: * **Dense Retrieval (์ž„๋ฒ ๋”ฉ ๊ฒ€์ƒ‰)**: ๋ฌธ๋งฅ๊ณผ ์˜๋ฏธ๋ฅผ ํŒŒ์•…ํ•˜์—ฌ ์œ ์‚ฌํ•œ ์ •๋ณด๋ฅผ ์ฐพ์Šต๋‹ˆ๋‹ค. (์˜ˆ: "๊ธˆ์œต ์œ„๊ธฐ"์™€ "๊ฒฝ์ œ ๊ณตํ™ฉ") * **Sparse Retrieval (ํ‚ค์›Œ๋“œ ๊ฒ€์ƒ‰)**: BM25 ๋“ฑ์„ ์‚ฌ์šฉํ•˜์—ฌ ์ •ํ™•ํ•œ ๋‹จ์–ด ๋งค์นญ์„ ์ˆ˜ํ–‰ํ•ฉ๋‹ˆ๋‹ค. (์˜ˆ: ์ œํ’ˆ๋ช…, ๊ณ ์œ  ๋ช…์‚ฌ ๊ฒ€์ƒ‰) * **Reciprocal Rank Fusion (RRF)**: ๋‘ ๊ฒ€์ƒ‰ ๊ฒฐ๊ณผ์˜ ์ˆœ์œ„๋ฅผ ์ˆ˜ํ•™์ ์œผ๋กœ ๊ฒฐํ•ฉํ•˜์—ฌ ์ตœ์ข… ํ›„๋ณด๊ตฐ์„ ์‚ฐ์ถœํ•ฉ๋‹ˆ๋‹ค. 2. **Reranking (์žฌ์ˆœ์œ„ํ™”)**: * **ํ•„์š”์„ฑ**: 1์ฐจ ๊ฒ€์ƒ‰(Vector Search)์€ ์ˆ˜๋ฐฑ๋งŒ ๊ฐœ ์ค‘ ํ›„๋ณด๋ฅผ ๋นจ๋ฆฌ ์ฐพ๋Š” ๋ฐ ์ตœ์ ํ™”๋˜์–ด ์žˆ์–ด ์ •๋ฐ€๋„๊ฐ€ ๋‹ค์†Œ ๋‚ฎ์„ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. * **์ž‘๋™**: 1์ฐจ ๊ฒ€์ƒ‰์œผ๋กœ ๋ฝ‘ํžŒ ์ˆ˜์‹ญ ๊ฐœ์˜ ํ›„๋ณด๊ตฐ์— ๋Œ€ํ•ด, ํ›จ์”ฌ ๋ฌด๊ฒ๊ณ  ์ •๋ฐ€ํ•œ Cross-Encoder ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•˜์—ฌ ์งˆ๋ฌธ๊ณผ์˜ ๊ด€๋ จ์„ฑ์„ ๋‹ค์‹œ ๊ณ„์‚ฐํ•˜๊ณ  ์ˆœ์œ„๋ฅผ ์žฌ๋ฐฐ์น˜ํ•ฉ๋‹ˆ๋‹ค. 3. **ํšจ๊ณผ**: * ๊ฒ€์ƒ‰ ๊ฒฐ๊ณผ์˜ ์ƒ์œ„๊ถŒ(Top-K)์— ์‹ค์ œ ์ •๋‹ต์ด ํฌํ•จ๋  ํ™•๋ฅ (Recall)๊ณผ ์ •๋‹ต๋งŒ ํฌํ•จ๋  ํ™•๋ฅ (Precision)์„ ๋™์‹œ์— ๋†’์ž…๋‹ˆ๋‹ค. ## โš–๏ธ Trade-offs & Caveats * **์ง€์—ฐ ์‹œ๊ฐ„**: Reranking ๋‹จ๊ณ„๋Š” ์ถ”๊ฐ€์ ์ธ ๋ชจ๋ธ ์—ฐ์‚ฐ์„ ํ•„์š”๋กœ ํ•˜๋ฏ€๋กœ, ์ „์ฒด ์‘๋‹ต ์†๋„๊ฐ€ ์ˆ˜๋ฐฑ ๋ฐ€๋ฆฌ์ดˆ ์ด์ƒ ๋А๋ ค์งˆ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. * **๋น„์šฉ**: ๊ณ ์„ฑ๋Šฅ Reranker ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•  ๊ฒฝ์šฐ API ํ˜ธ์ถœ ๋น„์šฉ์ด๋‚˜ GPU ์ž์› ์†Œ๋ชจ๊ฐ€ ๋Š˜์–ด๋‚ฉ๋‹ˆ๋‹ค. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) * **์ƒ์œ„ ์‹œ์Šคํ…œ**: [[Retrieval-Augmented Generation (RAG)|Retrieval-Augmented Generation (RAG)]] * **์—ฐ๊ด€ ๊ธฐ์ˆ **: [[Vector Databases & Search|Vector Databases & Search]], [[Embedding Models & MRL|Embedding Models & MRL]] * **์ฃผ์š” ํˆด**: Cohere Rerank, BGE-Reranker, Voyage Rerank --- *Last updated: 2026-05-04*