--- id: ALGO-LSH-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [algorithm, search, lsh, hashing, similarity-search, big-data] last_reinforced: 2026-04-26 --- # Locality-Sensitive Hashing (LSH, ์ง€์—ญ ๋ฏผ๊ฐ ํ•ด์‹ฑ) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "ํ•ด์‹œ ์ถฉ๋Œ์„ '๋ฒ„๊ทธ'๊ฐ€ ์•„๋‹Œ 'ํŠน์ง•'์œผ๋กœ ํ™œ์šฉํ•˜์—ฌ, ๋‹ฎ์€๊ผด ๋ฐ์ดํ„ฐ๋“ค์„ ๊ฐ™์€ ๋ฐ”๊ตฌ๋‹ˆ์— ๋‹ด์•„๋ผ" โ€” ๋น„์Šทํ•œ ํŠน์„ฑ์„ ๊ฐ€์ง„ ๊ณ ์ฐจ์› ๋ฐ์ดํ„ฐ๋“ค์ด ๋†’์€ ํ™•๋ฅ ๋กœ ๋™์ผํ•œ ํ•ด์‹œ ๊ฐ’์„ ๊ฐ–๊ฒŒ ํ•˜์—ฌ, ์„ ํ˜• ํƒ์ƒ‰ ์—†์ด๋„ ์œ ์‚ฌํ•œ ๋ฐ์ดํ„ฐ๋ฅผ ๋งค์šฐ ๋น ๋ฅด๊ฒŒ ์ฐพ์•„๋‚ด๋Š” ํ™•๋ฅ ์  ๊ทผ์‚ฌ ๊ฒ€์ƒ‰ ๊ธฐ๋ฒ•. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** "Probabilistic Bucketing" โ€” ๋ชจ๋“  ๋ฐ์ดํ„ฐ๋ฅผ ์ „์ˆ˜ ์กฐ์‚ฌํ•˜๋Š” ๋Œ€์‹ , ์œ ์‚ฌํ•œ ๋ฐ์ดํ„ฐ๋ผ๋ฆฌ ๊ฐ™์€ ๋ฒ„ํ‚ท(Bucket)์— ๋ชจ์ด๋„๋ก ์„ค๊ณ„๋œ ํ•ด์‹œ ํ•จ์ˆ˜๋ฅผ ํ†ตํ•ด ํƒ์ƒ‰ ๋ฒ”์œ„๋ฅผ ํš๊ธฐ์ ์œผ๋กœ ์ค„์ด๋Š” ๊ณ ์† ๊ฒ€์ƒ‰ ํŒจํ„ด. - **์ž‘๋™ ์›๋ฆฌ:** - **Projection:** ๊ณ ์ฐจ์› ๋ฒกํ„ฐ๋ฅผ ์ž„์˜์˜ ์ถ•์œผ๋กœ ํˆฌ์˜ํ•˜๊ฑฐ๋‚˜ ํ•ด์‹ฑํ•˜์—ฌ ์ฐจ์› ์ถ•์†Œ. - **Collision as Similarity:** ์ผ๋ฐ˜์ ์ธ ํ•ด์‹œ์™€ ๋ฐ˜๋Œ€๋กœ, ์œ ์‚ฌํ•œ ๋ฐ์ดํ„ฐ์ผ์ˆ˜๋ก ํ•ด์‹œ ์ถฉ๋Œ(Collision)์ด ๋นˆ๋ฒˆํ•˜๊ฒŒ ๋ฐœ์ƒํ•˜๋„๋ก ์œ ๋„. - **Candidate Selection:** ๋™์ผํ•œ ํ•ด์‹œ ๋ฒ„ํ‚ท์— ๋‹ด๊ธด ๋ฐ์ดํ„ฐ๋“ค๋งŒ์„ ๋Œ€์ƒ์œผ๋กœ ์ •๋ฐ€ํ•œ ์œ ์‚ฌ๋„ ์ธก์ • ์ˆ˜ํ–‰. - **์˜์˜:** ์ˆ˜์–ต ๊ฑด ์ด์ƒ์˜ ๋ฐ์ดํ„ฐ๊ฐ€ ์Œ“์ธ ํ™˜๊ฒฝ์—์„œ๋„ ์ค‘๋ณต ๋ฌธ์„œ ํƒ์ง€, ์œ ์‚ฌ ์ด๋ฏธ์ง€ ๊ฒ€์ƒ‰, ์ถ”์ฒœ ์‹œ์Šคํ…œ ๋“ฑ์„ ์‹ค์‹œ๊ฐ„ ์ˆ˜์ค€์œผ๋กœ ์ฒ˜๋ฆฌ ๊ฐ€๋Šฅ์ผ€ ํ•จ. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ์™„์ „ํ•œ ์ •๋‹ต์„ ๋ณด์žฅํ•˜์ง€ ๋ชปํ•œ๋‹ค๋Š” ์ด์œ ๋กœ ์™ธ๋ฉด๋ฐ›๊ธฐ๋„ ํ–ˆ์œผ๋‚˜, ๋ฐ์ดํ„ฐ๊ฐ€ ํญ๋ฐœ์ ์œผ๋กœ ์ฆ๊ฐ€ํ•˜๋Š” ๋น…๋ฐ์ดํ„ฐ ์‹œ๋Œ€์— '์™„๋ฒฝํ•œ ์ •๋‹ต'๋ณด๋‹ค '์ถฉ๋ถ„ํžˆ ๋น ๋ฅธ ๊ทผ์‚ฌ ์ •๋‹ต'์ด ๋” ๊ฐ€์น˜ ์žˆ์Œ์„ ์ž…์ฆํ•˜๋ฉฐ ํ•„์ˆ˜ ์•Œ๊ณ ๋ฆฌ์ฆ˜์œผ๋กœ ์ •์ฐฉ๋จ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” 1,174๊ฐœ์˜ ๋ฐฉ๋Œ€ํ•œ ๋ฌธ์„œ ์ค‘ ์ค‘๋ณต๋˜๊ฑฐ๋‚˜ ์œ ์‚ฌํ•œ ๋‚ด์šฉ์ด ์žˆ๋Š”์ง€ ์ „์ˆ˜ ๊ฒ€์‚ฌํ•  ๋•Œ, ์—ฐ์‚ฐ ํšจ์œจ์„ ์œ„ํ•ด LSH ๊ธฐ๋ฐ˜์˜ 1์ฐจ ํ•„ํ„ฐ๋ง์„ ์ˆ˜ํ–‰ํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Indexing-Strategies|Indexing-Strategies]], Vector-Database-Foundations, [[Dimensionality-Reduction|Dimensionality-Reduction]], [[K-Nearest-Neighbors-K-NN|K-Nearest-Neighbors-K-NN]] - **Raw Source:** 10_Wiki/Topics/AI/Locality-Sensitive-Hashing.md