--- id: ML-OUT-DET-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [machine-learning, outlier-detection, anomaly-detection, statistics, isolation-forest, data-quality] last_reinforced: 2026-04-26 --- # Outlier Detection Techniques (์ด์ƒ์น˜ ํƒ์ง€ ๊ธฐ๋ฒ•) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๋Œ€๋‹ค์ˆ˜์˜ ํ๋ฆ„์—์„œ ๋ฒ—์–ด๋‚œ ์†Œ์ˆ˜์˜ 'ํŠ€๋Š” ๋ฐ์ดํ„ฐ'๋ฅผ ์‹๋ณ„ํ•˜์—ฌ, ์‹œ์Šคํ…œ์˜ ์˜ค๋ฅ˜๋ฅผ ๋ฏธ์—ฐ์— ๋ฐฉ์ง€ํ•˜๊ฑฐ๋‚˜ ์ˆจ๊ฒจ์ง„ ์œ„ํ˜‘์„ ํฌ์ฐฉํ•˜๋ผ" โ€” ๋ฐ์ดํ„ฐ ์ „์ฒด์˜ ํ†ต๊ณ„์  ๊ฒฝํ–ฅ์„ฑ์—์„œ ํฌ๊ฒŒ ๋ฒ—์–ด๋‚˜ ๋ฐ์ดํ„ฐ์˜ ์งˆ์„ ๋–จ์–ด๋œจ๋ฆฌ๊ฑฐ๋‚˜ ๋ถ€์ •์  ์ด๋ฒคํŠธ๋ฅผ ์•”์‹œํ•˜๋Š” ์ด์ƒ์น˜(Outliers)๋ฅผ ํƒ์ƒ‰ํ•˜๋Š” ๋ฐฉ๋ฒ•๋ก . ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** "Deviation and Isolation Analysis" โ€” ์ •์ƒ ๋ฐ์ดํ„ฐ๊ฐ€ ๋ฐ€์ง‘๋œ ์˜์—ญ์„ ์ •์˜ํ•˜๊ณ  ๊ทธ ๋ฐ–์˜ ๋ฐ์ดํ„ฐ๋ฅผ ์ฐพ๊ฑฐ๋‚˜(Z-score, IQR), ์ •์ƒ ๋ฐ์ดํ„ฐ๋ณด๋‹ค ํ›จ์”ฌ ์ ์€ ํšŸ์ˆ˜์˜ ์งˆ๋ฌธ๋งŒ์œผ๋กœ๋„ ๊ณ ๋ฆฝ(Isolation)๋˜๋Š” ๋ฐ์ดํ„ฐ๋ฅผ ์ด์ƒ์น˜๋กœ ๋ถ„๋ฅ˜ํ•˜๋Š” ํŒจํ„ด. - **์ฃผ์š” ๊ธฐ๋ฒ•:** - **Statistical:** Z-score(ํ‘œ์ค€ํŽธ์ฐจ ํ™œ์šฉ), IQR(์‚ฌ๋ถ„์œ„์ˆ˜ ํ™œ์šฉ). ์ •๊ทœ๋ถ„ํฌ๋ฅผ ๊ฐ€์ •ํ•  ๋•Œ ํšจ๊ณผ์ . - **Distance-based (KNN):** ์ฃผ๋ณ€ ์ด์›ƒ๊ณผ์˜ ๊ฑฐ๋ฆฌ๊ฐ€ ๋จผ ๋ฐ์ดํ„ฐ๋ฅผ ์ด์ƒ์น˜๋กœ ํŒ๋‹จ. - **Density-based (LOF):** ์ฃผ๋ณ€ ๋ฐ์ดํ„ฐ ๋ฐ€๋„๊ฐ€ ์ƒ๋Œ€์ ์œผ๋กœ ๋‚ฎ์€ ์ง€์ ์„ ํƒ์ง€. - **Isolation Forest:** ๋ฐ์ดํ„ฐ๋ฅผ ๋ฌด์ž‘์œ„๋กœ ๋ถ„ํ• ํ•  ๋•Œ ๋นจ๋ฆฌ ๊ณ ๋ฆฝ๋˜๋Š” ์ง€์ ์„ ์ฐพ๋Š” ํ˜„๋Œ€์  ํ‘œ์ค€. - **์˜์˜:** ์‹ ์šฉ์นด๋“œ ๋ถ€์ • ๊ฒฐ์ œ ๊ฐ์ง€, ๊ณต์žฅ ์„ค๋น„ ๊ณ ์žฅ ์˜ˆ์ง€, ๋ฐ์ดํ„ฐ ์ „์ฒ˜๋ฆฌ ๋‹จ๊ณ„์˜ ๋…ธ์ด์ฆˆ ์ œ๊ฑฐ ๋“ฑ ์‹œ์Šคํ…œ์˜ ์‹ ๋ขฐ์„ฑ์„ ์ง€ํƒฑํ•˜๋Š” ํ•ต์‹ฌ ๋ชจ๋‹ˆํ„ฐ๋ง ๊ธฐ์ˆ . ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ๋ชจ๋“  ์ด์ƒ์น˜๋ฅผ '์ œ๊ฑฐํ•ด์•ผ ํ•  ์˜ค๋ฅ˜'๋กœ ๋ณด๋˜ ๋‹จ๊ณ„์—์„œ, ์ด์ œ๋Š” ์ด์ƒ์น˜ ์ž์ฒด๊ฐ€ ๊ฐ€์žฅ ์ค‘์š”ํ•œ ์ •๋ณด๋ฅผ ๋‹ด๊ณ  ์žˆ๋Š” '์ด๋ฒคํŠธ(Fraud ๋“ฑ)'๋ผ๋Š” ์ธ์‹์œผ๋กœ ์ „ํ™˜๋˜์–ด ์ •๊ตํ•œ ๋ถ„์„์˜ ๋Œ€์ƒ์ด ๋จ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ์—์ด์ „ํŠธ์˜ ์—ฐ์‚ฐ ๋ฆฌ์†Œ์Šค ์‚ฌ์šฉ๋Ÿ‰์ด ํ‰์†Œ ๋ถ„ํฌ๋ฅผ ํฌ๊ฒŒ ๋ฒ—์–ด๋‚  ๋•Œ, Isolation Forest ๊ธฐ๋ฐ˜์˜ ์ด์ƒ ํƒ์ง€ ์—”์ง„์„ ๊ฐ€๋™ํ•˜์—ฌ ๋น„์ •์ƒ์ ์ธ ๋ฃจํ”„๋‚˜ ํ•ดํ‚น ์‹œ๋„๋ฅผ ์‹ค์‹œ๊ฐ„์œผ๋กœ ์ฐจ๋‹จํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Out-of-distribution-Detection|Out-of-distribution-Detection]], [[Pre-processing-Data-for-AI|Pre-processing-Data-for-AI]], Clustering-Algorithms-Foundations, [[High-Availability-Systems|High-Availability-Systems]] - **Raw Source:** 10_Wiki/Topics/AI/Outlier-Detection-Techniques.md