--- id: wiki-2026-0508-algorithmic-fairness title: Algorithmic Fairness category: 10_Wiki/Topics status: needs_review canonical_id: self aliases: [P-Reinforce-AUTO-ALFA-001] duplicate_of: none source_trust_level: A confidence_score: 0.96 tags: [auto-reinforced, algorithmic-fairness, bias, Equality, machine-learning-ethics, data-governance] raw_sources: [] last_reinforced: 2026-04-20 github_commit: pending inferred_by: Claude Opus 4.7 (auto-normalize 2026-05-08) tech_stack: language: unspecified framework: unspecified --- # [[Algorithmic Fairness|Algorithmic Fairness]] ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๋ฐ์ดํ„ฐ์— ๊นƒ๋“  ์ฐจ๋ณ„ ๊ฑท์–ด๋‚ด๊ธฐ: AI๊ฐ€ ์„ฑ๋ณ„, ์ธ์ข…, ๊ณ„์ธต์— ๋Œ€ํ•œ ํŽธํ–ฅ์„ ํ•™์Šตํ•˜์—ฌ ๋ˆ„๊ตฐ๊ฐ€์—๊ฒŒ ๋ถˆ์ด์ต์„ ์ฃผ์ง€ ์•Š๋„๋ก, ํ•™์Šต ๋ฐ์ดํ„ฐ๋ถ€ํ„ฐ ๊ฒฐ๊ณผ ๋„์ถœ๊นŒ์ง€ ๋ชจ๋“  ๊ณผ์ •์˜ ๊ณต์ •์„ฑ์„ ํ™•๋ณดํ•˜๋Š” ์—”์ง€๋‹ˆ์–ด๋ง ์œค๋ฆฌ." ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) ์•Œ๊ณ ๋ฆฌ์ฆ˜ ๊ณต์ •์„ฑ(Algorithmic Fairness)์€ AI ๋ชจ๋ธ์˜ ์˜ˆ์ธก ๊ฒฐ๊ณผ๊ฐ€ ํŠน์ • ์ง‘๋‹จ์— ๋Œ€ํ•ด ์ฒด๊ณ„์ ์œผ๋กœ ์œ ๋ฆฌํ•˜๊ฑฐ๋‚˜ ๋ถˆ๋ฆฌํ•˜์ง€ ์•Š๋„๋ก ๊ด€๋ฆฌํ•˜๋Š” ๋จธ์‹ ๋Ÿฌ๋‹์˜ ํ•˜์œ„ ๋ถ„์•ผ์ž…๋‹ˆ๋‹ค. 1. **ํŽธํ–ฅ์˜ ์ถœ์ฒ˜**: * **Data Bias**: ํ•™์Šต ๋ฐ์ดํ„ฐ ์ž์ฒด๊ฐ€ ๊ธฐ์กด ์‚ฌํšŒ์˜ ํŽธ๊ฒฌ์ด๋‚˜ ๋ถˆํ‰๋“ฑ์„ ๋ฐ˜์˜ํ•˜๊ณ  ์žˆ๋Š” ๊ฒฝ์šฐ. * **Metric Bias**: ์„ฑ๊ณผ๋ฅผ ์ธก์ •ํ•˜๋Š” ์ง€ํ‘œ(์˜ˆ: ํด๋ฆญ๋ฅ ) ์ž์ฒด๊ฐ€ ํŠน์ • ์ง‘๋‹จ์— ์œ ๋ฆฌํ•˜๊ฒŒ ์„ค๊ณ„๋œ ๊ฒฝ์šฐ. 2. **๊ณต์ •์„ฑ ๋ฉ”ํŠธ๋ฆญ**: * **Demographic Parity**: ๋ชจ๋“  ์ง‘๋‹จ์— ๋Œ€ํ•ด ๊ธ์ •์ ์ธ ์˜ˆ์ธก ๊ฒฐ๊ณผ ๋น„์œจ์ด ๊ฐ™์•„์•ผ ํ•จ. * **Equalized Odds**: ์˜ค๋‹ต๋ฅ (FP, FN)์ด ์ง‘๋‹จ๋ณ„๋กœ ๊ท ๋“ฑํ•ด์•ผ ํ•จ. 3. **๋Œ€์‘ ๊ธฐ๋ฒ•**: * **Pre-[[Processing|Processing]]**: ํ•™์Šต ์ „ ๋ฐ์ดํ„ฐ๋ฅผ ์žฌ๊ฐ€๊ณตํ•˜์—ฌ ๊ท ํ˜• ๋งž์ถค. * **In-processing**: ํ•™์Šต ๊ณผ์ •์—์„œ ๊ณต์ •์„ฑ ์ œ์•ฝ ์กฐ๊ฑด(Penalty) ์ถ”๊ฐ€. * **Post-processing**: ๊ฒฐ๊ณผ ๋„์ถœ ํ›„ ํŽธํ–ฅ์ด ๊ฐ์ง€๋˜๋ฉด ๋ณด์ •. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & Updates) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ**: ๊ณผ๊ฑฐ์—๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜์˜ '์ˆ˜ํ•™์  ๊ฐ๊ด€์„ฑ' ์ •์ฑ…๋งŒ ๋ฏฟ์—ˆ์œผ๋‚˜, ํ˜„๋Œ€ ์ •์ฑ…์€ '๊ฐ๊ด€์ ์ธ ๋ฐ์ดํ„ฐ๊ฐ€ ๊ณง ๊ณต์ •ํ•œ ๊ฒฐ๊ณผ๋Š” ์•„๋‹ˆ๋‹ค'๋ผ๋Š” ์ธ์‹์„ ๋ฐ”ํƒ•์œผ๋กœ '์ ๊ทน์  ๋ถˆํ‰๋“ฑ ์‹œ์ • ์ •์ฑ…'์„ ๋ชจ๋ธ์— ์ฃผ์ž…ํ•จ(RL Update). - **์ •์ฑ… ๋ณ€ํ™”(RL Update)**: ์ฑ„์šฉ, ๋Œ€์ถœ ์‹ฌ์‚ฌ, ํ˜•๋Ÿ‰ ์˜ˆ์ธก ๋“ฑ ๋ฏผ๊ฐํ•œ ๊ณต๊ณต ์„œ๋น„์Šค ์ •์ฑ…์—์„œ ์‚ฌ์šฉ๋˜๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜์€ ์˜๋ฌด์ ์œผ๋กœ '๊ณต์ •์„ฑ ์˜ํ–ฅ ํ‰๊ฐ€(Fairness Audit)'๋ฅผ ํ†ต๊ณผํ•ด์•ผ ํ•˜๋Š” ์ •์ฑ…์ด ์ˆ˜๋ฆฝ๋จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Toxicity-and-Bias-Mitigation|Toxicity-and-Bias-Mitigation]], [[AI Accountability|AI Accountability]], [[AI Governance|AI Governance]], [[Ethics & AI|Ethics & AI]], [[Sociology of Knowledge|Sociology of Knowledge]] - **Modern Tech/Tools**: IBM AI Fairness 360, Google What-If Tool, Fairlearn. --- ## ๐Ÿค– LLM ํ™œ์šฉ ํžŒํŠธ (How to Use This Knowledge) **์–ธ์ œ ์ด ์ง€์‹์„ ์“ฐ๋Š”๊ฐ€:** - *(TODO)* **์–ธ์ œ ์“ฐ๋ฉด ์•ˆ ๋˜๋Š”๊ฐ€:** - *(TODO)* ## ๐Ÿงช ๊ฒ€์ฆ ์ƒํƒœ (Validation) - **์ •๋ณด ์ƒํƒœ:** needs_review - **์ถœ์ฒ˜ ์‹ ๋ขฐ๋„:** A - **๊ฒ€ํ†  ์ด์œ :** *(P-Reinforce Phase 1 ์ž๋™ ์ •๊ทœํ™”. ๋ณธ๋ฌธ ๊ฒ€์ฆ ํ•„์š”.)* ## ๐Ÿงฌ ์ค‘๋ณต ๊ฒ€์‚ฌ (Duplicate Check) - **๊ธฐ์กด ์œ ์‚ฌ ๋ฌธ์„œ:** *(TODO: ์ธ๋ฑ์„œ ํด๋Ÿฌ์Šคํ„ฐ ๋ฆฌํฌํŠธ ์ฐธ์กฐ)* - **์ฒ˜๋ฆฌ ๋ฐฉ์‹:** UPDATE (์ž๋™ ์ •๊ทœํ™”) - **์ฒ˜๋ฆฌ ์ด์œ :** Phase 1 ์ •๊ทœํ™” โ€” ์˜› ํ…œํ”Œ๋ฆฟ/๋ˆ„๋ฝ ํ•„๋“œ ๋ณด๊ฐ•. ## ๐Ÿ•“ ๋ณ€๊ฒฝ ์ด๋ ฅ (Changelog) | ๋‚ ์งœ | ๋ณ€๊ฒฝ ๋‚ด์šฉ | ์ฒ˜๋ฆฌ ๋ฐฉ์‹ | ์‹ ๋ขฐ๋„ | |------|-----------|-----------|--------| | 2026-05-08 | P-Reinforce Phase 1 ์ •๊ทœํ™” (frontmatter + ํ—ค๋” ํ‘œ์ค€ํ™”) | UPDATE | A | ## ๐Ÿ’ป ์ฝ”๋“œ ํŒจํ„ด (Code Patterns) **ํŒจํ„ด 1:** *(TODO: ์ด ํ”„๋กœ์ ํŠธ ์ปจ๋ฒค์…˜ ๋ฐ˜์˜ํ•œ ๊ตฌ์กฐ ์Šค์ผˆ๋ ˆํ†ค)* ```text # TODO ``` ## ๐Ÿค” ์˜์‚ฌ๊ฒฐ์ • ๊ธฐ์ค€ (Decision Criteria) **์„ ํƒ A๋ฅผ ์จ์•ผ ํ•  ๋•Œ:** - *(TODO)* **์„ ํƒ B๋ฅผ ์จ์•ผ ํ•  ๋•Œ:** - *(TODO)* **๊ธฐ๋ณธ๊ฐ’:** > *(TODO)* ## โŒ ์•ˆํ‹ฐํŒจํ„ด (Anti-Patterns) - **[์•ˆํ‹ฐํŒจํ„ด]:** *(TODO: ๋ฌด์—‡์„ ํ•˜๋ฉด ์•ˆ ๋˜๋Š”๊ฐ€ + ์ด์œ  + ๋Œ€์‹  ๋ฌด์—‡์„)*