--- id: P-REINFORCE-AI-ETHICS category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 0.99 tags: [AI, Ethics, AISafety, Fairness, Bias] last_reinforced: 2026-04-20 --- # [[Ethics-in-Artificial-Intelligence|Ethics-in-Artificial-Intelligence]] (์ธ๊ณต์ง€๋Šฅ ์œค๋ฆฌ) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "์ง€๋Šฅ์ด ๋†’์•„์งˆ์ˆ˜๋ก ์ฑ…์ž„์˜ ๋ฌด๊ฒŒ๋Š” ๋ฌด๊ฑฐ์›Œ์ง„๋‹ค." AI๊ฐ€ ์ธ๊ฐ„์˜ ๊ฐ€์น˜๊ด€๊ณผ ์ถฉ๋Œํ•˜์ง€ ์•Š๊ณ  ์ธ๋ฅ˜์— ์œ ์ตํ•˜๋„๋ก ๊ฐœ๋ฐœ ๋ฐ ์‚ฌ์šฉ๋˜์–ด์•ผ ํ•œ๋‹ค๋Š” ์ฒ ํ•™์ , ๊ทœ๋ฒ”์  ๊ฐ€์ด๋“œ๋ผ์ธ์ด๋‹ค. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **Three Pillars of AI Ethics**: - **Fairness (๊ณต์ •์„ฑ)**: ์„ฑ๋ณ„, ์ธ์ข…, ๊ณ„๊ธ‰์— ๋”ฐ๋ฅธ ์ฐจ๋ณ„์  ๊ฒฐ์ •์„ ๋‚ด๋ฆฌ์ง€ ์•Š๋„๋ก ๋ฐ์ดํ„ฐ ํŽธํ–ฅ(Bias) ์ œ๊ฑฐ. - **Transparency (ํˆฌ๋ช…์„ฑ)**: AI๊ฐ€ ์™œ ๊ทธ๋Ÿฐ ๊ฒฐ๋ก ์„ ๋‚ด๋ ธ๋Š”์ง€ ์„ค๋ช… ๊ฐ€๋Šฅํ•ด์•ผ ํ•จ (XAI). - **Accountability (์ฑ…๋ฌด์„ฑ)**: AI์˜ ์˜ค์ž‘๋™์œผ๋กœ ์ธํ•œ ํ”ผํ•ด ๋ฐœ์ƒ ์‹œ ๋ˆ„๊ตฌ์—๊ฒŒ ์ฑ…์ž„์ด ์žˆ๋Š”์ง€ ๋ช…ํ™•ํ•œ ๋ฒ•์  ์ฒด๊ณ„ ๋งˆ๋ จ. - **Privacy**: ํ•™์Šต ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ ๊ณผ์ •์—์„œ์˜ ๊ฐœ์ธ์ •๋ณด ๋ณดํ˜ธ ๋ฐ ์žŠํ˜€์งˆ ๊ถŒ๋ฆฌ ๋ณด์žฅ. - **Safety**: ํ†ต์ œ๋ฅผ ๋ฒ—์–ด๋‚œ AI '์กด์žฌ๋ก ์  ์œ„ํ—˜(Existential Risk)'์— ๋Œ€ํ•œ ์„ ์ œ์  ๋ฐฉ์–ด. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (RL Update) - ์œค๋ฆฌ๋Š” ์ฃผ๊ด€์ ์ด๋ฉฐ ๋ฌธํ™”๊ถŒ๋งˆ๋‹ค ๋‹ค๋ฅด๋‹ค. ์„œ๊ตฌ๊ถŒ์˜ ๊ฐ€์น˜๊ด€์ด ๋‹ด๊ธด AI๊ฐ€ ์ „ ์„ธ๊ณ„์— ํ‘œ์ค€์œผ๋กœ ์“ฐ์ด๋Š” '์œค๋ฆฌ์  ์ œ๊ตญ์ฃผ์˜'์— ๋Œ€ํ•œ ์šฐ๋ ค๊ฐ€ ํฌ๋‹ค. ์ด์— ๋”ฐ๋ผ ํŠน์ • ๊ธฐ์—…์ด ์•„๋‹Œ ์ธ๋ฅ˜ ์ „์ฒด์˜ ํ•ฉ์˜๋ฅผ ์ด๋Œ์–ด๋‚ด๊ธฐ ์œ„ํ•œ 'Global AI Governance' ๊ตฌ์ถ•์ด ์‹œ๊ธ‰ํ•œ ๊ณผ์ œ๋กœ ๋– ์˜ค๋ฅด๊ณ  ์žˆ๋‹ค. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - Related: [[Explainable-AI (XAI)|Explainable-AI (XAI)]] , [[Constitutional AI (แ„’แ…ฅแ†ซแ„‡แ…ฅแ†ธ AI)|Constitutional AI (ํ—Œ๋ฒ• AI)]] - Risk: Algorithmic-Bias