--- id: P-REINFORCE-AUTO-HALL-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 0.98 tags: [auto-reinforced, hallucination, llm-issue, ai-safety, fact-checking, alignment] last_reinforced: 2026-04-20 --- # [[Hallucination (ํ™˜๊ฐ)]] ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "ํ™•๋ฅ ์ด ๋งŒ๋“  ๊ทธ๋Ÿด์‹ธํ•œ ๊ฑฐ์ง“๋ง: AI๊ฐ€ ๋ฐฉ๋Œ€ํ•œ ๋ฐ์ดํ„ฐ์˜ ํ†ต๊ณ„์  ํŒจํ„ด์—๋งŒ ๋งค๋ชฐ๋˜์–ด, ์‚ฌ์‹ค ์—ฌ๋ถ€์™€ ์ƒ๊ด€์—†์ด ๋ฌธ๋ฒ•์ ์œผ๋กœ ์™„๋ฒฝํ•˜๊ณ  ์„ค๋“๋ ฅ ์žˆ๋Š” ๊ฐ€์งœ ์ •๋ณด๋ฅผ ์ƒ์„ฑํ•˜์—ฌ ์‚ฌ์šฉ์ž์—๊ฒŒ ์‹ฌ๊ฐํ•œ ํ˜ผ๋ž€์„ ์ฃผ๋Š” ์ง€๋Šฅ์˜ ๋ถ€์ž‘์šฉ." ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) ํ™˜๊ฐ(Hallucination)์€ ๊ฑฐ๋Œ€ ์–ธ์–ด ๋ชจ๋ธ(LLM)์ด ์กด์žฌํ•˜์ง€ ์•Š๋Š” ์‚ฌ์‹ค์ด๋‚˜ ๋น„๋…ผ๋ฆฌ์ ์ธ ๋‹ต๋ณ€์„ ์ƒ์„ฑํ•˜๋Š” ํ˜„์ƒ์ž…๋‹ˆ๋‹ค. 1. **์™œ ๋ฐœ์ƒํ•˜๋Š”๊ฐ€?**: * **Probabilistic Nature**: ๋ชจ๋ธ์€ ์‹ค์ œ ์ง„๋ฆฌ๋ฅผ ์•„๋Š” ๊ฒŒ ์•„๋‹ˆ๋ผ ๋‹ค์Œ ๋‹จ์–ด๊ฐ€ ์˜ฌ ํ™•๋ฅ ๋งŒ ๊ณ„์‚ฐํ•จ. * **Confabulation**: ๋ถ€์กฑํ•œ ์ •๋ณด๋ฅผ ๋ฉ”์šฐ๊ธฐ ์œ„ํ•ด ๋‡Œ๊ฐ€ ์ด์•ผ๊ธฐ๋ฅผ ์ง€์–ด๋‚ด๋Š” ์ธ๊ฐ„์˜ ์‹ฌ๋ฆฌ ๊ธฐ์ œ์™€ ์œ ์‚ฌํ•จ. (Gestalt Psychology์™€ ์—ฐ๊ฒฐ) * **Data Noise**: ํ•™์Šต ๋ฐ์ดํ„ฐ ์ž์ฒด์˜ ์˜ค๋ฅ˜๋‚˜ ๋ชจ์ˆœ์ด ๋ฐ˜์˜๋จ. (Data Cleaning Algorithms์˜ ํ•„์š”์„ฑ) 2. **ํ•ด๊ฒฐ ๋…ธ๋ ฅ**: * **RAG (Retrieval-Augmented Generation)**: ๋‹ต๋ณ€ ์ „ ์™ธ๋ถ€ ์ง€์‹์„ ๊ฒ€์ƒ‰ํ•˜์—ฌ ๊ทผ๊ฑฐ๋ฅผ ์ œ์‹œ. * **Constitutional AI**: "๋ชจ๋ฅด๋ฉด ๋ชจ๋ฅธ๋‹ค๊ณ  ๋งํ•˜๋ผ"๋Š” ์›์น™ ์ฃผ์ž…. (Constitutional AI์™€ ์—ฐ๊ฒฐ) ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ**: ์ดˆ๊ธฐ ์œ ์ € ์ •์ฑ…์€ AI๋ฅผ '๊ฒ€์ƒ‰ ์—”์ง„'์ฒ˜๋Ÿผ ๋ฏฟ์—ˆ์œผ๋‚˜, ํ™˜๊ฐ ์ •์ฑ…์˜ ์‹ค์ฒด๋ฅผ ์•Œ๊ฒŒ ๋œ ํ˜„๋Œ€ ์ •์ฑ…์€ AI์˜ ๋‹ต๋ณ€์„ ๋ฐ˜๋“œ์‹œ ์žฌ๊ฒ€์ฆํ•˜๋Š” '๋น„ํŒ์  ์ˆ˜์šฉ ์ •์ฑ…'์œผ๋กœ ๋ณ€ํ™”ํ•จ(RL Update). - **์ •์ฑ… ๋ณ€ํ™”(RL Update)**: ํ™˜๊ฐ์„ ๋‹จ์ˆœํ•œ '์ œ๊ฑฐ ๋Œ€์ƒ ์ •์ฑ…'์ด ์•„๋‹Œ, ์†Œ์„ค ์ฐฝ์ž‘์ด๋‚˜ ์•„์ด๋””์–ด ๋ฐœ๊ตด ๊ฐ™์€ '์ฐฝ์˜์  ์˜์—ญ ์ •์ฑ…'์—์„œ๋Š” ์œ ์šฉํ•œ ๋™๋ ฅ์œผ๋กœ ํ™œ์šฉํ•˜๋Š” ์—ญ๋ฐœ์ƒ์  ์ ‘๊ทผ ์ •์ฑ…๋„ ๋Œ€๋‘๋จ. (Computational Creativity์™€ ์—ฐ๊ฒฐ) ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Constitutional AI (ํ—Œ๋ฒ• AI)]], [[Computational Creativity]], [[Ethics & AI]], [[Self-Correction]], [[Data Cleaning Algorithms]] - **Modern Tech/Tools**: RAG, Fact-checkers, Hallucination detection benchmarks (HaluEval). ---