--- id: ATTENTION-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [ai, nlp, transformer, attention, deep-learning] last_reinforced: 2026-04-26 --- # NLP Attention Mechanisms (์–ดํ…์…˜ ๋ฉ”์ปค๋‹ˆ์ฆ˜) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "์ค‘์š”ํ•œ ๊ฒƒ์— ์ง‘์ค‘ํ•˜๊ณ  ๋‚˜๋จธ์ง€๋Š” ๋ฌด์‹œํ•˜๋ผ" โ€” ๋ฌธ์žฅ ๋‚ด์˜ ๊ฐ ๋‹จ์–ด๊ฐ€ ๋‹ค๋ฅธ ๋‹จ์–ด๋“ค๊ณผ ์–ด๋–ค ์—ฐ๊ด€์„ฑ์„ ๊ฐ€์ง€๋Š”์ง€ ๊ณ„์‚ฐํ•˜์—ฌ, ๋งฅ๋ฝ์„ ํŒŒ์•…ํ•  ๋•Œ ์ค‘์š”ํ•œ ์ •๋ณด์— ๋” ๋†’์€ ๊ฐ€์ค‘์น˜๋ฅผ ๋ถ€์—ฌํ•˜๋Š” ๋ฉ”์ปค๋‹ˆ์ฆ˜. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** ์ž…๋ ฅ ์‹œํ€€์Šค์˜ ๋ชจ๋“  ๋ถ€๋ถ„ ์ค‘์—์„œ ํ˜„์žฌ ์ฒ˜๋ฆฌ ์ค‘์ธ ์ •๋ณด์™€ ๊ฐ€์žฅ ๊ด€๋ จ์ด ๊นŠ์€ ๋ถ€๋ถ„์— '์ฃผ์˜(Attention)'๋ฅผ ๊ธฐ์šธ์—ฌ ๊ฐ€์ค‘ ํ‰๊ท ๊ฐ’์„ ๊ณ„์‚ฐํ•˜๋Š” ์ •๋ณด ์ถ”์ถœ ํŒจํ„ด. - **์„ธ๋ถ€ ๋‚ด์šฉ:** - **Self-Attention:** ํ•˜๋‚˜์˜ ๋ฌธ์žฅ ์•ˆ์—์„œ ๋‹จ์–ด๋“ค ๊ฐ„์˜ ๊ด€๊ณ„๋ฅผ ํŒŒ์•… (์˜ˆ: '๊ทธ'๊ฐ€ ๊ฐ€๋ฆฌํ‚ค๋Š” ๋Œ€์ƒ์„ ๋ฌธ๋งฅ ์†์—์„œ ์ฐพ์Œ). - **Query, Key, Value:** ์ •๋ณด๋ฅผ ์ฐพ๋Š” ์ฃผ์ฒด(Query), ์ •๋ณด์˜ ์‹๋ณ„์ž(Key), ์ •๋ณด์˜ ์‹ค์งˆ์  ๋‚ด์šฉ(Value)์œผ๋กœ ๋ฐ์ดํ„ฐ๋ฅผ ๋ถ„ํ•ดํ•˜์—ฌ ์—ฐ์‚ฐ. - **Multi-Head Attention:** ์—ฌ๋Ÿฌ ๊ฐœ์˜ ์–ดํ…์…˜ ๋ฃจํ”„๋ฅผ ๋ณ‘๋ ฌ๋กœ ์‹คํ–‰ํ•˜์—ฌ ๋‹ค์–‘ํ•œ ์ธก๋ฉด(๋ฌธ๋ฒ•, ์˜๋ฏธ, ๊ฑฐ๋ฆฌ ๋“ฑ)์—์„œ ๋ฌธ๋งฅ ๋ถ„์„. - **Evolution:** ๊ณ ์ •๋œ ๊ธธ์ด์˜ ๋ฒกํ„ฐ์— ์ •๋ณด๋ฅผ ์••์ถ•ํ•ด์•ผ ํ–ˆ๋˜ ๊ธฐ์กด ๋ชจ๋ธ์˜ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๊ณ  ํŠธ๋žœ์Šคํฌ๋จธ ์•„ํ‚คํ…์ฒ˜์˜ ํ•ต์‹ฌ์ด ๋จ. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ์ดˆ๊ธฐ์—๋Š” RNN์˜ ๋ณด์กฐ ๋„๊ตฌ๋กœ ๋“ฑ์žฅํ–ˆ์œผ๋‚˜, ํ˜„์žฌ๋Š” "Attention is All You Need"๋ผ๋Š” ๋…ผ๋ฌธ ์ œ๋ชฉ์ฒ˜๋Ÿผ ๋ชจ๋ธ ์•„ํ‚คํ…์ฒ˜ ๊ทธ ์ž์ฒด๊ฐ€ ๋จ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ์—์ด์ „ํŠธ๋Š” ์–ดํ…์…˜ ๋งต ๋ถ„์„์„ ํ†ตํ•ด ์‚ฌ์šฉ์ž์˜ ์งˆ๋ฌธ์—์„œ ๊ฐ€์žฅ ํ•ต์‹ฌ์ ์ธ ํ‚ค์›Œ๋“œ๋ฅผ ์‹๋ณ„ํ•˜๊ณ  ๋‹ต๋ณ€์˜ ์ดˆ์ ์„ ๋งž์ถค. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Transformer-Architecture|Transformer-Architecture]], [[LLM|LLM]], Neural-Networks-Foundations, Mechanistic-Interpretability - **Raw Source:** 10_Wiki/Topics/AI/NLP-Attention-Mechanisms.md