--- id: P-REINFORCE-AI-LSTM category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 0.98 tags: [DeepLearning, RNN, LSTM, NLP] last_reinforced: 2026-04-20 --- # [[Long-Short-Term-Memory (LSTM)|Long-Short-Term-Memory (LSTM)]] ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "์ •๋ณด์˜ ํ๋ฆ„์„ ์—ด๊ณ  ๋‹ซ๋Š” ์ˆ˜๋„๊ผญ์ง€๋ฅผ ๊ฐ€์ง„ ๋˜‘๋˜‘ํ•œ ๋ฉ”๋ชจ๋ฆฌ." ๊ธฐ์กด RNN์˜ ๊ณ ์งˆ๋ณ‘์ธ '์žฅ๊ธฐ ๊ธฐ์–ต ์ƒ์‹ค(Vanishing Gradient)' ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜์—ฌ, ์ˆ˜๋งŒ ๋‹จ๊ณ„ ์ด์ „์˜ ์ •๋ณด๋„ ์žŠ์ง€ ์•Š๊ณ  ํ˜„์žฌ๋กœ ๊ฐ€์ ธ์˜ค๋Š” ์‹œ๊ณ„์—ด ๋ฐ์ดํ„ฐ์˜ ํ˜๋ช…์ด๋‹ค. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **Cell State**: ์ •๋ณด๋ฅผ ๋‹ด๊ณ  ํ๋ฅด๋Š” '๊ธด ํ†ต๋กœ'. ๋งˆ์น˜ ์ปจ๋ฒ ์ด์–ด ๋ฒจํŠธ์ฒ˜๋Ÿผ ์ •๋ณด๋ฅผ ๋ณ€์กฐ ์—†์ด ์ „๋‹ฌํ•จ. - **The Three Gates**: - **Forget Gate**: ๊ณผ๊ฑฐ์˜ ์ •๋ณด ์ค‘ ๋ฌด์—‡์„ ๋ฒ„๋ฆด์ง€ ๊ฒฐ์ •. - **Input Gate**: ํ˜„์žฌ ๋“ค์–ด์˜จ ์ •๋ณด ์ค‘ ๋ฌด์—‡์„ ๊ธฐ์–ตํ• ์ง€ ๊ฒฐ์ •. - **Output Gate**: ํ˜„์žฌ์˜ ๊ธฐ์–ต ์ค‘ ๋ฌด์—‡์„ ๋ฐ–์œผ๋กœ ๋‚ด๋ณด๋‚ผ์ง€ ๊ฒฐ์ •. - **Utility**: ๋ฒˆ์—ญ, ์ฃผ๊ฐ€ ์˜ˆ์ธก, ์Œ์„ฑ ์ธ์‹ ๋“ฑ ์ˆœ์„œ(Sequence)๊ฐ€ ์ค‘์š”ํ•œ ๋ชจ๋“  ๋ถ„์•ผ๋ฅผ ํ‰์ •ํ–ˆ๋˜ ๋ชจ๋ธ์ด๋‹ค. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (RL Update) - LSTM์€ ์‹œ๊ณ„์—ด ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ์— ๊ฐ•๋ ฅํ•˜์ง€๋งŒ, ์ˆœ์ฐจ์ ์œผ๋กœ ์—ฐ์‚ฐํ•ด์•ผ ํ•˜๋ฏ€๋กœ ์„ฑ๋Šฅ ์Šค์ผ€์ผ๋ง(๋ณ‘๋ ฌ ์ฒ˜๋ฆฌ)์ด ์–ด๋ ต๋‹ค. ํ˜„์žฌ๋Š” ๋ชจ๋“  ์‹œ์ ์„ ๋™์‹œ์— ๋ฐ”๋ผ๋ณด๋Š” **ํŠธ๋žœ์Šคํฌ๋จธ(Transformer)** ์•„ํ‚คํ…์ฒ˜์— ์™•์ขŒ๋ฅผ ๋‚ด์–ด์ฃผ์—ˆ์œผ๋‚˜, ๋ฐ์ดํ„ฐ๊ฐ€ ์ ๊ฑฐ๋‚˜ ์ดˆ์ €์ง€์—ฐ ํ•˜๋“œ์›จ์–ด ๊ตฌํ˜„์ด ํ•„์š”ํ•œ ํŠน์ˆ˜ ๋ถ„์•ผ์—์„œ๋Š” ์—ฌ์ „ํžˆ ํ˜„์—ญ์œผ๋กœ ํ™œ๋™ ์ค‘์ด๋‹ค. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - Related: Recurrent-Neural-Networks (RNN) , Attention-Mechanism - Rival: [[Transformer-Architecture|Transformer-Architecture]]