--- category: Unified tags: [auto-consolidated, technical-documentation] title: [[LSTM (Long Short-Term Memory)|LSTM (Long Short-Term [[memory]])]] last_updated: 2026-05-02 --- # [[LSTM (Long Short-Term Memory)|LSTM (Long Short-Term [[memory]])]] ## ๐Ÿ“Œ Brief Summary > "๊ธฐ์–ตํ•  ๊ฒƒ๊ณผ ์žŠ์„ ๊ฒƒ์„ ์Šค์Šค๋กœ ๊ฒฐ์ •ํ•˜๋Š” ๋˜‘๋˜‘ํ•œ ๋ฉ”๋ชจ๋ฆฌ ์…€" โ€” ๊ธฐ์กด RNN์˜ ๊ณ ์งˆ์ ์ธ ๋ฌธ์ œ์ธ '์žฅ๊ธฐ ์˜์กด์„ฑ(Long-term dependency)' ์†์‹ค์„ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ๊ฒŒ์ดํŠธ(Gate) ๊ตฌ์กฐ๋ฅผ ๋„์ž…ํ•œ ์ˆœํ™˜ ์‹ ๊ฒฝ๋ง ์•„ํ‚คํ…์ฒ˜. --- > "์ •๋ณด์˜ ํ๋ฆ„์„ ์—ด๊ณ  ๋‹ซ๋Š” ์ˆ˜๋„๊ผญ์ง€๋ฅผ ๊ฐ€์ง„ ๋˜‘๋˜‘ํ•œ ๋ฉ”๋ชจ๋ฆฌ." ๊ธฐ์กด RNN์˜ ๊ณ ์งˆ๋ณ‘์ธ '์žฅ๊ธฐ ๊ธฐ์–ต ์ƒ์‹ค(Vanishing Gradient)' ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜์—ฌ, ์ˆ˜๋งŒ ๋‹จ๊ณ„ ์ด์ „์˜ ์ •๋ณด๋„ ์žŠ์ง€ ์•Š๊ณ  ํ˜„์žฌ๋กœ ๊ฐ€์ ธ์˜ค๋Š” ์‹œ๊ณ„์—ด ๋ฐ์ดํ„ฐ์˜ ํ˜๋ช…์ด๋‹ค. ## ๐Ÿ“– Core Content - **์ถ”์ถœ๋œ ํŒจํ„ด:** ์ •๋ณด์˜ ํ๋ฆ„์„ ์กฐ์ ˆํ•˜๋Š” ์„ธ ๊ฐ€์ง€ ๋ฌธ(Gate)์„ ํ†ตํ•ด, ์ค‘์š”ํ•œ ์ •๋ณด๋Š” ์˜ค๋ž˜ ๋ณด์กดํ•˜๊ณ  ๋ถˆํ•„์š”ํ•œ ์ •๋ณด๋Š” ์ฆ‰์‹œ ์ง€์›Œ๋ฒ„๋ฆฌ๋Š” ์‹œ๊ณ„์—ด ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ ํŒจํ„ด. - **์„ธ๋ถ€ ๋‚ด์šฉ:** - **Forget Gate:** ์ด์ „ ์ƒํƒœ์˜ ์ •๋ณด ์ค‘ ๋ฌด์—‡์„ ๋ฒ„๋ฆด์ง€ ๊ฒฐ์ •. - **Input Gate:** ํ˜„์žฌ ์ž…๋ ฅ ์ •๋ณด ์ค‘ ๋ฌด์—‡์„ ์…€ ์ƒํƒœ(Cell [[State|State]])์— ์ €์žฅํ• ์ง€ ๊ฒฐ์ •. - **Output Gate:** ๊ฐฑ์‹ ๋œ ์…€ ์ƒํƒœ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ๋‹ค์Œ ๋‹จ๊ณ„๋กœ ์ „๋‹ฌํ•  ๊ฐ’์„ ๊ฒฐ์ •. - **Cell State:** ์ปจ๋ฒ ์ด์–ด ๋ฒจํŠธ์ฒ˜๋Ÿผ ์ •๋ณด๊ฐ€ ํ๋ฅด๋ฉฐ, ๊ฒŒ์ดํŠธ๋“ค์— ์˜ํ•ด ์ •๋ณด๊ฐ€ ์ถ”๊ฐ€๋˜๊ฑฐ๋‚˜ ์‚ญ์ œ๋จ. --- - **Cell [[State|State]]**: ์ •๋ณด๋ฅผ ๋‹ด๊ณ  ํ๋ฅด๋Š” '๊ธด ํ†ต๋กœ'. ๋งˆ์น˜ ์ปจ๋ฒ ์ด์–ด ๋ฒจํŠธ์ฒ˜๋Ÿผ ์ •๋ณด๋ฅผ ๋ณ€์กฐ ์—†์ด ์ „๋‹ฌํ•จ. - **The Three [[Gates|Gates]]**: - **Forget Gate**: ๊ณผ๊ฑฐ์˜ ์ •๋ณด ์ค‘ ๋ฌด์—‡์„ ๋ฒ„๋ฆด์ง€ ๊ฒฐ์ •. - **Input Gate**: ํ˜„์žฌ ๋“ค์–ด์˜จ ์ •๋ณด ์ค‘ ๋ฌด์—‡์„ ๊ธฐ์–ตํ• ์ง€ ๊ฒฐ์ •. - **Output Gate**: ํ˜„์žฌ์˜ ๊ธฐ์–ต ์ค‘ ๋ฌด์—‡์„ ๋ฐ–์œผ๋กœ ๋‚ด๋ณด๋‚ผ์ง€ ๊ฒฐ์ •. - **Utility**: ๋ฒˆ์—ญ, ์ฃผ๊ฐ€ ์˜ˆ์ธก, ์Œ์„ฑ ์ธ์‹ ๋“ฑ ์ˆœ์„œ(Sequence)๊ฐ€ ์ค‘์š”ํ•œ ๋ชจ๋“  ๋ถ„์•ผ๋ฅผ ํ‰์ •ํ–ˆ๋˜ ๋ชจ๋ธ์ด๋‹ค. ## โš–๏ธ Trade-offs & Caveats - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ์ž์—ฐ์–ด ์ฒ˜๋ฆฌ์˜ ๋…๋ณด์  ์กด์žฌ์˜€์œผ๋‚˜, ๋ณ‘๋ ฌ ์—ฐ์‚ฐ์ด ๋ถˆ๊ฐ€๋Šฅํ•œ ์ˆœ์ฐจ์  ๊ตฌ์กฐ๋ผ๋Š” ํ•œ๊ณ„ ๋•Œ๋ฌธ์— ํ˜„์žฌ๋Š” ํŠธ๋žœ์Šคํฌ๋จธ(Transformer) ์•„ํ‚คํ…์ฒ˜์— ์ž๋ฆฌ๋ฅผ ๋‚ด์คŒ. ํ•˜์ง€๋งŒ ์Œ์„ฑ ์ธ์‹์ด๋‚˜ ์‹œ๊ณ„์—ด ์ˆ˜์น˜ ์˜ˆ์ธก ๋ถ„์•ผ์—์„œ๋Š” ์—ฌ์ „ํžˆ ํ™œ์šฉ๋จ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ์˜ ์„ผ์„œ ๋ฐ์ดํ„ฐ ๋ถ„์„(Telemetry) ๋ฐ ์‚ฌ์šฉ์ž ํ™œ๋™ ํŒจํ„ด ์˜ˆ์ธก ์‹œ, ๊ฐ€๋ฒผ์šด LSTM ๋ชจ๋ธ์„ ๋ณด์กฐ์ ์œผ๋กœ ์šด์šฉํ•จ. --- - LSTM์€ ์‹œ๊ณ„์—ด ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ์— ๊ฐ•๋ ฅํ•˜์ง€๋งŒ, ์ˆœ์ฐจ์ ์œผ๋กœ ์—ฐ์‚ฐํ•ด์•ผ ํ•˜๋ฏ€๋กœ ์„ฑ๋Šฅ ์Šค์ผ€์ผ๋ง(๋ณ‘๋ ฌ ์ฒ˜๋ฆฌ)์ด ์–ด๋ ต๋‹ค. ํ˜„์žฌ๋Š” ๋ชจ๋“  ์‹œ์ ์„ ๋™์‹œ์— ๋ฐ”๋ผ๋ณด๋Š” **ํŠธ๋žœ์Šคํฌ๋จธ(Transformer)** ์•„ํ‚คํ…์ฒ˜์— ์™•์ขŒ๋ฅผ ๋‚ด์–ด์ฃผ์—ˆ์œผ๋‚˜, ๋ฐ์ดํ„ฐ๊ฐ€ ์ ๊ฑฐ๋‚˜ ์ดˆ์ €์ง€์—ฐ ํ•˜๋“œ์›จ์–ด ๊ตฌํ˜„์ด ํ•„์š”ํ•œ ํŠน์ˆ˜ ๋ถ„์•ผ์—์„œ๋Š” ์—ฌ์ „ํžˆ ํ˜„์—ญ์œผ๋กœ ํ™œ๋™ ์ค‘์ด๋‹ค. ## ๐Ÿ”— Knowledge Connections - Recurrent-Neural-Network, Gated-Recurrent-Unit, [[Transformer-Architecture|Transformer-Architecture]], [[Time-Series-Analysis|Time-Series-Analysis]] - **Raw Source:** 10_Wiki/Topics/AI/LSTM (Long Short-Term Memory).md --- - Related: [[Recurrent-Neural-Networks|Recurrent-Neural-Networks]] (RNN) , Attention-Mechanism - Rival: [[Transformer-Architecture|Transformer-Architecture]]