--- id: LSTM-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 1.0 tags: [deep-learning, nlp, rnn, ai-history, time-series] last_reinforced: 2026-04-26 --- # [[LSTM (Long Short-Term Memory)|LSTM (Long Short-Term Memory)]] ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "κΈ°μ–΅ν•  것과 μžŠμ„ 것을 슀슀둜 κ²°μ •ν•˜λŠ” λ˜‘λ˜‘ν•œ λ©”λͺ¨λ¦¬ μ…€" β€” κΈ°μ‘΄ RNN의 고질적인 문제인 'μž₯κΈ° μ˜μ‘΄μ„±(Long-term dependency)' 손싀을 ν•΄κ²°ν•˜κΈ° μœ„ν•΄ 게이트(Gate) ꡬ쑰λ₯Ό λ„μž…ν•œ μˆœν™˜ 신경망 μ•„ν‚€ν…μ²˜. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **μΆ”μΆœλœ νŒ¨ν„΄:** μ •λ³΄μ˜ 흐름을 μ‘°μ ˆν•˜λŠ” μ„Έ κ°€μ§€ λ¬Έ(Gate)을 톡해, μ€‘μš”ν•œ μ •λ³΄λŠ” 였래 λ³΄μ‘΄ν•˜κ³  λΆˆν•„μš”ν•œ μ •λ³΄λŠ” μ¦‰μ‹œ μ§€μ›Œλ²„λ¦¬λŠ” μ‹œκ³„μ—΄ 데이터 처리 νŒ¨ν„΄. - **μ„ΈλΆ€ λ‚΄μš©:** - **Forget Gate:** 이전 μƒνƒœμ˜ 정보 쀑 무엇을 버릴지 κ²°μ •. - **Input Gate:** ν˜„μž¬ μž…λ ₯ 정보 쀑 무엇을 μ…€ μƒνƒœ(Cell State)에 μ €μž₯ν• μ§€ κ²°μ •. - **Output Gate:** κ°±μ‹ λœ μ…€ μƒνƒœλ₯Ό λ°”νƒ•μœΌλ‘œ λ‹€μŒ λ‹¨κ³„λ‘œ 전달할 값을 κ²°μ •. - **Cell State:** 컨베이어 벨트처럼 정보가 흐λ₯΄λ©°, κ²Œμ΄νŠΈλ“€μ— μ˜ν•΄ 정보가 μΆ”κ°€λ˜κ±°λ‚˜ μ‚­μ œλ¨. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌:** μžμ—°μ–΄ 처리의 독보적 μ‘΄μž¬μ˜€μœΌλ‚˜, 병렬 연산이 λΆˆκ°€λŠ₯ν•œ 순차적 κ΅¬μ‘°λΌλŠ” ν•œκ³„ λ•Œλ¬Έμ— ν˜„μž¬λŠ” 트랜슀포머(Transformer) μ•„ν‚€ν…μ²˜μ— 자리λ₯Ό λ‚΄μ€Œ. ν•˜μ§€λ§Œ μŒμ„± μΈμ‹μ΄λ‚˜ μ‹œκ³„μ—΄ 수치 예츑 λΆ„μ•Όμ—μ„œλŠ” μ—¬μ „νžˆ ν™œμš©λ¨. - **μ •μ±… λ³€ν™”:** Antigravity ν”„λ‘œμ νŠΈμ˜ μ„Όμ„œ 데이터 뢄석(Telemetry) 및 μ‚¬μš©μž ν™œλ™ νŒ¨ν„΄ 예츑 μ‹œ, κ°€λ²Όμš΄ LSTM λͺ¨λΈμ„ 보쑰적으둜 μš΄μš©ν•¨. ## πŸ”— 지식 μ—°κ²° (Graph) - Recurrent-Neural-Network, Gated-Recurrent-Unit, [[Transformer-Architecture|Transformer-Architecture]], [[Time-Series-Analysis|Time-Series-Analysis]] - **Raw Source:** 10_Wiki/Topics/AI/LSTM (Long Short-Term Memory).md