--- id: DL-SSM-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 1.0 tags: [ai, deep-learning, ssm, state-space-models, mamba, sequence-modeling, efficiency, transformer-alternative] last_reinforced: 2026-04-26 --- # State Space Models (SSM, μƒνƒœ 곡간 λͺ¨λΈ) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "λ°μ΄ν„°μ˜ 흐름을 연속적인 'μƒνƒœμ˜ λ³€ν™”'둜 λͺ¨λΈλ§ν•˜μ—¬ 트랜슀포머의 μ—°μ‚° 병λͺ©μ„ λŒνŒŒν•˜κ³ , λ¬΄ν•œμ— κ°€κΉŒμš΄ λ¬Έλ§₯을 μ„ ν˜•μ μΈ νš¨μœ¨μ„±($O(N)$)으둜 ν¬μ°©ν•˜λΌ" β€” κ³ μ „ μ œμ–΄ 이둠의 μƒνƒœ 방정식을 ν˜„λŒ€μ  μ‹ κ²½λ§μœΌλ‘œ μž¬ν•΄μ„ν•˜μ—¬ 초μž₯κΈ° μ‹œν€€μŠ€ μ²˜λ¦¬μ— μ΅œμ ν™”λœ μ°¨μ„ΈλŒ€ μ•„ν‚€ν…μ²˜. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **μΆ”μΆœλœ νŒ¨ν„΄:** "Continuous State Evolution and Recurrent-Convolutional Duality" β€” μž…λ ₯을 은닉 μƒνƒœ(Hidden State)둜 μ••μΆ•ν•˜μ—¬ μ—…λ°μ΄νŠΈν•΄ λ‚˜κ°€λŠ” μˆœν™˜ 방식(Recurrent)κ³Ό, 이λ₯Ό ν•œκΊΌλ²ˆμ— μ²˜λ¦¬ν•˜λŠ” ν•©μ„±κ³± 방식(Convolutional)의 μž₯점을 κ²°ν•©ν•˜μ—¬ μ—°μ‚° 효율과 병렬성을 λ™μ‹œμ— λ‹¬μ„±ν•˜λŠ” νŒ¨ν„΄. - **핡심 νŠΉμ§•:** - **Linear Scalability:** μ‹œν€€μŠ€ 길이에 λΉ„λ‘€ν•΄ μ—°μ‚°λŸ‰μ΄ λŠ˜μ–΄λ‚¨ ($O(N)$). 트랜슀포머($O(N^2)$) λŒ€λΉ„ 압도적 효율. - **Memory Efficiency:** 전체 κ³Όκ±° 데이터λ₯Ό λ‹€ κΈ°μ–΅ν•˜μ§€ μ•Šκ³ λ„ 핡심 μƒνƒœκ°’λ§Œμ„ μœ μ§€ν•˜λ©° λ¬΄ν•œν•œ 길이 λŒ€μ‘ κ°€λŠ₯. - **Selective Mechanism (Mamba):** μ€‘μš”ν•œ μ •λ³΄λŠ” 남기고 μ‚¬μ†Œν•œ μ •λ³΄λŠ” μžŠλŠ” μ§€λŠ₯ν˜• 필터링 κΈ°λŠ₯ νƒ‘μž¬. - **의의:** ν…μŠ€νŠΈλΏλ§Œ μ•„λ‹ˆλΌ μˆ˜μ‹­λ§Œ ν”„λ ˆμž„μ˜ μ˜μƒ, κΈ΄ DNA μ—ΌκΈ°μ„œμ—΄ λ“± κΈ°μ‘΄ νŠΈλžœμŠ€ν¬λ¨Έκ°€ μ²˜λ¦¬ν•˜κΈ° νž˜λ“€μ—ˆλ˜ 'κ±°λŒ€ μ‹œν€€μŠ€' λΆ„μ„μ˜ μƒˆλ‘œμš΄ 지평을 μ—Ό. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌:** μ‹œν€€μŠ€ λͺ¨λΈλ§μ€ μ–΄ν…μ…˜(Attention)이 μœ μΌν•œ μ •λ‹΅μ΄λΌλŠ” λ―ΏμŒμ„ κΉ¨κ³ , 고전적인 μƒνƒœ 곡간 κ°œλ…μ΄ ν˜„λŒ€μ  ν•˜λ“œμ›¨μ–΄ μ΅œμ ν™”(Flash Attentionκ³Ό μœ μ‚¬ν•œ 기법)와 λ§Œλ‚˜ 트랜슀포머λ₯Ό μœ„ν˜‘ν•˜λŠ” κ°•λ ₯ν•œ λŒ€μ•ˆμœΌλ‘œ 뢀상함. - **μ •μ±… λ³€ν™”:** Antigravity ν”„λ‘œμ νŠΈλŠ” μ‹€μ‹œκ°„μœΌλ‘œ μŸμ•„μ§€λŠ” λ°©λŒ€ν•œ μ—μ΄μ „νŠΈ 둜그 λΆ„μ„μ΄λ‚˜ μ‹€μ‹œκ°„ 슀트리밍 지식 처리 μ‹œ, μ €μ§€μ—°κ³Ό 고효율이 보μž₯된 SSM 기반의 κ²½λŸ‰ λͺ¨λΈμ„ μ‹€ν—˜μ μœΌλ‘œ μ μš©ν•¨. ## πŸ”— 지식 μ—°κ²° (Graph) - [[Self-Attention-Mechanisms|Self-Attention-Mechanisms]], Recurrent-Neural-Networks-RNN, [[Scalability-in-AI-Systems|Scalability-in-AI-Systems]], [[Sequence-to-Sequence-Models|Sequence-to-Sequence-Models]] - **Raw Source:** 10_Wiki/Topics/AI/State-Space-Models.md