"매 hidden state evolves over time, observations emerge from it". State space 는 매 control theory / signal processing 의 핵심 representation: x_{t+1}=Ax_t+Bu_t,\; y_t=Cx_t+Du_t. 매 2024-2026 의 deep learning 의 SSM (S4, Mamba, Mamba-2) 가 매 Transformer 의 long-context alternative 로 부상.
매 핵심
매 control-theory origin
Statex_t: 매 system 의 internal memory.
Inputu_t: 매 external control / observation.
Outputy_t: 매 measurable.
MatricesA, B, C, D: 매 dynamics, input mapping, observation, feedthrough.
Discrete vs continuous: \dot x = Ax + Bu (continuous) vs x_{t+1} (discrete).
매 deep learning SSM 진화
HiPPO (2020): 매 long-range memory 의 polynomial 근사 — 매 A matrix 의 영리한 init.
S4 (2022): structured SSM, FFT-based convolution view, long-range arena SOTA.
H3, Hyena: 매 SSM + gating 의 hybrid.
Mamba (2023): 매 selective SSM — 매 input-dependent A,B,C, hardware-aware parallel scan.
Mamba-2 (2024): 매 SSD (state-space duality) — 매 attention 과 의 unification.
Hybrid (2025+): 매 Jamba, Zamba, Samba — 매 Transformer + Mamba mix.
매 vs Transformer
측면
Transformer
SSM (Mamba)
Train
O(N²) parallel
O(N) parallel scan
Infer
O(N) per token, KV cache
O(1) per token, fixed state
Memory at infer
grows with context
constant
Long context
quadratic cost
linear cost
In-context recall
strong
weaker (improving with hybrids)
매 응용
Control systems — 매 robotics, aerospace, Kalman filter.
Time series — 매 Kalman / particle filter, dynamic factor model.
Long-context LLM — 매 Mamba-3B, Jamba-1.5 의 1M+ context.
DNA / genomics — 매 Caduceus, HyenaDNA 의 long sequence.
언제: SSM literature 정리, mamba-ssm boilerplate, control-system identification 의 sympy / numpy code.
언제 X: real-time safety-critical control (aerospace, medical) — 매 verified controller / formal methods 의 영역.
❌ 안티패턴
HiPPO init 무시: random init Mamba 의 long-range 성능 추락. 매 proper A init 필수.
Pure Mamba 의 in-context retrieval 기대: needle-in-haystack 약함. Hybrid 권장.
No selective mechanism: 매 vanilla S4 의 modern Mamba 보다 약함 — input-dependent param 필수.
Classical SSM 의 nonlinearity 무시: 매 real system 의 nonlinear — EKF / UKF / particle filter 사용.
CUDA scan 미활용: 매 naive Python loop 의 100배 slow. mamba-ssm 의 official kernel 사용.
🧪 검증 / 중복
Verified (Kalman 1960, Gu et al. S4 2022, Gu & Dao Mamba 2023, Dao & Gu Mamba-2 2024, Jamba 2024).
신뢰도 A.
🕓 Changelog
날짜
변경
2026-05-08
Phase 1
2026-05-10
Manual cleanup — control SSM + modern Mamba/Mamba-2/Jamba