--- id: SYS-OBS-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 1.0 tags: [systems, observability, shadowing, monitoring, mlops, distributed-tracing, reliability] last_reinforced: 2026-04-26 --- # Shadowing and Observability (μ„€λ„μž‰ 및 κ΄€μΈ‘μ„±) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "μ‚¬μš©μž λͺ¨λ₯΄κ²Œ μ‹€μ œ νŠΈλž˜ν”½μ˜ λ³΅μ‚¬λ³ΈμœΌλ‘œ λͺ¨λΈμ˜ λ‹΄λ ₯을 μ‹œν—˜(Shadowing)ν•˜κ³ , μ‹œμŠ€ν…œ λ‚΄λΆ€μ˜ λͺ¨λ“  μ‹ ν˜Έλ₯Ό 투λͺ…ν•˜κ²Œ κΈ°λ‘ν•˜μ—¬ μž₯μ• μ˜ μ§•ν›„λ₯Ό μ„ μ œμ μœΌλ‘œ ν¬μ°©ν•˜λΌ" β€” 운영 ν™˜κ²½μ— 영ν–₯을 μ£Όμ§€ μ•ŠλŠ” μ•ˆμ „ν•œ ν…ŒμŠ€νŠΈ 기법과 μ‹œμŠ€ν…œμ˜ λ™μž‘ μƒνƒœλ₯Ό μ •λ°€ν•˜κ²Œ νŒŒμ•…ν•˜κΈ° μœ„ν•œ κ΄€μΈ‘ 체계. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **μΆ”μΆœλœ νŒ¨ν„΄:** "Risk-free Validation and Transparent Monitoring" β€” μƒˆλ‘œμš΄ λͺ¨λΈμ΄λ‚˜ μ½”λ“œλ₯Ό 배포할 λ•Œ μ‹€μ œ νŠΈλž˜ν”½μ„ λ³‘λ ¬λ‘œ ν˜λ €λ³΄λ‚΄ κ²°κ³Όλ₯Ό 비ꡐ κ²€μ¦ν•˜κ³ , λ©”νŠΈλ¦­/둜그/νŠΈλ ˆμ΄μ‹±μ˜ 3λŒ€ μš”μ†Œλ₯Ό κ²°ν•©ν•΄ μž₯μ• μ˜ 원인을 즉각 규λͺ…ν•˜λŠ” νŒ¨ν„΄. - **핡심 μš”μ†Œ:** - **Shadow Deployment:** 운영 μ„œλΉ„μŠ€ κ²°κ³ΌλŠ” λ¬΄μ‹œν•˜κ³  μƒˆ λͺ¨λΈμ˜ μ˜ˆμΈ‘κ°’λ§Œ κΈ°λ‘ν•˜μ—¬ μ„±λŠ₯ 비ꡐ. 리슀크 μ—†λŠ” μ‹€μ „ ν…ŒμŠ€νŠΈ κ°€λŠ₯. - **Observability (Three Pillars):** - **Metrics:** μˆ˜μΉ˜ν™”λœ μ§€ν‘œ (CPU μ‚¬μš©λŸ‰, Latency λ“±). - **Logs:** λ°œμƒν•œ μ‚¬κ±΄μ˜ 상세 기둝. - **Tracing:** μ„œλΉ„μŠ€ κ°„μ˜ 호좜 경둜 좔적 (Distributed Tracing). - **의의:** λ³΅μž‘ν•΄μ§„ λ§ˆμ΄ν¬λ‘œμ„œλΉ„μŠ€ ν™˜κ²½μ—μ„œ "무슨 일이 μΌμ–΄λ‚¬λŠ”κ°€"λ₯Ό λ„˜μ–΄ "μ™œ μΌμ–΄λ‚¬λŠ”κ°€"에 λŒ€ν•œ 닡을 μ œκ³΅ν•˜λ©°, 배포의 두렀움을 데이터 기반의 ν™•μ‹ μœΌλ‘œ λ°”κΏˆ. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌:** λ‹¨μˆœνžˆ μ„œλ²„κ°€ μ£½μ—ˆλŠ”μ§€λ§Œ μ²΄ν¬ν•˜λ˜ 'λͺ¨λ‹ˆν„°λ§'의 μ‹œλŒ€λ₯Ό μ§€λ‚˜, μ΄μ œλŠ” λΆ„μ‚° μ‹œμŠ€ν…œ μ „μ²΄μ˜ λ§₯락을 μ΄ν•΄ν•˜κ³  μ˜ˆμƒμΉ˜ λͺ»ν•œ 문제(Unknown-Unknowns)λ₯Ό νƒμ‚¬ν•˜λŠ” 'κ΄€μΈ‘μ„±' μ€‘μ‹¬μ˜ μ—”μ§€λ‹ˆμ–΄λ§μœΌλ‘œ νŒ¨λŸ¬λ‹€μž„μ΄ 이동함. - **μ •μ±… λ³€ν™”:** Antigravity ν”„λ‘œμ νŠΈλŠ” μ—μ΄μ „νŠΈμ˜ λ‹΅λ³€ 생성 λͺ¨λΈ μ—…λ°μ΄νŠΈ μ‹œ, μ΅œμ†Œ 24μ‹œκ°„μ˜ μ„€λ„μž‰ 기간을 거쳐 κΈ°μ‘΄ λͺ¨λΈκ³Όμ˜ 응닡 ν’ˆμ§ˆ 차이λ₯Ό μ •λ°€ λΆ„μ„ν•œ ν›„ μ΅œμ’… 배포λ₯Ό 결정함. ## πŸ”— 지식 μ—°κ²° (Graph) - [[Scalability-in-AI-Systems]], [[Service-oriented-Architecture]], Reliability-Engineering, MLOps-Best-Practices - **Raw Source:** 10_Wiki/Topics/AI/Shadowing-and-Observability.md