--- id: [[MLOps|MLOps]]-DEPLOY-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 1.0 tags: [mlops, model-deployment, cicd, canary-deployment, blue-green,[[_system|system]]-design] last_reinforced: 2026-04-26 --- # Model Deployment Patterns (λͺ¨λΈ 배포 νŒ¨ν„΄) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "λͺ¨λΈμ˜ ꡐ체가 μ„œλΉ„μŠ€μ˜ 쀑단이 μ•„λ‹Œ 'μžμ—°μŠ€λŸ¬μš΄ μ§„ν™”'κ°€ λ˜λ„λ‘, μ•ˆμ „ν•˜κ³  탄λ ₯적인 배포 관문을 μ„€κ³„ν•˜λΌ" β€” λ¨Έμ‹ λŸ¬λ‹ λͺ¨λΈμ„ ν”„λ‘œλ•μ…˜ ν™˜κ²½μ— μ μš©ν•  λ•Œ 리슀크λ₯Ό μ΅œμ†Œν™”ν•˜κ³  μ•ˆμ •μ μΈ μ „ν™˜μ„ 보μž₯ν•˜κΈ° μœ„ν•œ μ•„ν‚€ν…μ²˜ νŒ¨ν„΄. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **μΆ”μΆœλœ νŒ¨ν„΄:** "Staged Transition and Risk Isolation" β€” μ‹ κ·œ λͺ¨λΈμ„ μ¦‰μ‹œ 전체 μ μš©ν•˜λŠ” λŒ€μ‹ , νŠΈλž˜ν”½μ„ λ‹¨κ³„μ μœΌλ‘œ μ œμ–΄ν•˜κ±°λ‚˜ 병렬 ν™˜κ²½μ—μ„œ κ²€μ¦ν•¨μœΌλ‘œμ¨ 배포 ν›„ λ°œμƒν•  수 μžˆλŠ” μ„±λŠ₯ μ €ν•˜λ‚˜ μ˜ˆμ™Έ μƒν™©μœΌλ‘œλΆ€ν„° μ‹œμŠ€ν…œμ„ λ³΄ν˜Έν•˜λŠ” 배포 νŒ¨ν„΄. - **μ£Όμš” νŒ¨ν„΄:** - **Canary Deployment:** μ†Œμˆ˜μ˜ μ‚¬μš©μž(예: 5%)μ—κ²Œ λ¨Όμ € μ‹ κ·œ λͺ¨λΈμ„ λ…ΈμΆœν•˜μ—¬ μ§€ν‘œ 확인 ν›„ 점진적 ν™•λŒ€. - **Blue-Green Deployment:** ꡬ버전(Blue)κ³Ό 신버전(Green) ν™˜κ²½μ„ λ™μ‹œμ— λ„μ›Œλ‘κ³  λ‘œλ“œ λ°ΈλŸ°μ„œλ₯Ό 톡해 ν•œ λ²ˆμ— μŠ€μœ„μΉ­. - **Shadow Deployment:** μ‹ κ·œ λͺ¨λΈμ΄ μ‹€μ œ νŠΈλž˜ν”½μ„ λ°›μ§€λ§Œ 응닡은 λ°˜ν™˜ν•˜μ§€ μ•Šκ³ , 둜그만 남겨 μ„±λŠ₯을 비ꡐ 검증. - **A/B [[Testing|Testing]]:** 두 λͺ¨λΈμ˜ μ„±λŠ₯을 ν†΅κ³„μ μœΌλ‘œ λΉ„κ΅ν•˜μ—¬ λΉ„μ¦ˆλ‹ˆμŠ€ μ§€ν‘œμ— 더 μœ λ¦¬ν•œ λͺ¨λΈ 선택. - **의의:** λΉˆλ²ˆν•œ λͺ¨λΈ μ—…λ°μ΄νŠΈκ°€ ν•„μš”ν•œ ν˜„λŒ€ AI μ„œλΉ„μŠ€μ—μ„œ μ‹œμŠ€ν…œ μ•ˆμ •μ„±μ„ ν•΄μΉ˜μ§€ μ•Šκ³  지속적인 κ°œμ„ (CI/CD)을 κ°€λŠ₯μΌ€ 함. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌:** λ‹¨μˆœνžˆ νŒŒμΌμ„ κ΅μ²΄ν•˜λŠ” 정적 λ°°ν¬μ—μ„œ, μ΄μ œλŠ” 데이터와 λͺ¨λΈ, μ½”λ“œκ°€ 유기적으둜 맞물렀 λ°°ν¬λ˜λŠ” 'νŒŒμ΄ν”„λΌμΈ 쀑심 배포'둜 νŒ¨λŸ¬λ‹€μž„μ΄ 전이됨. - **μ •μ±… λ³€ν™”:** Antigravity ν”„λ‘œμ νŠΈλŠ” μ—μ΄μ „νŠΈμ˜ 핡심 μΆ”λ‘  λͺ¨λΈ μ—…λ°μ΄νŠΈ μ‹œ, Shadow Deployment νŒ¨ν„΄μ„ 톡해 κΈ°μ‘΄ μ‘λ‹΅κ³Όμ˜ 일관성을 48μ‹œκ°„ 이상 κ²€μ¦ν•˜λŠ” 것을 ν‘œμ€€ 절차둜 μ‚ΌμŒ. ## πŸ”— 지식 μ—°κ²° (Graph) - [[Microservices-Architecture|Microservices-Architecture]], [[Load-Balancing-Strategies|Load-Balancing-Strategies]], [[Model-Drift-and-Monitoring|Model-Drift-and-Monitoring]], [[High-Availability-Systems|High-Availability-Systems]] - **Raw Source:** 10_Wiki/Topics/AI/Model-Deployment-Patterns.md