--- id: SYS-K8S-AI-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 1.0 tags: [infrastructure, kubernetes, ai-orchestration, mlops, gpu-scheduling, scalability] last_reinforced: 2026-04-26 --- # Kubernetes for AI Orchestration (AI μ˜€μΌ€μŠ€νŠΈλ ˆμ΄μ…˜μ„ μœ„ν•œ μΏ λ²„λ„€ν‹°μŠ€) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "μ„œλ²„μ˜ ꡰ단을 μ½”λ“œ ν•œ μ€„λ‘œ μ§€νœ˜ν•˜μ—¬, AI의 κ±°λŒ€ν•œ μ—°μ‚° λΆ€ν•˜λ₯Ό λΉˆν‹ˆμ—†μ΄ κ΄€λ¦¬ν•˜λΌ" β€” μ»¨ν…Œμ΄λ„ˆν™”λœ AI μ• ν”Œλ¦¬μΌ€μ΄μ…˜μ˜ 배포, ν™•μž₯ 및 관리λ₯Ό μžλ™ν™”ν•˜κ³ , 특히 ν¬μ†Œ μžμ›μΈ GPUλ₯Ό 효율적으둜 ν• λ‹Ή/κ³΅μœ ν•˜κ²Œ ν•΄μ£ΌλŠ” λΆ„μ‚° μ»΄ν“¨νŒ… 운영 ν”Œλž«νΌ. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **μΆ”μΆœλœ νŒ¨ν„΄:** "Declarative Infrastructure" β€” μΈν”„λΌμ˜ μƒνƒœλ₯Ό μ½”λ“œλ‘œ μ„ μ–Έν•˜λ©΄ μ‹œμŠ€ν…œμ΄ 슀슀둜 κ·Έ μƒνƒœλ₯Ό μœ μ§€(Self-healing)ν•˜λ©°, λ™μ μœΌλ‘œ μžμ›μ„ ν• λ‹Ή(Auto-scaling)ν•˜μ—¬ λͺ¨λΈμ˜ μ—°μ‚° μš”κ΅¬λŸ‰μ— 즉각 λŒ€μ‘ν•˜λŠ” μžλ™ν™” νŒ¨ν„΄. - **AI νŠΉν™” κΈ°λŠ₯:** - **GPU Scheduling:** νŠΉμ • νŒŒλ“œ(Pod)에 GPU μžμ›μ„ ν• λ‹Ήν•˜κ³  λͺ¨λ‹ˆν„°λ§. - **Job Orchestration:** λŒ€κ·œλͺ¨ ν•™μŠ΅ μž‘μ—…μ„ μ—¬λŸ¬ λ…Έλ“œμ— λΆ„μ‚° λ°°μΉ˜ν•˜κ³  μ™„λ£Œ ν›„ μžμ› 회수. - **Service Mesh:** λ³΅μž‘ν•˜κ²Œ μ–½νžŒ AI λ§ˆμ΄ν¬λ‘œμ„œλΉ„μŠ€ κ°„μ˜ 톡신과 λ³΄μ•ˆ μ œμ–΄. - **의의:** μ‹€ν—˜μ‹€ μˆ˜μ€€μ˜ AI λͺ¨λΈμ„ 수백만 μ‚¬μš©μžκ°€ μ‚¬μš©ν•˜λŠ” μ—”ν„°ν”„λΌμ΄μ¦ˆ κΈ‰ μ„œλΉ„μŠ€λ‘œ ν™•μž₯ν•˜κΈ° μœ„ν•œ ν•„μˆ˜ 인프라(MLOps의 핡심). ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌:** λ‹¨μˆœ μ›Ή μ„œλΉ„μŠ€μš©μœΌλ‘œ μ“°μ΄λ˜ 단계λ₯Ό λ„˜μ–΄, μ΄μ œλŠ” Kubeflowλ‚˜ Ray와 같은 ν”„λ ˆμž„μ›Œν¬μ™€ κ²°ν•©ν•˜μ—¬ AI μ›Œν¬ν”Œλ‘œμš° 전체λ₯Ό κ΄€λ¦¬ν•˜λŠ” μ „μš© ν”Œλž«νΌμœΌλ‘œ μ§„ν™”. - **μ •μ±… λ³€ν™”:** Antigravity ν”„λ‘œμ νŠΈμ˜ ν΄λΌμš°λ“œ μΈν”„λΌλŠ” μΏ λ²„λ„€ν‹°μŠ€λ₯Ό 기반으둜 μ„€κ³„λ˜μ—ˆμœΌλ©°, μ—μ΄μ „νŠΈμ˜ λΆ€ν•˜κ°€ 급증할 경우 μžλ™μœΌλ‘œ μ»΄ν“¨νŒ… λ…Έλ“œλ₯Ό ν™•μž₯ν•˜μ—¬ μ§€μ—° μ—†λŠ” 지식 μ„œλΉ„μŠ€λ₯Ό μ œκ³΅ν•¨. ## πŸ”— 지식 μ—°κ²° (Graph) - [[Infrastructure-as-Code-IaC]], [[DevOps-for-AI-MLOps]], System-Design-for-AI-Scale, [[High-Availability-Systems]] - **Raw Source:** 10_Wiki/Topics/AI/Kubernetes-for-AI-Orchestration.md