--- id: SYS-LB-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 1.0 tags: [infrastructure, load-balancing, high-availability, scalability, system-design] last_reinforced: 2026-04-26 --- # Load Balancing Strategies (λΆ€ν•˜ λΆ„μ‚° μ „λž΅) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "단일 μ§€μ μ˜ κ³ΌλΆ€ν•˜λ₯Ό λ°©μ§€ν•˜κ³  μ‹œμŠ€ν…œ μ „μ²΄μ˜ 체λ ₯을 κ· ν˜• 있게 ν™œμš©ν•˜μ—¬, μ–΄λ– ν•œ νŒŒλ„(Traffic)에도 λ¬΄λ„ˆμ§€μ§€ μ•ŠλŠ” κ²¬κ³ ν•œ μš”μƒˆλ₯Ό κ΅¬μΆ•ν•˜λΌ" β€” ν΄λΌμ΄μ–ΈνŠΈμ˜ μš”μ²­μ„ μ—¬λŸ¬ μ„œλ²„λ‘œ 효율적으둜 λΆ„μ‚°μ‹œμΌœ 응닡 μ‹œκ°„μ„ μ΅œμ ν™”ν•˜κ³  νŠΉμ • μ„œλ²„μ˜ μž₯μ• κ°€ 전체 μ„œλΉ„μŠ€ μ€‘λ‹¨μœΌλ‘œ 이어지지 μ•Šκ²Œ ν•˜λŠ” μ‹œμŠ€ν…œ μ•„ν‚€ν…μ²˜ μ „λž΅. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **μΆ”μΆœλœ νŒ¨ν„΄:** "Distributed Traffic Mediation" β€” μ€‘μ•™μ˜ λ‘œλ“œ λ°ΈλŸ°μ„œκ°€ 각 λ…Έλ“œμ˜ μƒνƒœλ₯Ό μ‹€μ‹œκ°„μœΌλ‘œ 확인(Health Check)ν•˜κ³ , κ°€μš©ν•œ μžμ›μ—κ²Œ μš”μ²­μ„ μ§€λŠ₯적으둜 μ „λ‹¬ν•˜μ—¬ μ‹œμŠ€ν…œμ˜ κ°€μš©μ„±κ³Ό ν™•μž₯성을 λ™μ‹œμ— ν™•λ³΄ν•˜λŠ” μ€‘μž¬ νŒ¨ν„΄. - **μ£Όμš” μ•Œκ³ λ¦¬μ¦˜:** - **Round Robin:** μ„œλ²„ μˆœμ„œλŒ€λ‘œ μ°¨λ‘€μ°¨λ‘€ ν• λ‹Ή. μ„œλ²„ μ„±λŠ₯이 동일할 λ•Œ 유리. - **Least Connections:** ν˜„μž¬ μ—°κ²° μˆ˜κ°€ κ°€μž₯ 적은 μ„œλ²„ μš°μ„ . μž‘μ—… 처리 μ‹œκ°„μ΄ 제각각일 λ•Œ 효과적. - **IP Hash:** ν΄λΌμ΄μ–ΈνŠΈ IPλ₯Ό ν•΄μ‹±ν•˜μ—¬ νŠΉμ • μ„œλ²„μ— κ³ μ •(Sticky Session). μ„Έμ…˜ μœ μ§€κ°€ ν•„μš”ν•  λ•Œ μ‚¬μš©. - **Weighted Strategies:** μ„œλ²„ 사양에 따라 κ°€μ€‘μΉ˜λ₯Ό λΆ€μ—¬ν•˜μ—¬ 더 쒋은 μ„œλ²„μ— 더 λ§Žμ€ λΆ€ν•˜ λ°°μ •. - **L4 vs L7:** - **L4 (Transport Layer):** IP/Port 기반 λΆ„μ‚°. λΉ λ₯΄μ§€λ§Œ μ„Έλ°€ν•œ μ œμ–΄ λΆˆκ°€. - **L7 (Application Layer):** URL, μΏ ν‚€, 헀더 λ“± μ½˜ν…μΈ  기반 λΆ„μ‚°. μ§€λŠ₯적인 λΌμš°νŒ… κ°€λŠ₯. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌:** κ³ κ°€μ˜ ν•˜λ“œμ›¨μ–΄ μž₯λΉ„ μ€‘μ‹¬μ—μ„œ, μ΄μ œλŠ” ν΄λΌμš°λ“œ 기반의 탄λ ₯적 λ‘œλ“œ λ°ΈλŸ°μ‹±(ELB, ALB)κ³Ό μ„œλΉ„μŠ€ λ©”μ‹œ(Service Mesh)λ₯Ό ν†΅ν•œ μ •κ΅ν•œ νŠΈλž˜ν”½ μ œμ–΄λ‘œ νŒ¨λŸ¬λ‹€μž„ μ „ν™˜. - **μ •μ±… λ³€ν™”:** Antigravity ν”„λ‘œμ νŠΈμ˜ λ°±μ—”λ“œ μ•„ν‚€ν…μ²˜λŠ” L7 λ‘œλ“œ λ°ΈλŸ°μ‹±μ„ 톡해 μ—μ΄μ „νŠΈ μš”μ²­μ˜ μœ ν˜•(지식 검색 vs λͺ¨λΈ 생성)에 따라 μ΅œμ ν™”λœ μ—°μ‚° λ…Έλ“œλ‘œ νŠΈλž˜ν”½μ„ λΌμš°νŒ…ν•¨. ## πŸ”— 지식 μ—°κ²° (Graph) - [[High-Availability-Systems]], System-Design-for-AI-Scale, Cloud-Computing-Foundations, [[Kubernetes-for-AI-Orchestration]] - **Raw Source:** 10_Wiki/Topics/AI/Load-Balancing-Strategies.md