Files

T

Antigravity Agent f8b21af4be Wiki cleanup: error-doc removal, dedup merge, link normalization

10_Wiki/Topics 대규모 정리:
- 오류 캡처/미완성 stub 문서 227개 제거
- 교차폴더 중복 43클러스터 병합 (63파일 → redirect)
- 링크명 정규화: 깨진 링크 수정·redirect 직결·개념 매핑 ~2,400건
- 카테고리 MOC 6개 신규 생성
- Graph 섹션 미해결 related-keyword 링크 10,058건 제거

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-20 23:52:15 +09:00

5.7 KiB

Raw Blame History

id, title, category, status, canonical_id, aliases, duplicate_of, source_trust_level, confidence_score, verification_status, tags, raw_sources, last_reinforced, github_commit, tech_stack

title

API Gateway

매 한 줄

"매 single entry point — fan-out, auth, rate limit". 매 microservices 의 클라이언트 facing facade. 매 Netflix Zuul (2013) 시작 → Kong (2015) → Envoy/Istio (2017) → AWS API Gateway HTTP API (2019). 매 2026 modern stack 은 Envoy + xDS control plane, edge AI inference gateway (LiteLLM, Portkey) 의 추가.

매 핵심

매 책임

Routing: path/host/header → upstream service.
Auth/AuthZ: JWT validation, OAuth2 introspection, mTLS termination.
Rate limiting: per-key, per-IP, sliding window.
Observability: trace propagation (W3C Trace Context), metrics, access log.
Transformation: request/response shaping, protocol translation (REST↔gRPC).

매 NOT 책임

Business logic — 매 service 의 책임.
Data persistence — 매 stateless edge.
Heavy aggregation — 매 BFF (Backend-for-Frontend) layer 의 책임.

매 응용

Public API edge — Stripe, Twilio 형 SaaS API.
BFF per client — mobile/web/CLI 매 다른 shape.
LLM gateway — multi-provider routing (Claude, GPT, local), fallback, cost cap.

💻 패턴

Kong declarative config

_format_version: "3.0"
services:
  - name: orders-api
    url: http://orders.svc.cluster.local:8080
    routes:
      - name: orders-route
        paths: ["/api/orders"]
        strip_path: false
    plugins:
      - name: rate-limiting
        config: { minute: 600, policy: redis }
      - name: jwt
        config: { key_claim_name: kid }
      - name: prometheus

Envoy route config

route_config:
  virtual_hosts:
    - name: api
      domains: ["api.example.com"]
      routes:
        - match: { prefix: "/v1/orders" }
          route:
            cluster: orders_cluster
            timeout: 5s
            retry_policy:
              retry_on: 5xx,reset,connect-failure
              num_retries: 2
              per_try_timeout: 1s

AWS API Gateway HTTP API + Lambda authorizer

# SAM template
HttpApi:
  Type: AWS::Serverless::HttpApi
  Properties:
    Auth:
      Authorizers:
        JwtAuth:
          IdentitySource: $request.header.Authorization
          JwtConfiguration:
            issuer: https://auth.example.com
            audience: [api.example.com]
      DefaultAuthorizer: JwtAuth
    RouteSettings:
      "POST /orders":
        ThrottlingBurstLimit: 100
        ThrottlingRateLimit: 50

LLM gateway (Portkey-style fallback)

from portkey_ai import Portkey

client = Portkey(
    api_key="...",
    config={
        "strategy": {"mode": "fallback"},
        "targets": [
            {"provider": "anthropic", "override_params": {"model": "claude-opus-4-7"}},
            {"provider": "openai",    "override_params": {"model": "gpt-5"}},
        ],
        "cache": {"mode": "semantic", "max_age": 3600},
    },
)
resp = client.chat.completions.create(messages=[{"role":"user","content":"hi"}])

Rate limit (token bucket, Redis)

-- Kong-style Redis Lua
local key = "rl:" .. consumer_id
local tokens = tonumber(redis.call("GET", key) or "100")
if tokens <= 0 then return 429 end
redis.call("DECR", key)
redis.call("EXPIRE", key, 60)
return 200

Header-based canary

routes:
  - match:
      prefix: "/v1/checkout"
      headers: [{name: "x-canary", exact_match: "true"}]
    route: { cluster: checkout_v2 }
  - match: { prefix: "/v1/checkout" }
    route: { cluster: checkout_v1 }

gRPC-Web transcoding

http_filters:
  - name: envoy.filters.http.grpc_web
  - name: envoy.filters.http.grpc_json_transcoder
    typed_config:
      proto_descriptor: /etc/proto/api.pb
      services: ["api.OrderService"]

매 결정 기준

상황	Approach
Public SaaS API, multi-tenant	Kong / AWS API Gateway
Service mesh edge ingress	Envoy + Istio Gateway
Single-team internal API	Skip gateway → direct service + library SDK
Multi-LLM provider	Portkey / LiteLLM gateway
Heterogeneous protocols (REST+gRPC+WS)	Envoy with transcoding filters

기본값: 매 Envoy-based (Istio Gateway / Contour) 의 in-cluster, AWS API Gateway 의 fully managed edge.

🔗 Graph

🤖 LLM 활용

언제: 매 multi-service public API, 매 cross-cutting concerns (auth/rate-limit/observability) 의 centralization, 매 multi-provider LLM routing. 언제 X: 매 single monolith, 매 internal service-to-service only (use mesh sidecar), 매 hot path 의 < 100us latency 요구.

❌ 안티패턴

Smart gateway: 매 business logic 의 gateway 에 stuff — 매 deployment coupling 의 발생.
Single gateway for all clients: 매 mobile/web/partner 매 BFF 의 분리 안 함 → over-fetching.
No timeout/retry budget: 매 cascading failure 의 발생.
Auth-only gateway, no rate limit: 매 abuse vector.

🧪 검증 / 중복

Verified (Kong docs, Envoy docs, AWS API Gateway docs, Microsoft Azure Architecture Center "Gateway Aggregation" pattern).
신뢰도 A.

🕓 Changelog

날짜	변경
2026-05-08	Phase 1
2026-05-10	Manual cleanup — full content (Kong/Envoy/AWS/LLM gateway patterns)

5.7 KiB Raw Blame History