Files
2nd/10_Wiki/Topics/AI_and_ML/GPU Infrastructure.md
T
2026-05-10 22:08:15 +09:00

1.0 KiB

id, title, category, status, canonical_id, duplicate_of, aliases, source_trust_level, confidence_score, verification_status, tags, last_reinforced, github_commit
id title category status canonical_id duplicate_of aliases source_trust_level confidence_score verification_status tags last_reinforced github_commit
wiki-2026-0508-gpu-infrastructure GPU Infrastructure 10_Wiki/Topics duplicate wiki-2026-0508-gpu GPU
GPU infra
GPU cluster
AI infra
NVLink
Infiniband
A 0.92 redirected
duplicate
gpu
infrastructure
ai-infra
2026-05-10 pending

GPU Infrastructure

이 문서는 GPU 의 specialization 입니다. Canonical 문서로 redirect.

핵심 요약 (infrastructure-specific)

  • 매 multi-GPU node (NVLink, NVSwitch).
  • 매 multi-node cluster (Infiniband, RoCE).
  • 매 cloud (AWS p5/p4d, Azure ND, GCP A3).
  • 매 colocation / on-prem (Lambda, CoreWeave).
  • 매 distributed training (FSDP, ZeRO, TP, PP).
  • 매 spot / preemptible cost optimization.

🔗 Graph

🕓 변경 이력

날짜 변경
2026-05-08 Phase 1
2026-05-10 중복 처리 — canonical 문서로 redirect