Wiki cleanup: error-doc removal, dedup merge, link normalization
10_Wiki/Topics 대규모 정리: - 오류 캡처/미완성 stub 문서 227개 제거 - 교차폴더 중복 43클러스터 병합 (63파일 → redirect) - 링크명 정규화: 깨진 링크 수정·redirect 직결·개념 매핑 ~2,400건 - 카테고리 MOC 6개 신규 생성 - Graph 섹션 미해결 related-keyword 링크 10,058건 제거 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This commit is contained in:
@@ -180,10 +180,10 @@ decode_stream = decode_client.decode(kv_ref=prefill_resp.kv_ref, max_tokens=512)
|
||||
**기본값**: FSDP2 for training under 100B; vLLM with FP8 + prefix cache + spec decoding for serving.
|
||||
|
||||
## 🔗 Graph
|
||||
- 부모: [[Distributed-Systems]] · [[High-Performance-Computing]]
|
||||
- 변형: [[FSDP]] · [[DeepSpeed-ZeRO]] · [[Tensor-Parallelism]] · [[Pipeline-Parallelism]]
|
||||
- 응용: [[LLM-Pretraining]] · [[LLM-Serving]] · [[Fine-tuning]]
|
||||
- Adjacent: [[vLLM]] · [[Flash-Attention]] · [[Speculative-Decoding]] · [[Quantization]]
|
||||
- 부모: [[Distributed-Systems]]
|
||||
- 변형: [[Pipeline-Parallelism]]
|
||||
- 응용: [[Fine-tuning]]
|
||||
- Adjacent: [[LLM_Optimization_and_Deployment_Strategies|vLLM]] · [[Flash-Attention]] · [[LLM_Optimization_and_Deployment_Strategies|Quantization]]
|
||||
|
||||
## 🤖 LLM 활용
|
||||
**언제**: capacity planning estimates, config-file generation (deepspeed, accelerate), debugging OOM via log triage, scaling-law back-of-envelope.
|
||||
|
||||
Reference in New Issue
Block a user