chore(wiki): dangling 링크 canonical 정규화 (768파일/1200건)

이름만 다른(표기 변형) [[위키링크]]를 대상 문서의 canonical 제목으로 치환해 끊겼던 1,200개 링크를 연결. 제목/파일명 정규화 일치만 적용하고 별칭 매칭은 과병합 위험으로 제외(애매성 가드). 원본은 _link_reconcile_backup/ 에 백업. 도구: Datacollect/scripts/link_reconcile_apply.mjs Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-08 12:24:15 +09:00
parent 2ddf30f8e4
commit d8a80f6272
768 changed files with 1085 additions and 1085 deletions
@@ -175,10 +175,10 @@ def sample_token(logits, temperature=0.7, top_p=0.9):
 **기본값**: 매 Thompson Sampling — 매 strong empirical 의 winner, 매 simple implementation.

 ## 🔗 Graph
- 부모: [[Reinforcement-Learning]] · [[Decision-Theory]]
+- 부모: [[Reinforcement-Learning]] · [[Decision Theory]]
 - 변형: [[Multi-Armed-Bandit]]
 - 응용: [[Recommender-Systems]] · [[Hyperparameters|Hyperparameter-Tuning]] · [[MCTS]]
- Adjacent: [[Bayesian-Optimization]] · [[Active-Learning]] · [[LLM-Sampling]]
+- Adjacent: [[Bayesian-Optimization]] · [[Active Learning]] · [[LLM-Sampling]]

 ## 🤖 LLM 활용
 **언제**: 매 sequential decision 매 reward feedback. Cold-start recommender. A/B 의 multi-arm 의 generalize.