Wiki cleanup: error-doc removal, dedup merge, link normalization
10_Wiki/Topics 대규모 정리: - 오류 캡처/미완성 stub 문서 227개 제거 - 교차폴더 중복 43클러스터 병합 (63파일 → redirect) - 링크명 정규화: 깨진 링크 수정·redirect 직결·개념 매핑 ~2,400건 - 카테고리 MOC 6개 신규 생성 - Graph 섹션 미해결 related-keyword 링크 10,058건 제거 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This commit is contained in:
@@ -241,9 +241,9 @@ def calibrate_reward(expected, actual):
|
||||
**기본값**: 매 PPO / SAC + 매 distributional / replay. 매 brain-inspired = Dreamer.
|
||||
|
||||
## 🔗 Graph
|
||||
- 부모: [[Reinforcement-Learning]] · [[Computational-Neuroscience]]
|
||||
- 변형: [[TD-Learning]] · [[Q-Learning]] · [[Distributional-RL]] · [[Meta-RL]] · [[Successor-Representation]]
|
||||
- 응용: [[Dreamer]] · [[MuZero]] · [[AlphaGo]] · [[Disease-Modeling]]
|
||||
- 부모: [[Reinforcement-Learning]] · [[Computational-Neuroscience-RL|Computational-Neuroscience]]
|
||||
- 변형: [[TD-Learning]] · [[Distributional-RL]] · [[Meta-RL]]
|
||||
- 응용: [[Disease-Modeling]]
|
||||
- Adjacent: [[Bayesian-Brain-Hypothesis]] · [[Biological-Intelligence]] · [[Addiction-Neuroscience]] · [[Brain-Derived Neurotrophic Factor (BDNF)]]
|
||||
- Concept: [[Reward-Prediction-Error]] · [[Dopamine]] · [[Basal-Ganglia]]
|
||||
|
||||
|
||||
Reference in New Issue
Block a user