Wiki cleanup: error-doc removal, dedup merge, link normalization
10_Wiki/Topics 대규모 정리: - 오류 캡처/미완성 stub 문서 227개 제거 - 교차폴더 중복 43클러스터 병합 (63파일 → redirect) - 링크명 정규화: 깨진 링크 수정·redirect 직결·개념 매핑 ~2,400건 - 카테고리 MOC 6개 신규 생성 - Graph 섹션 미해결 related-keyword 링크 10,058건 제거 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This commit is contained in:
@@ -270,9 +270,9 @@ def find_max_activating(sae, feature_idx, dataset, top_k=10):
|
||||
|
||||
## 🔗 Graph
|
||||
- 부모: [[Interpretability]] · [[AI-Safety]] · [[Mechanistic-Interpretability]]
|
||||
- 변형: [[Activation-Patching]] · [[Path-Patching]] · [[Logit-Lens]] · [[ACDC]] · [[SAE]]
|
||||
- 응용: [[Steering]] · [[Feature-Visualization]] · [[Induction-Head]] · [[IOI-Circuit]]
|
||||
- Adjacent: [[Anthropic]] · [[OpenAI]] · [[AI-Alignment]] · [[TransformerLens]] · [[Neuronpedia]]
|
||||
- 변형: [[Activation-Patching]] · [[Path-Patching]] · [[ACDC]]
|
||||
- 응용: [[Steering]] · [[Induction-Head]]
|
||||
- Adjacent: [[Anthropic]] · [[AI_Safety_and_Alignment|AI-Alignment]]
|
||||
|
||||
## 🤖 LLM 활용
|
||||
**언제**: 매 alignment research. 매 model debugging. 매 capability discovery. 매 steering. 매 trust evaluation.
|
||||
@@ -289,7 +289,7 @@ def find_max_activating(sae, feature_idx, dataset, top_k=10):
|
||||
## 🧪 검증 / 중복
|
||||
- Verified (Anthropic transformer-circuits.pub, Olsson induction heads, Wang IOI, ACDC paper).
|
||||
- 신뢰도 A.
|
||||
- Related: [[AI-Alignment]] · [[AI-Safety]] · [[Anthropic-Principle]] · [[Sparse-Autoencoder]].
|
||||
- Related: [[AI_Safety_and_Alignment|AI-Alignment]] · [[AI-Safety]] · [[Anthropic-Principle]] · [[Sparse-Autoencoder]].
|
||||
|
||||
## 🕓 Changelog
|
||||
| 날짜 | 변경 |
|
||||
|
||||
Reference in New Issue
Block a user