[G1-Sync] Manual knowledge update

2026-04-30 22:42:02 +09:00
parent 0bd4f19e38
commit c36c0644a1
4888 changed files with 18470 additions and 18602 deletions
@@ -2,7 +2,7 @@
 id: NLP-LDA-001
 category: "10_Wiki/💡 Topics/AI"
 confidence_score: 1.0
-tags: [nlp, machine-learning, lda, topic-modeling, unsupervised-learning, probability]
+tags: [nlp, machine-learning, lda, topic-modeling, un[[Supervised-Learning]], probability]
 last_reinforced: 2026-04-26
 ---

@@ -19,9 +19,9 @@ last_reinforced: 2026-04-26
 - **의의:** 사람이 일일이 읽지 않아도 수천만 건의 문서에서 주요 화두(Topic)를 자동으로 추출하여 지식을 체계화할 수 있게 함.

 ## ⚠️ 모순 및 업데이트 (Contradictions & RL Update)
- **과거 데이터와의 충돌:** 문맥을 무시하는 단어 빈도 중심의 한계를 넘기 위해, 최근에는 문장의 의미를 임베딩 벡터로 파악하는 BERT 기반의 토픽 모델링(BERTopic)과 결합하여 정밀도를 높이는 추세.
+- **과거 데이터와의 충돌:** 문맥을 무시하는 단어 빈도 중심의 한계를 넘기 위해, 최근에는 문장의 의미를 임베딩 벡터로 파악하는 [[BERT]] 기반의 토픽 모델링(BERTopic)과 결합하여 정밀도를 높이는 추세.
 - **정책 변화:** Antigravity 프로젝트는 `00_Raw`에 유입되는 대규모 텍스트 데이터를 1차 분류할 때 LDA를 활용하여 위키의 어떤 카테고리에 배정할지 결정하는 클러스터링 보조 도구로 사용함.

 ## 🔗 지식 연결 (Graph)
- [[Latent-Semantic-Analysis-LSA]], Unsupervised-Learning-Foundations, NLP-Foundations, [[Exploratory-Data-Analysis]]
+- [[Latent-Semantic-[[Analysis]]-LSA]], Un[[Supervised-Learning-Foundations]], NLP-Foundations, [[Exploratory-Data-Analysis]]
 - **Raw Source:** 10_Wiki/Topics/AI/Latent-Dirichlet-Allocation.md