id, title, category, status, canonical_id, aliases, duplicate_of, source_trust_level, confidence_score, verification_status, tags, raw_sources, last_reinforced, github_commit, tech_stack
id
title
category
status
canonical_id
aliases
duplicate_of
source_trust_level
confidence_score
verification_status
tags
raw_sources
last_reinforced
github_commit
tech_stack
wiki-2026-0508-adaptive-curation
Adaptive Curation
10_Wiki/Topics
verified
self
Dynamic Curation
Personalized Curation
Active Curation
none
A
0.88
applied
recsys
curation
personalization
active-learning
feedback
2026-05-10
pending
language
framework
Python
PyTorch/scikit-learn
Adaptive Curation
매 한 줄
"매 selection 의 close-loop — 매 user feedback 의 corpus 의 reshape." . Adaptive curation 의 static collection 의 X, 매 user signal (click, dwell, rating, embedding drift) 의 use → 매 corpus / ranking / recommendation 의 dynamic adjust. 매 2026 의 LLM-augmented (semantic embedding + bandits + RLHF) 의 standard.
매 핵심
매 Components
Catalog / corpus : candidate items.
User signal : explicit (rating, like) + implicit (click, dwell, scroll depth).
Ranker / selector : scoring function (often embedding sim + bandit + LTR).
Update loop : feedback → model update → next selection.
매 Algorithms
Collaborative filtering : matrix factorization (SVD, ALS).
Content-based : TF-IDF / semantic embedding (sentence-transformers, OpenAI ada-3, Cohere v4).
Hybrid : 매 collaborative + content.
Bandits : ε-greedy, UCB, Thompson sampling — 매 explore/exploit.
Contextual bandit / LinUCB : user feature 의 use.
RLHF / DPO : 매 LLM-era curation (Claude, GPT-5).
매 응용
News feed (TikTok, X).
E-commerce product ranking (Amazon, Coupang).
Knowledge base curation (Notion AI, Glean).
RAG corpus filtering — 매 LLM context 의 dynamic selection.
💻 패턴
Pattern 1 — Embedding-based candidate retrieval
Pattern 2 — Thompson Sampling bandit
Pattern 3 — Click-through online update
Pattern 4 — Contextual ranker (LightGBM LTR)
Pattern 5 — RAG with adaptive filter
매 결정 기준
상황
Approach
Cold start, no user data
Content-based + popularity prior
Rich interaction logs
Hybrid + LTR
Real-time exploration
Thompson / LinUCB
LLM context curation
Embedding + cross-encoder rerank
Long-tail discovery
UCB exploration boost
기본값 : embedding retrieval + cross-encoder rerank + Thompson exploration.
🔗 Graph
🤖 LLM 활용
언제 : dynamic corpus, user feedback available, explore/exploit tradeoff matters, RAG context selection.
언제 X : static catalog (use plain ranking), no feedback (cold start dominates), regulated content (use rule-based).
❌ 안티패턴
Filter bubble : pure exploit 의 user 의 narrow exposure.
Feedback contamination : bot click 의 model 의 poison.
No exploration decay : ε constant — 매 mature system 의 ε ↓.
Position bias ignore : top item 의 click 의 inflate — debiasing essential.
🧪 검증 / 중복
Verified (Netflix tech blog, TikTok recsys papers, RecSys 2024 proceedings).
신뢰도 A.
🕓 Changelog
날짜
변경
2026-05-08
Phase 1
2026-05-10
Manual cleanup — FULL content (bandits, LTR, RAG patterns)