"매 mind 의 잠정적 결론". 매 evidence 의 objective ↔ subjective 의 confidence. 매 action 의 trigger. 매 AI 의 응용 — 매 agent 의 belief state, 매 RAG 의 trust scoring, 매 confirmation bias 의 detect.
📖 핵심
매 정의 (philosophical)
Belief: 매 proposition 의 true 의 mental acceptance.
Knowledge: 매 Justified True Belief (Plato).
Gettier problem: JTB 가 X 의 case (Gettier 1963).
→ 매 knowledge 의 stricter (no luck / safety / sensitivity).
매 belief 의 type
Occurrent: 매 active conscious thought.
Dispositional: 매 stored, 매 retrieve 매 ready.
De dicto vs de re: 매 about-words vs about-thing.
Implicit / explicit: 매 articulate-able.
매 belief revision (AGM)
Expansion: 매 add (no conflict).
Contraction: 매 remove.
Revision: 매 add + remove 매 conflicting.
Postulates: 매 closure, success, consistency, ...
Bayesian belief
매 belief = 매 probability (degree of confidence).
매 update via Bayes (Cox theorem).
매 coherent.
매 modern AI 의 standard.
매 cognitive bias (belief 관련)
Confirmation bias: 매 belief 의 confirm 의 selective.
Belief perseverance: 매 disconfirming evidence 후 의 retain.
Backfire effect: 매 disconfirming evidence 의 strengthen.
Sunk cost: 매 commitment 의 belief 의 maintain.
Motivated reasoning: 매 want 의 believe.
매 AI / agent 의 응용
Belief state (POMDP)
매 partially observable.
매 belief = 매 distribution over state.
매 action 의 belief 의 update.
RAG trust score
매 retrieved document 의 belief.
매 confidence = recency × authority × consistency.
Multi-agent BDI (Belief-Desire-Intention)
매 belief: world state.
매 desire: goal.
매 intention: committed plan.
매 PRS, JADE.
LLM 의 belief
매 train 의 belief 의 instillation.
매 RLHF 의 alignment.
매 calibration: 매 P(true) 의 actual frequency.
매 epistemic logic
매 K_a φ: 매 agent a 의 knows φ.
매 B_a φ: 매 belief.
매 multi-agent: 매 common knowledge.
매 Aumann's agreement theorem: 매 rational 의 동의.
💻 패턴 (응용)
Bayesian belief update
defupdate_belief(prior,likelihood_true,likelihood_false,evidence):# P(H | E) = P(E | H) * P(H) / P(E)posterior_unnorm=likelihood_true*priorevidence_prob=likelihood_true*prior+likelihood_false*(1-prior)returnposterior_unnorm/evidence_probbelief=0.3# 매 priorbelief=update_belief(belief,0.9,0.2,evidence=True)# 매 0.66belief=update_belief(belief,0.9,0.2,evidence=True)# 매 0.90
classBDIAgent:def__init__(self):self.beliefs={}# 매 facts about worldself.desires=[]# 매 goalsself.intentions=[]# 매 active plansdefperceive(self,observations):forobsinobservations:self.beliefs[obs.key]=obs.valuedefdeliberate(self):# 매 desire selection based on belieffeasible=[dfordinself.desiresifself.is_feasible(d)]returnmax(feasible,key=lambdad:d.priority)defplan(self,goal):# 매 belief 기반 의 planreturnplanner.plan(self.beliefs,goal)defexecute(self):ifnotself.intentions:goal=self.deliberate()self.intentions=self.plan(goal)action=self.intentions.pop(0)returnaction
LLM calibration
defcalibration_check(model,eval_set):# 매 P(true) 의 declared confidence vs actualbins=[(0,0.1),(0.1,0.2),...,(0.9,1.0)]bin_correct={b:[]forbinbins}forexampleineval_set:response=model.generate(example.prompt+' Reply with answer and confidence (0-1).')ans,conf=parse(response)actual=(ans==example.expected)forbinbins:ifb[0]<=conf<b[1]:bin_correct[b].append(actual)break# 매 ECE (Expected Calibration Error)ece=sum(abs(np.mean(corr)-(b[0]+b[1])/2)*len(corr)/len(eval_set)forb,corrinbin_correct.items()ifcorr)returnece
→ 매 well-calibrated = ECE 낮음.
Confirmation bias detector
defdetect_confirmation_bias(query,results,user_belief):# 매 user 의 belief 의 align 의 source 만 의 click?aligning=[rforrinresultsifr.aligns_with(user_belief)]clicked_aligning=sum(1forrinaligningifr.clicked)clicked_total=sum(1forrinresultsifr.clicked)ifclicked_total==0:returnNonebias_ratio=clicked_aligning/clicked_totalreturnbias_ratio# 매 > 0.7 = 매 strong confirmation bias
🤔 결정 기준
응용
Approach
Agent world model
POMDP belief
RAG trust
Source authority + consistency
Multi-agent
BDI
LLM calibration
ECE + temperature scaling
User UX
Diverse perspective + bias detect
Knowledge graph
Justified belief (provenance)
기본값: Bayesian belief + ECE calibration + diverse evidence.
언제: 매 agent design (belief state). 매 RAG trust scoring. 매 LLM calibration eval. 매 bias detection.
언제 X: 매 metaphysical claim 의 substitute. 매 single belief 의 deterministic system.
❌ 안티패턴
Belief 의 binary: 매 confidence 의 lose.
No update: 매 stale belief.
Confirmation bias 의 ignore: 매 echo chamber.
Calibration 무시: 매 over-confident model.
Multiple agent 의 belief 의 share assumption: 매 multi-agent fail.