"매 data → 매 truth 의 inference". Statistics 의 uncertainty 하의 decision-making — descriptive (summary), inferential (population from sample), causal (intervention) 의 3 axes. 17C Pascal/Fermat 에서 시작, 2026 에 frequentist + Bayesian + ML 의 hybrid 의 표준.
매 핵심
매 3 axes
Descriptive: mean, median, SD, distribution, viz. 매 "what is".
Inferential: confidence interval, hypothesis test, p-value, posterior. 매 "what could be".
Causal: RCT, propensity, IV, DAG. 매 "what if".
매 frequentist vs Bayesian
Frequentist: probability = 매 long-run frequency. 매 parameters fixed, data random. p-value, CI.
Bayesian: probability = 매 belief degree. 매 parameters random (prior + likelihood → posterior). credible interval.
매 ML community 의 mostly Bayesian-ish (regularization = prior).
매 inference workflow
Question → estimand 정의.
Design → sample, randomization, power.
Data collection.
Model — distribution / link function.
Estimate — MLE, MAP, posterior.
Inference — CI / credible interval, test.
Critique — diagnostics, sensitivity, replication.
매 응용
A/B testing — frequentist or Bayesian decision.
Survey / poll — sampling design.
Clinical trial — RCT, survival analysis.
ML model evaluation — bootstrap, cross-validation.
fromsklearn.linear_modelimportLogisticRegressionps=LogisticRegression().fit(X,treatment).predict_proba(X)[:,1]# 매 nearest neighbor matching on logit(ps) → ATT estimate
언제: choosing appropriate test, interpreting output, explaining p-value/CI to non-technical, drafting analysis plan.
언제 X: 매 numerical computation — scipy/statsmodels/PyMC 의 사용.
❌ 안티패턴
p < 0.05 = 매 truth: 매 decision threshold 의 X, just evidence — replication, effect size, prior 의 고려.
Correlation = causation: 매 흔한 fallacy — confounding, reverse causality, selection.
Ignoring assumptions: t-test 의 normality, regression 의 linearity → 매 diagnostic 의 필수.
Cherry-picking subgroups: 매 fishing → false positive 의 폭증.
Reporting only mean (skewed data): median, IQR 의 추가.
Sample size 의 unspecified: 매 power 의 미상.
🧪 검증 / 중복
Verified (Wasserman All of Statistics, Gelman BDA3, ASA p-value statement 2016).