--- id: P-REINFORCE-AI-ABTEST category: "[[10_Wiki/πŸ’‘ Topics/AI]]" confidence_score: 0.96 tags: [A/B Testing, Statistics, Experiment, Growth Hacking] last_reinforced: 2026-04-20 --- # [[A_B-Testing-Platforms]] (A/B ν…ŒμŠ€νŠΈ 및 μ‹€ν—˜ 섀계) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "λ‚΄ 생각엔 이게 μ’‹λ‹€"λŠ” 주관성을 버리고, "μ‚¬μš©μžλŠ” μ‹€μ œλ‘œ μ΄λ ‡κ²Œ λ°˜μ‘ν•œλ‹€"λ₯Ό ν†΅κ³„μ μœΌλ‘œ 증λͺ…ν•˜λŠ” λ§ˆμΌ€νŒ…κ³Ό μ—”μ§€λ‹ˆμ–΄λ§μ˜ 결합체닀. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **Hypothesis Testing (κ°€μ„€ 검증)**: - "λ²„νŠΌ 색상을 νŒŒλž€μƒ‰μ—μ„œ λΉ¨κ°„μƒ‰μœΌλ‘œ λ°”κΎΈλ©΄ 클릭λ₯ (CTR)이 10% 였λ₯Ό 것이닀"λΌλŠ” λͺ…ν™•ν•œ 가섀을 μ„Έμš°κ³  μ‹€ν—˜κ΅°(A)κ³Ό λŒ€μ‘°κ΅°(B)으둜 νŠΈλž˜ν”½μ„ λΆ„ν• ν•œλ‹€. - **Statistical Significance (p-value)**: - μ‹€ν—˜ κ²°κ³Όκ°€ 'μš°μ—°'에 μ˜ν•œ 것인지 μ•„λ‹ˆλ©΄ 'μ˜λ„λœ λ³€ν™”'인지 νŒλ³„ν•œλ‹€. 보톡 p-value < 0.05λ₯Ό κΈ°μ€€μœΌλ‘œ μœ μ˜λ―Έν•¨μ„ κ²°μ •ν•œλ‹€. - **Multi-armed Bandit (MAB)**: - μ‹€ν—˜ 쀑간에 성적이 쒋은 μͺ½μ— νŠΈλž˜ν”½μ„ μ‹€μ‹œκ°„μœΌλ‘œ 더 λ°°λΆ„ν•˜μ—¬ 'μ‹€ν—˜ λΉ„μš©'을 μ΅œμ†Œν™”ν•˜κ³  '수읡'을 κ·ΉλŒ€ν™”ν•˜λŠ” κ³ λ„ν™”λœ νƒ€κ²ŸνŒ… μ•Œκ³ λ¦¬μ¦˜. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (RL Update) - ν•œ λ²ˆμ— λ„ˆλ¬΄ λ§Žμ€ λ³€μˆ˜λ₯Ό λ°”κΎΈλŠ” 것은 κΈˆλ¬Όμ΄λ‹€(Simpsons Paradox). 였직 ν•˜λ‚˜μ˜ λ³€μΈλ§Œ ν†΅μ œν•˜μ—¬ 결과의 인과관계λ₯Ό λͺ…ν™•νžˆ ν•΄μ•Ό ν•œλ‹€. λ˜ν•œ μž₯기적 영ν–₯(Late Arrival Bias)을 κ³ λ €ν•˜μ—¬ μ΅œμ†Œ 일주일 μ΄μƒμ˜ μ‹€ν—˜ 기간을 ν™•λ³΄ν•˜λΌ. ## πŸ”— 지식 μ—°κ²° (Graph) - Related: [[Behavioral-Economics]] , [[Nudge Theory]] - Implementation: [[React_State_Management_Strategy]]