--- id: MATH-STAT-TEST-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [math, statistics, hypothesis-testing, p-value, null-hypothesis, alternative-hypothesis, significance-level] last_reinforced: 2026-04-26 --- # Statistical Hypothesis Testing (ํ†ต๊ณ„์  ๊ฐ€์„ค ๊ฒ€์ •) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๋ฐ์ดํ„ฐ๋ผ๋Š” ์ฆ๊ฑฐ๋ฅผ ํ† ๋Œ€๋กœ '์šฐ์—ฐํ•œ ์ผ์น˜'์ธ์ง€ 'ํ•„์—ฐ์  ์‚ฌ์‹ค'์ธ์ง€ ํŒ๊ฒฐ์„ ๋‚ด๋ฆฌ๊ณ , ์—„๊ฒฉํ•œ ํ™•๋ฅ ์  ์žฃ๋Œ€(P-value)๋ฅผ ํ†ตํ•ด ์ง€์‹์˜ ํƒ€๋‹น์„ฑ์„ ์ž…์ฆํ•˜๋ผ" โ€” ํ‘œ๋ณธ ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ด ๋ชจ์ง‘๋‹จ์— ๋Œ€ํ•œ ๊ฐ€์„ค์ด ํ†ต๊ณ„์ ์œผ๋กœ ์œ ์˜๋ฏธํ•œ์ง€ ํŒ๋‹จํ•˜๋Š” ์ฒด๊ณ„์ ์ธ ์˜์‚ฌ๊ฒฐ์ • ํ”„๋กœ์„ธ์Šค. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** "Conflict-based Decision and Probability of Coincidence" โ€” 'ํšจ๊ณผ๊ฐ€ ์—†๋‹ค'๋Š” ๊ท€๋ฌด๊ฐ€์„ค(Null Hypothesis)์„ ์„ธ์šฐ๊ณ , ์‹ค์ œ ๋ฐ์ดํ„ฐ๊ฐ€ ๋‚˜ํƒ€๋‚  ํ™•๋ฅ ์„ ๊ณ„์‚ฐํ•˜์—ฌ ๊ทธ ํ™•๋ฅ ์ด ๋งค์šฐ ๋‚ฎ๋‹ค๋ฉด(์œ ์˜ ์ˆ˜์ค€ ๋ฏธ๋‹ฌ) ๊ท€๋ฌด๊ฐ€์„ค์„ ๊ธฐ๊ฐํ•˜๊ณ  ๋Œ€๋ฆฝ๊ฐ€์„ค(Alternative Hypothesis)์„ ์ฑ„ํƒํ•˜๋Š” ํŒจํ„ด. - **ํ•ต์‹ฌ ๊ตฌ์„ฑ ์š”์†Œ:** - **Null Hypothesis ($H_0$):** ํ˜„์žฌ์˜ ์ง€์‹์ด๋‚˜ ์ฐจ์ด๊ฐ€ ์—†๋‹ค๋Š” ๊ฐ€์ •. - **Alternative Hypothesis ($H_1$):** ์ž…์ฆํ•˜๊ณ  ์‹ถ์€ ์ƒˆ๋กœ์šด ์‚ฌ์‹ค์ด๋‚˜ ์ฐจ์ด๊ฐ€ ์žˆ๋‹ค๋Š” ๊ฐ€์ •. - **P-value:** ๊ท€๋ฌด๊ฐ€์„ค์ด ๋งž์„ ๋•Œ, ๊ด€์ธก๋œ ๋ฐ์ดํ„ฐ๊ฐ€ ๋‚˜ํƒ€๋‚  ํ™•๋ฅ . ๋‚ฎ์„์ˆ˜๋ก ๊ฐ€์„ค ๊ธฐ๊ฐ์˜ ๊ทผ๊ฑฐ๊ฐ€ ๋จ. - **Significance Level ($\alpha$):** ๊ธฐ๊ฐ ์—ฌ๋ถ€๋ฅผ ๊ฒฐ์ •ํ•˜๋Š” ๊ธฐ์ค€๊ฐ’ (์ฃผ๋กœ 0.05). - **์˜์˜:** ์ฃผ๊ด€์  ํŒ๋‹จ์„ ๋ฐฐ์ œํ•˜๊ณ  ๊ฐ๊ด€์  ์ˆ˜์น˜์— ๊ทผ๊ฑฐํ•˜์—ฌ ๊ณผํ•™์  ๋ฐœ๊ฒฌ, ์‹ ์•ฝ์˜ ํšจ๋Šฅ, ๋งˆ์ผ€ํŒ… ์ „๋žต์˜ ์„ฑ๊ณต ์—ฌ๋ถ€ ๋“ฑ์„ ํ™•์ • ์ง“๋Š” ๋ฐ์ดํ„ฐ ๊ณผํ•™์˜ ํ•ต์‹ฌ ์–ธ์–ด. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ๋‹จ์ˆœํžˆ P-value๊ฐ€ 0.05๋ณด๋‹ค ์ž‘์œผ๋ฉด ์„ฑ๊ณต์ด๋ผ๋Š” 'P-hacking'์˜ ์œ„ํ—˜์„ฑ์ด ์ œ๊ธฐ๋˜๋ฉด์„œ, ์ด์ œ๋Š” ํšจ๊ณผ ํฌ๊ธฐ(Effect Size)์™€ ์‹ ๋ขฐ ๊ตฌ๊ฐ„(Confidence Interval)์„ ๋ณ‘ํ–‰ํ•˜์—ฌ ์‹ค์งˆ์ ์ธ ์˜๋ฏธ๋ฅผ ๋ถ„์„ํ•˜๋Š” ๊ฒƒ์ด ๊ธ€๋กœ๋ฒŒ ์—ฐ๊ตฌ ํ‘œ์ค€์ด ๋จ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ์—์ด์ „ํŠธ์˜ ์ƒˆ๋กœ์šด ์ถ”๋ก  ์•Œ๊ณ ๋ฆฌ์ฆ˜ ๋„์ž… ์‹œ, ๊ธฐ์กด ์•Œ๊ณ ๋ฆฌ์ฆ˜๊ณผ์˜ ํ’ˆ์งˆ ์ฐจ์ด๊ฐ€ ํ†ต๊ณ„์ ์œผ๋กœ ์œ ์˜๋ฏธํ•œ์ง€ ์—„๊ฒฉํ•œ ๊ฐ€์„ค ๊ฒ€์ •(A/B Test) ๊ณผ์ •์„ ๊ฑฐ์ณ ๊ฒ€์ฆํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Statistical-Power]], [[Standard-Deviation-and-Variance]], [[Performance-Metrics-in-AI]], A-B-Testing-Foundations - **Raw Source:** 10_Wiki/Topics/AI/Statistical-Hypothesis-Testing.md