--- id: MATH-HYPO-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [statistics, math, hypothesis-testing, p-value, data-science] last_reinforced: 2026-04-26 --- # Hypothesis Testing (๊ฐ€์„ค ๊ฒ€์ •) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๋ฐ์ดํ„ฐ์˜ ์ฆ๊ฑฐ๋ฅผ ํ† ๋Œ€๋กœ, ์šฐ๋ฆฌ๊ฐ€ ๋ฏฟ๊ณ  ์‹ถ์€ ๊ฐ€์„ค์ด '์šฐ์—ฐ'์ด๋ผ๋Š” ํ•จ์ •์— ๋น ์ง€์ง€ ์•Š์•˜๋Š”์ง€ ์—„๊ฒฉํ•˜๊ฒŒ ์‹ฌํŒํ•˜๋ผ" โ€” ํ‘œ๋ณธ ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ด ๋ชจ์ง‘๋‹จ์˜ ํŠน์„ฑ์— ๋Œ€ํ•œ ๊ฐ€์„ค์ด ํ†ต๊ณ„์ ์œผ๋กœ ํƒ€๋‹นํ•œ์ง€ ๊ณ„์‚ฐํ•˜์—ฌ, ์˜์‚ฌ๊ฒฐ์ •์˜ ๋ถˆํ™•์‹ค์„ฑ์„ ์ˆ˜์น˜ํ™”๋œ ์‹ ๋ขฐ๋„๋กœ ์น˜ํ™˜ํ•˜๋Š” ๋ฐฉ๋ฒ•๋ก . ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** "๊ท€๋ฌด๊ฐ€์„ค(Null Hypothesis)"์ด๋ผ๋Š” ๊ธฐ๋ณธ ์ „์ œ๋ฅผ ์„ธ์šฐ๊ณ , ๊ด€์ธก๋œ ๋ฐ์ดํ„ฐ๊ฐ€ ์ด ์ „์ œ ํ•˜์—์„œ ๋ฐœ์ƒํ•  ํ™•๋ฅ (p-value)์ด ๋งค์šฐ ๋‚ฎ๋‹ค๋ฉด ๊ท€๋ฌด๊ฐ€์„ค์„ ๊ธฐ๊ฐํ•˜๊ณ  "๋Œ€๋ฆฝ๊ฐ€์„ค(Alternative Hypothesis)"์„ ์ฑ„ํƒํ•˜๋Š” ๋…ผ๋ฆฌ์  ์ถ”๋ก  ํŒจํ„ด. - **ํ•ต์‹ฌ ์š”์†Œ:** - **Null Hypothesis ($H_0$):** ํšจ๊ณผ๋‚˜ ์ฐจ์ด๊ฐ€ ์—†๋‹ค๋Š” ๊ธฐ๋ณธ ๊ฐ€์„ค. - **Alternative Hypothesis ($H_1$):** ์ฆ๋ช…ํ•˜๊ณ  ์‹ถ์€ ํšจ๊ณผ๋‚˜ ์ฐจ์ด๊ฐ€ ์žˆ๋‹ค๋Š” ๊ฐ€์„ค. - **P-value:** ๊ท€๋ฌด๊ฐ€์„ค์ด ๋งž๋‹ค๋Š” ์ „์ œ ํ•˜์— ํ˜„์žฌ ๋ฐ์ดํ„ฐ๊ฐ€ ๊ด€์ธก๋  ํ™•๋ฅ . ๋ณดํ†ต 0.05 ๋ฏธ๋งŒ์ผ ๋•Œ ์œ ์˜๋ฏธํ•˜๋‹ค๊ณ  ํŒ๋‹จ. - **Type I & II Error:** ๋งž๋Š”๋ฐ ํ‹€๋ฆฌ๋‹ค๊ณ  ํ•˜๊ฑฐ๋‚˜(Alpha), ํ‹€๋ฆฐ๋ฐ ๋งž๋‹ค๊ณ  ํ•˜๋Š”(Beta) ์˜ค๋ฅ˜์˜ ๊ด€๋ฆฌ. - **์˜์˜:** ์ฃผ๊ด€์ ์ธ ํŒ๋‹จ์„ ๋ฐฐ์ œํ•˜๊ณ , ๊ฐ๊ด€์ ์ธ ์ง€ํ‘œ๋ฅผ ํ†ตํ•ด ๋ณ€ํ™”์˜ ์‹คํšจ์„ฑ์„ ์ฆ๋ช…ํ•˜๋Š” ๋ฐ์ดํ„ฐ ๊ณผํ•™์˜ ๊ทผ๊ฐ„. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** p-value์—๋งŒ ๊ณผ๋„ํ•˜๊ฒŒ ์˜์กดํ•˜๋Š” 'P-hacking'์˜ ์œ„ํ—˜์„ฑ์„ ๊ฒฝ๊ณ ํ•˜๋ฉฐ, ์ตœ๊ทผ์—๋Š” ํšจ๊ณผ ํฌ๊ธฐ(Effect Size)์™€ ๋ฒ ์ด์ง€์•ˆ ๊ฐ€์„ค ๊ฒ€์ •์„ ๋ณ‘ํ–‰ํ•˜๋Š” ๋ฐฉํ–ฅ์œผ๋กœ ์ •๋ฐ€๋„ ๊ฐ•ํ™”. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ์ƒˆ๋กœ์šด ์—์ด์ „ํŠธ ์•Œ๊ณ ๋ฆฌ์ฆ˜ ๋„์ž… ์‹œ, ๊ธฐ์กด ์•Œ๊ณ ๋ฆฌ์ฆ˜๊ณผ์˜ ์„ฑ๋Šฅ ์ฐจ์ด๋ฅผ ๊ฐ€์„ค ๊ฒ€์ •์„ ํ†ตํ•ด ํ†ต๊ณ„์ ์œผ๋กœ ์ฆ๋ช…ํ•œ ํ›„ ๋ฐฐํฌ๋ฅผ ๊ฒฐ์ •ํ•˜๋Š” 'Evidence-based Deployment' ์›์น™์„ ์ค€์ˆ˜ํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - Probability-Theory, [[Exploratory-Data-Analysis]], A-B-Testing-Foundations, Decision-Making - **Raw Source:** 10_Wiki/Topics/AI/Hypothesis-Testing.md