--- id: wiki-2026-0508-statistics-data-analysis title: "Statistics & Data Analysis" category: 10_Wiki/Topics status: needs_review canonical_id: self aliases: [P-Reinforce-AUTO-SADA-001] duplicate_of: none source_trust_level: A confidence_score: 0.98 tags: [auto-reinforced, Statistics, data-Analysis, Hypothesis-Testing, data-science] raw_sources: [] last_reinforced: 2026-04-20 github_commit: pending inferred_by: Claude Opus 4.7 (auto-normalize 2026-05-08) tech_stack: language: unspecified framework: unspecified --- # [[Statistics & Data Analysis|Statistics & Data Analysis]] ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๋ฐ์ดํ„ฐ์˜ ๋…ธ์ด์ฆˆ๋ฅผ ๋šซ๊ณ  ์ง„์‹ค์„ ๋ณด๋Š” ๋ˆˆ: ๋ถˆํ™•์‹ค์„ฑ ๊ฐ€๋“ํ•œ ์„ธ์ƒ์˜ ์ˆซ์ž๋“ค์„ ์ˆ˜์ง‘, ์ •๋ฆฌ, ๋ถ„์„ํ•˜์—ฌ ๋ณด์ด์ง€ ์•Š๋Š” ํŒจํ„ด์„ ๋ฐœ๊ฒฌํ•˜๊ณ  ๋…ผ๋ฆฌ์ ์ธ ์˜์‚ฌ๊ฒฐ์ •์˜ ๊ทผ๊ฑฐ๋ฅผ ๋งˆ๋ จํ•˜๋Š” ์ง€์  ๋ฌด๊ธฐ." ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) ํ†ต๊ณ„ ๋ฐ ๋ฐ์ดํ„ฐ ๋ถ„์„(Statistics & Data Analysis)์€ ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ด ํ˜„์ƒ์„ ์ดํ•ดํ•˜๊ณ  ์ถ”๋ก ํ•˜์—ฌ ๊ฐ€์น˜ ์žˆ๋Š” ํ†ต์ฐฐ(Insight)์„ ๋„์ถœํ•˜๋Š” ๊ณผํ•™์  ๋ฐฉ๋ฒ•๋ก ์ž…๋‹ˆ๋‹ค. 1. **3๋Œ€ ๋ถ„์„ ์˜์—ญ**: * **Descriptive (๊ธฐ์ˆ  ํ†ต๊ณ„)**: ๋ฐ์ดํ„ฐ๋ฅผ ์š”์•ฝํ•˜๊ณ  ํŠน์„ฑ์„ ๋ฌ˜์‚ฌ (ํ‰๊ท , ํ‘œ์ค€ํŽธ์ฐจ, ๋ถ„ํฌ ๋“ฑ). * **Inferential (์ถ”๋ก  ํ†ต๊ณ„)**: ํ‘œ๋ณธ์„ ํ†ตํ•ด ๋ชจ์ง‘๋‹จ์˜ ์„ฑ์งˆ์„ ์ถ”์ธกํ•˜๊ณ  ๊ฐ€์„ค์„ ๊ฒ€์ • (P-value, ์‹ ๋ขฐ๊ตฌ๊ฐ„). * **Predictive (์˜ˆ์ธก ๋ถ„์„)**: ๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ๋จธ์‹ ๋Ÿฌ๋‹ ๋“ฑ์„ ํ™œ์šฉํ•ด ๋ฏธ๋ž˜ ๊ฒฐ๊ณผ ์˜ˆ์ธก. 2. **ํ•ต์‹ฌ ์›Œํฌํ”Œ๋กœ์šฐ**: * ์งˆ๋ฌธ ์ •์˜ -> ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ -> ์ „์ฒ˜๋ฆฌ(Cleaning) -> ํƒ์ƒ‰์  ๋ถ„์„(EDA) -> ๋ชจ๋ธ๋ง -> ๊ฒฐ๊ณผ ํ•ด์„ ๋ฐ ์‹œ๊ฐํ™”. 3. **๋ฐ์ดํ„ฐ ์‚ฌ์ด์–ธ์Šค์™€์˜ ๊ด€๊ณ„**: * ํ†ต๊ณ„ํ•™์€ ๋ฟŒ๋ฆฌ์ด๋ฉฐ, ์—ฌ๊ธฐ์— ์ปดํ“จํ„ฐ ๊ณตํ•™์˜ ์—ฐ์‚ฐ๋ ฅ๊ณผ ๋„๋ฉ”์ธ ์ง€์‹์ด ๊ฒฐํ•ฉ๋˜์–ด ํ˜„๋Œ€์˜ ๋ฐ์ดํ„ฐ ์‚ฌ์ด์–ธ์Šค๊ฐ€ ๋จ. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & Updates) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ**: ๊ณผ๊ฑฐ์—๋Š” ์ž‘์€ ํ‘œ๋ณธ(Sample)์„ ํ†ตํ•œ ์ถ”๋ก ์ด ์ค‘์š”ํ–ˆ์œผ๋‚˜, ํ˜„๋Œ€ ์ •์ฑ…์€ 'Big Data' ์ „์ฒด๋ฅผ ๋‹ค๋ฃจ๋Š” ๊ณ„์‚ฐ ํ†ต๊ณ„ํ•™๊ณผ, ์ƒ๊ด€๊ด€๊ณ„ ๋„ˆ๋จธ์˜ ์›์ธ์„ ์ฐพ๋Š” '์ธ๊ณผ ์ถ”๋ก (Causal Inference)' ์ •์ฑ…์œผ๋กœ ํŒจ๋Ÿฌ๋‹ค์ž„์ด ์ด๋™ํ•จ(RL Update). - **์ •์ฑ… ๋ณ€ํ™”(RL Update)**: '๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ์˜์‚ฌ๊ฒฐ์ •(Data-Driven Decision Making)'์ด ๋ชจ๋“  ๊ณต๊ณต ๋ฐ ๋ฏผ๊ฐ„ ์ •์ฑ…์˜ ๊ธฐ๋ณธ ์š”๊ฑด์œผ๋กœ ๊ทœ์ •๋จ์— ๋”ฐ๋ผ, ๋ถ„์„ ๊ฒฐ๊ณผ์˜ ์žฌํ˜„์„ฑ(Reproducibility)๊ณผ ํˆฌ๋ช…์„ฑ์„ ํ™•๋ณดํ•˜๊ธฐ ์œ„ํ•œ '๋ฐ์ดํ„ฐ ์‹ ๋ขฐ์„ฑ ๊ฒ€์ฆ ํ‘œ์ค€' ์ˆ˜๋ฆฝ์ด ์‹œ๊ธ‰ํ•œ ์ •์ฑ… ๊ณผ์ œ๊ฐ€ ๋จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Probability Theory|Probability Theory]], [[Quantitative Economics (แ„‰แ…ฎแ„…แ…ฃแ†ผแ„€แ…งแ†ผแ„Œแ…ฆแ„’แ…กแ†จ)|Quantitative Economics (์ˆ˜๋Ÿ‰๊ฒฝ์ œํ•™)]], [[Sensitivity-Analysis|Sensitivity-Analysis]], [[Signal in Noise|Signal in Noise]], [[Philosophy|Philosophy]] of Science - **Modern Tech/Tools**: R, Python (Pandas/Scipy), Tableau, Google BigQuery. --- ## ๐Ÿค– LLM ํ™œ์šฉ ํžŒํŠธ (How to Use This Knowledge) **์–ธ์ œ ์ด ์ง€์‹์„ ์“ฐ๋Š”๊ฐ€:** - *(TODO)* **์–ธ์ œ ์“ฐ๋ฉด ์•ˆ ๋˜๋Š”๊ฐ€:** - *(TODO)* ## ๐Ÿงช ๊ฒ€์ฆ ์ƒํƒœ (Validation) - **์ •๋ณด ์ƒํƒœ:** needs_review - **์ถœ์ฒ˜ ์‹ ๋ขฐ๋„:** A - **๊ฒ€ํ†  ์ด์œ :** *(P-Reinforce Phase 1 ์ž๋™ ์ •๊ทœํ™”. ๋ณธ๋ฌธ ๊ฒ€์ฆ ํ•„์š”.)* ## ๐Ÿงฌ ์ค‘๋ณต ๊ฒ€์‚ฌ (Duplicate Check) - **๊ธฐ์กด ์œ ์‚ฌ ๋ฌธ์„œ:** *(TODO: ์ธ๋ฑ์„œ ํด๋Ÿฌ์Šคํ„ฐ ๋ฆฌํฌํŠธ ์ฐธ์กฐ)* - **์ฒ˜๋ฆฌ ๋ฐฉ์‹:** UPDATE (์ž๋™ ์ •๊ทœํ™”) - **์ฒ˜๋ฆฌ ์ด์œ :** Phase 1 ์ •๊ทœํ™” โ€” ์˜› ํ…œํ”Œ๋ฆฟ/๋ˆ„๋ฝ ํ•„๋“œ ๋ณด๊ฐ•. ## ๐Ÿ•“ ๋ณ€๊ฒฝ ์ด๋ ฅ (Changelog) | ๋‚ ์งœ | ๋ณ€๊ฒฝ ๋‚ด์šฉ | ์ฒ˜๋ฆฌ ๋ฐฉ์‹ | ์‹ ๋ขฐ๋„ | |------|-----------|-----------|--------| | 2026-05-08 | P-Reinforce Phase 1 ์ •๊ทœํ™” (frontmatter + ํ—ค๋” ํ‘œ์ค€ํ™”) | UPDATE | A | ## ๐Ÿ’ป ์ฝ”๋“œ ํŒจํ„ด (Code Patterns) **ํŒจํ„ด 1:** *(TODO: ์ด ํ”„๋กœ์ ํŠธ ์ปจ๋ฒค์…˜ ๋ฐ˜์˜ํ•œ ๊ตฌ์กฐ ์Šค์ผˆ๋ ˆํ†ค)* ```text # TODO ``` ## ๐Ÿค” ์˜์‚ฌ๊ฒฐ์ • ๊ธฐ์ค€ (Decision Criteria) **์„ ํƒ A๋ฅผ ์จ์•ผ ํ•  ๋•Œ:** - *(TODO)* **์„ ํƒ B๋ฅผ ์จ์•ผ ํ•  ๋•Œ:** - *(TODO)* **๊ธฐ๋ณธ๊ฐ’:** > *(TODO)* ## โŒ ์•ˆํ‹ฐํŒจํ„ด (Anti-Patterns) - **[์•ˆํ‹ฐํŒจํ„ด]:** *(TODO: ๋ฌด์—‡์„ ํ•˜๋ฉด ์•ˆ ๋˜๋Š”๊ฐ€ + ์ด์œ  + ๋Œ€์‹  ๋ฌด์—‡์„)*