--- id: wiki-2026-0508-mapreduce title: MapReduce category: 10_Wiki/Topics status: needs_review canonical_id: self aliases: [P-Reinforce-AUTO-MARE-001] duplicate_of: none source_trust_level: A confidence_score: 0.94 tags: [auto-reinforced, mapreduce, Distributed-Computing, Big-Data, Parallel-Processing, cluster-computing] raw_sources: [] last_reinforced: 2026-04-20 github_commit: pending inferred_by: Claude Opus 4.7 (auto-normalize 2026-05-08) tech_stack: language: unspecified framework: unspecified --- # [[MapReduce|MapReduce]] ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๊ฑฐ๋Œ€ํ•œ ๋ฐ์ดํ„ฐ๋ฅผ ์ž‘๊ฒŒ ์ชผ๊ฐœ์–ด ์ •๋ณตํ•˜๋ผ: ํ˜ผ์ž์„œ๋Š” ๊ฐ๋‹น ๋ชป ํ•  ๋ฐฉ๋Œ€ํ•œ ๋ฐ์ดํ„ฐ๋ฅผ ์ˆ˜์ฒœ ๋Œ€์˜ ์ปดํ“จํ„ฐ์— ๋‚˜๋ˆ„์–ด ์ค€ ๋’ค(Map), ๊ฐ์ž ๊ณ„์‚ฐํ•œ ๊ฒฐ๊ณผ๋“ค ์ค‘์—์„œ ํ•„์š”ํ•œ ๊ฒƒ๋งŒ ๋ฝ‘์•„ ๋‹ค์‹œ ํ•˜๋‚˜๋กœ ํ•ฉ์น˜๋Š”(Reduce) ๋ถ„์‚ฐ ์ฒ˜๋ฆฌ์˜ ํ‘œ์ค€ ๋ฌธ๋ฒ•." ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) ๋งต๋ฆฌ๋“€์Šค(MapReduce)๋Š” ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ ์„ธํŠธ๋ฅผ ๋ณ‘๋ ฌ๋กœ ์ฒ˜๋ฆฌํ•˜๊ธฐ ์œ„ํ•œ ํ”„๋กœ๊ทธ๋ž˜๋ฐ ๋ชจ๋ธ์ด์ž ํ”„๋ ˆ์ž„์›Œํฌ์ž…๋‹ˆ๋‹ค. (๊ตฌ๊ธ€์— ์˜ํ•ด ๋Œ€์ค‘ํ™”) 1. **๋‘ ๋‹จ๊ณ„์˜ ๋งˆ๋ฒ•**: * **Map Step**: ์ž…๋ ฅ ๋ฐ์ดํ„ฐ๋ฅผ (Key, Value) ์Œ์œผ๋กœ ๋ณ€ํ™˜ํ•˜์—ฌ ์ž‘์€ ์ž‘์—…๋“ค๋กœ ๋ถ„์‚ฐ. * **Reduce Step**: ๊ฐ™์€ Key๋ฅผ ๊ฐ€์ง„ ๊ฒฐ๊ณผ๋ฅผ ํ•ฉ์‚ฐ(Aggregating)ํ•˜์—ฌ ์ตœ์ข… ๊ฒฐ๊ณผ ์ƒ์„ฑ. 2. **์žฅ์ **: * **[[Scalability|Scalability]]**: ์ปดํ“จํ„ฐ๋ฅผ ์ถ”๊ฐ€ํ• ์ˆ˜๋ก ์ฒ˜๋ฆฌ ๋Šฅ๋ ฅ์ด ์„ ํ˜•์ ์œผ๋กœ ์ฆ๊ฐ€. (Scalability์™€ ์—ฐ๊ฒฐ) * **[[Fault-Tolerance|Fault-Tolerance]]**: ํ•œ ๋Œ€์˜ ์ปดํ“จํ„ฐ๊ฐ€ ๊ณ ์žฅ ๋‚˜๋„ ๋‹ค๋ฅธ ์ปดํ“จํ„ฐ๊ฐ€ ์ž‘์—…์„ ๋Œ€์‹  ์ˆ˜ํ–‰. (Fault-Tolerance์™€ ์—ฐ๊ฒฐ) ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & Updates) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ**: ๊ณผ๊ฑฐ์—๋Š” ๋ชจ๋“  ๋น…๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ๋ฅผ ๋งต๋ฆฌ๋“€์Šค ์ •์ฑ…์œผ๋กœ ํ•ด๊ฒฐํ•˜๋ ค ํ–ˆ์œผ๋‚˜, ํ˜„๋Œ€ ์ •์ฑ…์€ ๋””์Šคํฌ ๊ธฐ๋ฐ˜์˜ ๋А๋ฆฐ ๋งต๋ฆฌ๋“€์Šค๋ณด๋‹ค ๋ฉ”๋ชจ๋ฆฌ ๊ธฐ๋ฐ˜์˜ ๋น ๋ฅธ 'Apache Spark ์ •์ฑ…'์ด๋‚˜ '์‹ค์‹œ๊ฐ„ ์ŠคํŠธ๋ฆฌ๋ฐ ์ฒ˜๋ฆฌ ์ •์ฑ…'์„ ์„ ํ˜ธํ•จ(RL Update). - **์ •์ฑ… ๋ณ€ํ™”(RL Update)**: ๋‹จ์ˆœํžˆ ๋ฐ์ดํ„ฐ๋ฅผ ์„ธ๋Š” ์ •์ฑ…์„ ๋„˜์–ด, ๋ถ„์‚ฐ ํ™˜๊ฒฝ์—์„œ ๊ฑฐ๋Œ€ ์ธ๊ณต์ง€๋Šฅ ๋ชจ๋ธ์„ ํ•™์Šต์‹œํ‚ค๋Š” '๋ถ„์‚ฐ ๋”ฅ๋Ÿฌ๋‹ ์ •์ฑ…'์œผ๋กœ ๊ทธ ๊ฐœ๋…์  ํ† ๋Œ€๊ฐ€ ํ™•์žฅ๋˜์–ด ๊ณ„์Šน๋จ. ([[High-Performance Computing (HPC)|High-Performance Computing (HPC)]]์™€ ์—ฐ๊ฒฐ) ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Scalability|Scalability]], [[Fault-Tolerance|Fault-Tolerance]], [[High-Performance Computing (HPC)|High-Performance Computing (HPC)]], [[Analysis|Analysis]], [[Information-Society|Information-Society]] - **Modern Tech/Tools**: Hadoop (HDFS), Apache Spark, Google File[[_system|system]] (GFS), Hive. --- ## ๐Ÿค– LLM ํ™œ์šฉ ํžŒํŠธ (How to Use This Knowledge) **์–ธ์ œ ์ด ์ง€์‹์„ ์“ฐ๋Š”๊ฐ€:** - *(TODO)* **์–ธ์ œ ์“ฐ๋ฉด ์•ˆ ๋˜๋Š”๊ฐ€:** - *(TODO)* ## ๐Ÿงช ๊ฒ€์ฆ ์ƒํƒœ (Validation) - **์ •๋ณด ์ƒํƒœ:** needs_review - **์ถœ์ฒ˜ ์‹ ๋ขฐ๋„:** A - **๊ฒ€ํ†  ์ด์œ :** *(P-Reinforce Phase 1 ์ž๋™ ์ •๊ทœํ™”. ๋ณธ๋ฌธ ๊ฒ€์ฆ ํ•„์š”.)* ## ๐Ÿงฌ ์ค‘๋ณต ๊ฒ€์‚ฌ (Duplicate Check) - **๊ธฐ์กด ์œ ์‚ฌ ๋ฌธ์„œ:** *(TODO: ์ธ๋ฑ์„œ ํด๋Ÿฌ์Šคํ„ฐ ๋ฆฌํฌํŠธ ์ฐธ์กฐ)* - **์ฒ˜๋ฆฌ ๋ฐฉ์‹:** UPDATE (์ž๋™ ์ •๊ทœํ™”) - **์ฒ˜๋ฆฌ ์ด์œ :** Phase 1 ์ •๊ทœํ™” โ€” ์˜› ํ…œํ”Œ๋ฆฟ/๋ˆ„๋ฝ ํ•„๋“œ ๋ณด๊ฐ•. ## ๐Ÿ•“ ๋ณ€๊ฒฝ ์ด๋ ฅ (Changelog) | ๋‚ ์งœ | ๋ณ€๊ฒฝ ๋‚ด์šฉ | ์ฒ˜๋ฆฌ ๋ฐฉ์‹ | ์‹ ๋ขฐ๋„ | |------|-----------|-----------|--------| | 2026-05-08 | P-Reinforce Phase 1 ์ •๊ทœํ™” (frontmatter + ํ—ค๋” ํ‘œ์ค€ํ™”) | UPDATE | A | ## ๐Ÿ’ป ์ฝ”๋“œ ํŒจํ„ด (Code Patterns) **ํŒจํ„ด 1:** *(TODO: ์ด ํ”„๋กœ์ ํŠธ ์ปจ๋ฒค์…˜ ๋ฐ˜์˜ํ•œ ๊ตฌ์กฐ ์Šค์ผˆ๋ ˆํ†ค)* ```text # TODO ``` ## ๐Ÿค” ์˜์‚ฌ๊ฒฐ์ • ๊ธฐ์ค€ (Decision Criteria) **์„ ํƒ A๋ฅผ ์จ์•ผ ํ•  ๋•Œ:** - *(TODO)* **์„ ํƒ B๋ฅผ ์จ์•ผ ํ•  ๋•Œ:** - *(TODO)* **๊ธฐ๋ณธ๊ฐ’:** > *(TODO)* ## โŒ ์•ˆํ‹ฐํŒจํ„ด (Anti-Patterns) - **[์•ˆํ‹ฐํŒจํ„ด]:** *(TODO: ๋ฌด์—‡์„ ํ•˜๋ฉด ์•ˆ ๋˜๋Š”๊ฐ€ + ์ด์œ  + ๋Œ€์‹  ๋ฌด์—‡์„)*