--- id: SYS-PAR-COMP-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [infrastructure, [[Parallel-Computing|Parallel-Computing]], ai, [[Distributed-Systems|Distributed-Systems]], gpu, throughput] last_reinforced: 2026-04-26 --- # Parallel Computing in AI (AI์—์„œ์˜ ๋ณ‘๋ ฌ ์ปดํ“จํŒ…) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๊ฑฐ๋Œ€ํ•œ ์‚ฐ์„ ์‚ฝ ํ•˜๋‚˜๋กœ ์˜ฎ๊ธฐ๋ ค ํ•˜์ง€ ๋ง๊ณ , ์ˆ˜์ฒœ ๊ฐœ์˜ ์‚ฝ์ด ๋™์‹œ์— ์›€์ง์ด๋Š” '๋™์‹œ์„ฑ'์˜ ํž˜์œผ๋กœ ์ง€๋Šฅ์˜ ์˜ํ† ๋ฅผ ํ™•์žฅํ•˜๋ผ" โ€” ๋ฐฉ๋Œ€ํ•œ ๋ฐ์ดํ„ฐ์™€ ๋ณต์žกํ•œ ์—ฐ์‚ฐ์„ ์—ฌ๋Ÿฌ ๊ฐœ์˜ ํ”„๋กœ์„ธ์„œ(CPU, GPU, TPU)์— ๋ถ„์‚ฐ์‹œ์ผœ ๋™์‹œ์— ์ฒ˜๋ฆฌํ•จ์œผ๋กœ์จ ์‹คํ–‰ ์‹œ๊ฐ„์„ ํš๊ธฐ์ ์œผ๋กœ ๋‹จ์ถ•ํ•˜๋Š” ์ปดํ“จํŒ… ํŒจ๋Ÿฌ๋‹ค์ž„. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** "Divide and Conquer in Computation" โ€” ๋…๋ฆฝ์ ์ธ ์—ฐ์‚ฐ ๋‹จ์œ„๋“ค์„ ์‹๋ณ„ํ•˜์—ฌ ๋ณ‘๋ ฌ๋กœ ํ• ๋‹นํ•˜๊ณ , ๊ฐ ํ”„๋กœ์„ธ์„œ ์‚ฌ์ด์˜ ๋ฐ์ดํ„ฐ ๋™๊ธฐํ™”์™€ ํ†ต์‹  ์˜ค๋ฒ„ํ—ค๋“œ๋ฅผ ์ตœ์†Œํ™”ํ•˜์—ฌ ์‹œ์Šคํ…œ ์ „์ฒด์˜ ์ฒ˜๋ฆฌ๋Ÿ‰(Throughput)์„ ์„ ํ˜•์ ์œผ๋กœ ๋†’์ด๋Š” ํŒจํ„ด. - **์ฃผ์š” ๋ณ‘๋ ฌํ™” ์ „๋žต:** - **Data Parallelism:** ๋™์ผํ•œ ๋ชจ๋ธ์„ ์—ฌ๋Ÿฌ ์žฅ์น˜์— ๋ณต์ œํ•˜๊ณ , ์„œ๋กœ ๋‹ค๋ฅธ ๋ฐ์ดํ„ฐ ๋ฐฐ์น˜๋ฅผ ๋™์‹œ์— ํ•™์Šตํ•œ ํ›„ ๊ธฐ์šธ๊ธฐ๋ฅผ ํ•ฉ์‚ฐ. - **Model Parallelism:** ๋ชจ๋ธ ์ž์ฒด๊ฐ€ ๋„ˆ๋ฌด ์ปค์„œ ํ•œ ์žฅ์น˜์— ๋‹ด๊ธฐ์ง€ ์•Š์„ ๋•Œ, ๋ ˆ์ด์–ด๋‚˜ ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ์ชผ๊ฐœ์–ด ์—ฌ๋Ÿฌ ์žฅ์น˜์— ๋ถ„์‚ฐ ๋ฐฐ์น˜. - **Pipeline Parallelism:** ๋ชจ๋ธ์˜ ์ธต๋ณ„ ์—ฐ์‚ฐ์„ ๋งˆ์น˜ ๊ณต์žฅ์˜ ์ปจ๋ฒ ์ด์–ด ๋ฒจํŠธ์ฒ˜๋Ÿผ ์ˆœ์ฐจ์ /๋ณ‘๋ ฌ์ ์œผ๋กœ ์ฒ˜๋ฆฌ. - **์˜์˜:** ๋ฌด์–ด์˜ ๋ฒ•์น™์ด ํ•œ๊ณ„์— ๋‹ค๋‹ค๋ฅธ ์‹œ๋Œ€์—, ์ˆ˜์ฒœ์–ต ๊ฐœ์˜ ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ๊ฐ€์ง„ ์ดˆ๊ฑฐ๋Œ€ ์–ธ์–ด ๋ชจ๋ธ(LLM)์„ ํ˜„์‹ค์ ์ธ ์‹œ๊ฐ„ ๋‚ด์— ํ•™์Šต์‹œํ‚ค๊ณ  ์„œ๋น„์Šคํ•  ์ˆ˜ ์žˆ๊ฒŒ ๋งŒ๋“œ๋Š” ํ˜„๋Œ€ AI์˜ ๋ฌผ๋ฆฌ์  ์‹ฌ์žฅ. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ๋‹จ์ˆœํžˆ ์žฅ์น˜ ์ˆ˜๊ฐ€ ๋งŽ์„์ˆ˜๋ก ๋น„๋ก€ํ•ด์„œ ๋นจ๋ผ์ง„๋‹ค๋Š” ๊ณ ์ •๊ด€๋…์—์„œ ๋ฒ—์–ด๋‚˜, ์žฅ์น˜ ๊ฐ„ ๋ฐ์ดํ„ฐ ์ „์†ก ์†๋„(Interconnect)์™€ ๋™๊ธฐํ™” ๋Œ€๊ธฐ ์‹œ๊ฐ„์œผ๋กœ ์ธํ•œ ์„ฑ๋Šฅ ์ €ํ•˜(Amdahl's Law)๋ฅผ ๊ทน๋ณตํ•˜๋Š” 'ํšจ์œจ์  ๋ถ„์‚ฐ ์•„ํ‚คํ…์ฒ˜' ์„ค๊ณ„๊ฐ€ ๋” ์ค‘์š”ํ•œ ํ™”๋‘๊ฐ€ ๋จ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ๋Œ€๊ทœ๋ชจ ์ง€์‹ ์ž„๋ฒ ๋”ฉ ๋ฐ ๋ฒกํ„ฐ ๊ฒ€์ƒ‰ ์ธ๋ฑ์‹ฑ ์‹œ, ๋ฉ€ํ‹ฐ GPU ํ™˜๊ฒฝ์—์„œ์˜ ๋ฐ์ดํ„ฐ ๋ณ‘๋ ฌํ™” ๊ธฐ์ˆ ์„ ์ ์šฉํ•˜์—ฌ ์ธ๋ฑ์‹ฑ ์†๋„๋ฅผ ๋‹จ์ผ ์žฅ์น˜ ๋Œ€๋น„ 8๋ฐฐ ์ด์ƒ ํ–ฅ์ƒ์‹œํ‚ด. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[NVIDIA-CUDA-and-AI|NVIDIA-CUDA-and-AI]], [[Hardware-Acceleration-for-AI|Hardware-Acceleration-for-AI]],[[_system|system]]-Design-for-AI-Scale, [[High-Availability-Systems|High-Availability-Systems]] - **Raw Source:** 10_Wiki/Topics/AI/Parallel-Computing-in-AI.md