--- id: EDGE-AI-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 1.0 tags: [ai, infrastructure, edge-computing, on-device-ai, latency-optimization] last_reinforced: 2026-04-26 --- # Edge AI and Computing (μ—£μ§€ AI와 μ»΄ν“¨νŒ…) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "데이터가 νƒœμ–΄λ‚˜λŠ” κ·Έκ³³μ—μ„œ μ§€λŠ₯을 μ¦‰μ‹œ μ‹€ν–‰ν•˜λΌ" β€” ν΄λΌμš°λ“œ μ„œλ²„μ— μ˜μ‘΄ν•˜μ§€ μ•Šκ³  μ‚¬μš©μžμ˜ 단말기(슀마트폰, IoT κΈ°κΈ°, λ‘œλ΄‡ λ“±)μ—μ„œ 직접 AI λͺ¨λΈμ„ μ‹€ν–‰ν•˜μ—¬ μ§€μ—° μ‹œκ°„μ„ 쀄이고 ν”„λΌμ΄λ²„μ‹œλ₯Ό λ³΄ν˜Έν•˜λŠ” 기술. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **μΆ”μΆœλœ νŒ¨ν„΄:** λŒ€μ—­ν­(Bandwidth) ν•œκ³„μ™€ λ³΄μ•ˆ 리슀크λ₯Ό κ·Ήλ³΅ν•˜κΈ° μœ„ν•΄, 쀑앙 집쀑식 연산을 λΆ„μ‚°λœ λ‹¨λ§κΈ°λ‘œ μ „μ΄μ‹œν‚€κ³  ν•„μš”ν•œ μ •λ³΄λ§Œ μš”μ•½ν•˜μ—¬ μ „μ†‘ν•˜λŠ” λΆ„μ‚° μ§€λŠ₯ νŒ¨ν„΄. - **핡심 기술:** - **Model Compression:** μ–‘μžν™”(Quantization), 프루닝(Pruning), 지식 증λ₯˜(Distillation) 등을 톡해 λͺ¨λΈ 크기 μΆ•μ†Œ. - **NPU (Neural Processing Unit):** λͺ¨λ°”일 기기에 μ΅œμ ν™”λœ AI μ „μš© ν•˜λ“œμ›¨μ–΄ 가속기. - **On-device Learning:** μ„œλ²„ μ—°κ²° 없이 κΈ°κΈ° λ‚΄λΆ€ λ°μ΄ν„°λ‘œ λͺ¨λΈμ„ λ―Έμ„Έ μ‘°μ •. - **μž₯점:** μ΄ˆμ €μ§€μ—° 응닡(μžμœ¨μ£Όν–‰, κ²Œμž„ λ“±), μ˜€ν”„λΌμΈ μž‘λ™ κ°€λŠ₯, 데이터 유좜 λ°©μ§€, μ„œλ²„ λΉ„μš© 절감. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌:** μ„±λŠ₯이 λΆ€μ‘±ν•œ μ—£μ§€ κΈ°κΈ°λŠ” λ‹¨μˆœ μˆ˜μ§‘λ§Œ ν•΄μ•Ό ν•œλ‹€λŠ” κ³ μ •κ΄€λ…μ—μ„œ λ²—μ–΄λ‚˜, κ°•λ ₯ν•œ λͺ¨λ°”일 ν”„λ‘œμ„Έμ„œμ˜ λ°œμ „μœΌλ‘œ μ„œλΉ™κ³Ό ν•™μŠ΅μ΄ κ°€λŠ₯ν•œ 'μ§€λŠ₯ν˜• μ—£μ§€' μ‹œλŒ€λ‘œ μ§„μž…. - **μ •μ±… λ³€ν™”:** ConnectAI ν”„λ‘œμ νŠΈλŠ” 둜컬 LLM μ—”λ“œν¬μΈνŠΈλ₯Ό ν™œμš©ν•œ '둜컬 브레인' μ „λž΅μ„ 톡해, μ‚¬μš©μžμ˜ μ½”λ“œκ°€ μ™ΈλΆ€λ‘œ μœ μΆœλ˜μ§€ μ•ŠλŠ” Edge AI μ§€ν–₯적 μ•„ν‚€ν…μ²˜λ₯Ό 좔ꡬ함. ## πŸ”— 지식 μ—°κ²° (Graph) - System-Design-for-AI-Scale, Data-Ethics-and-Privacy, [[Federated-Learning|Federated-Learning]], [[Distributed-Computing|Distributed-Computing]] - **Raw Source:** 10_Wiki/Topics/AI/Edge-AI-and-Computing.md