--- id: AI-API-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [software-engineering, api-design, ai-services, streaming, grpc, rest] last_reinforced: 2026-04-26 --- # API Design for AI Services (AI ์„œ๋น„์Šค๋ฅผ ์œ„ํ•œ API ๋””์ž์ธ) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๊ธด ์ถ”๋ก  ์‹œ๊ฐ„๊ณผ ๊ฑฐ๋Œ€ํ•œ ๋ฐ์ดํ„ฐ ํ๋ฆ„์„ ์šฐ์•„ํ•˜๊ฒŒ ์ถ”์ƒํ™”ํ•˜๋ผ" โ€” ๋ชจ๋ธ์˜ ๋น„๊ฒฐ์ •์  ์ถœ๋ ฅ๊ณผ ๋น„๋™๊ธฐ์  ์—ฐ์‚ฐ ํŠน์„ฑ์„ ๊ณ ๋ คํ•˜์—ฌ ๊ฐœ๋ฐœ์ž๊ฐ€ ์˜ˆ์ธก ๊ฐ€๋Šฅํ•˜๊ณ  ํšจ์œจ์ ์œผ๋กœ AI ๊ธฐ๋Šฅ์„ ํ†ตํ•ฉํ•  ์ˆ˜ ์žˆ๋„๋ก ์„ค๊ณ„๋œ ์ธํ„ฐํŽ˜์ด์Šค. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** ๋™๊ธฐ์‹ ์š”์ฒญ-์‘๋‹ต์˜ ํ•œ๊ณ„๋ฅผ ๋„˜์–ด ์ŠคํŠธ๋ฆฌ๋ฐ, ๋น„๋™๊ธฐ ์ž‘์—… ํ, ์ƒํƒœ ๋ณด์กดํ˜• ์„ธ์…˜ ๋“ฑ์„ ํ†ตํ•ด ๊ณ ์‚ฌ์–‘ ์—ฐ์‚ฐ ์ž์›์„ ํšจ์œจ์ ์œผ๋กœ ๋…ธ์ถœํ•˜๋Š” ์„œ๋น„์Šค ์ธํ„ฐํŽ˜์ด์Šค ํŒจํ„ด. - **ํ•ต์‹ฌ ์„ค๊ณ„ ์›์น™:** - **Streaming First:** LLM์˜ ํ† ํฐ ์ƒ์„ฑ์„ ์‹ค์‹œ๊ฐ„์œผ๋กœ ์ „๋‹ฌํ•˜๊ธฐ ์œ„ํ•ด SSE(Server-Sent Events)๋‚˜ WebSockets ํ•„์ˆ˜ ์ ์šฉ. - **Stateless vs Stateful:** ๋Œ€ํ™” ๋งฅ๋ฝ ์œ ์ง€(Conversation ID)์™€ ๋ชจ๋ธ ๊ฐ€์ค‘์น˜ ๋…๋ฆฝ์„ฑ์„ ์œ„ํ•œ ์ƒํƒœ ๊ด€๋ฆฌ ์ „๋žต. - **Asynchronous Execution:** ์‹œ๊ฐ„์ด ์˜ค๋ž˜ ๊ฑธ๋ฆฌ๋Š” ํƒœ์Šคํฌ(์ด๋ฏธ์ง€ ์ƒ์„ฑ ๋“ฑ)๋ฅผ ์œ„ํ•œ Job ID ๊ธฐ๋ฐ˜์˜ ํด๋ง(Polling) ๋˜๋Š” ์›นํ›…(Webhook) ๊ตฌ์กฐ. - **Safety & Filtering:** API ์ˆ˜์ค€์—์„œ ์œ ํ•ด ๊ฒฐ๊ณผ๋ฌผ์„ ์ฐจ๋‹จํ•˜๋Š” ๊ฐ€๋“œ๋ ˆ์ผ ๋ ˆ์ด์–ด ํ†ตํ•ฉ. - **Version Control:** ๋ชจ๋ธ ๋ฒ„์ „ ์—…๋ฐ์ดํŠธ ์‹œ ๊ฒฐ๊ณผ๋ฌผ์˜ ๋ฏธ์„ธํ•œ ๋ณ€ํ™”๋ฅผ ๊ณ ๋ คํ•œ ์‹œ๋งจํ‹ฑ ๋ฒ„์ €๋‹. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ์ •์ ์ธ ๋ฐ์ดํ„ฐ๋ฅผ ์ฃผ๊ณ ๋ฐ›๋˜ REST API์—์„œ, ์‹ค์‹œ๊ฐ„ ์ถ”๋ก ๊ณผ ๋Œ€๊ทœ๋ชจ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋ฐ์ดํ„ฐ๋ฅผ ์ฒ˜๋ฆฌํ•˜๋Š” ๋™์ ์ธ ์ธํ„ฐํŽ˜์ด์Šค๋กœ ์„ค๊ณ„ ์ค‘์‹ฌ์ด ์ด๋™. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ๋ชจ๋“  ์—์ด์ „ํŠธ ๊ฐ„ ํ†ต์‹ ์— gRPC ์ŠคํŠธ๋ฆฌ๋ฐ์„ ์šฐ์„  ์‚ฌ์šฉํ•˜๋ฉฐ, ์™ธ๋ถ€ ์›น ์ธํ„ฐํŽ˜์ด์Šค ์ œ๊ณต ์‹œ์—๋Š” SSE ํ‘œ์ค€์„ ์ค€์ˆ˜ํ•˜์—ฌ ์‚ฌ์šฉ์ž ๊ฒฝํ—˜์„ ์ตœ์ ํ™”ํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - System-Design-for-AI-Scale, [[LLM]], Streaming-Data-Processing, Microservices - **Raw Source:** 10_Wiki/Topics/AI/API-Design for AI Services.md