--- id: QA-INT-TEST-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [software-engineering, testing, ai-qa, integration-testing, reliability, mlops] last_reinforced: 2026-04-26 --- # Integration Testing for AI (AI ํ†ตํ•ฉ ํ…Œ์ŠคํŠธ) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๊ฐœ๋ณ„ ๋ถ€ํ’ˆ์˜ ์™„๋ฒฝํ•จ์— ์•ˆ์ฃผํ•˜์ง€ ๋ง๊ณ , ๊ทธ๋“ค์ด ํ†ฑ๋‹ˆ๋ฐ”ํ€ด์ฒ˜๋Ÿผ ๋งž๋ฌผ๋ ค ๋Œ์•„๊ฐ€๋Š” ์ „์ฒด์˜ ํ•˜๋ชจ๋‹ˆ๋ฅผ ๊ฒ€์ฆํ•˜๋ผ" โ€” ์—ฌ๋Ÿฌ ์†Œํ”„ํŠธ์›จ์–ด ๋ชจ๋“ˆ๊ณผ AI ๋ชจ๋ธ, ์™ธ๋ถ€ ๋ฐ์ดํ„ฐ ์†Œ์Šค ๋“ฑ์ด ์œ ๊ธฐ์ ์œผ๋กœ ์—ฐ๊ฒฐ๋˜์–ด ๋ฐ์ดํ„ฐ ํŒŒ์ดํ”„๋ผ์ธ๊ณผ ๋น„์ฆˆ๋‹ˆ์Šค ๋กœ์ง์ด ์˜ฌ๋ฐ”๋ฅด๊ฒŒ ์ž‘๋™ํ•˜๋Š”์ง€ ํ™•์ธํ•˜๋Š” ํ’ˆ์งˆ ๋ณด์ฆ ํ”„๋กœ์„ธ์Šค. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** "Contract Testing" โ€” ๊ฐ ์ปดํฌ๋„ŒํŠธ ๊ฐ„์˜ ์ž…์ถœ๋ ฅ ๊ทœ์•ฝ(Interface)์ด ์ค€์ˆ˜๋˜๋Š”์ง€ ํ™•์ธํ•˜๊ณ , ํŠนํžˆ ๋น„๊ฒฐ์ •๋ก ์ ์ธ AI ๋ชจ๋ธ์˜ ์‘๋‹ต์ด ์ „์ฒด ์‹œ์Šคํ…œ์˜ ์˜ˆ์™ธ ์ฒ˜๋ฆฌ ๋กœ์ง์„ ๋ฌด๋„ˆ๋œจ๋ฆฌ์ง€ ์•Š๋Š”์ง€ ๊ฒ€์ฆํ•˜๋Š” ํ๋ฆ„ ๋ณด์žฅ ํŒจํ„ด. - **์ฃผ์š” ํ…Œ์ŠคํŠธ ์˜์—ญ:** - **Data Pipeline Integration:** ์ˆ˜์ง‘๋œ Raw ๋ฐ์ดํ„ฐ๊ฐ€ ์ „์ฒ˜๋ฆฌ ๊ณผ์ •์„ ๊ฑฐ์ณ ์œ„ํ‚ค ์ธ๋ฑ์Šค๊นŒ์ง€ ๋ฌด๊ฒฐํ•˜๊ฒŒ ๋„๋‹ฌํ•˜๋Š”๊ฐ€? - **Agent-Tool Interaction:** ์—์ด์ „ํŠธ๊ฐ€ ์™ธ๋ถ€ ๋„๊ตฌ(Git, ํŒŒ์ผ ์‹œ์Šคํ…œ ๋“ฑ)๋ฅผ ํ˜ธ์ถœํ•˜๊ณ  ๊ทธ ๊ฒฐ๊ณผ๋ฅผ ์˜ฌ๋ฐ”๋ฅด๊ฒŒ ํ•ด์„ํ•˜๋Š”๊ฐ€? - **Model-UI Sync:** AI์˜ ์‹ค์‹œ๊ฐ„ ์ŠคํŠธ๋ฆฌ๋ฐ ์‘๋‹ต์ด ํ”„๋ก ํŠธ์—”๋“œ ์•„ํ‚คํ…์ฒ˜ ์ƒ์—์„œ ๊นจ์ง ์—†์ด ๋ Œ๋”๋ง๋˜๋Š”๊ฐ€? - **๋„์ „ ๊ณผ์ œ:** AI ์‘๋‹ต์˜ ๊ฐ€๋ณ€์„ฑ์œผ๋กœ ์ธํ•ด ์ „ํ†ต์ ์ธ Assert ๋ฌธ ์‚ฌ์šฉ์ด ํž˜๋“ฆ. -> ํ™•๋ฅ ์  ๋ฒ”์œ„ ๊ฒ€์ฆ(Probabilistic Testing)์ด๋‚˜ ๊ณจ๋“ ์…‹(Golden Set) ๋น„๊ต ๊ธฐ๋ฒ• ํ™œ์šฉ. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ์ •์  ์ฝ”๋“œ๋ฅผ ๊ฒ€์ฆํ•˜๋˜ ๋ฐฉ์‹์—์„œ, ๋ฐ์ดํ„ฐ์˜ ๋ณ€ํ™”์™€ ๋ชจ๋ธ์˜ ํ™•๋ฅ ์  ํŠน์„ฑ๊นŒ์ง€ ๊ณ ๋ คํ•ด์•ผ ํ•˜๋Š” '๋™์  ์‹œ์Šคํ…œ ๊ฒ€์ฆ'์œผ๋กœ ํ…Œ์ŠคํŠธ์˜ ๋‚œ์ด๋„์™€ ์ค‘์š”๋„๊ฐ€ ๊ธ‰์ƒ์Šนํ•จ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ๋ชจ๋“  ์ปค๋ฐ‹ ์ „, ์—์ด์ „ํŠธ์˜ ์ฃผ์š” ์‹œ๋‚˜๋ฆฌ์˜ค(์ง€์‹ ์ƒ์„ฑ, ํŒŒ์ผ ์ˆ˜์ • ๋“ฑ)๋ฅผ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ํ•˜๋Š” ํ†ตํ•ฉ ํ…Œ์ŠคํŠธ ์ž๋™ํ™” ์Šคํฌ๋ฆฝํŠธ๋ฅผ ์‹คํ–‰ํ•˜์—ฌ ์‹œ์Šคํ…œ์˜ ๊ฐ•๊ฑด์„ฑ์„ ์œ ์ง€ํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[DevOps-for-AI-MLOps|DevOps-for-AI-MLOps]], [[Software-Architecture-Patterns|Software-Architecture-Patterns]], [[Input-Validation-Strategies|Input-Validation-Strategies]], System-Design-for-AI-Scale - **Raw Source:** 10_Wiki/Topics/AI/Integration-Testing-for-AI.md