--- id: [[P-Reinforce|P-Reinforce]]-AI-ADAPTIVE-COMPUTE category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 0.97 tags: [AI, [[Efficiency|Efficiency]], AdaptiveCompute, Inference] last_reinforced: 2026-04-20 --- # [[Adaptive Compute (แ„Œแ…ฅแ†จแ„‹แ…ณแ†ผแ„’แ…งแ†ผ แ„€แ…จแ„‰แ…กแ†ซแ„…แ…ฃแ†ผ แ„Œแ…ฉแ„Œแ…ฅแ†ฏ)|Adaptive Compute (์ ์‘ํ˜• ๊ณ„์‚ฐ๋Ÿ‰ ์กฐ์ ˆ)]] ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "์‰ฌ์šด ๋ฌธ์ œ๋Š” ๋Œ€์ถฉ, ์–ด๋ ค์šด ๋ฌธ์ œ๋Š” ์‹ ์ค‘ํ•˜๊ฒŒ." ๋ชจ๋“  ์ž…๋ ฅ์— ๋™์ผํ•œ ๊ณ„์‚ฐ ์ž์›์„ ๋‚ญ๋น„ํ•˜์ง€ ์•Š๊ณ , ๊ณผ์ œ ๋‚œ์ด๋„์— ๋”ฐ๋ผ ๋ชจ๋ธ์ด ์‚ฌ์šฉํ•˜๋Š” ์—ฐ์‚ฐ๋Ÿ‰(์ธต์˜ ๊นŠ์ด, ํ™œ์„ฑ ๋‰ด๋Ÿฐ ์ˆ˜ ๋“ฑ)์„ ์œ ๋™์ ์œผ๋กœ ์กฐ์ ˆํ•˜๋Š” ์ง€๋Šฅ์  ์ž์› ๋ฐฐ๋ถ„ ๊ธฐ์ˆ ์ด๋‹ค. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **Early Exit**: ๋ชจ๋ธ์˜ ์ค‘๊ฐ„ ์ธต์—์„œ ์ด๋ฏธ ๊ฒฐ๊ณผ๊ฐ€ ํ™•์‹คํ•˜๋‹ค๋ฉด ์ตœ์ข… ์ธต๊นŒ์ง€ ๊ฐ€์ง€ ์•Š๊ณ  ๋ฐ”๋กœ ๊ฒฐ๊ณผ๋ฅผ ์ถœ๋ ฅํ•˜์—ฌ ์‹œ๊ฐ„๊ณผ ์—๋„ˆ์ง€๋ฅผ ์•„๋‚Œ. - **MoE (Mixture of Experts)**: ๊ฑฐ๋Œ€ ๋ชจ๋ธ์˜ ์ผ๋ถ€(์ „๊ณต ๊ต์ˆ˜)๋งŒ ํ™œ์„ฑํ™”ํ•˜์—ฌ ํŠน์ • ๋ถ„์•ผ์˜ ์งˆ๋ฌธ์—๋งŒ ์ž์›์„ ์ง‘์ค‘ํ•จ. - **Dynamic Token [[Processing|Processing]]**: ๋ฌธ๋งฅ์ƒ ์ค‘์š”ํ•˜์ง€ ์•Š์€ ๋‹จ์–ด(์กฐ์‚ฌ ๋“ฑ)๋Š” ๋‚ฎ์€ ์ •๋ฐ€๋„๋กœ ์ฒ˜๋ฆฌํ•˜๊ณ , ํ•ต์‹ฌ์ ์ธ ๋‹จ์–ด์— ์—ฐ์‚ฐ๋ ฅ์„ ๋ชฐ์•„์คŒ. - **Inference Efficiency**: ๋™์ผํ•œ ์ •ํ™•๋„๋ฅผ ์œ ์ง€ํ•˜๋ฉด์„œ ์„œ๋น™ ๋น„์šฉ(GPU ์†Œ๋ชจ)์„ ํš๊ธฐ์ ์œผ๋กœ ๋‚ฎ์ถ”๋Š” ํ•ต์‹ฌ ์—ด์‡ ๋‹ค. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (RL Update) - ๊ณ„์‚ฐ๋Ÿ‰์„ ์ค„์ด๋Š” ๊ณผ์ •์—์„œ ๋ชจ๋ธ์˜ '์„ค๋ช… ๊ฐ€๋Šฅ์„ฑ'์ด ํŒŒํŽธํ™”๋  ์œ„ํ—˜์ด ์žˆ๋‹ค. ๋˜ํ•œ ์–ด๋–ค ๋ฌธ์ œ๊ฐ€ '์‰ฌ์šด ๋ฌธ์ œ'์ธ์ง€ ํŒ๋‹จํ•˜๋Š” ๊ณผ์ • ์ž์ฒด๊ฐ€ ๋˜ ๋‹ค๋ฅธ ๊ณ„์‚ฐ ์˜ค๋ฒ„ํ—ค๋“œ๊ฐ€ ๋  ์ˆ˜ ์žˆ์œผ๋ฏ€๋กœ, ํŒ๋‹จ ๋กœ์ง์„ ๊ทน๋„๋กœ ๊ฐ€๋ณ๊ฒŒ ์„ค๊ณ„ํ•˜๋Š” ๊ฒƒ์ด ์Ÿ์ ์ด๋‹ค. ์ตœ๊ทผ์—๋Š” OpenAI o1์ฒ˜๋Ÿผ ์ถ”๋ก  ์‹œ๊ฐ„์„ ์˜๋„์ ์œผ๋กœ ๋Š˜๋ ค ์„ฑ๋Šฅ์„ ๊ทน๋Œ€ํ™”ํ•˜๋Š” '์—ญ๋ฐฉํ–ฅ' ์ ์‘ํ˜• ๊ณ„์‚ฐ ์—ฐ๊ตฌ๋„ ๋ถ€์ƒํ•˜๊ณ  ์žˆ๋‹ค. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - Related: [[Model-Compression|Model-Compression]] , Mixture of Experts (MoE) - Modern Pattern: [[Test-Time Compute Scaling (แ„Žแ…ฎแ„…แ…ฉแ†ซ แ„‰แ…ตแ„€แ…กแ†ซ แ„€แ…จแ„‰แ…กแ†ซ แ„‰แ…ณแ„แ…ฆแ„‹แ…ตแ†ฏแ„…แ…ตแ†ผ)|Test-Time Compute Scaling (์ถ”๋ก  ์‹œ๊ฐ„ ๊ณ„์‚ฐ ์Šค์ผ€์ผ๋ง)]]