--- id: AI-INC-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [ai, machine-learning, incremental-learning, lifelong-learning, online-learning] last_reinforced: 2026-04-26 --- # Incremental Learning (์ฆ๋ถ„ ํ•™์Šต) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๊ณผ๊ฑฐ์˜ ์ง€ํ˜œ๋ฅผ ์žŠ์ง€ ์•Š์œผ๋ฉด์„œ, ์ƒˆ๋กœ์šด ์ง€์‹์„ ๋Š์ž„์—†์ด ํก์ˆ˜ํ•˜์—ฌ ์ง„ํ™”ํ•˜๋Š” ์ง€๋Šฅ์„ ๊ตฌ์ถ•ํ•˜๋ผ" โ€” ์ „์ฒด ๋ฐ์ดํ„ฐ๋ฅผ ๋‹ค์‹œ ํ•™์Šตํ•˜์ง€ ์•Š๊ณ , ์‹ค์‹œ๊ฐ„์œผ๋กœ ์œ ์ž…๋˜๋Š” ์ƒˆ๋กœ์šด ๋ฐ์ดํ„ฐ๋ฅผ ์ ์ง„์ ์œผ๋กœ ๋ฐ˜์˜ํ•˜์—ฌ ๋ชจ๋ธ์„ ์—…๋ฐ์ดํŠธํ•˜๋Š” ๋จธ์‹ ๋Ÿฌ๋‹ ๊ธฐ๋ฒ•. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** "Streaming Intelligence" โ€” ๋ฐ์ดํ„ฐ์˜ ํ๋ฆ„(Stream)์„ ๋”ฐ๋ผ ๋ชจ๋ธ์˜ ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ๋ฏธ์„ธ ์กฐ์ •ํ•˜๋ฉฐ, ์ƒˆ๋กœ์šด ์ง€์‹์„ ์ถ”๊ฐ€ํ•  ๋•Œ ๋ฐœ์ƒํ•˜๋Š” ํŒŒ๊ดด์  ๋ง๊ฐ(Catastrophic Forgetting)์„ ๋ฐฉ์ง€ํ•˜๋Š” ์ง€์‹ ์ถ•์  ํŒจํ„ด. - **ํ•ต์‹ฌ ๊ณผ์ œ ๋ฐ ํ•ด๊ฒฐ์ฑ…:** - **Catastrophic Forgetting:** ์ƒˆ๋กœ์šด ํ•™์Šต์ด ๊ธฐ์กด ๊ฐ€์ค‘์น˜๋ฅผ ๋ฎ์–ด์”Œ์›Œ ๊ณผ๊ฑฐ ์ง€์‹์„ ์žƒ์–ด๋ฒ„๋ฆฌ๋Š” ํ˜„์ƒ. -> ์ •๊ทœํ™”(Regularization)๋‚˜ ๋ฆฌํ”Œ๋ ˆ์ด(Replay) ๋ฒ„ํผ๋ฅผ ํ†ตํ•ด ํ•ด๊ฒฐ. - **Plasticity vs Stability:** ๋ณ€ํ™”์— ์œ ์—ฐํ•˜๋ฉด์„œ๋„ ๋ณธ์งˆ์ ์ธ ์ง€์‹์€ ๊ณ ์ˆ˜ํ•ด์•ผ ํ•˜๋Š” ๋”œ๋ ˆ๋งˆ. - **Elastic Weight Consolidation (EWC):** ์ค‘์š”ํ•œ ๊ณผ๊ฑฐ ์ง€์‹์— ๊ด€๋ จ๋œ ๊ฐ€์ค‘์น˜ ๋ณ€ํ™”์— ๋ฒŒ์ ์„ ๋ถ€์—ฌํ•˜์—ฌ ๋ณด์กด. - **์˜์˜:** ๋ฐ์ดํ„ฐ ๊ทœ๋ชจ๊ฐ€ ๊ธฐํ•˜๊ธ‰์ˆ˜์ ์œผ๋กœ ์ปค์ง€๋Š” ํ™˜๊ฒฝ์—์„œ ์žฌํ•™์Šต ๋น„์šฉ์„ ์ ˆ๊ฐํ•˜๊ณ , ์ตœ์‹  ํŠธ๋ Œ๋“œ๋ฅผ ์ฆ‰๊ฐ ๋ฐ˜์˜ํ•˜๋Š” '์‚ด์•„์žˆ๋Š” ๋ชจ๋ธ' ์šด์˜ ๊ฐ€๋Šฅ. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ๊ณ ์ •๋œ ๋ฐ์ดํ„ฐ์…‹(Static dataset) ํ•™์Šต์ด ์ฃผ๋ฅ˜์˜€์œผ๋‚˜, ์ด์ œ๋Š” ์‹ค์‹œ๊ฐ„์œผ๋กœ ๋ณ€ํ™”ํ•˜๋Š” ๋„๋ฉ”์ธ์— ์ ์‘ํ•˜๋Š” '์—ฐ์† ํ•™์Šต(Continual Learning)'์ด AI์˜ ์ƒ์กด ํ•„์ˆ˜ ์กฐ๊ฑด์œผ๋กœ ๋ถ€์ƒํ•จ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ๋งค์ผ ์ถ”๊ฐ€๋˜๋Š” ์ˆ˜์ฒœ ๊ฐœ์˜ ์ƒˆ๋กœ์šด ์œ„ํ‚ค ๋ฌธ์„œ๋ฅผ ์ฆ‰๊ฐ์ ์œผ๋กœ ๋ฐ˜์˜ํ•˜๊ธฐ ์œ„ํ•ด, ๋ฒกํ„ฐ ์ธ๋ฑ์Šค๋ฟ๋งŒ ์•„๋‹ˆ๋ผ ๊ฒฝ๋Ÿ‰ํ™”๋œ ์ฆ๋ถ„ ํ•™์Šต ํŒŒ์ดํ”„๋ผ์ธ์„ ์šด์˜ ์ค‘์ž„. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Reinforcement-Learning|Reinforcement-Learning]], Transfer-Learning-Foundations, Online-Learning-Algorithms, [[Generalization-in-AI|Generalization-in-AI]] - **Raw Source:** 10_Wiki/Topics/AI/Incremental-Learning.md