--- id: ML-FOUND-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [machine-learning, ai-foundations, supervised-learning, unsupervised-learning, reinforcement-learning] last_reinforced: 2026-04-26 --- # Machine Learning Foundations (๋จธ์‹ ๋Ÿฌ๋‹ ๊ธฐ์ดˆ) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๋ช…์‹œ์  ํ”„๋กœ๊ทธ๋ž˜๋ฐ์˜ ํ•œ๊ณ„๋ฅผ ๋ฐ์ดํ„ฐ์˜ ํž˜์œผ๋กœ ๋ŒํŒŒํ•˜๊ณ , ๊ธฐ๊ณ„๊ฐ€ ์Šค์Šค๋กœ ๊ทœ์น™์„ ๋ฐœ๊ฒฌํ•˜๊ฒŒ ํ•˜๋ผ" โ€” ๋ช…์‹œ์ ์ธ ๊ทœ์น™์„ ์ฝ”๋”ฉํ•˜์ง€ ์•Š๊ณ  ๋ฐ์ดํ„ฐ ์†์— ์ˆจ๊ฒจ์ง„ ํ†ต๊ณ„์  ํŒจํ„ด์„ ํ•™์Šตํ•˜์—ฌ ์˜ˆ์ธก์ด๋‚˜ ํŒ๋‹จ์„ ์ˆ˜ํ–‰ํ•˜๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜๊ณผ ๋ชจ๋ธ์˜ ์ด์ฒด. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** "Learning from Data" โ€” ์ž…๋ ฅ๊ณผ ์ถœ๋ ฅ์˜ ๊ด€๊ณ„๋ฅผ ์ •์˜ํ•˜๋Š” ๋งค๊ฐœ๋ณ€์ˆ˜(Weights)๋ฅผ ๋ฐ์ดํ„ฐ์™€์˜ ์˜ค์ฐจ๋ฅผ ์ค„์—ฌ๊ฐ€๋ฉฐ ์ตœ์ ํ™”ํ•˜๋Š” ๊ณผ์ •์„ ํ†ตํ•ด, ํ”„๋กœ๊ทธ๋ž˜๋จธ๊ฐ€ ์ธ์ง€ํ•˜์ง€ ๋ชปํ•œ ๋ณต์žกํ•œ ๋…ผ๋ฆฌ๋ฅผ ์ถ”์ถœํ•˜๋Š” ํ•™์Šต ํŒจํ„ด. - **3๋Œ€ ์ฃผ์š” ๋ฒ”์ฃผ:** - **Supervised Learning (์ง€๋„ ํ•™์Šต):** ์ •๋‹ต(Label)์ด ์žˆ๋Š” ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ด ์ž…๋ ฅ-์ถœ๋ ฅ ๋งคํ•‘ ํ•™์Šต. (๋ถ„๋ฅ˜, ํšŒ๊ท€) - **Unsupervised Learning (๋น„์ง€๋„ ํ•™์Šต):** ์ •๋‹ต ์—†์ด ๋ฐ์ดํ„ฐ ์ž์ฒด์˜ ๊ตฌ์กฐ๋‚˜ ํŒจํ„ด ํƒ์ƒ‰. (๊ตฐ์ง‘, ์ฐจ์› ์ถ•์†Œ) - **Reinforcement Learning (๊ฐ•ํ™” ํ•™์Šต):** ์‹œํ–‰์ฐฉ์˜ค๋ฅผ ํ†ตํ•œ ๋ณด์ƒ(Reward) ๊ทน๋Œ€ํ™” ์ „๋žต ํ•™์Šต. - **ํ•ต์‹ฌ ํ”„๋กœ์„ธ์Šค:** ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ -> ์ „์ฒ˜๋ฆฌ -> ๋ชจ๋ธ ์„ ํƒ -> ํ•™์Šต -> ํ‰๊ฐ€ -> ๋ฐฐํฌ ๋ฐ ๋ชจ๋‹ˆํ„ฐ๋ง. - **์˜์˜:** ๊ธฐ์กด ์†Œํ”„ํŠธ์›จ์–ด๋กœ๋Š” ํ•ด๊ฒฐ ๋ถˆ๊ฐ€๋Šฅํ–ˆ๋˜ ์ด๋ฏธ์ง€ ์ธ์‹, ์–ธ์–ด ๋ฒˆ์—ญ, ์ž์œจ ์ฃผํ–‰ ๋“ฑ์˜ ๋ณต์žกํ•œ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๋Š” ํ˜„๋Œ€ ์ง€๋Šฅํ˜• ์‹œ์Šคํ…œ์˜ ๋ฌผ๋ฆฌ์  ๊ธฐ์ดˆ. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ๊ทœ์น™ ๊ธฐ๋ฐ˜ AI(Expert Systems)์˜ ๊ฒฝ์ง์„ฑ์„ ๊ทน๋ณตํ–ˆ์œผ๋‚˜, ์ด์ œ๋Š” ํ•™์Šต ๋ฐ์ดํ„ฐ์˜ ํŽธํ–ฅ(Bias) ๋ฌธ์ œ์™€ ๋ชจ๋ธ์˜ ๋ธ”๋ž™๋ฐ•์Šค ํŠน์„ฑ์„ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•œ ์„ค๋ช… ๊ฐ€๋Šฅ์„ฑ(XAI) ์—ฐ๊ตฌ๊ฐ€ ํ•„์ˆ˜์ ์ธ ๋ณด์™„์ฑ…์œผ๋กœ ๋Œ€๋‘๋จ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ๋ชจ๋“  ์ง€๋Šฅํ˜• ๊ธฐ๋Šฅ์„ ๊ตฌํ˜„ํ•  ๋•Œ, ๊ทœ์น™ ๊ธฐ๋ฐ˜์˜ ๊ฐ•๊ฑดํ•จ๊ณผ ๋จธ์‹ ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜์˜ ์œ ์—ฐํ•จ์„ ๊ฒฐํ•ฉํ•œ 'ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์ง€๋Šฅ' ์•„ํ‚คํ…์ฒ˜๋ฅผ ์ง€ํ–ฅํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Supervised-Learning-Foundations|Supervised-Learning-Foundations]], Unsupervised-Learning-Foundations, [[Reinforcement-Learning|Reinforcement-Learning]], [[Loss-Functions-Foundations|Loss-Functions-Foundations]], [[Explainable-AI-XAI|Explainable-AI-XAI]] - **Raw Source:** 10_Wiki/Topics/AI/Machine-Learning-Foundations.md