--- id: P-REINFORCE-AUTO-SSLE-001 category: "[[10_Wiki/๐Ÿ’ก Topics/AI]]" confidence_score: 0.97 tags: [auto-reinforced, self-supervised-learning, labels, pre-training, deep-learning, representation-learning] last_reinforced: 2026-04-20 --- # [[Self-Supervised-Learning]] ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "์Šค์Šน ์—†๋Š” ํ•™์Šต์˜ ๊ธฐ์ : ์‚ฌ๋žŒ์ด ์ผ์ผ์ด ์ •๋‹ต(Label)์„ ๋‹ฌ์•„์ฃผ์ง€ ์•Š์•„๋„, ๋ฐ์ดํ„ฐ ์Šค์Šค๋กœ์˜ ์ผ๋ถ€๋ฅผ ๊ฐ€๋ฆฌ๊ณ (Masking) ๋‚˜๋จธ์ง€๋กœ ๋งž์ถ”๋Š” ๊ณผ์ •์„ ํ†ตํ•ด ์„ธ์ƒ์˜ ํŒจํ„ด์„ ํ†ต๋‹ฌํ•ด ๋‚ด๋Š” ๋ฐ์ดํ„ฐ ํšจ์œจ์„ฑ์˜ ๊ทน์น˜์ด์ž ํ˜„๋Œ€ ๊ฑฐ๋Œ€ ๋ชจ๋ธ(LLM)์˜ ์‹ฌ์žฅ." ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) ์ž๊ธฐ ์ง€๋„ ํ•™์Šต(Self-Supervised-Learning)์€ ๋ ˆ์ด๋ธ”์ด ์—†๋Š” ๋ฐ์ดํ„ฐ์—์„œ ์ž์ฒด์ ์œผ๋กœ ๋ ˆ์ด๋ธ”์„ ์ƒ์„ฑํ•˜์—ฌ ํ•™์Šตํ•˜๋Š” ๋จธ์‹ ๋Ÿฌ๋‹ ๊ธฐ๋ฒ•์ž…๋‹ˆ๋‹ค. 1. **ํ•ต์‹ฌ ๋ฉ”์ปค๋‹ˆ์ฆ˜ (Pretext Tasks)**: * **Masking**: ๋ฌธ์žฅ์˜ ์ค‘๊ฐ„ ๋‹จ์–ด๋ฅผ ๊ฐ€๋ฆฌ๊ณ  ๋ฌธ๋งฅ์œผ๋กœ ๋งž์ถ”๊ธฐ (BERT ์Šคํƒ€์ผ). * **Prediction**: ๋‹ค์Œ ๋‹จ์–ด๋‚˜ ํ”„๋ ˆ์ž„์„ ์˜ˆ์ธกํ•˜๊ธฐ (GPT ์Šคํƒ€์ผ). * **Contrastive Learning**: ๊ฐ™์€ ์ด๋ฏธ์ง€ ๋ณ€ํ˜•๋ณธ๋ผ๋ฆฌ๋Š” ๊ฐ€๊น๊ฒŒ, ๋‹ค๋ฅธ ์ด๋ฏธ์ง€์™€๋Š” ๋ฉ€๊ฒŒ ๋ฐฐ์น˜ํ•˜๊ธฐ. (Representation-Learning์™€ ์—ฐ๊ฒฐ) 2. **์™œ ์ค‘์š”ํ•œ๊ฐ€?**: * ์ธํ„ฐ๋„ท์˜ ๋ฐฉ๋Œ€ํ•œ ๋‚ ๊ฒƒ์˜ ๋ฐ์ดํ„ฐ(Text, Image)๋ฅผ ์ •๋‹ต์ง€ ์ž‘์—… ์—†์ด ํ†ต์งธ๋กœ ๋จน์ผ ์ˆ˜ ์žˆ์–ด, ์ง€๋Šฅ์˜ ๊ทœ๋ชจ๋ฅผ ์ธ๊ฐ„์˜ ํ•œ๊ณ„๋ฅผ ๋„˜์–ด ๋ฌดํ•œํžˆ ํ‚ค์šธ ์ˆ˜ ์žˆ๊ธฐ ๋•Œ๋ฌธ์ž„. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ**: ๊ณผ๊ฑฐ์—๋Š” ์‚ฌ๋žŒ์ด ์ •๋‹ต์„ ์ค€ ๊ฒƒ๋งŒ ๋ฐฐ์šฐ๋Š” ์ง€๋„ ํ•™์Šต(Supervised)์ด ์ตœ๊ณ ์˜€์œผ๋‚˜, ํ˜„๋Œ€ ์ •์ฑ…์€ ์ง€๋„ ํ•™์Šต์„ '๋ฏธ์„ธ ์กฐ์ •'์šฉ ๋ณด์กฐ ์ •์ฑ…์œผ๋กœ ๋ฐ€์–ด๋‚ด๊ณ  ์ž๊ธฐ ์ง€๋„ ํ•™์Šต ์ •์ฑ…์ด '๊ธฐ์ดˆ ์ฒด๋ ฅ(Foundation)'์„ ๋งŒ๋“œ๋Š” ์ฃผ๋ฅ˜ ์ •์ฑ…์ด ๋จ(RL Update). - **์ •์ฑ… ๋ณ€ํ™”(RL Update)**: ๋‹จ์ˆœํžˆ ํ…์ŠคํŠธ ์˜ˆ์ธก ์ •์ฑ…์„ ๋„˜์–ด, ์„ธ์ƒ์„ ๋ฌผ๋ฆฌํ•™์ ์œผ๋กœ ์ดํ•ดํ•˜๋Š” '๋น„๋””์˜ค ์ƒ์„ฑ ๋ชจ๋ธ'์—์„œ์˜ ์ž๊ธฐ ์ง€๋„ ํ•™์Šต ์ •์ฑ…์ด ์ฐจ์„ธ๋Œ€ AI์˜ ํ•ต์‹ฌ ์ „์žฅ ์ •์ฑ…์ž„. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Representation-Learning]], [[Deep Learning (DL)]], [[Machine Learning (ML)]], [[Optimization]], [[Efficiency]] - **Modern Tech/Tools**: BERT, GPT, DINO, SimCLR, MAE (Masked Autoencoders). ---