--- id: AI-PHASE-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 1.0 tags: [ai, deep-learning, phase-transitions, learning-dynamics, emergent-abilities, grokking] last_reinforced: 2026-04-26 --- # Phase Transitions in Learning (ν•™μŠ΅μ—μ„œμ˜ 상전이 ν˜„μƒ) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "μ§€λ£¨ν•œ 정체기 끝에 κ°‘μž‘μŠ€λŸ¬μš΄ κΉ¨λ‹¬μŒ(Grokking)이 μ°Ύμ•„μ˜€λ“―, λͺ¨λΈμ˜ μ§€λŠ₯은 μ„ ν˜•μ μΈ μ„±μž₯이 μ•„λ‹Œ 폭발적인 '상전이'λ₯Ό 톡해 λ„μ•½ν•œλ‹€" β€” ν•™μŠ΅ κ³Όμ •μ—μ„œ 손싀 ν•¨μˆ˜κ°€ μ™„λ§Œν•˜κ²Œ 쀄어듀닀가 νŠΉμ • μž„κ³„μ μ—μ„œ λͺ¨λΈμ˜ λ‚΄λΆ€ κ΅¬μ‘°λ‚˜ μΌλ°˜ν™” λŠ₯λ ₯이 κΈ‰κ²©νžˆ λ³€ν™”ν•˜λŠ” ν˜„μƒ. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **μΆ”μΆœλœ νŒ¨ν„΄:** "Abrupt Structural and Functional Transformation" β€” 물리적 상전이(μ–ΌμŒμ΄ 물이 λ˜λŠ” 것)와 μœ μ‚¬ν•˜κ²Œ, 신경망이 λ¬΄μž‘μœ„μ μΈ μƒνƒœμ—μ„œ μ§ˆμ„œ μžˆλŠ” λ‚΄λΆ€ ν‘œμƒ(Representation)을 ν˜•μ„±ν•˜κ±°λ‚˜, νŠΉμ • 규λͺ¨ μ΄μƒμ˜ 데이터/νŒŒλΌλ―Έν„°μ—μ„œ '창발적 λŠ₯λ ₯(Emergent Abilities)'을 νšλ“ν•˜λŠ” νŒ¨ν„΄. - **μ£Όμš” ν˜„μƒ:** - **Grokking:** ν•™μŠ΅ 데이터λ₯Ό λ‹€ μ™Έμš΄(Overfitting) 이후에도 ν•œμ°Έ 더 ν•™μŠ΅μ‹œμΌ°μ„ λ•Œ, κ°‘μžκΈ° μΌλ°˜ν™” μ„±λŠ₯이 κΈ‰μƒμŠΉν•˜λŠ” ν˜„μƒ. - **Scaling Laws:** λͺ¨λΈ ν¬κΈ°λ‚˜ μ—°μ‚°λŸ‰μ΄ μž„κ³„μΉ˜λ₯Ό λ„˜μ„ λ•Œ μΆ”λ‘  λŠ₯λ ₯이 λΉ„μ•½μ μœΌλ‘œ λ°œμ „. - **Double Descent:** λͺ¨λΈ λ³΅μž‘λ„κ°€ 증가함에 따라 ν…ŒμŠ€νŠΈ μ˜€μ°¨κ°€ κ°μ†Œν•˜λ‹€ μ¦κ°€ν•˜κ³ , λ‹€μ‹œ κ°μ†Œν•˜λŠ” ν˜„μƒ. - **의의:** AI ν•™μŠ΅μ„ λ‹¨μˆœνžˆ 였차λ₯Ό μ€„μ΄λŠ” 과정이 μ•„λ‹Œ, μ§€λŠ₯이 ν˜•μ„±λ˜λŠ” 동역학적 'μ§„ν™”'의 κ³Όμ •μœΌλ‘œ μ΄ν•΄ν•˜κ²Œ ν•˜λ©°, μ΄ˆκ±°λŒ€ λͺ¨λΈμ˜ 잠재λ ₯을 μ˜ˆμΈ‘ν•˜λŠ” μ§€ν‘œκ°€ 됨. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌:** ν•™μŠ΅μ΄ 였래될수둝 무쑰건 μ„±λŠ₯이 λ‚˜λΉ μ§„λ‹€λŠ” 초기 과적합 이둠을 μ •λ©΄μœΌλ‘œ λ°˜λ°•ν•˜λ©°, ν˜„λŒ€ λ”₯λŸ¬λ‹μ—μ„œλŠ” '상전이'λ₯Ό μœ λ„ν•˜κΈ° μœ„ν•œ μΆ©λΆ„ν•œ κ³Όμž‰ ν•™μŠ΅(Over-training)의 κ°€μΉ˜κ°€ 재발견됨. - **μ •μ±… λ³€ν™”:** Antigravity ν”„λ‘œμ νŠΈλŠ” μ—μ΄μ „νŠΈμ˜ μƒˆλ‘œμš΄ μŠ€ν‚¬ ν•™μŠ΅ μ‹œ, λ‹¨μˆœ 수렴 지점을 λ„˜μ–΄ 상전이 ν˜„μƒμ΄ λ°œμƒν•˜λŠ” 'κΉŠμ€ ν•™μŠ΅' κ΅¬κ°„κΉŒμ§€ λͺ¨λ‹ˆν„°λ§ν•˜μ—¬ 졜적의 톡찰 μˆ˜μ€€μ„ 확보함. ## πŸ”— 지식 μ—°κ²° (Graph) - [[Overfitting-and-Underfitting|Overfitting-and-Underfitting]], Deep-Learning-Foundations, Emergent-Abilities-in-LLM, [[Optimization-in-AI|Optimization-in-AI]] - **Raw Source:** 10_Wiki/Topics/AI/Phase-Transitions-in-Learning.md