--- id: DL-PARAM-SHARE-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 1.0 tags: [ai, deep-learning, parameter-sharing, cnn, rnn, weight-tying, efficiency] last_reinforced: 2026-04-26 --- # Parameter Sharing (νŒŒλΌλ―Έν„° 곡유) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "λ°μ΄ν„°μ˜ μœ„μΉ˜λ‚˜ μ‹œμ μ— 상관없이 λ™μΌν•œ 'νŠΉμ§• μΆ”μΆœκΈ°'λ₯Ό 반볡 μ‚¬μš©ν•˜μ—¬, λͺ¨λΈμ˜ λ©μΉ˜λŠ” 쀄이고 μ§€λŠ₯의 λ³΄νŽΈμ„±μ€ 높여라" β€” μ‹ κ²½λ§μ˜ μ„œλ‘œ λ‹€λ₯Έ λΆ€λΆ„μ—μ„œ λ™μΌν•œ κ°€μ€‘μΉ˜(Weight)λ₯Ό κ³΅μœ ν•¨μœΌλ‘œμ¨ ν•™μŠ΅ν•΄μ•Ό ν•  νŒŒλΌλ―Έν„° 수λ₯Ό 획기적으둜 쀄이고 μΌλ°˜ν™” μ„±λŠ₯을 λ†’μ΄λŠ” 기법. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **μΆ”μΆœλœ νŒ¨ν„΄:** "Structural Symmetry and Translation Invariance" β€” μ΄λ―Έμ§€λŠ” μ–΄λŠ μœ„μΉ˜μ—μ„œλ“  같은 ν•„ν„°λ‘œ νŠΉμ§•μ„ 뽑을 수 있고(CNN), λ¬Έμž₯은 μ–΄λŠ μ‹œμ μ—μ„œλ“  같은 λ…Όλ¦¬λ‘œ λ‹€μŒμ„ μ˜ˆμΈ‘ν•  수 μžˆλ‹€(RNN)λŠ” ꡬ쑰적 가정을 λ°”νƒ•μœΌλ‘œ κ°€μ€‘μΉ˜λ₯Ό λ¬Άμ–΄λ²„λ¦¬λŠ”(Weight Tying) νŒ¨ν„΄. - **μ£Όμš” 적용 사둀:** - **CNN (Convolutional Neural Networks):** ν•˜λ‚˜μ˜ ν•„ν„°(컀널)κ°€ 이미지 전체λ₯Ό ν›‘μœΌλ©° λ™μΌν•œ κ°€μ€‘μΉ˜λ‘œ μ—°μ‚°. 곡간적 λΆˆλ³€μ„± 확보. - **RNN (Recurrent Neural Networks):** λ§€ μ‹œκ°„ 단계(Time step)λ§ˆλ‹€ λ™μΌν•œ 전이 행렬을 μ‚¬μš©ν•˜μ—¬ μ‹œν€€μŠ€ 데이터 처리. - **Siamese Networks:** 두 개의 μž…λ ₯을 μ •ν™•νžˆ λ™μΌν•œ κ°€μ€‘μΉ˜λ₯Ό κ°€μ§„ λ„€νŠΈμ›Œν¬μ— ν†΅κ³Όμ‹œμΌœ 비ꡐ. - **의의:** 과적합(Overfitting)을 λ°©μ§€ν•˜κ³  λ©”λͺ¨λ¦¬ μ‚¬μš©λŸ‰μ„ μ ˆκ°ν•˜λ©°, λ°μ΄ν„°μ˜ λŒ€μΉ­μ„±μ΄λ‚˜ λ°˜λ³΅λ˜λŠ” νŒ¨ν„΄μ„ ν¬μ°©ν•˜λŠ” 데 μ΅œμ ν™”λœ 도ꡬ. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌:** λͺ¨λ“  νŒŒλΌλ―Έν„°κ°€ μžμœ λ‘œμ›Œμ•Ό 더 λ˜‘λ˜‘ν•  κ²ƒμ΄λΌλŠ” 초기 직관을 κΉ¨κ³ , 였히렀 νŒŒλΌλ―Έν„°λ₯Ό κ°•μ œμ μœΌλ‘œ κ³΅μœ ν–ˆμ„ λ•Œ λͺ¨λΈμ΄ λ°μ΄ν„°μ˜ 핡심적인 λΆˆλ³€ νŠΉμ§•(Invariant features)을 더 잘 λ°°μš΄λ‹€λŠ” 사싀이 λ”₯λŸ¬λ‹μ˜ 폭발적 μ„±μž₯을 μ΄λŒμ—ˆμŒ. - **μ •μ±… λ³€ν™”:** Antigravity ν”„λ‘œμ νŠΈλŠ” λ©€ν‹°λͺ¨λ‹¬ μ—μ΄μ „νŠΈ 섀계 μ‹œ, μ„œλ‘œ λ‹€λ₯Έ μž…λ ₯(이미지, ν…μŠ€νŠΈ)μ—μ„œ κ³΅ν†΅λœ 의미 곡간을 μΆ”μΆœν•˜κΈ° μœ„ν•΄ 곡유된 κ°€μ€‘μΉ˜ 측을 ν™œμš©ν•˜λŠ” μž„λ² λ”© μ•„ν‚€ν…μ²˜λ₯Ό μ μš©ν•¨. ## πŸ”— 지식 μ—°κ²° (Graph) - Convolutional-Neural-Networks-CNN, Recurrent-Neural-Networks-RNN, [[Overfitting-and-Underfitting]], [[One-Shot-Learning]] - **Raw Source:** 10_Wiki/Topics/AI/Parameter-Sharing.md