--- id: P-REINFORCE-AUTO-EVAL-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 0.94 tags: [auto-reinforced, evolutionary-algorithms, genetic-algorithms, optimization, bio-inspired, search] last_reinforced: 2026-04-20 --- # [[Evolutionary-Algorithms|Evolutionary-Algorithms]] ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "μ½”λ“œλ‘œ κ΅¬ν˜„ν•œ μ μžμƒμ‘΄: 생물학적 μ§„ν™” 과정을 λͺ¨λ°©ν•˜μ—¬, μˆ˜λ§Žμ€ 해법(개체) 쀑 μ„±λŠ₯이 쒋은 κ²ƒλ“€λ§Œ 골라 ꡐ배(Crossover)ν•˜κ³  변이(Mutation)μ‹œμΌœ μ„ΈλŒ€λ₯Ό κ±°λ“­ν• μˆ˜λ‘ 점점 더 μ™„λ²½ν•œ 정닡에 κ°€κΉŒμ›Œμ§€λŠ” μžκ°€ μ΅œμ ν™” μ•Œκ³ λ¦¬μ¦˜." ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) μ§„ν™” μ•Œκ³ λ¦¬μ¦˜(Evolutionary-Algorithms)은 μžμ—° 선택 섀에 κΈ°λ°˜ν•œ ν™•λ₯ μ  μ΅œμ ν™” 탐색 κΈ°λ²•μž…λ‹ˆλ‹€. 1. **μ£Όμš” ν”„λ‘œμ„ΈμŠ€**: * **Initialization**: λ¬΄μž‘μœ„ μ†”λ£¨μ…˜ μ§‘ν•© 생성. * **Fitness Evaluation**: 각 μ†”λ£¨μ…˜μ΄ μ–Όλ§ˆλ‚˜ 문제λ₯Ό 잘 ν‘ΈλŠ”μ§€ 평가. * **Selection**: 성적이 쒋은 μƒμœ„ 개체 선택. * **Reproduction (Crossover & Mutation)**: λΆ€λͺ¨ 개체의 μž₯점을 μ„žκ±°λ‚˜ μš°μ—°ν•œ λ³€ν™”λ₯Ό μ£Όμ–΄ μƒˆλ‘œμš΄ μžμ† 생성. * **Iteration**: 졜적의 κ²°κ³Όκ°€ λ‚˜μ˜¬ λ•ŒκΉŒμ§€ λ¬΄ν•œ 반볡. 2. **μ™œ μ€‘μš”ν•œκ°€?**: * μˆ˜ν•™μ μœΌλ‘œ λ―ΈλΆ„ λΆˆκ°€λŠ₯ν•˜κ±°λ‚˜ κ·œμΉ™μ΄ λ³΅μž‘ν•˜μ—¬ 전톡적 λ°©μ‹μœΌλ‘œ ν’€κΈ° μ–΄λ €μš΄ κ±°λŒ€ μ‘°ν•© μ΅œμ ν™” λ¬Έμ œμ— κ°•λ ₯함. (Combinatorial-Optimizationκ³Ό λ°€μ ‘) ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌**: κ³Όκ±°μ—λŠ” μ—°μ‚° 속도가 λ„ˆλ¬΄ 느렀 μ‹€μš©μ„±μ΄ λ–¨μ–΄μ§„λ‹€λŠ” 정책이 λ§Žμ•˜μœΌλ‚˜, ν˜„λŒ€ 정책은 κ°•λ ₯ν•œ GPU μ—°μ‚°κ³Ό κ²°ν•©ν•˜μ—¬ AI 신경망 ꡬ쑰 자체λ₯Ό μ§„ν™”μ‹œν‚€λŠ” 'μ‹ κ²½ μ§„ν™”(Neuroevolution) μ •μ±…'으둜 λΆ€ν™œν•¨(RL Update). - **μ •μ±… λ³€ν™”(RL Update)**: κ°•ν™”ν•™μŠ΅μ˜ κ·Έλž˜λ””μ–ΈνŠΈ 방식이 λ§‰νžˆλŠ” λ³΅μž‘ν•œ ν™˜κ²½μ—μ„œ, μ§„ν™” μ•Œκ³ λ¦¬μ¦˜μ„ ν†΅ν•œ 'μ—μ΄μ „νŠΈ λͺ¨μ§‘단 ν•™μŠ΅ μ •μ±…'이 더 κ°•κ±΄ν•œ 인곡지λŠ₯을 λ§Œλ“œλŠ” λŒ€μ•ˆ μ •μ±…μœΌλ‘œ μ—°κ΅¬λ˜κ³  있음. ## πŸ”— 지식 μ—°κ²° (Graph) - [[Optimization|Optimization]], [[Combinatorial-Optimization|Combinatorial-Optimization]], [[Genetic-Algorithms|Genetic-Algorithms]], [[Complexity Theory|Complexity Theory]], [[Emergence|Emergence]] - **Modern Tech/Tools**: Neuroevolution (NEAT), CMA-ES, Evolutionary Strategies (ES). ---