--- id: [[P-Reinforce|P-Reinforce]]-AUTO-SEOP-001 category: Dev confidence_score: 0.95 tags: [auto-reinforced, [[Search|Search]]-[[Optimization|Optimization]], algorithms, pathfinding, [[Heuristic-Search|Heuristic-Search]], [[Efficiency|Efficiency]]] last_reinforced: 2026-04-20 --- # [[Search-Optimization|Search-Optimization]] ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "μ΅œλ‹¨ 경둜λ₯Ό ν–₯ν•œ λμ—†λŠ” 탐색: μˆ˜μ—†μ΄ λ§Žμ€ μ„ νƒμ§€μ˜ 숲([[State|State]] Space)μ—μ„œ, λͺ©ν‘œ μ§€μ κΉŒμ§€μ˜ λΉ„μš©μ„ μ΅œμ†Œν™”ν•˜κΈ° μœ„ν•΄ νœ΄λ¦¬μŠ€ν‹±(Heuristic)μ΄λΌλŠ” λ‚˜μΉ¨λ°˜μ„ λ“€κ³  κ°€μž₯ μœ λ§ν•œ λ°©ν–₯으둜 λ°œμ„ λ“€μ΄λŠ” μ˜λ¦¬ν•œ κΈΈ μ°ΎκΈ°." ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) 탐색 μ΅œμ ν™”(Search-Optimization)λŠ” 문제의 해닡을 μ°ΎκΈ° μœ„ν•΄ κ°€λŠ₯ν•œ λͺ¨λ“  μƒνƒœλ₯Ό 효율적으둜 μ‘°μ‚¬ν•˜λŠ” κΈ°λ²•μž…λ‹ˆλ‹€. (Grail-Search적 관점 포함) 1. **μ£Όμš” μ•Œκ³ λ¦¬μ¦˜**: * **Uninformed Search**: 정보 없이 λ‹€ λ’€μ§€λŠ” 방식 (BFS, DFS). ([[Brute-force|Brute-force]]와 μ—°κ²°) * **Informed Search (Heuristic)**: λͺ©ν‘œκΉŒμ§€ 남은 거리λ₯Ό 'μΆ”μ •'ν•΄μ„œ 탐색 (A* Algorithm). * **Local Search**: ν˜„μž¬λ³΄λ‹€ λ‚˜μ€ μ£Όλ³€μœΌλ‘œλ§Œ 이동 (Hill Climbing, Simulated Annealing). 2. **μ™œ μ€‘μš”ν•œκ°€?**: * κ²Œμž„ AI의 경둜 μ°ΎκΈ°, λ¬Όλ₯˜ 배솑 μ΅œμ ν™”, 퍼즐 풀이, 그리고 μ‹ κ²½λ§μ˜ κ°€μ€‘μΉ˜λ₯Ό μ°ΎλŠ” κ³Όμ •([[Gradient-Descent|Gradient-Descent]]) μžμ²΄κ°€ κ±°λŒ€ν•œ 탐색 μ΅œμ ν™” λ¬Έμ œμž„. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌**: κ³Όκ±°μ—λŠ” 'μ™„μ „ 탐색 μ •μ±…'으둜 정닡을 보μž₯ν•˜λ € ν–ˆμœΌλ‚˜, ν˜„λŒ€ 정책은 정닡보닀 'μΆ©λΆ„νžˆ 쒋은 ν•΄ μ •μ±…(Satisficing)'을 μ œν•œλœ μ‹œκ°„ 내에 μ°ΎλŠ” νš¨μœ¨μ„± 정책을 μš°μ„ μ‹œν•¨(RL Update). ([[Bounded-Rationality|Bounded-Rationality]]와 μ—°κ²°) - **μ •μ±… λ³€ν™”(RL Update)**: κ±°λŒ€ λͺ¨λΈμ˜ μΆ”λ‘  μ •μ±…μ—μ„œ, μˆ˜λ§Žμ€ λ‹΅λ³€ 후보 쀑 κ°€μž₯ 논리적인 경둜λ₯Ό νƒμƒ‰ν•˜λŠ” 'MCTS(Monte Carlo Tree Search)' 기반의 사고 흐름 탐색 정책이 μƒˆλ‘œμš΄ μ„±λŠ₯ ν–₯μƒμ˜ λŒνŒŒκ΅¬κ°€ 됨. ## πŸ”— 지식 μ—°κ²° (Graph) - [[Brute-force|Brute-force]], [[Optimization|Optimization]], [[Heuristics|Heuristics]], [[Combinatorial-Optimization|Combinatorial-Optimization]], [[Gradient-Descent|Gradient-Descent]] - **Modern Tech/Tools**: A* Search, MCTS, Beam Search (in NLP), AlphaGo's search engine. ---