[G1-Sync] Manual knowledge update

2026-05-10 22:08:15 +09:00
parent 21ac3ed255
commit 504fd5fb42
3011 changed files with 380280 additions and 206977 deletions
@@ -2,98 +2,197 @@
 id: wiki-2026-0508-몬테카를로-시뮬레이션-monte-carlo-simulati
 title: 몬테카를로 시뮬레이션(Monte Carlo Simulation)
 category: 10_Wiki/Topics
-status: needs_review
+status: verified
 canonical_id: self
-aliases: []
+aliases: [Monte Carlo, MC Simulation, 몬테카를로법, Random Sampling]
 duplicate_of: none
 source_trust_level: A
-confidence_score: 0.92
-tags: [uncategorized]
+confidence_score: 0.9
+verification_status: applied
+tags: [simulation, statistics, numerical-methods, probability]
 raw_sources: []
-last_reinforced: 2026-05-08
+last_reinforced: 2026-05-10
 github_commit: pending
-inferred_by: Claude Opus 4.7 (auto-normalize 2026-05-08)
 tech_stack:
-  language: unspecified
-  framework: unspecified
+  language: python
+  framework: numpy
 ---

 # 몬테카를로 시뮬레이션(Monte Carlo Simulation)

-## 📌 한 줄 통찰 (The Karpathy Summary)
-몬테카를로 시뮬레이션은 본질적인 불확실성을 가진 요소에 다양한 값의 범위를 대입하여 가능한 결과 모델을 구축하는 컴퓨터 기반의 수학적 기법입니다[1]. 게임 경제 설계에서 이 기법은 실제 플레이어 기반이 만들어내는 무작위성과 변동성을 시뮬레이션에 반영하기 위해 사용됩니다[2, 3]. 단순한 확률이나 수학적 평균에 의존하는 대신 수많은 가상 플레이어 여정을 반복 샘플링하여, 게임 디자이너가 경제 밸런스를 정확하게 예측하고 최적화할 수 있도록 돕는 핵심 도구입니다[4-6].
+## 매 한 줄
+> **"매 random sampling 의 deterministic answer 추정"**. Stanislaw Ulam (Manhattan Project, 1946)이 neutron diffusion 위해 고안. 매 closed-form solution 없는 high-dim integral, optimization, risk modeling 의 해법 — 매 large-N law of large numbers 의 convergence.

-## 📖 구조화된 지식 (Synthesized Content)
-* **정의 및 작동 원리**
-  몬테카를로 시뮬레이션은 미지의 매개변수에 대한 수치적 추정치를 만들기 위해 반복적인 무작위 샘플링 과정을 사용하는 기법입니다[4]. 이 방식은 시행 횟수가 많아질수록 결과의 평균이 기댓값에 가까워진다는 '대수의 법칙(Law of Large Numbers)'을 기반으로 합니다[4]. 모나코의 유명한 카지노 이름에서 유래한 이 명칭은 룰렛이나 두 개의 주사위를 굴릴 때의 확률을 계산하는 것과 같이 근본적으로 무작위성을 띠는 결과를 예측하는 데 유용하다는 특징을 반영합니다[1].
+## 매 핵심

-* **게임 설계에서의 무작위성 반영**
-  실제 플레이어들은 개인적인 선호도, 역할 수행(role-playing) 선택, 좋아하는 캐릭터 등 다양한 편향(biases)을 가지고 있어 수학적으로 최적화된 패턴으로만 움직이지 않습니다[2]. 따라서 단순한 평균값만으로는 실제 플레이어의 행동을 예측하는 데 실패하는 경우가 많습니다[2]. 몬테카를로 시뮬레이션은 이러한 무작위성(randomness)을 방정식에 다시 도입하여, 단순한 평균보다 실제 현실에 훨씬 더 가까운 결과를 제공합니다[2, 7]. 예를 들어, 특정 지점에서 실패할 확률이 50%일 때 "절반의 플레이어가 실패한다"고 단순하게 단정하는 대신, 어떤 플레이어는 일찍 실패하고 어떤 플레이어는 늦게 실패하는 등 전체 결과의 스펙트럼(full spectrum of outcomes)을 세밀하게 보여줍니다[5].
+### 매 4단계
+1. **Define domain** of possible inputs (probability distribution).
+2. **Sample randomly** from domain (PRNG).
+3. **Compute deterministically** for each sample.
+4. **Aggregate** results (mean, variance, percentiles).

-* **경제 균형(Balance) 및 시스템 안정화**
-  이 시뮬레이션은 긴 기간에 걸쳐 다양한 유형의 플레이어에 맞게 게임의 균형을 맞추는 데 핵심적인 역할을 합니다[3]. 수만 번의 가상 플레이어 여정을 실행하여 특정 구간에서 재화가 부족해지거나 너무 많아지는 시점을 포착할 수 있습니다[6]. 이러한 높은 정확도는 새로운 콘텐츠가 게임 내 자원의 생성(Taps)과 소모(Sinks)의 균형을 붕괴시키지 않도록 사전에 테스트하는 것을 가능하게 합니다[3]. 또한, AI 기반의 보상 스케일링 환경에서도 포인트 대 가치(points-to-value) 비율이 안정적으로 유지되도록 보장합니다[8].
+### 매 수학 기반
+- **Law of Large Numbers**: sample mean → expected value as N → ∞.
+- **Convergence rate**: O(1/√N) — 매 4× samples 의 2× accuracy.
+- **Curse of dimensionality 의 escape**: deterministic methods (grid) 매 d²/d³, MC 매 dimension-independent.

-* **시뮬레이션 툴과 데이터 기반 최적화**
-  [[Machinations|Machinations]]와 같은 툴은 몬테카를로 시뮬레이션 기능을 제공하여, 게임 디자이너가 프로그래밍 코드 없이도 복잡한 시스템을 시각화하고 매개변수를 직접 설정해 테스트할 수 있게 해줍니다[7, 9]. 이를 통해 디자이너는 "첫 10분 동안 플레이어가 최대 3번만 죽도록" 설정하는 등 결과를 자동 조정(AI Balancer)할 수 있으며, 수분 내에 다양한 플레이어 여정을 비교하고 역학을 반복 수정할 수 있습니다[6, 7, 10].
+### 매 응용
+1. **금융**: VaR, option pricing (Black-Scholes 외 path-dependent).
+2. **물리**: particle transport, lattice QCD.
+3. **AI**: Monte Carlo Tree Search (AlphaGo, MuZero).
+4. **Engineering**: reliability analysis, sensitivity.
+5. **3D rendering**: path tracing (Blender Cycles, Pixar RenderMan).

-## 🔗 지식 연결 (Graph)
- **Related Topics:** [[대수의 법칙(Law of Large Numbers)|대수의 법칙(Law of Large Numbers]], 수도꼭지와 배수구(Taps and Sinks), 게임 경제 균형(Game Economy Balance), [[마키네이션(Machinations)|마키네이션(Machinations]]
- **Projects/Contexts:** [[가상 경제 시뮬레이션 및 사전 검증(Virtual Economy Simulation)|가상 경제 시뮬레이션 및 사전 검증(Virtual Economy Simulation]], 부분 유료화(Freemium) 게임 경제 모델링, [[AI 기반 보상 및 난이도 스케일링|AI 기반 보상 및 난이도 스케일링]]
- **Contradictions/Notes:** 소스에 따르면 단순한 수학적 평균(Simple averages)은 무작위성이 결여되어 있어 실제 플레이어의 행동을 예측하는 데 실패하는 경우가 많으나, 몬테카를로 시뮬레이션은 대수의 법칙과 결합하여 이러한 변동성과 무작위성을 시뮬레이션에 성공적으로 복원하여 훨씬 더 정확한 밸런싱을 가능하게 한다고 강조합니다[2, 3].
+## 💻 패턴

---
-*Last updated: 2026-04-28*
+### 1. π 추정 (canonical example)
+```python
+import numpy as np

-## 🤖 LLM 활용 힌트 (How to Use This Knowledge)
+def estimate_pi(n: int = 1_000_000) -> float:
+    pts = np.random.uniform(-1, 1, (n, 2))
+    inside = np.sum(pts[:, 0]**2 + pts[:, 1]**2 <= 1)
+    return 4 * inside / n

-**언제 이 지식을 쓰는가:**
- *(TODO)*
-
-**언제 쓰면 안 되는가:**
- *(TODO)*
-
-## 🧪 검증 상태 (Validation)
-
- **정보 상태:** needs_review
- **출처 신뢰도:** A
- **검토 이유:** *(P-Reinforce Phase 1 자동 정규화. 본문 검증 필요.)*
-
-## 🧬 중복 검사 (Duplicate Check)
-
- **기존 유사 문서:** *(TODO: 인덱서 클러스터 리포트 참조)*
- **처리 방식:** UPDATE (자동 정규화)
- **처리 이유:** Phase 1 정규화 — 옛 템플릿/누락 필드 보강.
-
-## ⚠️ 모순 및 업데이트 (Contradictions & Updates)
-
- **과거 데이터와의 충돌:** 없음
- **정책 변화:** 없음
-
-## 🕓 변경 이력 (Changelog)
-
-| 날짜 | 변경 내용 | 처리 방식 | 신뢰도 |
-|------|-----------|-----------|--------|
-| 2026-05-08 | P-Reinforce Phase 1 정규화 (frontmatter + 헤더 표준화) | UPDATE | A |
-
-## 💻 코드 패턴 (Code Patterns)
-
-**패턴 1:** *(TODO: 이 프로젝트 컨벤션 반영한 구조 스켈레톤)*
-
-```text
-# TODO
+print(estimate_pi(10_000_000))  # ~3.14159
 ```

-## 🤔 의사결정 기준 (Decision Criteria)
+### 2. Option pricing (Black-Scholes via MC)
+```python
+import numpy as np

-**선택 A를 써야 할 때:**
- *(TODO)*
+def european_call_mc(S0=100, K=105, r=0.05, sigma=0.2, T=1.0, n_paths=1_000_000):
+    Z = np.random.standard_normal(n_paths)
+    ST = S0 * np.exp((r - 0.5 * sigma**2) * T + sigma * np.sqrt(T) * Z)
+    payoff = np.maximum(ST - K, 0)
+    return np.exp(-r * T) * payoff.mean()

-**선택 B를 써야 할 때:**
- *(TODO)*
+price = european_call_mc()
+print(f"Call price: {price:.4f}")
+```

-**기본값:**
-> *(TODO)*
+### 3. Value at Risk (VaR)
+```python
+import numpy as np

-## ❌ 안티패턴 (Anti-Patterns)
+def historical_var(returns: np.ndarray, alpha: float = 0.05) -> float:
+    return np.quantile(returns, alpha)

- **[안티패턴]:** *(TODO: 무엇을 하면 안 되는가 + 이유 + 대신 무엇을)*
+def parametric_var(mu, sigma, alpha=0.05, n=100_000):
+    samples = np.random.normal(mu, sigma, n)
+    return np.quantile(samples, alpha)
+
+# portfolio daily returns
+returns = np.random.normal(0.001, 0.02, 10_000)
+print(f"5% VaR: {historical_var(returns):.4f}")
+```
+
+### 4. Monte Carlo integration
+```python
+import numpy as np
+
+def mc_integrate(f, a, b, n=1_000_000):
+    x = np.random.uniform(a, b, n)
+    return (b - a) * np.mean(f(x))
+
+# integrate sin(x) on [0, π] (true value = 2)
+result = mc_integrate(np.sin, 0, np.pi)
+print(result)  # ~2.0
+```
+
+### 5. MCTS (Monte Carlo Tree Search) — AI
+```python
+import math, random
+
+class Node:
+    def __init__(self, state, parent=None):
+        self.state = state
+        self.parent = parent
+        self.children = []
+        self.visits = 0
+        self.wins = 0
+
+    def ucb1(self, c=1.41):
+        if self.visits == 0:
+            return float('inf')
+        return self.wins / self.visits + c * math.sqrt(math.log(self.parent.visits) / self.visits)
+
+def mcts(root, n_iter=10_000, get_actions=None, simulate=None):
+    for _ in range(n_iter):
+        # 1. select via UCB1
+        node = root
+        while node.children:
+            node = max(node.children, key=lambda n: n.ucb1())
+        # 2. expand
+        for action in get_actions(node.state):
+            node.children.append(Node(apply(node.state, action), node))
+        # 3. simulate (random rollout)
+        leaf = random.choice(node.children) if node.children else node
+        result = simulate(leaf.state)
+        # 4. backpropagate
+        cur = leaf
+        while cur:
+            cur.visits += 1
+            cur.wins += result
+            cur = cur.parent
+    return max(root.children, key=lambda n: n.visits)
+```
+
+### 6. Variance reduction: antithetic variates
+```python
+import numpy as np
+
+def mc_call_antithetic(S0=100, K=105, r=0.05, sigma=0.2, T=1.0, n=500_000):
+    Z = np.random.standard_normal(n)
+    ST_plus = S0 * np.exp((r - 0.5 * sigma**2) * T + sigma * np.sqrt(T) * Z)
+    ST_minus = S0 * np.exp((r - 0.5 * sigma**2) * T - sigma * np.sqrt(T) * Z)
+    payoff = (np.maximum(ST_plus - K, 0) + np.maximum(ST_minus - K, 0)) / 2
+    return np.exp(-r * T) * payoff.mean()
+# 매 same N 의 ~2× variance reduction
+```
+
+### 7. Reproducibility (seed)
+```python
+import numpy as np
+rng = np.random.default_rng(seed=42)
+samples = rng.normal(0, 1, 1000)  # reproducible
+```
+
+## 매 결정 기준
+| 상황 | Approach |
+|---|---|
+| Low-dim (d ≤ 3), smooth integrand | quadrature (Gauss, Simpson) — faster. |
+| High-dim integral | MC — escapes curse of dimensionality. |
+| Path-dependent option | MC. |
+| European option (closed form) | Black-Scholes formula — instant. |
+| Need confidence interval | MC + bootstrap. |
+| Game tree search | MCTS + UCB1. |
+
+**기본값**: NumPy MC 매 baseline, antithetic variates / control variates 매 variance reduction. JAX/CUDA 매 GPU acceleration.
+
+## 🔗 Graph
+- 부모: [[수치 해석]] · [[확률론]] · [[Stochastic Methods]]
+- 변형: [[MCMC (Markov Chain Monte Carlo)]] · [[Quasi-Monte Carlo]] · [[Importance Sampling]]
+- 응용: [[금융 모델링]] · [[Path Tracing]] · [[AlphaGo]] · [[Risk Analysis]]
+- Adjacent: [[Bayesian Inference]] · [[Bootstrap]] · [[난수 생성]]
+
+## 🤖 LLM 활용
+**언제**: high-dim integral, risk metrics, option pricing, game AI, sensitivity analysis.
+**언제 X**: 매 closed-form 존재 매 (Black-Scholes European), low-dim quadrature 효율적인 case, deterministic answer 필요 매 (use seed).
+
+## ❌ 안티패턴
+- **Too few samples**: O(1/√N) 매 slow — 매 1% accuracy 의 N=10000+.
+- **No seed in production**: non-reproducible bugs.
+- **Bad PRNG**: `random.random()` 매 OK, `Math.random()` (JS) 매 not crypto-safe — but fine 매 simulation.
+- **No variance reduction**: antithetic / control variates 매 free 2-10× speedup.
+- **MC 의 deterministic 문제**: 매 closed form 존재 의 use that.
+
+## 🧪 검증 / 중복
+- Verified (Metropolis & Ulam 1949 original, Glasserman "Monte Carlo Methods in Financial Engineering" 2003, Kalos & Whitlock 2008).
+- 신뢰도 A.
+
+## 🕓 Changelog
+| 날짜 | 변경 |
+|---|---|
+| 2026-05-08 | Phase 1 |
+| 2026-05-10 | Manual cleanup — full content with π/option/VaR/MCTS patterns |