--- id: STAT-FACTOR-001 category: Dev confidence_score: 1.0 tags: [[Statistics|[Statistics]], machine-learning, factor-[[Analysis|Analysis]], latent-variables, [[Dimensionality-Reduction|Dimensionality-Reduction]]] last_reinforced: 2026-04-26 --- # Factor Analysis (μš”μΈ 뢄석) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "μˆ˜λ§Žμ€ 겉λͺ¨μŠ΅ 속에 μˆ¨κ²¨μ§„ κ³΅ν†΅μ˜ 근원을 찾아라" β€” κ΄€μΈ‘λœ μ—¬λŸ¬ λ³€μˆ˜λ“€ μ‚¬μ΄μ˜ 상관관계λ₯Ό λΆ„μ„ν•˜μ—¬, 배후에 μ‘΄μž¬ν•˜λŠ” μ†Œμˆ˜μ˜ 잠재 λ³€μˆ˜(Latent Variables) ν˜Ήμ€ μš”μΈ(Factors)을 μΆ”μΆœν•˜λŠ” 톡계적 기법. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **μΆ”μΆœλœ νŒ¨ν„΄:** λˆˆμ— λ³΄μ΄λŠ” 데이터(Manifest Variables)의 μš”λ™μ΄ 사싀은 보이지 μ•ŠλŠ” 핡심 동인(Latent Factors)에 μ˜ν•΄ κ²°μ •λœλ‹€κ³  κ°€μ •ν•˜κ³  κ·Έ ꡬ쑰λ₯Ό νŒŒμ•…ν•˜λŠ” ꡬ쑰적 해석 νŒ¨ν„΄. - **μ£Όμš” λͺ©μ :** - **Data Reduction:** μˆ˜λ§Žμ€ λ³€μˆ˜λ₯Ό μ†Œμˆ˜μ˜ μš”μΈμœΌλ‘œ μ••μΆ•ν•˜μ—¬ νš¨μœ¨μ„± μ¦λŒ€. - **Structure Discovery:** λ³€μˆ˜λ“€ κ°„μ˜ λ³΅μž‘ν•œ 관계 λ„€νŠΈμ›Œν¬ νŒŒμ•…. - **Scaling:** 좔상적인 κ°œλ…(예: μ§€λŠ₯, 성격, μ„œλΉ„μŠ€ λ§Œμ‘±λ„)을 μΈ‘μ • κ°€λŠ₯ν•œ μ§€ν‘œλ‘œ λ³€ν™˜. - **μž‘λ™ 원리:** λ³€μˆ˜λ“€ κ°„μ˜ 곡톡 λΆ„μ‚°(Common Variance)을 κ·ΉλŒ€ν™”ν•˜λŠ” 좕을 탐색. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌:** PCA(μ£Όμ„±λΆ„ 뢄석)와 ν˜Όλ™ν•˜κΈ° μ‰¬μš°λ‚˜, PCAλŠ” 정보 μš”μ•½μ— μ§‘μ€‘ν•˜κ³  μš”μΈ 뢄석은 데이터가 μƒμ„±λœ '인과적 ꡬ쑰'λ₯Ό μ„€λͺ…ν•˜λŠ” 데 μ§‘μ€‘ν•œλ‹€λŠ” 차이점이 λͺ…확해짐. - **μ •μ±… λ³€ν™”:** Antigravity ν”„λ‘œμ νŠΈλŠ” μ—μ΄μ „νŠΈμ˜ μ„±λŠ₯ μ§€ν‘œ(응닡 속도, 정확도, 토큰 μ‚¬μš©λŸ‰ λ“±)λ₯Ό 뢄석할 λ•Œ, 이듀을 κ²°μ •μ§“λŠ” 잠재 μš”μΈ(예: ν•˜λ“œμ›¨μ–΄ μ„±λŠ₯, λͺ¨λΈ λ³΅μž‘λ„, λ„€νŠΈμ›Œν¬ μ§€μ—°)을 λΆ„λ¦¬ν•˜κΈ° μœ„ν•΄ μš”μΈ 뢄석 기법을 ν™œμš©ν•¨. ## πŸ”— 지식 μ—°κ²° (Graph) - [[Principal-Component-Analysis|Principal-Component-Analysis]]-PCA, [[Dimensionality-Reduction|Dimensionality-Reduction]], [[Exploratory-Data-Analysis|Exploratory-Data-Analysis]], Un[[Supervised-Learning-Foundations|Supervised-Learning-Foundations]] - **Raw Source:** 10_Wiki/Topics/AI/Factor-Analysis.md