--- id: P-REINFORCE-AUTO-ASSM-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 0.94 tags: [auto-reinforced, assessment, evaluation, feedback, measurement, educational-psychology] last_reinforced: 2026-04-20 --- # [[Assessment|Assessment]] ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "μ„±μž₯을 μœ„ν•œ 거울: ν˜„μž¬μ˜ 도달 μˆ˜μ€€μ„ κ°κ΄€μ μœΌλ‘œ μΈ‘μ •ν•˜κ³ , λͺ©ν‘œμ™€μ˜ 간극을 νŒŒμ•…ν•˜μ—¬ 더 λ‚˜μ€ λ°©ν–₯으둜 λ‚˜μ•„κ°€λ„λ‘ λ•λŠ” ν”Όλ“œλ°± μ‹œμŠ€ν…œμ˜ 핡심 단계." ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) 평가(Assessment)λŠ” νŠΉμ • λŒ€μƒμ˜ λŠ₯λ ₯, κ°€μΉ˜, μ„±κ³Ό 등을 μ²΄κ³„μ μœΌλ‘œ νŒŒμ•…ν•˜κ³  등급을 λ§€κΈ°κ±°λ‚˜ ν”Όλ“œλ°±μ„ μ£ΌλŠ” 일련의 κ³Όμ •μž…λ‹ˆλ‹€. 1. **μ‹œμ  및 λͺ©μ μ— λ”°λ₯Έ λΆ„λ₯˜**: * **Formative Assessment (ν˜•μ„± 평가)**: ν•™μŠ΅ 도쀑에 μˆ˜μ‹œλ‘œ μ‹€μ‹œν•˜μ—¬ ν•™μŠ΅μžμ—κ²Œ 도움을 쀌. (Active Learningκ³Ό μ—°κ²°) * **Summative Assessment (총괄 평가)**: ν•™μŠ΅μ΄ λλ‚œ ν›„ 성취도λ₯Ό μ΅œμ’… 확인. * **Diagnostic Assessment (진단 평가)**: μ‹œμž‘ μ „ 미리 μˆ˜μ€€μ„ νŒŒμ•…ν•˜μ—¬ 졜적의 경둜 μ„€μ •. 2. **쒋은 ν‰κ°€μ˜ 쑰건**: * **Validity (타당도)**: μΈ‘μ •ν•˜κ³ μž ν•˜λŠ” 것을 μ •ν™•νžˆ μΈ‘μ •ν•˜λŠ”κ°€? * **Reliability (신뒰도)**: λˆ„κ°€ μ–Έμ œ 츑정해도 μΌκ΄€λœ κ²°κ³Όκ°€ λ‚˜μ˜€λŠ”κ°€? * **Fairness (곡정성)**: 평가 λŒ€μƒ λͺ¨λ‘μ—κ²Œ κ· λ“±ν•œ κΈ°νšŒκ°€ 보μž₯λ˜λŠ”κ°€? (Algorithmic Fairness와 μ—°κ²°) ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌**: 과거의 평가 정책은 쀄 μ„Έμš°κΈ°λ₯Ό ν†΅ν•œ '선별'이 λͺ©μ μ΄μ—ˆμœΌλ‚˜, ν˜„λŒ€μ˜ ꡐ윑 및 인사 정책은 λΆ€μ‘±ν•œ 뢀뢄을 λ©”μ›Œμ£ΌλŠ” '지속적 μ„±μž₯ 지원 μ •μ±…'으둜 νŒ¨λŸ¬λ‹€μž„μ„ μ „ν™˜ν•¨(RL Update). - **μ •μ±… λ³€ν™”(RL Update)**: AI λͺ¨λΈ 평가 μ •μ±…μ—μ„œ, λ‹¨μˆœνžˆ 벀치마크 점수(Accuracy)만 따지기보닀 λͺ¨λΈμ˜ 취약점과 μœ€λ¦¬μ„±μ„ μž…μ²΄μ μœΌλ‘œ νŒŒμ•…ν•˜λŠ” 'Multi-dimensional Assessment μ •μ±…'이 ν‘œμ€€μ΄ 됨. ## πŸ”— 지식 μ—°κ²° (Graph) - [[Active Learning|Active Learning]], [[Algorithmic Fairness|Algorithmic Fairness]], [[Type 1 vs Type 2 Errors|Type 1 vs Type 2 Errors]], [[Statistics & Data Analysis|Statistics & Data Analysis]], Self-Correction Mechanisms - **Modern Tech/Tools**: AI-automated evaluation tools, Performance dashboards (KPI/OKR). ---