--- id: AI-ROBUST-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 1.0 tags: [ai, machine-learning, [[Robustness]], adversarial-attacks, ood-detection, [[Reliability]], [[Trustworthy-AI]]] last_reinforced: 2026-04-26 --- # Robust Machine Learning (κ°•κ±΄ν•œ λ¨Έμ‹ λŸ¬λ‹) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "λ°μ΄ν„°μ˜ λ…Έμ΄μ¦ˆμ™€ μ λŒ€μ  κ³΅κ²©μ΄λΌλŠ” ν­ν’μš° μ†μ—μ„œλ„ 흔듀리지 μ•ŠλŠ” 'κ°•κ±΄ν•œ μ§€λŠ₯'을 κ΅¬μΆ•ν•˜κ³ , λ‚―μ„  ν™˜κ²½μ—μ„œλ„ μ‹ λ’°ν•  수 μžˆλŠ” νŒλ‹¨μ˜ 일관성을 μœ μ§€ν•˜λΌ" β€” λͺ¨λΈμ΄ μž…λ ₯ λ°μ΄ν„°μ˜ 변동, λ…Έμ΄μ¦ˆ, ν˜Ήμ€ μ˜λ„μ μΈ μ™œκ³‘(Adversarial Perturbation)에 λŒ€ν•΄ μ•ˆμ •μ μΈ μ„±λŠ₯을 μœ μ§€ν•˜λ„λ‘ λ§Œλ“œλŠ” λ¨Έμ‹ λŸ¬λ‹ 방법둠. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **μΆ”μΆœλœ νŒ¨ν„΄:** "Adversarial Defense and Uncertainty Awareness" β€” ν•™μŠ΅ κ³Όμ •μ—μ„œ μ˜λ„μ μœΌλ‘œ μ–΄λ €μš΄ μƒ˜ν”Œμ„ μ£Όμž…ν•˜μ—¬ λ‹¨λ ¨μ‹œν‚€κ³ (Adversarial Training), μžμ‹ μ΄ λͺ¨λ₯΄λŠ” 데이터(OOD)에 λŒ€ν•΄μ„œλŠ” "λͺ¨λ₯Έλ‹€"κ³  λ‹΅ν•  수 μžˆλŠ” 확신도(Confidence)λ₯Ό κ΄€λ¦¬ν•˜λŠ” νŒ¨ν„΄. - **핡심 도전 과제:** - **Adversarial Attacks:** μ‚¬λžŒ λˆˆμ—λŠ” 보이지 μ•ŠλŠ” λ―Έμ„Έν•œ λ³€μ‘°λ‘œ λͺ¨λΈμ„ μ†μ΄λŠ” 곡격 λ°©μ–΄. - **Distribution [[Shift]]:** ν•™μŠ΅ 데이터와 μ‹€μ œ λ°μ΄ν„°μ˜ 뢄포가 λ‹¬λΌμ§ˆ λ•Œμ˜ μ„±λŠ₯ ν•˜λ½ λ°©μ§€. - **Data Corruption:** 데이터 μˆ˜μ§‘ κ³Όμ •μ˜ κ²°μΈ‘μ΄λ‚˜ 였λ₯˜μ— λŒ€ν•œ μ €ν•­λ ₯ 확보. - **의의:** μžμœ¨μ£Όν–‰, 의료 진단, μ•ˆλ³΄ λ“± μž‘μ€ 였λ₯˜κ°€ 치λͺ…적인 κ²°κ³Όλ₯Ό μ΄ˆλž˜ν•˜λŠ” 'λ―Έμ…˜ 크리티컬' λΆ„μ•Όμ—μ„œ AIκ°€ μƒμš©ν™”λ˜κΈ° μœ„ν•œ ν•„μˆ˜ 쑰건. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌:** λ‹¨μˆœνžˆ 정확도(Accuracy)만 λ†’μœΌλ©΄ 쒋은 λͺ¨λΈμ΄λΌλŠ” μ§€ν‘œ μ§€μƒμ£Όμ˜μ—μ„œ λ²—μ–΄λ‚˜, μ΄μ œλŠ” μ΅œμ•…μ˜ μƒν™©μ—μ„œλ„ μ–Όλ§ˆλ‚˜ μ•ˆμ •μ μΈμ§€(Worst-case Robustness)κ°€ λͺ¨λΈμ˜ μ§„μ •ν•œ κ°€μΉ˜λ₯Ό κ²°μ •ν•˜λŠ” 핡심 척도가 됨. - **μ •μ±… λ³€ν™”:** Antigravity ν”„λ‘œμ νŠΈλŠ” μ—μ΄μ „νŠΈμ˜ νŒλ‹¨ 둜직 배포 μ „, μ λŒ€μ  μƒ˜ν”Œ ν…ŒμŠ€νŠΈμ™€ 이상 데이터 탐지 μ„±λŠ₯을 λ°˜λ“œμ‹œ κ²€μ¦ν•˜μ—¬ μ˜ˆμ™Έ 상황에 λŒ€ν•œ 강건성을 ν™•λ³΄ν•˜λŠ” 'Robust-First' 정책을 κ³ μˆ˜ν•¨. ## πŸ”— 지식 μ—°κ²° (Graph) - [[Trustworthy-AI]], Adversarial-Machine-Learning, OOD-Detection-Techniques, [[Outlier-Detection-Techniques]] - **Raw Source:** 10_Wiki/Topics/AI/Robust-Machine-Learning.md