--- id: [[P-Reinforce]]-AUTO-COVI-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 0.98 tags: [auto-reinforced, [[Computer-Vision]], [[Deep-Learning]], [[Pattern-Recognition]], image-[[Processing]], perception] last_reinforced: 2026-04-20 --- # [[Computer Vision]] ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "λ””μ§€ν„Έ 눈의 μ§„ν™”: ν”½μ…€μ˜ λ‹¨μˆœν•œ λ‚˜μ—΄μΈ 이미지와 λΉ„λ””μ˜€ 데이터λ₯Ό 컴퓨터가 μΈκ°„μ²˜λŸΌ μ΄ν•΄ν•˜κ³ , 객체λ₯Ό μ‹λ³„ν•˜λ©°, κ³΅κ°„μ˜ 깊이λ₯Ό 읽고, 의미 μžˆλŠ” 정보λ₯Ό μΆ”μΆœν•˜κ²Œ λ§Œλ“œλŠ” 인곡지λŠ₯의 μ‹œκ° 쀑좔." ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) 컴퓨터 λΉ„μ „(Computer Vision)은 κ°€μ‹œκ΄‘μ„  λ“± 물리적 μ‹ ν˜Έλ₯Ό λ””μ§€ν„Έ λ°μ΄ν„°λ‘œ λ³€ν™˜ν•˜κ³  λΆ„μ„ν•˜μ—¬ 'λ³Έλ‹€'λŠ” ν–‰μœ„λ₯Ό κΈ°κ³„λ‘œ κ΅¬ν˜„ν•˜λŠ” κΈ°μˆ μž…λ‹ˆλ‹€. 1. **핡심 νƒœμŠ€ν¬**: * **Classification**: 무엇이 λ“€μ–΄μžˆλŠ”κ°€? (예: 개/고양이 ꡬ뢄) * **Detection**: 무엇이 '어디에' μžˆλŠ”κ°€? (Bounding Box ν‘œμ‹œ) * **Segmentation**: ν”½μ…€ λ‹¨μœ„λ‘œ 객체의 경계선 λ”°κΈ°. * **Depth Estimation**: κ³΅κ°„μ˜ μž…μ²΄μ  거리감 νŒŒμ•…. 2. **기반 기술**: * CNN(Convolutional Neural Networks)μ—μ„œ μ΅œκ·Όμ—λŠ” Vision [[Transformers]](ViT)둜 μ•„ν‚€ν…μ²˜κ°€ μ§„ν™” 쀑. ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌**: κ³Όκ±°μ—λŠ” ν•„ν„° μ œμž‘ λ“± μˆ˜λ™ νŠΉμ§• μΆ”μΆœ(Hand-crafted features) μ •μ±… μœ„μ£Όμ˜€μœΌλ‚˜, ν˜„λŒ€ 정책은 λ°μ΄ν„°λ‘œλΆ€ν„° 슀슀둜 νŠΉμ§•μ„ λ°°μš°λŠ” 'λ”₯λŸ¬λ‹ 기반 쒅단간 ν•™μŠ΅ μ •μ±…(End-to-end)'으둜 μ™„μ „νžˆ μ „ν™˜λ¨(RL Update). - **μ •μ±… λ³€ν™”(RL Update)**: 2D 이미지 뢄석 정책을 λ„˜μ–΄, μ΅œκ·Όμ—λŠ” '3D 곡간 μ§€λŠ₯ μ •μ±…'κ³Ό 'λ©€ν‹°λͺ¨λ‹¬(μ‹œκ°+μ–Έμ–΄) 톡합 μ •μ±…'이 μžμœ¨μ£Όν–‰κ³Ό 에이전틱 μ„œλΉ„μŠ€μ˜ 핡심 μ •μ±… ν† λŒ€κ°€ 됨. ## πŸ”— 지식 μ—°κ²° (Graph) - Pattern Recognition, [[Autonomous Vehicles]], [[CV_Synthesis]], [[Artificial Intelligence (AI)]], [[Robotics]] - **Modern Tech/Tools**: OpenCV, PyTorch/TensorFlow, YOLO, Segment Anything Model (SAM), NeRF. ---