2nd/10_Wiki/Topics/Computer Vision.md

---
id: [[P-Reinforce|P-Reinforce]]-AUTO-COVI-001
category: Unified
confidence_score: 0.98
tags: [auto-reinforced, [[Computer-Vision|Computer-Vision]], [[Deep-Learning|Deep-Learning]], [[Pattern-Recognition|Pattern-Recognition]], image-[[Processing|Processing]], perception]
last_reinforced: 2026-04-20
---

# [[Computer Vision|Computer Vision]]

## 📌 한 줄 통찰 (The Karpathy Summary)
> "디지털 눈의 진화: 픽셀의 단순한 나열인 이미지와 비디오 데이터를 컴퓨터가 인간처럼 이해하고, 객체를 식별하며, 공간의 깊이를 읽고, 의미 있는 정보를 추출하게 만드는 인공지능의 시각 중추."

## 📖 구조화된 지식 (Synthesized Content)
컴퓨터 비전(Computer Vision)은 가시광선 등 물리적 신호를 디지털 데이터로 변환하고 분석하여 '본다'는 행위를 기계로 구현하는 기술입니다.

1.  **핵심 태스크**:
    *   **Classification**: 무엇이 들어있는가? (예: 개/고양이 구분)
    *   **Detection**: 무엇이 '어디에' 있는가? (Bounding Box 표시)
    *   **Segmentation**: 픽셀 단위로 객체의 경계선 따기.
    *   **Depth Estimation**: 공간의 입체적 거리감 파악.
2.  **기반 기술**:
    *   CNN(Convolutional Neural Networks)에서 최근에는 Vision [[Transformers|Transformers]](ViT)로 아키텍처가 진화 중.

## ⚠️ 모순 및 업데이트 (Contradictions & RL Update)
- **과거 데이터와의 충돌**: 과거에는 필터 제작 등 수동 특징 추출(Hand-crafted features) 정책 위주였으나, 현대 정책은 데이터로부터 스스로 특징을 배우는 '딥러닝 기반 종단간 학습 정책(End-to-end)'으로 완전히 전환됨(RL Update).
- **정책 변화(RL Update)**: 2D 이미지 분석 정책을 넘어, 최근에는 '3D 공간 지능 정책'과 '멀티모달(시각+언어) 통합 정책'이 자율주행과 에이전틱 서비스의 핵심 정책 토대가 됨.

## 🔗 지식 연결 (Graph)
- Pattern Recognition, [[Autonomous Vehicles|Autonomous Vehicles]], [[CV_Synthesis|CV_Synthesis]], [[Artificial Intelligence (AI)|Artificial Intelligence (AI)]], [[Robotics|Robotics]]
- **Modern Tech/Tools**: OpenCV, PyTorch/TensorFlow, YOLO, Segment Anything Model (SAM), NeRF.
---