--- category: Unified tags: [auto-consolidated, technical-documentation] title: [[Computer Vision|Computer Vision]] last_updated: 2026-05-02 --- # [[Computer Vision|Computer Vision]] ## ๐Ÿ“Œ Brief Summary > "๋””์ง€ํ„ธ ๋ˆˆ์˜ ์ง„ํ™”: ํ”ฝ์…€์˜ ๋‹จ์ˆœํ•œ ๋‚˜์—ด์ธ ์ด๋ฏธ์ง€์™€ ๋น„๋””์˜ค ๋ฐ์ดํ„ฐ๋ฅผ ์ปดํ“จํ„ฐ๊ฐ€ ์ธ๊ฐ„์ฒ˜๋Ÿผ ์ดํ•ดํ•˜๊ณ , ๊ฐ์ฒด๋ฅผ ์‹๋ณ„ํ•˜๋ฉฐ, ๊ณต๊ฐ„์˜ ๊นŠ์ด๋ฅผ ์ฝ๊ณ , ์˜๋ฏธ ์žˆ๋Š” ์ •๋ณด๋ฅผ ์ถ”์ถœํ•˜๊ฒŒ ๋งŒ๋“œ๋Š” ์ธ๊ณต์ง€๋Šฅ์˜ ์‹œ๊ฐ ์ค‘์ถ”." --- > "ํ”ฝ์…€์˜ ๋‚˜์—ด์—์„œ ์‚ฌ๋ฌผ๊ณผ ๋งฅ๋ฝ์„ ์ฝ์–ด๋‚ด๋Š” AI์˜ ๋ˆˆ์„ ์™„์„ฑํ•˜๋ผ" โ€” ์ด๋ฏธ์ง€๋‚˜ ๋น„๋””์˜ค๋กœ๋ถ€ํ„ฐ ์œ ์˜๋ฏธํ•œ ์ •๋ณด๋ฅผ ์ถ”์ถœ, ๋ถ„์„ ๋ฐ ์ดํ•ดํ•˜๊ธฐ ์œ„ํ•œ ๊ธฐ์ˆ  ์ฒด๊ณ„๋กœ, ์ž์œจ์ฃผํ–‰๋ถ€ํ„ฐ ์˜๋ฃŒ ์˜์ƒ ํŒ๋…๊นŒ์ง€ ์‹œ๊ฐ ์ง€๋Šฅ์˜ ์ •์ˆ˜. --- > ๋””์ง€ํ„ธ ์ด๋ฏธ์ง€์™€ ๋น„๋””์˜ค์—์„œ ๊ณ ์ฐจ์›์ ์ธ ์˜๋ฏธ๋ฅผ ์ถ”์ถœํ•˜์—ฌ ๊ธฐ๊ณ„๊ฐ€ ์„ธ์ƒ์„ '๋ณด๊ณ ' '์ดํ•ดํ•˜๊ฒŒ' ๋งŒ๋“œ๋Š” AI์˜ ๊ฐ๊ฐ ๊ธฐ๊ด€. ## ๐Ÿ“– Core Content ์ปดํ“จํ„ฐ ๋น„์ „(Computer Vision)์€ ๊ฐ€์‹œ๊ด‘์„  ๋“ฑ ๋ฌผ๋ฆฌ์  ์‹ ํ˜ธ๋ฅผ ๋””์ง€ํ„ธ ๋ฐ์ดํ„ฐ๋กœ ๋ณ€ํ™˜ํ•˜๊ณ  ๋ถ„์„ํ•˜์—ฌ '๋ณธ๋‹ค'๋Š” ํ–‰์œ„๋ฅผ ๊ธฐ๊ณ„๋กœ ๊ตฌํ˜„ํ•˜๋Š” ๊ธฐ์ˆ ์ž…๋‹ˆ๋‹ค. 1. **ํ•ต์‹ฌ ํƒœ์Šคํฌ**: * **Classification**: ๋ฌด์—‡์ด ๋“ค์–ด์žˆ๋Š”๊ฐ€? (์˜ˆ: ๊ฐœ/๊ณ ์–‘์ด ๊ตฌ๋ถ„) * **Detection**: ๋ฌด์—‡์ด '์–ด๋””์—' ์žˆ๋Š”๊ฐ€? (Bounding Box ํ‘œ์‹œ) * **Segmentation**: ํ”ฝ์…€ ๋‹จ์œ„๋กœ ๊ฐ์ฒด์˜ ๊ฒฝ๊ณ„์„  ๋”ฐ๊ธฐ. * **Depth Estimation**: ๊ณต๊ฐ„์˜ ์ž…์ฒด์  ๊ฑฐ๋ฆฌ๊ฐ ํŒŒ์•…. 2. **๊ธฐ๋ฐ˜ ๊ธฐ์ˆ **: * CNN(Convolutional Neural Networks)์—์„œ ์ตœ๊ทผ์—๋Š” Vision [[Transformers|Transformers]](ViT)๋กœ ์•„ํ‚คํ…์ฒ˜๊ฐ€ ์ง„ํ™” ์ค‘. --- - **์ถ”์ถœ๋œ ํŒจํ„ด:** ๊ณ ์ฐจ์›์˜ ์‹œ๊ฐ ๋ฐ์ดํ„ฐ๋ฅผ ํŠน์ง• ์ถ”์ถœ ๋ ˆ์ด์–ด๋ฅผ ํ†ตํ•ด ์ €์ฐจ์›์˜ ์ถ”์ƒ์  ๊ฐœ๋…์œผ๋กœ ๋ณ€ํ™˜ํ•˜๊ณ , ์ด๋ฅผ ๋‹ค์‹œ ๊ฐ์ฒด ์ธ์‹์ด๋‚˜ ๋ถ„ํ•  ๋“ฑ์˜ ํƒœ์Šคํฌ๋กœ ๊ตฌ์ฒดํ™”ํ•˜๋Š” ์ธ์ง€ ํŒจํ„ด. - **ํ•ต์‹ฌ ๊ธฐ์ˆ  ๊ณ„๋ณด:** - **Traditional CV:** ์†Œ๋ฒจ ํ•„ํ„ฐ, Canny edge detection, SIFT ๋“ฑ ์ˆ˜ํ•™์  ํ•„ํ„ฐ ๊ธฐ๋ฐ˜ ํŠน์ง• ์ถ”์ถœ. - **CNN (Convolutional Neural Networks):** ์ด๋ฏธ์ง€์˜ ์ง€์—ญ์  ํŠน์ง•์„ ๊ณ„์ธต์ ์œผ๋กœ ํ•™์Šต (AlexNet, ResNet). - **Object Detection:** ์ด๋ฏธ์ง€ ๋‚ด ๋ฌผ์ฒด์˜ ์œ„์น˜์™€ ์ข…๋ฅ˜ ํŒŒ์•… (YOLO, Faster R-CNN). - **Segmentation:** ํ”ฝ์…€ ๋‹จ์œ„๋กœ ์˜์—ญ ๊ตฌ๋ถ„ (U-Net, Mask R-CNN). - **Vision Transformer (ViT):** ํ…์ŠคํŠธ ์ฒ˜๋ฆฌ์˜ ํŠธ๋žœ์Šคํฌ๋จธ ๊ตฌ์กฐ๋ฅผ ์ด๋ฏธ์ง€์— ์ ์šฉํ•˜์—ฌ ์ „์—ญ์  ๋งฅ๋ฝ ํŒŒ์•…. - **์˜์˜:** ์ธ๊ฐ„์˜ ์‹œ๊ฐ ๊ธฐ๋Šฅ์„ ๊ธฐ๊ณ„๋กœ ์™„๋ฒฝํžˆ ๊ตฌํ˜„ํ•˜์—ฌ ๋ฌผ๋ฆฌ ์„ธ๊ณ„์™€ ๋””์ง€ํ„ธ ์„ธ๊ณ„์˜ ๊ฒฝ๊ณ„๋ฅผ ํ—ˆ๋ฌพ. --- - **์ถ”์ถœ๋œ ํŒจํ„ด:** ์ด๋ฏธ์ง€ ํ”ฝ์…€์—์„œ ํŠน์ง•(Feature)์„ ์ถ”์ถœํ•˜๊ณ  ์ด๋ฅผ ๊ณ„์ธต์ ์œผ๋กœ ๊ตฌ์กฐํ™”ํ•˜์—ฌ ๊ฐ์ฒด๋ฅผ ์ธ์‹ํ•˜๋Š” ๋น„์ „ ์ฒ˜๋ฆฌ ํŒจํ„ด. - **์„ธ๋ถ€ ๋‚ด์šฉ:** - CNN(ํ•ฉ์„ฑ๊ณฑ ์‹ ๊ฒฝ๋ง)์—์„œ ViT(๋น„์ „ ํŠธ๋žœ์Šคํฌ๋จธ)๋กœ์˜ ์•„ํ‚คํ…์ฒ˜ ์ง„ํ™”. - ์ด๋ฏธ์ง€ ๋ถ„๋ฅ˜, ๊ฐ์ฒด ํƒ์ง€, ์„ธ๊ทธ๋ฉ˜ํ…Œ์ด์…˜ ๋“ฑ ํ•ต์‹ฌ ํƒœ์Šคํฌ Taxonomy ์ •์˜. - ์‹ค์‹œ๊ฐ„ ๊ฐ์ฒด ์ถ”์  ๋ฐ ๊ณต๊ฐ„ ์ดํ•ด๋ฅผ ์œ„ํ•œ ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ฒ• ํ†ตํ•ฉ. ## โš–๏ธ Trade-offs & Caveats - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ**: ๊ณผ๊ฑฐ์—๋Š” ํ•„ํ„ฐ ์ œ์ž‘ ๋“ฑ ์ˆ˜๋™ ํŠน์ง• ์ถ”์ถœ(Hand-crafted features) ์ •์ฑ… ์œ„์ฃผ์˜€์œผ๋‚˜, ํ˜„๋Œ€ ์ •์ฑ…์€ ๋ฐ์ดํ„ฐ๋กœ๋ถ€ํ„ฐ ์Šค์Šค๋กœ ํŠน์ง•์„ ๋ฐฐ์šฐ๋Š” '๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ์ข…๋‹จ๊ฐ„ ํ•™์Šต ์ •์ฑ…(End-to-end)'์œผ๋กœ ์™„์ „ํžˆ ์ „ํ™˜๋จ(RL Update). - **์ •์ฑ… ๋ณ€ํ™”(RL Update)**: 2D ์ด๋ฏธ์ง€ ๋ถ„์„ ์ •์ฑ…์„ ๋„˜์–ด, ์ตœ๊ทผ์—๋Š” '3D ๊ณต๊ฐ„ ์ง€๋Šฅ ์ •์ฑ…'๊ณผ '๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ(์‹œ๊ฐ+์–ธ์–ด) ํ†ตํ•ฉ ์ •์ฑ…'์ด ์ž์œจ์ฃผํ–‰๊ณผ ์—์ด์ „ํ‹ฑ ์„œ๋น„์Šค์˜ ํ•ต์‹ฌ ์ •์ฑ… ํ† ๋Œ€๊ฐ€ ๋จ. --- - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ๋‹จ์ˆœํžˆ ํ˜•ํƒœ๋ฅผ ์ธ์‹ํ•˜๋Š” ์ˆ˜์ค€์—์„œ, ํ˜„์žฌ๋Š” [[CLIP|CLIP]]์ด๋‚˜ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ LLM์„ ํ†ตํ•ด ์ด๋ฏธ์ง€ ์† ์ƒํ™ฉ์„ '์„ค๋ช…'ํ•˜๊ณ  '์ถ”๋ก 'ํ•˜๋Š” ๋‹จ๊ณ„๋กœ ์ง„์ž…. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ์œ„ํ‚ค ๋ฌธ์„œ ๋‚ด์˜ ๋น„์ •ํ˜• ๋„ํ‘œ๋‚˜ ์Šคํฌ๋ฆฐ์ƒท ๋ฐ์ดํ„ฐ๋ฅผ ํ…์ŠคํŠธ๋กœ ๋ณ€ํ™˜ํ•˜์—ฌ ์ง€์‹ ๋ฒ ์ด์Šค์— ํ†ตํ•ฉํ•  ๋•Œ ์ตœ์‹  ๋น„์ „-์–ธ์–ด ๋ชจ๋ธ์„ ํ™œ์šฉํ•จ. --- - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ๊ธฐํ•˜ํ•™์  ๋งค์นญ ์ค‘์‹ฌ์˜ ์ „ํ†ต์  CV์—์„œ ๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜์˜ ์‹ ๊ฒฝ๋ง ํ•™์Šต ๋ชจ๋ธ๋กœ ํŒจ๋Ÿฌ๋‹ค์ž„ ์™„์ „ ์ „ํ™˜. - **์ •์ฑ… ๋ณ€ํ™”:** ๊ธฐ์ˆ ์  ์ •ํ™•๋„(w1)์™€ ์œค๋ฆฌ์  ํ”„๋ผ์ด๋ฒ„์‹œ ๋ณดํ˜ธ์˜ ๊ฐ€์ค‘์น˜ ๊ท ํ˜• ์กฐ์ ˆ. ## ๐Ÿ”— Knowledge Connections - Pattern Recognition, [[Autonomous Vehicles|Autonomous Vehicles]], [[CV_Synthesis|CV_Synthesis]], [[Artificial Intelligence (AI)|Artificial Intelligence (AI)]], [[Robotics|Robotics]] - **Modern Tech/Tools**: OpenCV, PyTorch/TensorFlow, YOLO, Segment Anything Model (SAM), NeRF. --- --- - [[Convolutional-Neural-Networks|Convolutional-Neural-Networks]], [[CLIP|CLIP]], Image-Processing, [[Transformer-Architecture|Transformer-Architecture]] - **Raw Source:** 10_Wiki/Topics/AI/Computer-Vision.md --- - **Parent:** 10_Wiki/๐Ÿ’ก Topics/AI - **Related:** [[CV_Synthesis|CV_Synthesis]], Object-Detection, CNN - **Raw Source:** 00_Raw/2026-04-20/[[Computer Vision|Computer Vision]].md