--- id: CV-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [ai, computer-vision, image-processing, deep-learning, cnn] last_reinforced: 2026-04-26 --- # Computer Vision Mastery (์ปดํ“จํ„ฐ ๋น„์ „ ๋งˆ์Šคํ„ฐ๋ฆฌ) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "ํ”ฝ์…€์˜ ๋‚˜์—ด์—์„œ ์‚ฌ๋ฌผ๊ณผ ๋งฅ๋ฝ์„ ์ฝ์–ด๋‚ด๋Š” AI์˜ ๋ˆˆ์„ ์™„์„ฑํ•˜๋ผ" โ€” ์ด๋ฏธ์ง€๋‚˜ ๋น„๋””์˜ค๋กœ๋ถ€ํ„ฐ ์œ ์˜๋ฏธํ•œ ์ •๋ณด๋ฅผ ์ถ”์ถœ, ๋ถ„์„ ๋ฐ ์ดํ•ดํ•˜๊ธฐ ์œ„ํ•œ ๊ธฐ์ˆ  ์ฒด๊ณ„๋กœ, ์ž์œจ์ฃผํ–‰๋ถ€ํ„ฐ ์˜๋ฃŒ ์˜์ƒ ํŒ๋…๊นŒ์ง€ ์‹œ๊ฐ ์ง€๋Šฅ์˜ ์ •์ˆ˜. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** ๊ณ ์ฐจ์›์˜ ์‹œ๊ฐ ๋ฐ์ดํ„ฐ๋ฅผ ํŠน์ง• ์ถ”์ถœ ๋ ˆ์ด์–ด๋ฅผ ํ†ตํ•ด ์ €์ฐจ์›์˜ ์ถ”์ƒ์  ๊ฐœ๋…์œผ๋กœ ๋ณ€ํ™˜ํ•˜๊ณ , ์ด๋ฅผ ๋‹ค์‹œ ๊ฐ์ฒด ์ธ์‹์ด๋‚˜ ๋ถ„ํ•  ๋“ฑ์˜ ํƒœ์Šคํฌ๋กœ ๊ตฌ์ฒดํ™”ํ•˜๋Š” ์ธ์ง€ ํŒจํ„ด. - **ํ•ต์‹ฌ ๊ธฐ์ˆ  ๊ณ„๋ณด:** - **Traditional CV:** ์†Œ๋ฒจ ํ•„ํ„ฐ, Canny edge detection, SIFT ๋“ฑ ์ˆ˜ํ•™์  ํ•„ํ„ฐ ๊ธฐ๋ฐ˜ ํŠน์ง• ์ถ”์ถœ. - **CNN (Convolutional Neural Networks):** ์ด๋ฏธ์ง€์˜ ์ง€์—ญ์  ํŠน์ง•์„ ๊ณ„์ธต์ ์œผ๋กœ ํ•™์Šต (AlexNet, ResNet). - **Object Detection:** ์ด๋ฏธ์ง€ ๋‚ด ๋ฌผ์ฒด์˜ ์œ„์น˜์™€ ์ข…๋ฅ˜ ํŒŒ์•… (YOLO, Faster R-CNN). - **Segmentation:** ํ”ฝ์…€ ๋‹จ์œ„๋กœ ์˜์—ญ ๊ตฌ๋ถ„ (U-Net, Mask R-CNN). - **Vision Transformer (ViT):** ํ…์ŠคํŠธ ์ฒ˜๋ฆฌ์˜ ํŠธ๋žœ์Šคํฌ๋จธ ๊ตฌ์กฐ๋ฅผ ์ด๋ฏธ์ง€์— ์ ์šฉํ•˜์—ฌ ์ „์—ญ์  ๋งฅ๋ฝ ํŒŒ์•…. - **์˜์˜:** ์ธ๊ฐ„์˜ ์‹œ๊ฐ ๊ธฐ๋Šฅ์„ ๊ธฐ๊ณ„๋กœ ์™„๋ฒฝํžˆ ๊ตฌํ˜„ํ•˜์—ฌ ๋ฌผ๋ฆฌ ์„ธ๊ณ„์™€ ๋””์ง€ํ„ธ ์„ธ๊ณ„์˜ ๊ฒฝ๊ณ„๋ฅผ ํ—ˆ๋ฌพ. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ๋‹จ์ˆœํžˆ ํ˜•ํƒœ๋ฅผ ์ธ์‹ํ•˜๋Š” ์ˆ˜์ค€์—์„œ, ํ˜„์žฌ๋Š” CLIP์ด๋‚˜ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ LLM์„ ํ†ตํ•ด ์ด๋ฏธ์ง€ ์† ์ƒํ™ฉ์„ '์„ค๋ช…'ํ•˜๊ณ  '์ถ”๋ก 'ํ•˜๋Š” ๋‹จ๊ณ„๋กœ ์ง„์ž…. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ์œ„ํ‚ค ๋ฌธ์„œ ๋‚ด์˜ ๋น„์ •ํ˜• ๋„ํ‘œ๋‚˜ ์Šคํฌ๋ฆฐ์ƒท ๋ฐ์ดํ„ฐ๋ฅผ ํ…์ŠคํŠธ๋กœ ๋ณ€ํ™˜ํ•˜์—ฌ ์ง€์‹ ๋ฒ ์ด์Šค์— ํ†ตํ•ฉํ•  ๋•Œ ์ตœ์‹  ๋น„์ „-์–ธ์–ด ๋ชจ๋ธ์„ ํ™œ์šฉํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Convolutional-Neural-Networks]], [[CLIP]], Image-Processing, [[Transformer-Architecture]] - **Raw Source:** 10_Wiki/Topics/AI/Computer-Vision.md