--- id: P-REINFORCE-AI-BBOX category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [Bounding Box Regression, Object Detection, Computer Vision, IoU] last_reinforced: 2026-04-20 --- # [[Bounding-Box-Regression|Bounding-Box-Regression]] (๊ฒฝ๊ณ„ ๋ฐ•์Šค ํšŒ๊ท€) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "์ด๋ฏธ์ง€ ์† ์‚ฌ๋ฌผ์˜ ์ •ํ™•ํ•œ ์ฃผ์†Œ๋ฅผ ์ฐพ๋Š” ์ผ." ๋ฌผ์ฒด๊ฐ€ ์–ด๋””์— ์žˆ๋Š”์ง€ ๋Œ€๋žต์ ์ธ ์˜์—ญ์„ ๋„˜์–ด, x, y, Width, Height๋ผ๋Š” 4๊ฐœ์˜ ์ˆซ์ž๋ฅผ ์ •๋ฐ€ํ•˜๊ฒŒ ์˜ˆ์ธกํ•˜์—ฌ ๋ฌผ์ฒด๋ฅผ ์ƒ์ž ์•ˆ์— ๊ฐ€๋‘๋Š” ์ปดํ“จํ„ฐ ๋น„์ „์˜ ํ•ต์‹ฌ ๊ธฐ์ˆ ์ด๋‹ค. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **Coordinate Prediction**: - ์‹ ๊ฒฝ๋ง์˜ ๋งˆ์ง€๋ง‰ ์ธต์—์„œ ๋ฌผ์ฒด์˜ ์ค‘์‹ฌ์  ์ขŒํ‘œ์™€ ํฌ๊ธฐ๋ฅผ ์—ฐ์†์ ์ธ ์‹ค์ˆ˜๊ฐ’์œผ๋กœ ์ถœ๋ ฅํ•œ๋‹ค. - **Intersection over Union (IoU)**: - ์˜ˆ์ธกํ•œ ๋ฐ•์Šค์™€ ์‹ค์ œ ์ •๋‹ต ๋ฐ•์Šค๊ฐ€ ์–ผ๋งˆ๋‚˜ ๊ฒน์น˜๋Š”์ง€(0~1 ์‚ฌ์ด) ์ธก์ •ํ•˜์—ฌ ๋ฐ•์Šค์˜ ์ •ํ™•๋„๋ฅผ ํ‰๊ฐ€ํ•˜๋Š” ์ง€ํ‘œ. - **Anchor Boxes**: - ๋‹ค์–‘ํ•œ ํฌ๊ธฐ์™€ ๋น„์œจ์˜ ๊ฐ€์ด๋“œ๋ผ์ธ(Anchor)์„ ๋ฏธ๋ฆฌ ๋ฟŒ๋ ค๋‘๊ณ , ๋ฌผ์ฒด์™€ ๊ฐ€์žฅ ๋น„์Šทํ•œ ์•ต์ปค๋ฅผ ์„ธ๋ฐ€ํ•˜๊ฒŒ ์กฐ์ •(Offset)ํ•˜์—ฌ ์ตœ์ข… ์œ„์น˜๋ฅผ ๊ฒฐ์ •ํ•œ๋‹ค. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (RL Update) - ์—ฌ๋Ÿฌ ๋ฌผ์ฒด๊ฐ€ ๊ฒน์ณ ์žˆ์„ ๋•Œ ํ•˜๋‚˜์˜ ๋ฐ•์Šค๋งŒ ๋‚จ๊ฒจ์•ผ ํ•˜๋Š” 'Non-Maximum Suppression (NMS)' ๊ณผ์ •์ด ์„ฑ๋Šฅ์— ํฐ ์˜ํ–ฅ์„ ๋ฏธ์นœ๋‹ค. ์ตœ๊ทผ์—๋Š” NMS ์—†์ด ์ง์ ‘ ๋ฌผ์ฒด ์ง‘ํ•ฉ์„ ์˜ˆ์ธกํ•˜๋Š” Transformer ๊ธฐ๋ฐ˜(DETR) ๋ฐฉ์‹์ด ๊ฐ๊ด‘๋ฐ›๊ณ  ์žˆ๋‹ค. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - Related: Object-Detection , Convolutional-Neural-Networks-(CNN) - Metric: Mean-Average-Precision-(mAP)