--- id: CV-OCR-001 category: "10_Wiki/πŸ’‘ Topics/AI" confidence_score: 1.0 tags: [ai, computer-vision, ocr, text-recognition, deep-learning, tesseract] last_reinforced: 2026-04-26 --- # Optical Character Recognition (OCR, κ΄‘ν•™ 문자 인식) ## πŸ“Œ ν•œ 쀄 톡찰 (The Karpathy Summary) > "ν”½μ…€μ˜ λ©μ–΄λ¦¬μ—μ„œ μ–Έμ–΄μ˜ ν˜•μƒμ„ λ°œκ²¬ν•˜κ³ , 물리적 μ„Έμƒμ˜ 기둝을 λ””μ§€ν„Έ μ§€μ‹μ˜ νλ¦„μœΌλ‘œ λ³΅μ›ν•˜λΌ" β€” μ΄λ―Έμ§€λ‚˜ μŠ€μΊ”λœ λ¬Έμ„œ λ‚΄μ˜ ν…μŠ€νŠΈλ₯Ό μ‹λ³„ν•˜μ—¬ 컴퓨터가 νŽΈμ§‘ν•˜κ³  검색할 수 μžˆλŠ” ν…μŠ€νŠΈ λ°μ΄ν„°λ‘œ λ³€ν™˜ν•˜λŠ” 기술. ## πŸ“– κ΅¬μ‘°ν™”λœ 지식 (Synthesized Content) - **μΆ”μΆœλœ νŒ¨ν„΄:** "Localization, Recognition, and Linguistic Refinement" β€” λ¨Όμ € κΈ€μžκ°€ 어디에 μžˆλŠ”μ§€ μ˜μ—­μ„ μ°Ύκ³ (Detection), 각 μ˜μ—­μ˜ 이미지λ₯Ό 문자둜 λ²ˆμ—­ν•˜λ©°(Recognition), μ–Έμ–΄ λͺ¨λΈμ„ 톡해 λ¬Έλ§₯상 μžμ—°μŠ€λŸ¬μš΄ λ‹¨μ–΄λ‘œ κ΅μ •ν•˜λŠ” 3단계 처리 νŒ¨ν„΄. - **μ£Όμš” 기술적 μ§„ν™”:** - **Classic OCR (Tesseract):** μ •ν•΄μ§„ ν°νŠΈμ™€ κΉ”λ”ν•œ λ°°κ²½ μœ„μ£Όλ‘œ μž‘λ™. - **Deep Learning OCR (CRNN, Transformer):** λΉ„μ •ν˜• λ°°κ²½, νœ˜μ–΄μ§„ ν…μŠ€νŠΈ, λ‹€μ–‘ν•œ 필체 인식 κ°€λŠ₯. - **Scene Text Recognition:** μžμ—° ν™˜κ²½ 속 κ°„νŒμ΄λ‚˜ μ‚¬λ¬Όμ˜ ν…μŠ€νŠΈ 탐지. - **의의:** λ°©λŒ€ν•œ 쒅이 λ¬Έμ„œμ˜ λ””μ§€ν„Έν™”(Digital Transformation)λ₯Ό κ°€λŠ₯μΌ€ ν•˜λ©°, μžμœ¨μ£Όν–‰μ°¨μ˜ ν‘œμ§€νŒ 인식, λ²ˆμ—­ μ•±μ˜ μ‹€μ‹œκ°„ ν…μŠ€νŠΈ μΉ˜ν™˜ λ“± μ‹€μƒν™œ μ§€λŠ₯의 ν•„μˆ˜ κ΄€λ¬Έ μ—­ν• . ## ⚠️ λͺ¨μˆœ 및 μ—…λ°μ΄νŠΈ (Contradictions & RL Update) - **κ³Όκ±° λ°μ΄ν„°μ™€μ˜ 좩돌:** λ‹¨μˆœνžˆ 'κΈ€μž ν•˜λ‚˜ν•˜λ‚˜'λ₯Ό λ§žνžˆλŠ” 단계λ₯Ό λ„˜μ–΄, μ΄μ œλŠ” λ¬Έμ„œμ˜ λ ˆμ΄μ•„μ›ƒ(ν‘œ, 리슀트 λ“±)κΉŒμ§€ νŒŒμ•…ν•˜μ—¬ κ΅¬μ‘°ν™”λœ JSON/Markdown으둜 λ³€ν™˜ν•˜λŠ” Layout Analysis 기술이 ν˜„λŒ€ OCR의 핡심 경쟁λ ₯이 됨. - **μ •μ±… λ³€ν™”:** Antigravity ν”„λ‘œμ νŠΈλŠ” μ™ΈλΆ€ 데이터 μˆ˜μ§‘ μ‹œ μ΄λ―Έμ§€λ‚˜ PDF λ‚΄μ˜ ν…μŠ€νŠΈλ₯Ό μ§€μ‹ν™”ν•˜κΈ° μœ„ν•΄, μ΅œμ‹  트랜슀포머 기반 OCR 엔진을 ν™œμš©ν•˜μ—¬ 높은 μ •ν™•λ„μ˜ ν…μŠ€νŠΈ μΆ”μΆœμ„ 보μž₯함. ## πŸ”— 지식 μ—°κ²° (Graph) - Computer-Vision-Foundations, [[Natural-Language-Processing-NLP]], [[Object-Detection-Foundations]], [[Image-Segmentation]] - **Raw Source:** 10_Wiki/Topics/AI/Optical-Character-Recognition.md