"매 tilted image 의 axis-aligned 로 복원". Document scanning, satellite imagery, photo correction 의 fundamental preprocessing. Classical (Hough line + rotation, perspective transform) + modern deep learning (DeepDeskew, DocAligner) 의 combo.
매 핵심
매 두 가지 problem
Skew correction (rotation): 매 in-plane rotation 의 보정. Hough line 의 dominant angle detection.
Perspective correction (homography): 매 4-point 의 quadrilateral → rectangle. 매 non-frontal photo 의 document.
매 classical pipeline
Edge detection (Canny).
Line detection (Hough transform) 또는 corner detection.
Dominant angle estimation 또는 4-point selection.
Rotation matrix / homography 계산.
Warp (affine / perspective).
매 modern (deep learning)
DocTr / DocAligner (2022+): document 의 corner regression.
CNN-based skew angle predictor: 매 single forward pass.
LayoutLMv3-based: 매 document understanding 의 part.
언제: OCR 의 preprocessing pipeline, document understanding 의 normalization, vision-language model 의 input quality 개선.
언제 X: 매 ill-defined edges (handwriting on textured background), 매 already aligned image (overhead).
❌ 안티패턴
Single Hough line: 매 outlier 의 dominate. median angle 사용.
Aggressive crop: rotation 후 black border 의 crop 시 content loss.
Over-correction: 매 small skew (< 0.5°) 무시 — overhead > benefit.