[G1-Sync] Manual knowledge update

2026-04-30 22:42:02 +09:00
parent 0bd4f19e38
commit c36c0644a1
4888 changed files with 18470 additions and 18602 deletions
@@ -2,7 +2,7 @@
 id: DATA-PRE-001
 category: "10_Wiki/💡 Topics/AI"
 confidence_score: 1.0
-tags: [data-science, ai, machine-learning, preprocessing, data-cleaning, feature-engineering, normalization]
+tags: [data-science, ai, machine-learning, pre[[Processing]], data-cleaning, [[Feature-Engineering]], [[Normalization]]]
 last_reinforced: 2026-04-26
 ---

@@ -12,7 +12,7 @@ last_reinforced: 2026-04-26
 > "데이터의 날것 그대로를 신뢰하지 말고, 지능이 소화하기 가장 편안한 형태로 정제하고 규격화하여 모델의 잠재력을 해방하라" — 분석이나 학습에 적합하지 않은 원시 데이터를 데이터 품질을 높이고 학습 효율을 최적화하기 위해 가공하는 모든 인공지능 워크플로우의 최우선 과정.

 ## 📖 구조화된 지식 (Synthesized Content)
- **추출된 패턴:** "Data Cleansing and Structural Alignment" — 불완전한 기록(Missing value)을 메우고, 극단적인 값(Outlier)을 처리하며, 서로 다른 단위의 숫자들을 동일한 범위로 맞추어(Scaling) 모델이 특정 변수에만 휘둘리지 않게 만드는 패턴.
+- **추출된 패턴:** "Data Cleansing and Structural [[Alignment]]" — 불완전한 기록(Missing value)을 메우고, 극단적인 값(Outlier)을 처리하며, 서로 다른 단위의 숫자들을 동일한 범위로 맞추어(Scaling) 모델이 특정 변수에만 휘둘리지 않게 만드는 패턴.
 - **주요 작업 단계:**
    - **Cleaning:** 오타 수정, 결측치 처리(Imputation), 중복 데이터 제거.
    - **Transformation:** 정규화(Normalization), 표준화(Standardization), 로그 변환.