"매 personal data 가 lawful basis + minimum + purpose-limited 로 다뤄진다.". Data privacy engineering 은 매 GDPR/CCPA/LGPD/K-PIPA 의 legal requirement 를 매 storage, processing, transfer, retention 의 매 단계 에 deterministic control 로 구현. 2026 stack: classification + DLP + tokenization/PETs (DP, FHE, TEE) + consent management + DSAR automation + privacy-by-design.
Zero-Knowledge Proofs: identity 증명 without disclose.
매 응용
EU GDPR + 한국 PIPA + 중국 PIPL compliance.
Healthcare HIPAA, PCI-DSS payment.
ML training without raw data (FL, DP).
Cross-border transfer (SCC, BCR, DPF).
Right to be forgotten (RTBF) automation.
💻 패턴
Data classification + DLP
# 매 PII detection — Microsoft Presidiofrompresidio_analyzerimportAnalyzerEngineanalyzer=AnalyzerEngine()results=analyzer.analyze(text=user_input,language='en',entities=['EMAIL_ADDRESS','PHONE_NUMBER','CREDIT_CARD','PERSON','KR_RRN'])forrinresults:redact_or_mask(text,r.start,r.end)
Format-preserving tokenization
# 매 ff3-1 — preserves format (e.g., card number)fromff3importFF3Cipherc=FF3Cipher(key,tweak)token=c.encrypt("4242424242424242")# → 16-digit stringplain=c.decrypt(token)
Differential Privacy noise
importnumpyasnpdeflaplace_mechanism(true_val,sensitivity,epsilon):returntrue_val+np.random.laplace(0,sensitivity/epsilon)# 매 query: count of users in segmentnoisy_count=laplace_mechanism(true_count=1234,sensitivity=1,epsilon=1.0)
k-anonymity check
importpandasaspddefk_anonymity(df:pd.DataFrame,quasi_ids:list[str])->int:returndf.groupby(quasi_ids).size().min()# 매 ensure k>=5 before releaseassertk_anonymity(df,['zip','age','gender'])>=5