[G1-Sync] Manual knowledge update

This commit is contained in:
Antigravity Agent
2026-05-09 21:08:02 +09:00
parent f0befc887a
commit 93ec7e9056
363 changed files with 68333 additions and 64 deletions
@@ -0,0 +1,176 @@
---
id: db-time-series-patterns
title: Time-series — TimescaleDB / 다운샘플 / 보존
category: Coding
status: draft
source_trust_level: B
verification_status: conceptual
created_at: 2026-05-09
updated_at: 2026-05-09
tags: [database, time-series, timescale, vibe-coding]
tech_stack: { language: "Postgres / TimescaleDB / InfluxDB", applicable_to: ["Backend"] }
applied_in: []
aliases: [time-series, TimescaleDB, hypertable, continuous aggregate, retention policy]
---
# Time-series Patterns
> 메트릭 / 이벤트 / 로그 / IoT = time-series. **TimescaleDB (Postgres extension)** 가 modern 표준 — 일반 SQL + 시간축 최적화. InfluxDB / Prometheus 도 옵션.
## 📖 핵심 개념
- Hypertable: 시간 기준 자동 파티션.
- Continuous aggregate: 실시간 materialized view (1m / 1h / 1d 다운샘플).
- Compression: 오래된 chunk 자동 압축 (10-30x).
- Retention policy: N일 후 자동 drop.
## 💻 코드 패턴
### TimescaleDB hypertable
```sql
CREATE EXTENSION IF NOT EXISTS timescaledb;
CREATE TABLE metrics (
time TIMESTAMPTZ NOT NULL,
device_id TEXT NOT NULL,
cpu DOUBLE PRECISION,
mem DOUBLE PRECISION
);
SELECT create_hypertable('metrics', 'time', chunk_time_interval => INTERVAL '1 day');
CREATE INDEX metrics_device_time ON metrics(device_id, time DESC);
```
### Insert (대량)
```sql
COPY metrics FROM STDIN;
-- 또는 multi-row INSERT
INSERT INTO metrics(time, device_id, cpu, mem)
VALUES ('2026-05-09 10:00', 'd1', 0.5, 0.3),
('2026-05-09 10:01', 'd1', 0.6, 0.3);
```
### Query (시간 범위)
```sql
-- 최근 1시간
SELECT time_bucket('1 minute', time) AS bucket,
device_id,
avg(cpu) AS avg_cpu
FROM metrics
WHERE time > NOW() - INTERVAL '1 hour'
GROUP BY bucket, device_id
ORDER BY bucket;
```
### Continuous aggregate
```sql
CREATE MATERIALIZED VIEW metrics_hourly
WITH (timescaledb.continuous) AS
SELECT time_bucket('1 hour', time) AS hour,
device_id,
avg(cpu) AS avg_cpu,
max(cpu) AS max_cpu,
count(*) AS samples
FROM metrics
GROUP BY hour, device_id;
-- refresh 정책 (실시간으로 따라옴)
SELECT add_continuous_aggregate_policy('metrics_hourly',
start_offset => INTERVAL '3 hours',
end_offset => INTERVAL '5 minutes',
schedule_interval => INTERVAL '5 minutes');
```
### Compression
```sql
ALTER TABLE metrics SET (
timescaledb.compress,
timescaledb.compress_segmentby = 'device_id',
timescaledb.compress_orderby = 'time DESC'
);
SELECT add_compression_policy('metrics', INTERVAL '7 days');
-- 7일 지난 chunk 자동 압축
```
### Retention
```sql
SELECT add_retention_policy('metrics', INTERVAL '90 days');
-- 90일 지난 chunk 자동 drop
```
### Time-bucket gap fill
```sql
SELECT time_bucket_gapfill('1 minute', time) AS bucket,
device_id,
locf(avg(cpu)) AS cpu -- last observation carried forward
FROM metrics
WHERE time > NOW() - INTERVAL '1 hour'
AND time <= NOW()
GROUP BY bucket, device_id;
```
### Downsampling 단계
```
raw (1s) → 1분 (cont. agg) → 1시간 → 1일
```
각 단계는 다른 retention.
### InfluxDB (alternative)
```
# Line protocol
metrics,device=d1 cpu=0.5,mem=0.3 1715238000000000000
```
```ts
// Flux 쿼리
from(bucket: "default")
|> range(start: -1h)
|> filter(fn: (r) => r._measurement == "metrics")
|> aggregateWindow(every: 1m, fn: mean)
```
### Prometheus (메트릭 + 알람)
```
# scrape config
scrape_configs:
- job_name: api
static_configs:
- targets: ['api:9090']
```
PromQL:
```
rate(http_requests_total[5m])
histogram_quantile(0.99, sum by (le) (rate(http_request_duration_seconds_bucket[5m])))
```
## 🤔 의사결정 기준
| 데이터 | 추천 |
|---|---|
| 메트릭 + alert | Prometheus + Grafana |
| IoT / 센서 (장기 보관) | TimescaleDB |
| 트레이딩 / 분석 | TimescaleDB / ClickHouse |
| 단순 events | Postgres + 파티션 |
| 매우 큰 (PB) | ClickHouse / Druid |
| 짧은 (실시간 30일) | Prometheus / InfluxDB |
## ❌ 안티패턴
- **단일 테이블 1B+ rows 일반 PG**: 인덱스 거대, query 느림.
- **Index time + device 따로**: composite (device, time DESC) 가 best.
- **Retention 없음**: 영원 자라남.
- **Compression 미사용**: 디스크 5-10x 더.
- **Continuous agg 없음 — raw 매번**: 분 단위 query 가 시간.
- **Time as TEXT**: 정렬 안 됨, range 쿼리 느림. TIMESTAMPTZ.
- **Monitoring DB 에 트랜잭션**: HFT app + monitoring 같은 PG = 맥 끊김.
## 🤖 LLM 활용 힌트
- TimescaleDB = Postgres 알면 그대로.
- Hypertable + cont agg + compression + retention 4종.
- Time-bucket + gapfill + locf 활용.
## 🔗 관련 문서
- [[DB_Partitioning_Patterns]]
- [[DevOps_Observability_Stack]]
- [[Native_Battery_Network_Profiling]]