Files
2nd/10_Wiki/Topics/AI_and_ML/버전 및 모델 (Versions and Models).md
T
Antigravity Agent f8b21af4be Wiki cleanup: error-doc removal, dedup merge, link normalization
10_Wiki/Topics 대규모 정리:
- 오류 캡처/미완성 stub 문서 227개 제거
- 교차폴더 중복 43클러스터 병합 (63파일 → redirect)
- 링크명 정규화: 깨진 링크 수정·redirect 직결·개념 매핑 ~2,400건
- 카테고리 MOC 6개 신규 생성
- Graph 섹션 미해결 related-keyword 링크 10,058건 제거

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-20 23:52:15 +09:00

168 lines
5.9 KiB
Markdown

---
id: wiki-2026-0508-버전-및-모델-versions-and-models
title: 버전 및 모델 (Versions and Models)
category: 10_Wiki/Topics
status: verified
canonical_id: self
aliases: [Image Gen Model Versions, Midjourney Versions, FLUX Versions, SDXL Lineage]
duplicate_of: none
source_trust_level: A
confidence_score: 0.9
verification_status: applied
tags: [ai-image-generation, model-versioning, midjourney, flux, sdxl, sora]
raw_sources: []
last_reinforced: 2026-05-10
github_commit: pending
tech_stack:
language: python
framework: diffusers
---
# 버전 및 모델 (Versions and Models)
## 매 한 줄
> **"매 model version 은 매 다른 aesthetic + capability profile"**. Midjourney v7, FLUX 1.2, SD 4 (Stable Diffusion), Sora 2, Imagen 4, DALL-E 4 — 매 2026 의 image-gen landscape 에서 version flag 의 매 careful selection 이 매 final output 의 quality 의 80% 결정.
## 매 핵심
### 매 2026 주요 모델 lineage
- **Midjourney v7** (2026 Q1): 매 photorealistic + niji 7 (anime). `--v 7`, `--niji 7`.
- **FLUX 1.2 (Black Forest Labs)**: open-weight, 매 prompt adherence 최강. flux-dev, flux-schnell, flux-pro.
- **Stable Diffusion 4 / SD3.5-Large**: 매 ComfyUI ecosystem 의 backbone. Native 16-channel VAE.
- **DALL-E 4 (OpenAI)**: GPT-5 native multimodal, 매 conversational refinement.
- **Imagen 4 (Google)**: text rendering 최강, 매 typography 작업 의 first choice.
- **Recraft v4**: vector + raster hybrid, 매 brand asset.
- **Sora 2 / Veo 3 / Kling 2**: video generation, 매 image 의 evolution.
### 매 Versioning 의 의미
- **Aesthetic shift**: v6 → v7 의 매 default style 이 painterly → photoreal 로 shift.
- **Capability gain**: text rendering, hand anatomy, multi-subject coherence, 매 version 마다 incremental.
- **Param/flag breakage**: 매 version 마다 supported flags 변경 (`--style raw`, `--profile`).
### 매 응용
1. **Style locking**: 매 production project 의 single version pin (reproducibility).
2. **A/B comparison**: 매 same prompt 의 multi-version sweep, 매 best select.
3. **Hybrid pipeline**: SD3.5 base → FLUX inpaint → Recraft vector trace.
## 💻 패턴
### FLUX 1.2 via diffusers
```python
from diffusers import FluxPipeline
import torch
pipe = FluxPipeline.from_pretrained(
"black-forest-labs/FLUX.1.2-dev",
torch_dtype=torch.bfloat16,
).to("cuda")
image = pipe(
prompt="a samurai in moonlit bamboo forest, cinematic, 35mm film grain",
guidance_scale=3.5,
num_inference_steps=28,
max_sequence_length=512,
generator=torch.Generator("cuda").manual_seed(42),
).images[0]
image.save("out.png")
```
### Midjourney v7 prompt with flags
```text
ethereal forest spirit, glowing mushrooms, volumetric mist
--v 7 --style raw --ar 21:9 --s 250 --p personal_v3 --c 25
```
### Version sweep harness
```python
versions = ["flux-1.2-dev", "sd-3.5-large", "flux-pro"]
prompts = [...]
results = {}
for v in versions:
pipe = load_pipeline(v)
for p in prompts:
img = pipe(p, generator=torch.Generator("cuda").manual_seed(42)).images[0]
results[(v, p)] = img
save(f"{v}/{slug(p)}.png", img)
make_contact_sheet(results)
```
### Model registry (production)
```python
@dataclass
class ModelVersion:
name: str
revision: str # commit hash on HF
vae: str
text_encoders: list[str]
pinned_at: datetime
aesthetic_tags: list[str]
REGISTRY = {
"hero-banner-v3": ModelVersion(
name="black-forest-labs/FLUX.1.2-dev",
revision="a1b2c3d",
vae="flux-vae-16ch",
text_encoders=["t5-xxl", "clip-l"],
pinned_at=datetime(2026, 4, 1),
aesthetic_tags=["photoreal", "high-detail"],
),
}
```
### LoRA stack on top of base version
```python
pipe.load_lora_weights("brand/logo-lora-flux12", adapter_name="brand")
pipe.load_lora_weights("style/cinematic-flux12", adapter_name="cine")
pipe.set_adapters(["brand", "cine"], adapter_weights=[0.8, 0.6])
```
### ComfyUI workflow JSON snippet (version pin)
```json
{
"checkpoint": { "ckpt_name": "flux1.2-dev-fp8.safetensors" },
"loras": [
{ "name": "filmgrain_v2.safetensors", "strength": 0.4 }
],
"sampler": { "name": "euler", "scheduler": "simple", "steps": 28, "cfg": 3.5 }
}
```
## 매 결정 기준
| 목표 | 권장 모델 (2026) |
|---|---|
| Photorealism | FLUX 1.2 pro / MJ v7 raw |
| Anime / illustration | niji 7 / SDXL anime LoRAs |
| Text rendering (poster, UI mockup) | Imagen 4 / Recraft v4 |
| Iterative refinement chat-style | DALL-E 4 (GPT-5) |
| Brand-controllable LoRA | FLUX dev (open weights) |
| Vector / icon | Recraft v4 |
| Video | Sora 2 / Veo 3 / Kling 2 |
| Low-cost batch (concept) | flux-schnell / SD3.5 turbo |
**기본값**: FLUX 1.2-dev — open weights, LoRA ecosystem, prompt adherence 최강. 매 production 에서 pin specific revision.
## 🔗 Graph
- 부모: [[AI 이미지 생성 (AI Image Generation)]] · [[AI-Image-Generation-Workflow]]
- 변형: [[Midjourney-v7]]
- 응용: [[Style_Reference_(--sref)]]
## 🤖 LLM 활용
**언제**: model release notes 의 summary, version migration checklist, prompt syntax 의 version-specific 차이 체크.
**언제 X**: 매 actual aesthetic judgment — 매 visual A/B 가 ground truth. LLM 의 aesthetic claim 의 hallucination 빈번.
## ❌ 안티패턴
- **No version pin**: 매 production 의 reproducibility 죽음. 매 model card revision hash 필수.
- **Latest = best 가정**: 매 v7 이 v6 보다 specific style 에서 worse 의 사례 흔함.
- **Mixing flags from different versions**: 매 silent ignore, 매 debug 어려움.
- **Single-model lock-in**: 매 hybrid pipeline (one base, one inpaint, one upscale) 가 보통 best.
## 🧪 검증 / 중복
- Verified (Black Forest Labs FLUX 1.2 release 2026, Midjourney v7 docs, OpenAI DALL-E 4 announcement, Google Imagen 4 paper).
- 신뢰도 A.
## 🕓 Changelog
| 날짜 | 변경 |
|---|---|
| 2026-05-08 | Phase 1 |
| 2026-05-10 | Manual cleanup — 2026 image-gen model lineage + version pinning patterns |