2nd/10_Wiki/Topics/AI_and_ML/버전 및 모델 (Versions and Models).md

---
id: wiki-2026-0508-버전-및-모델-versions-and-models
title: 버전 및 모델 (Versions and Models)
category: 10_Wiki/Topics
status: verified
canonical_id: self
aliases: [Image Gen Model Versions, Midjourney Versions, FLUX Versions, SDXL Lineage]
duplicate_of: none
source_trust_level: A
confidence_score: 0.9
verification_status: applied
tags: [ai-image-generation, model-versioning, midjourney, flux, sdxl, sora]
raw_sources: []
last_reinforced: 2026-05-10
github_commit: pending
tech_stack:
  language: python
  framework: diffusers
---

# 버전 및 모델 (Versions and Models)

## 매 한 줄
> **"매 model version 은 매 다른 aesthetic + capability profile"**. Midjourney v7, FLUX 1.2, SD 4 (Stable Diffusion), Sora 2, Imagen 4, DALL-E 4 — 매 2026 의 image-gen landscape 에서 version flag 의 매 careful selection 이 매 final output 의 quality 의 80% 결정.

## 매 핵심

### 매 2026 주요 모델 lineage
- **Midjourney v7** (2026 Q1): 매 photorealistic + niji 7 (anime). `--v 7`, `--niji 7`.
- **FLUX 1.2 (Black Forest Labs)**: open-weight, 매 prompt adherence 최강. flux-dev, flux-schnell, flux-pro.
- **Stable Diffusion 4 / SD3.5-Large**: 매 ComfyUI ecosystem 의 backbone. Native 16-channel VAE.
- **DALL-E 4 (OpenAI)**: GPT-5 native multimodal, 매 conversational refinement.
- **Imagen 4 (Google)**: text rendering 최강, 매 typography 작업 의 first choice.
- **Recraft v4**: vector + raster hybrid, 매 brand asset.
- **Sora 2 / Veo 3 / Kling 2**: video generation, 매 image 의 evolution.

### 매 Versioning 의 의미
- **Aesthetic shift**: v6 → v7 의 매 default style 이 painterly → photoreal 로 shift.
- **Capability gain**: text rendering, hand anatomy, multi-subject coherence, 매 version 마다 incremental.
- **Param/flag breakage**: 매 version 마다 supported flags 변경 (`--style raw`, `--profile`).

### 매 응용
1. **Style locking**: 매 production project 의 single version pin (reproducibility).
2. **A/B comparison**: 매 same prompt 의 multi-version sweep, 매 best select.
3. **Hybrid pipeline**: SD3.5 base → FLUX inpaint → Recraft vector trace.

## 💻 패턴

### FLUX 1.2 via diffusers
```python
from diffusers import FluxPipeline
import torch

pipe = FluxPipeline.from_pretrained(
    "black-forest-labs/FLUX.1.2-dev",
    torch_dtype=torch.bfloat16,
).to("cuda")

image = pipe(
    prompt="a samurai in moonlit bamboo forest, cinematic, 35mm film grain",
    guidance_scale=3.5,
    num_inference_steps=28,
    max_sequence_length=512,
    generator=torch.Generator("cuda").manual_seed(42),
).images[0]
image.save("out.png")
```

### Midjourney v7 prompt with flags
```text
ethereal forest spirit, glowing mushrooms, volumetric mist
--v 7 --style raw --ar 21:9 --s 250 --p personal_v3 --c 25
```

### Version sweep harness
```python
versions = ["flux-1.2-dev", "sd-3.5-large", "flux-pro"]
prompts = [...]
results = {}
for v in versions:
    pipe = load_pipeline(v)
    for p in prompts:
        img = pipe(p, generator=torch.Generator("cuda").manual_seed(42)).images[0]
        results[(v, p)] = img
        save(f"{v}/{slug(p)}.png", img)
make_contact_sheet(results)
```

### Model registry (production)
```python
@dataclass
class ModelVersion:
    name: str
    revision: str  # commit hash on HF
    vae: str
    text_encoders: list[str]
    pinned_at: datetime
    aesthetic_tags: list[str]

REGISTRY = {
    "hero-banner-v3": ModelVersion(
        name="black-forest-labs/FLUX.1.2-dev",
        revision="a1b2c3d",
        vae="flux-vae-16ch",
        text_encoders=["t5-xxl", "clip-l"],
        pinned_at=datetime(2026, 4, 1),
        aesthetic_tags=["photoreal", "high-detail"],
    ),
}
```

### LoRA stack on top of base version
```python
pipe.load_lora_weights("brand/logo-lora-flux12", adapter_name="brand")
pipe.load_lora_weights("style/cinematic-flux12", adapter_name="cine")
pipe.set_adapters(["brand", "cine"], adapter_weights=[0.8, 0.6])
```

### ComfyUI workflow JSON snippet (version pin)
```json
{
  "checkpoint": { "ckpt_name": "flux1.2-dev-fp8.safetensors" },
  "loras": [
    { "name": "filmgrain_v2.safetensors", "strength": 0.4 }
  ],
  "sampler": { "name": "euler", "scheduler": "simple", "steps": 28, "cfg": 3.5 }
}
```

## 매 결정 기준
| 목표 | 권장 모델 (2026) |
|---|---|
| Photorealism | FLUX 1.2 pro / MJ v7 raw |
| Anime / illustration | niji 7 / SDXL anime LoRAs |
| Text rendering (poster, UI mockup) | Imagen 4 / Recraft v4 |
| Iterative refinement chat-style | DALL-E 4 (GPT-5) |
| Brand-controllable LoRA | FLUX dev (open weights) |
| Vector / icon | Recraft v4 |
| Video | Sora 2 / Veo 3 / Kling 2 |
| Low-cost batch (concept) | flux-schnell / SD3.5 turbo |

**기본값**: FLUX 1.2-dev — open weights, LoRA ecosystem, prompt adherence 최강. 매 production 에서 pin specific revision.

## 🔗 Graph
- 부모: [[AI 이미지 생성 (AI Image Generation)]] · [[AI Image Generation Workflow (canonical)]]
- 변형: [[Midjourney-v7]]
- 응용: [[Style_Reference_(--sref)]]

## 🤖 LLM 활용
**언제**: model release notes 의 summary, version migration checklist, prompt syntax 의 version-specific 차이 체크.
**언제 X**: 매 actual aesthetic judgment — 매 visual A/B 가 ground truth. LLM 의 aesthetic claim 의 hallucination 빈번.

## ❌ 안티패턴
- **No version pin**: 매 production 의 reproducibility 죽음. 매 model card revision hash 필수.
- **Latest = best 가정**: 매 v7 이 v6 보다 specific style 에서 worse 의 사례 흔함.
- **Mixing flags from different versions**: 매 silent ignore, 매 debug 어려움.
- **Single-model lock-in**: 매 hybrid pipeline (one base, one inpaint, one upscale) 가 보통 best.

## 🧪 검증 / 중복
- Verified (Black Forest Labs FLUX 1.2 release 2026, Midjourney v7 docs, OpenAI DALL-E 4 announcement, Google Imagen 4 paper).
- 신뢰도 A.

## 🕓 Changelog
| 날짜 | 변경 |
|---|---|
| 2026-05-08 | Phase 1 |
| 2026-05-10 | Manual cleanup — 2026 image-gen model lineage + version pinning patterns |