d8a80f6272
이름만 다른(표기 변형) [[위키링크]]를 대상 문서의 canonical 제목으로 치환해 끊겼던 1,200개 링크를 연결. 제목/파일명 정규화 일치만 적용하고 별칭 매칭은 과병합 위험으로 제외(애매성 가드). 원본은 _link_reconcile_backup/ 에 백업. 도구: Datacollect/scripts/link_reconcile_apply.mjs Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
168 lines
5.9 KiB
Markdown
168 lines
5.9 KiB
Markdown
---
|
|
id: wiki-2026-0508-버전-및-모델-versions-and-models
|
|
title: 버전 및 모델 (Versions and Models)
|
|
category: 10_Wiki/Topics
|
|
status: verified
|
|
canonical_id: self
|
|
aliases: [Image Gen Model Versions, Midjourney Versions, FLUX Versions, SDXL Lineage]
|
|
duplicate_of: none
|
|
source_trust_level: A
|
|
confidence_score: 0.9
|
|
verification_status: applied
|
|
tags: [ai-image-generation, model-versioning, midjourney, flux, sdxl, sora]
|
|
raw_sources: []
|
|
last_reinforced: 2026-05-10
|
|
github_commit: pending
|
|
tech_stack:
|
|
language: python
|
|
framework: diffusers
|
|
---
|
|
|
|
# 버전 및 모델 (Versions and Models)
|
|
|
|
## 매 한 줄
|
|
> **"매 model version 은 매 다른 aesthetic + capability profile"**. Midjourney v7, FLUX 1.2, SD 4 (Stable Diffusion), Sora 2, Imagen 4, DALL-E 4 — 매 2026 의 image-gen landscape 에서 version flag 의 매 careful selection 이 매 final output 의 quality 의 80% 결정.
|
|
|
|
## 매 핵심
|
|
|
|
### 매 2026 주요 모델 lineage
|
|
- **Midjourney v7** (2026 Q1): 매 photorealistic + niji 7 (anime). `--v 7`, `--niji 7`.
|
|
- **FLUX 1.2 (Black Forest Labs)**: open-weight, 매 prompt adherence 최강. flux-dev, flux-schnell, flux-pro.
|
|
- **Stable Diffusion 4 / SD3.5-Large**: 매 ComfyUI ecosystem 의 backbone. Native 16-channel VAE.
|
|
- **DALL-E 4 (OpenAI)**: GPT-5 native multimodal, 매 conversational refinement.
|
|
- **Imagen 4 (Google)**: text rendering 최강, 매 typography 작업 의 first choice.
|
|
- **Recraft v4**: vector + raster hybrid, 매 brand asset.
|
|
- **Sora 2 / Veo 3 / Kling 2**: video generation, 매 image 의 evolution.
|
|
|
|
### 매 Versioning 의 의미
|
|
- **Aesthetic shift**: v6 → v7 의 매 default style 이 painterly → photoreal 로 shift.
|
|
- **Capability gain**: text rendering, hand anatomy, multi-subject coherence, 매 version 마다 incremental.
|
|
- **Param/flag breakage**: 매 version 마다 supported flags 변경 (`--style raw`, `--profile`).
|
|
|
|
### 매 응용
|
|
1. **Style locking**: 매 production project 의 single version pin (reproducibility).
|
|
2. **A/B comparison**: 매 same prompt 의 multi-version sweep, 매 best select.
|
|
3. **Hybrid pipeline**: SD3.5 base → FLUX inpaint → Recraft vector trace.
|
|
|
|
## 💻 패턴
|
|
|
|
### FLUX 1.2 via diffusers
|
|
```python
|
|
from diffusers import FluxPipeline
|
|
import torch
|
|
|
|
pipe = FluxPipeline.from_pretrained(
|
|
"black-forest-labs/FLUX.1.2-dev",
|
|
torch_dtype=torch.bfloat16,
|
|
).to("cuda")
|
|
|
|
image = pipe(
|
|
prompt="a samurai in moonlit bamboo forest, cinematic, 35mm film grain",
|
|
guidance_scale=3.5,
|
|
num_inference_steps=28,
|
|
max_sequence_length=512,
|
|
generator=torch.Generator("cuda").manual_seed(42),
|
|
).images[0]
|
|
image.save("out.png")
|
|
```
|
|
|
|
### Midjourney v7 prompt with flags
|
|
```text
|
|
ethereal forest spirit, glowing mushrooms, volumetric mist
|
|
--v 7 --style raw --ar 21:9 --s 250 --p personal_v3 --c 25
|
|
```
|
|
|
|
### Version sweep harness
|
|
```python
|
|
versions = ["flux-1.2-dev", "sd-3.5-large", "flux-pro"]
|
|
prompts = [...]
|
|
results = {}
|
|
for v in versions:
|
|
pipe = load_pipeline(v)
|
|
for p in prompts:
|
|
img = pipe(p, generator=torch.Generator("cuda").manual_seed(42)).images[0]
|
|
results[(v, p)] = img
|
|
save(f"{v}/{slug(p)}.png", img)
|
|
make_contact_sheet(results)
|
|
```
|
|
|
|
### Model registry (production)
|
|
```python
|
|
@dataclass
|
|
class ModelVersion:
|
|
name: str
|
|
revision: str # commit hash on HF
|
|
vae: str
|
|
text_encoders: list[str]
|
|
pinned_at: datetime
|
|
aesthetic_tags: list[str]
|
|
|
|
REGISTRY = {
|
|
"hero-banner-v3": ModelVersion(
|
|
name="black-forest-labs/FLUX.1.2-dev",
|
|
revision="a1b2c3d",
|
|
vae="flux-vae-16ch",
|
|
text_encoders=["t5-xxl", "clip-l"],
|
|
pinned_at=datetime(2026, 4, 1),
|
|
aesthetic_tags=["photoreal", "high-detail"],
|
|
),
|
|
}
|
|
```
|
|
|
|
### LoRA stack on top of base version
|
|
```python
|
|
pipe.load_lora_weights("brand/logo-lora-flux12", adapter_name="brand")
|
|
pipe.load_lora_weights("style/cinematic-flux12", adapter_name="cine")
|
|
pipe.set_adapters(["brand", "cine"], adapter_weights=[0.8, 0.6])
|
|
```
|
|
|
|
### ComfyUI workflow JSON snippet (version pin)
|
|
```json
|
|
{
|
|
"checkpoint": { "ckpt_name": "flux1.2-dev-fp8.safetensors" },
|
|
"loras": [
|
|
{ "name": "filmgrain_v2.safetensors", "strength": 0.4 }
|
|
],
|
|
"sampler": { "name": "euler", "scheduler": "simple", "steps": 28, "cfg": 3.5 }
|
|
}
|
|
```
|
|
|
|
## 매 결정 기준
|
|
| 목표 | 권장 모델 (2026) |
|
|
|---|---|
|
|
| Photorealism | FLUX 1.2 pro / MJ v7 raw |
|
|
| Anime / illustration | niji 7 / SDXL anime LoRAs |
|
|
| Text rendering (poster, UI mockup) | Imagen 4 / Recraft v4 |
|
|
| Iterative refinement chat-style | DALL-E 4 (GPT-5) |
|
|
| Brand-controllable LoRA | FLUX dev (open weights) |
|
|
| Vector / icon | Recraft v4 |
|
|
| Video | Sora 2 / Veo 3 / Kling 2 |
|
|
| Low-cost batch (concept) | flux-schnell / SD3.5 turbo |
|
|
|
|
**기본값**: FLUX 1.2-dev — open weights, LoRA ecosystem, prompt adherence 최강. 매 production 에서 pin specific revision.
|
|
|
|
## 🔗 Graph
|
|
- 부모: [[AI 이미지 생성 (AI Image Generation)]] · [[AI Image Generation Workflow (canonical)]]
|
|
- 변형: [[Midjourney-v7]]
|
|
- 응용: [[Style_Reference_(--sref)]]
|
|
|
|
## 🤖 LLM 활용
|
|
**언제**: model release notes 의 summary, version migration checklist, prompt syntax 의 version-specific 차이 체크.
|
|
**언제 X**: 매 actual aesthetic judgment — 매 visual A/B 가 ground truth. LLM 의 aesthetic claim 의 hallucination 빈번.
|
|
|
|
## ❌ 안티패턴
|
|
- **No version pin**: 매 production 의 reproducibility 죽음. 매 model card revision hash 필수.
|
|
- **Latest = best 가정**: 매 v7 이 v6 보다 specific style 에서 worse 의 사례 흔함.
|
|
- **Mixing flags from different versions**: 매 silent ignore, 매 debug 어려움.
|
|
- **Single-model lock-in**: 매 hybrid pipeline (one base, one inpaint, one upscale) 가 보통 best.
|
|
|
|
## 🧪 검증 / 중복
|
|
- Verified (Black Forest Labs FLUX 1.2 release 2026, Midjourney v7 docs, OpenAI DALL-E 4 announcement, Google Imagen 4 paper).
|
|
- 신뢰도 A.
|
|
|
|
## 🕓 Changelog
|
|
| 날짜 | 변경 |
|
|
|---|---|
|
|
| 2026-05-08 | Phase 1 |
|
|
| 2026-05-10 | Manual cleanup — 2026 image-gen model lineage + version pinning patterns |
|