Files
2nd/10_Wiki/Topics/AI_and_ML/2026년 인공지능 시각 언어 생성 패러다임 전환 및 연속적 창작 워크플로우.md
T
koriweb d8a80f6272 chore(wiki): dangling 링크 canonical 정규화 (768파일/1200건)
이름만 다른(표기 변형) [[위키링크]]를 대상 문서의 canonical 제목으로 치환해
끊겼던 1,200개 링크를 연결. 제목/파일명 정규화 일치만 적용하고 별칭 매칭은
과병합 위험으로 제외(애매성 가드). 원본은 _link_reconcile_backup/ 에 백업.
도구: Datacollect/scripts/link_reconcile_apply.mjs

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-08 12:24:15 +09:00

7.4 KiB

id, title, category, status, canonical_id, aliases, duplicate_of, source_trust_level, confidence_score, verification_status, tags, raw_sources, last_reinforced, github_commit
id title category status canonical_id aliases duplicate_of source_trust_level confidence_score verification_status tags raw_sources last_reinforced github_commit
wiki-2026-0508-2026년-인공지능-시각-언어-생성-패러다임-전환-및-연속 2026 AI Visual Language Generation Paradigm Shift 10_Wiki/Topics verified self
continuous creative workflow
visual AI 2026
draft mode paradigm
prompt engineering visual
none B 0.85 conceptual
image-generation
midjourney-v7
draft-mode
prompt-engineering
continuous-workflow
visual-ai
2026-05-09 pending

2026 AI Visual Language Generation Paradigm Shift

📌 한 줄 통찰

Single shot → continuous workflow. 매 draft mode 의 fast iteration + omni reference 의 consistency + post-edit 의 polish. 매 prompt 의 camera / lighting science 의 vocabulary.

📖 핵심 paradigm shift

매 evolution

2022-2023 (Era 1): Single shot

  • 매 prompt → image.
  • 매 luck.
  • 매 generic output.

2023-2024 (Era 2): Iterative

  • 매 multiple variation.
  • 매 prompt iterate.
  • 매 inpaint.

2025-2026 (Era 3): Continuous workflow

  • 매 draft mode (cheap explore).
  • 매 reference (style, character, omni).
  • 매 post-edit pipeline.
  • 매 production-quality output.

매 5-layer prompt structure

1. Subject

  • 매 specific entity (person, object, scene).
  • 매 physical detail.
  • 매 emotional / narrative context.

2. Medium

  • "Oil painting, watercolor, digital art, photo".
  • 매 era / school ("Renaissance, Bauhaus, Cyberpunk").

3. Environment / Composition

  • 매 location.
  • 매 framing ("close-up, wide shot, low angle").
  • 매 background.

4. Lighting

  • 매 type ("Golden hour, volumetric, chiaroscuro, rim light").
  • 매 source ("softbox, natural, neon").

5. Technical parameter

  • 매 lens ("85mm, 24mm, macro").
  • 매 depth ("shallow, deep").
  • 매 ratio ("--ar 16:9").
  • 매 quality ("--q 2, 8k").

매 photography vocabulary

  • Lens: 매 85mm portrait, 24mm wide, 100mm macro.
  • Aperture: f/1.4 (shallow DOF), f/8 (sharp).
  • Lighting type: golden hour, blue hour, soft light, hard light.
  • Composition: rule of thirds, leading lines, symmetry.
  • Color theory: complementary, analogous, monochrome.

Continuous workflow

Step 1: Mood board

  • 매 reference (Pinterest, ArtStation).
  • 매 style direction.

Step 2: Draft generation

  • 매 30+ variant.
  • Midjourney --draft (10x speed).
  • Flux Schnell (4 step).

Step 3: Selection

  • 매 promising 5-10.
  • 매 visual review.

Step 4: Refinement

  • 매 prompt iterate.
  • 매 reference (sref / cref / oref).

Step 5: Full quality

  • 매 selected 의 high-quality.

Step 6: Post-edit

  • 매 inpaint defects.
  • 매 outpaint extend.
  • 매 face restoration.

Step 7: Upscale

  • Real-ESRGAN.
  • Magnific.
  • Topaz.

Step 8: Final touch (optional)

  • Photoshop.
  • Lightroom (color grade).

매 reference 의 type

Style reference (sref)

  • 매 brand 의 mood.
  • 매 visual coherence.

Character reference (cref)

  • 매 person consistency.
  • 매 series / campaign.

Omni reference (oref) — Midjourney V7

  • 매 specific object identity.
  • 매 product mockup.

IP-Adapter (Stable Diffusion)

  • 매 reference image 의 style + structure.

매 model 의 specific control

Midjourney V7

  • --draft, --sref, --cref, --oref.
  • --s (stylize), --c (chaos), --w (weird).
  • 매 minimal natural language.

DALL-E 3

  • 매 natural language.
  • 매 GPT-4 의 expansion.
  • 매 negation 약.

Stable Diffusion / Flux

  • 매 weighted prompt: (keyword:1.2).
  • 매 negative prompt 강.
  • 매 LoRA, ControlNet, IP-Adapter.

매 emerging (2026)

Video generation

  • Sora (OpenAI).
  • Veo 2 (Google).
  • Runway Gen-3.
  • Kling.
  • 매 image → video.
  • 매 1 minute clip.

3D generation

  • 매 image / text → 3D mesh.
  • 매 game asset.
  • TripoSR, InstantMesh.

Real-time generation

  • LCM (Latent Consistency Model).
  • SDXL Turbo.
  • 매 < 1 sec / image.

💻 Code

Iterative workflow (production)

class CreativeWorkflow:
    def __init__(self, model="midjourney"):
        self.model = model
    
    def explore(self, base_prompt: str, n_drafts=30):
        """Stage 1: Draft."""
        variations = self.generate_variations(base_prompt)
        return self.batch_generate(variations, draft=True)
    
    def select(self, drafts, criteria="visual_quality"):
        """Stage 2: Select."""
        scored = [(d, self.score(d, criteria)) for d in drafts]
        return sorted(scored, key=lambda x: -x[1])[:5]
    
    def refine(self, selected_image, refinement_prompt):
        """Stage 3: Refine."""
        return self.generate(refinement_prompt, reference=selected_image)
    
    def post_edit(self, image):
        """Stage 4: Post-edit."""
        defects = self.detect_defects(image)
        for d in defects:
            image = self.inpaint(image, d.mask, prompt=d.fix_prompt)
        return image
    
    def upscale(self, image):
        """Stage 5: Upscale."""
        return self.upscaler.enhance(image, scale=4)

Reference-driven generation

def generate_with_references(prompt, style_ref=None, character_ref=None):
    parts = [prompt]
    if style_ref:
        parts.append(f"--sref {style_ref}")
    if character_ref:
        parts.append(f"--cref {character_ref}")
    
    full_prompt = " ".join(parts)
    return midjourney.generate(full_prompt)

Prompt builder (5-layer)

def build_prompt(subject, medium, env, lighting, params):
    return f"{subject}, {medium}, {env}, {lighting} {params}"

prompt = build_prompt(
    subject="elegant woman, age 30, blue eyes, smiling",
    medium="oil painting, Renaissance style",
    env="close-up portrait, marble background",
    lighting="chiaroscuro, dramatic light, volumetric",
    params="85mm lens, shallow depth of field --ar 3:2 --s 500"
)

Batch + cost optimization

def cost_aware_batch(prompts, target='exploration'):
    if target == 'exploration':
        return [generate(p, draft=True, steps=10) for p in prompts]
    elif target == 'production':
        return [generate(p, steps=50, upscale=True) for p in prompts]

🤔 결정 기준

Goal Workflow
Brand campaign sref + multi-iteration + post-edit
Character consistency cref / oref + LoRA
Quick concept Draft mode
Final polish Full quality + post-edit + upscale
Video Sora / Veo / Runway
3D asset TripoSR / InstantMesh

기본값: 5-layer prompt + draft mode + reference + post-edit + upscale 의 sequence.

🔗 Graph

🤖 LLM 활용

언제: 매 commercial creative project. 매 visual brand. 언제 X: 매 throwaway. 매 highly specific artist (legal).

안티패턴

  • Single prompt 의 expectation: cliche / generic.
  • No reference: brand inconsistency.
  • Skip post-edit: defect ship.
  • Generic vocab ("nice picture"): 매 specific 의 더 좋음.
  • Full quality from start: cost 폭발.

🧪 검증 / 중복

🕓 Changelog

날짜 변경
2026-05-08 Phase 1
2026-05-09 Manual cleanup — paradigm shift + 5-layer + workflow + emerging tech