"매 fixed config → 매 dynamic schedule". Parameter control 은 training / inference / game system 의 knob (learning rate, temperature, difficulty) 을 매 시간/state 에 따라 adaptively 변경하는 전략. 매 generative AI 의 sampler param (temp/top-p/top-k), RL 의 exploration ε, game 의 dynamic balancing 모두 같은 framework.
매 핵심
매 3 도메인
ML training: lr schedule (cosine, warmup), weight decay, dropout 의 adaptive.
Generative inference: temperature, top-p, top-k, repetition penalty 의 per-token 조정.
Game balance: dynamic difficulty adjustment (DDA), procedural generation 의 parameter.
fromvllmimportLLM,SamplingParamsllm=LLM("meta-llama/Llama-3.3-70B-Instruct")creative=SamplingParams(temperature=0.9,top_p=0.95,top_k=50,repetition_penalty=1.1,max_tokens=512)factual=SamplingParams(temperature=0.2,top_p=0.9,top_k=20,max_tokens=256)llm.generate("Write a poem.",creative)llm.generate("Capital of FR?",factual)