"매 real-time game translation은 매 2026 기준 매 LLM (Claude Opus 4.7, GPT-5, Gemini 2.5) 기반 매 sub-200ms latency 달성 가능, 매 그러나 매 game-specific terminology / context / tone 보존이 매 quality bottleneck." 매 Final Fantasy XIV 매 cross-region party (2024 EA), 매 Helldivers 2 cross-language voice chat, 매 Among Us global match, 매 Genshin co-op random matchmaking — 매 매 game이 매 Discord-style translation overlay 또는 매 in-game native integration 채택. 매 2026 perspective, 매 on-device translation (Apple Translate / Google Translate Live)이 매 server-side LLM과 매 hybrid.
매 핵심
매 latency budget
Voice → text (STT): 50-150ms (Whisper Large v3 / Apple SpeechRecognizer).
Text → speech (TTS): 50-200ms (ElevenLabs Turbo / Apple AVSpeech).
Total: 매 200-650ms — 매 acceptable for chat, 매 borderline for synced voice.
매 quality challenges
Game terminology: 매 'gank', 매 'AOE', 매 'CC', 매 'tank/heal/dps' — 매 generic translator는 매 비-game 의미 출력.
Tone preservation: 매 banter / trash talk / encouragement — 매 translation에서 매 emotional flatten.
Code-switching: 매 'GG', 매 'rip', 매 'gg ez' — 매 community jargon은 매 translate 안 하는 게 적절.
Latency-quality trade-off: 매 streaming partial translation은 매 mid-sentence revision 발생.
매 응용
Final Fantasy XIV (2024) Auto-Translator: 매 dictionary-based + 매 LLM hybrid — 매 dictionary는 매 game terms (zero latency), 매 free chat은 매 LLM.
Helldivers 2 voice chat (2024): 매 server-side STT + 매 translation, 매 ~400ms.
VRChat OSC translation mods (2025): 매 third-party — 매 Whisper.cpp local + 매 Llama 3 8B local 매 fully on-device.
💻 패턴
Streaming STT + translate pipeline
importasynciofromanthropicimportAsyncAnthropicclient=AsyncAnthropic()asyncdeftranslate_stream(audio_chunks,target_lang:str):# 매 1단계 — 매 streaming STT (Whisper)asyncforpartial_textinwhisper_stream(audio_chunks):# 매 2단계 — 매 LLM translation with cacheasyncwithclient.messages.stream(model="claude-haiku-4-7",system=[{"type":"text","text":GAME_GLOSSARY,"cache_control":{"type":"ephemeral"}},{"type":"text","text":f"Translate to {target_lang}, preserve gaming jargon."},],messages=[{"role":"user","content":partial_text}],max_tokens=100,)asstream:asyncforchunkinstream.text_stream:yieldchunk
Game term glossary (cached system prompt)
GAME_GLOSSARY="""
You translate game chat. Keep the following terms verbatim:
- gank, AOE, CC, tank, heal, dps, OOM, OP, nerf, buff, meta
- GG, GLHF, AFK, BRB, lol, lmao, rip, OMW, inc, bot, top, mid, jg, sup
- skillshot, juke, last-hit, deny, ward, vision, smite, flash
For roles in MOBA: keep "ADC", "support", "jungle", "carry" verbatim.
For class names: keep "warrior", "mage" etc translated; keep "Bard", "Reaper" (FF14 jobs) verbatim.
Respond ONLY with the translation, no preamble.
"""
Dictionary first-pass + LLM fallback
TERM_DICT={'ko->en':{'딜러':'DPS','탱커':'tank','힐러':'healer','버스':'carry me','버스기사':'carry player',},'en->ko':{'gank':'갱','AOE':'광역',},}asyncdeftranslate_hybrid(text:str,src:str,tgt:str)->str:direction=f"{src}->{tgt}"table=TERM_DICT.get(direction,{})# 매 모든 token이 매 dict에 있으면 매 LLM call 매 skiptokens=text.split()ifall(tintableort.isspace()fortintokens):return' '.join(table.get(t,t)fortintokens)# 매 그 외 — 매 LLMreturnawaitllm_translate(text,src,tgt)
Quality-vs-latency mode switch
classTranslationMode:INSTANT="instant"# 매 dictionary only, 매 ~5msFAST="fast"# 매 Haiku, 매 ~150msQUALITY="quality"# 매 Sonnet, 매 ~400msdefselect_mode(message_length:int,channel:str)->str:ifchannel=='voice_synced':returnTranslationMode.INSTANTifchannel=='team_chat':returnTranslationMode.FASTifchannel=='guild_chat':returnTranslationMode.QUALITYreturnTranslationMode.FAST
TTS voice cloning preservation (2026 ElevenLabs)
# 매 player의 매 own voice 보존 — 매 다른 언어로 매 same voiceasyncdefvoice_translate(audio_chunk,target_lang,voice_id):text=awaitwhisper_transcribe(audio_chunk)translated=awaitllm_translate(text,target=target_lang)audio_out=awaitelevenlabs.tts(text=translated,voice_id=voice_id,# 매 player가 매 onboarding 시 voice clonemodel='eleven_turbo_v3',latency_optimization=True,)returnaudio_out# 매 ~300ms total
Anti-toxicity filter (LLM evaluator)
asyncdeffilter_toxicity(translated:str)->str:result=awaitclient.messages.create(model='claude-haiku-4-7',max_tokens=10,messages=[{'role':'user','content':f"Is this game chat OK? Reply 'OK' or 'BLOCK'.\n{translated}",}],)returntranslatedif'OK'inresult.content[0].textelse'[message hidden]'
매 결정 기준
상황
Approach
Synced voice (FPS shoutcaller)
매 dictionary-only + 매 partial overlap acceptable
Async chat (MMO town)
매 LLM Quality mode
MOBA team chat
매 dictionary first + 매 LLM fallback
VR social
매 voice cloning + 매 ~300ms
Mobile turn-based
매 server LLM, 매 latency 무관
기본값: 매 dictionary-first + 매 LLM (Haiku) fallback + 매 prompt cache. 매 매 game-specific glossary가 매 quality 의 50% 결정.
🔗 Graph
🤖 LLM 활용
언제: 매 game chat translation의 매 core. 매 Claude Haiku 4.7 / GPT-5 nano + 매 prompt cache (system glossary)로 매 ~$0.001/message.
언제 X: 매 lore-heavy NPC dialogue → 매 사전 localization 작업 매 더 quality.
❌ 안티패턴
Generic translator (Google Translate raw): 매 game jargon 의 매 mistranslate.
No glossary cache: 매 매 message마다 glossary token 재전송 → 매 cost / latency 5×.
Block over-translate: 매 'GG' → '잘 했어요' 매 awkward — 매 jargon은 매 keep raw.
Voice without anti-toxicity: 매 translation이 매 toxicity amplifier 역할.