--- id: P-REINFORCE-AUTO-ZSCOT-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 0.98 tags: [auto-reinforced, zero-shot-cot, chain-of-thought, reasoning, prompting, llm] last_reinforced: 2026-04-20 --- # [[Zero-Shot-Chain-of-Thought|Zero-Shot-Chain-of-Thought]] ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๋งˆ๋ฒ•์˜ ์ฃผ๋ฌธ '์ฐจ๊ทผ์ฐจ๊ทผ ์ƒ๊ฐํ•ด๋ณด์ž': ๋ณต์žกํ•œ ๋ฌธ์ œ์— ๋Œ€ํ•ด ๊ฒฐ๊ณผ๋ฅผ ๋ฐ”๋กœ ๋ฌป์ง€ ์•Š๊ณ  ์ƒ๊ฐ์˜ ๊ณผ์ •์„ ์š”๊ตฌํ•จ์œผ๋กœ์จ, AI์˜ ๋…ผ๋ฆฌ ํšŒ๋กœ๋ฅผ ๊ฐ•์ œ๋กœ ํ™œ์„ฑํ™”ํ•ด ์ •๋‹ต๋ฅ ์„ ๋“œ๋ผ๋งˆํ‹ฑํ•˜๊ฒŒ ๋†’์ด๋Š” ์ถ”๋ก  ์ตœ์ ํ™” ๊ธฐํญ์ œ." ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) ์ œ๋กœ์ƒท ์—ฐ์‡„ ์‚ฌ๊ณ (Zero-Shot Chain-of-Thought, Zero-Shot-CoT)๋Š” 2022๋…„ Kojima ๋“ฑ์ด ์ œ์•ˆํ•œ ๊ธฐ๋ฒ•์œผ๋กœ, ํ”„๋กฌํ”„ํŠธ ๋์— ํŠน์ • ๋ฌธ๊ตฌ๋ฅผ ๋ง๋ถ™์ด๋Š” ๊ฒƒ๋งŒ์œผ๋กœ LLM์˜ ๋‹ค๋‹จ๊ณ„ ์ถ”๋ก  ๋Šฅ๋ ฅ์„ ์ด๋Œ์–ด๋‚ด๋Š” ๋งค์šฐ ๋‹จ์ˆœํ•˜๋ฉด์„œ๋„ ๊ฐ•๋ ฅํ•œ ํ”„๋กฌํ”„ํŠธ ์—”์ง€๋‹ˆ์–ด๋ง ๊ธฐ์ˆ ์ž…๋‹ˆ๋‹ค. 1. **ํ•ต์‹ฌ ํŠธ๋ฆฌ๊ฑฐ ๋ฌธ๊ตฌ**: * **"Let's think step by step (์ฐจ๊ทผ์ฐจ๊ทผ ๋‹จ๊ณ„๋ณ„๋กœ ์ƒ๊ฐํ•ด๋ณด์ž)"** 2. **์ž‘๋™ ์›๋ฆฌ**: * ์ด ๋ฌธ๊ตฌ๋Š” ๋ชจ๋ธ์ด ๋‹ค์Œ์— ์˜ฌ ํ† ํฐ์„ ์ƒ์„ฑํ•  ๋•Œ, '์ตœ์ข… ์ •๋‹ต' ๋Œ€์‹  '์ค‘๊ฐ„ ์ถ”๋ก  ๊ณผ์ •'์„ ๋จผ์ € ์ƒ์„ฑํ•˜๊ฒŒ ์œ ๋„ํ•จ. * ๋ชจ๋ธ์€ ์ž์‹ ์ด ์•ž์„œ ์ƒ์„ฑํ•œ ์ถ”๋ก  ๋‹จ๊ณ„๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ๋‹ค์Œ ๋‹จ๊ณ„๋ฅผ ์—ฐ์‚ฐํ•˜๋ฏ€๋กœ, ๋ณต์žกํ•œ ์‚ฐ์ˆ ์ด๋‚˜ ๋…ผ๋ฆฌ ๋ฌธ์ œ์—์„œ '์‹œ์Šคํ…œ 2(๋А๋ฆฐ ์‚ฌ๊ณ )'์™€ ์œ ์‚ฌํ•œ ์ •๊ตํ•จ์„ ๋ฐœํœ˜ํ•˜๊ฒŒ ๋จ. 3. **์žฅ์ **: * **Zero-shot**: ์–ด๋– ํ•œ ํ“จ์ƒท ์˜ˆ์‹œ(Examples)๋„ ์ค€๋น„ํ•  ํ•„์š”๊ฐ€ ์—†์Œ. * **Versatility**: ์ˆ˜ํ•™, ๋…ผ๋ฆฌ, ์ƒ์‹ ์ถ”๋ก  ๋“ฑ ๋ถ„์•ผ๋ฅผ ๊ฐ€๋ฆฌ์ง€ ์•Š๊ณ  ์ ์šฉ ๊ฐ€๋Šฅ. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ**: ๊ณผ๊ฑฐ์—๋Š” ๋ชจ๋ธ์˜ ์ถ”๋ก  ๋Šฅ๋ ฅ์ด ์ˆœ์ „ํžˆ ํŒŒ๋ผ๋ฏธํ„ฐ ํฌ๊ธฐ์—๋งŒ ๋‹ฌ๋ฆฐ ์ค„ ์•Œ์•˜์œผ๋‚˜, Zero-Shot-CoT ์ •์ฑ…์€ '๋ช…๋ น์–ด์˜ ๊ตฌ์กฐ'๋งŒ์œผ๋กœ๋„ ์ž ์žฌ๋œ ์ง€๋Šฅ์„ ์ˆ˜์‹ญ ํผ์„ผํŠธ ๋” ๋Œ์–ด์˜ฌ๋ฆด ์ˆ˜ ์žˆ์Œ์„ ์ž…์ฆํ•˜๋ฉฐ ํ”„๋กฌํ”„ํŠธ ์—”์ง€๋‹ˆ์–ด๋ง ์ •์ฑ…์˜ ์œ„์ƒ์„ ๊ฒฉ์ƒ์‹œํ‚ด(RL Update). - **์ •์ฑ… ๋ณ€ํ™”(RL Update)**: ์—”ํ„ฐํ”„๋ผ์ด์ฆˆ AI ์†”๋ฃจ์…˜ ์ •์ฑ… ์ˆ˜๋ฆฝ ์‹œ, ๋‹ต๋ณ€์˜ ์ •ํ™•๋„์™€ ํˆฌ๋ช…์„ฑ์„ ๋†’์ด๊ธฐ ์œ„ํ•ด ๋ชจ๋“  AI ์‘๋‹ต์— Zero-Shot-CoT๋ฅผ ๊ธฐ๋ณธ ์ ์šฉํ•˜์—ฌ '์‚ฌ๊ณ ์˜ ๊ทผ๊ฑฐ'๋ฅผ ํ•จ๊ป˜ ์ถœ๋ ฅํ•˜๊ฒŒ ํ•˜๋Š” '๊ณผ์ • ์ค‘์‹ฌ ์‘๋‹ต ์ •์ฑ…'์ด ํ™•๋Œ€๋จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Prompt-Engineering|Prompt-Engineering]], [[Zero Shot and Few Shot Learning|Zero Shot and Few Shot Learning]], [[Thought-Architecture|Thought-Architecture]], [[Decision Theory|Decision Theory]], Self-Correction Mechanisms - **Modern Tech/Tools**: OpenAI o1 model (Internal CoT), LangChain advanced chains. ---