[[Tree-of-Thought (ToT, ์‚ฌ๊ณ  ํŠธ๋ฆฌ)]] ๐Ÿ“Œ Brief Summary Tree-of-Thought(ToT)๋Š” LLM์ด ๋ฌธ์ œ๋ฅผ ์„ ํ˜• ๋‹จ๊ณ„(Chain-of-Thought)๊ฐ€ ์•„๋‹Œ **ํŠธ๋ฆฌ ๊ตฌ์กฐ**๋กœ ํƒ์ƒ‰ํ•˜์—ฌ, ๊ฐ ์ค‘๊ฐ„ ๋‹จ๊ณ„์—์„œ ์—ฌ๋Ÿฌ ๊ฐ€๋Šฅํ•œ ์‚ฌ๊ณ  ๊ฒฝ๋กœ๋ฅผ ๋ถ„๊ธฐ(Branch)ํ•˜๊ณ  ํ‰๊ฐ€ยท์„ ํƒํ•˜๋Š” ์ถ”๋ก  ํ”„๋ ˆ์ž„์›Œํฌ๋‹ค. ์ฒด์Šค๋‚˜ ์ˆ˜ํ•™ ํผ์ฆ์ฒ˜๋Ÿผ ์ดˆ๋ฐ˜ ์„ ํƒ์ด ์ตœ์ข… ๊ฒฐ๊ณผ์— ๊ฒฐ์ •์  ์˜ํ–ฅ์„ ๋ฏธ์น˜๋Š” ๋ฌธ์ œ์—์„œ Chain-of-Thought๋ณด๋‹ค ์›”๋“ฑํžˆ ๋†’์€ ์ •ํ™•๋„๋ฅผ ๋ณด์ธ๋‹ค. --- ๐Ÿ“– Core Content ## 1. CoT vs ToT ๊ตฌ์กฐ ๋น„๊ต ``` [Chain-of-Thought (CoT)] S โ†’ Tโ‚ โ†’ Tโ‚‚ โ†’ Tโ‚ƒ โ†’ ๋‹ต (์„ ํ˜• ๋‹จ์ผ ๊ฒฝ๋กœ โ†’ ํ•œ ๋ฒˆ ํ‹€๋ฆฌ๋ฉด ๋ณต๊ตฌ ๋ถˆ๊ฐ€) [Tree-of-Thought (ToT)] S / | \ Tโ‚ Tโ‚‚ Tโ‚ƒ โ† ๋‹จ๊ณ„ 1: 3๊ฐ€์ง€ ์‚ฌ๊ณ  ๋ถ„๊ธฐ ์ƒ์„ฑ /โ”‚\ โ”‚ A B C D โ† ๋‹จ๊ณ„ 2: ๊ฐ ๋ถ„๊ธฐ์—์„œ ์ถ”๊ฐ€ ํ™•์žฅ โ†“ [ํ‰๊ฐ€] B๊ฐ€ ๊ฐ€์žฅ ์œ ๋ง โ†’ B๋งŒ ๊ณ„์† ํƒ์ƒ‰ โ†“ ์ตœ์ข… ๋‹ต (์œ ๋งํ•œ ๊ฒฝ๋กœ๋งŒ ๊นŠ์ด ํƒ์ƒ‰) ``` --- ## 2. ToT์˜ 4๊ฐ€์ง€ ํ•ต์‹ฌ ์š”์†Œ | ์š”์†Œ | ์„ค๋ช… | |------|------| | **Thought (์‚ฌ๊ณ  ๋‹จ์œ„)** | ์ค‘๊ฐ„ ์ถ”๋ก  ๋‹จ๊ณ„ (๋ฌธ์žฅยท๋ฐฉ์ •์‹ยท๊ณ„ํš ๋“ฑ) | | **Generator (์ƒ์„ฑ๊ธฐ)** | LLM์ด ํ˜„์žฌ ์ƒํƒœ์—์„œ ์—ฌ๋Ÿฌ Thought ํ›„๋ณด ์ƒ์„ฑ | | **Evaluator (ํ‰๊ฐ€๊ธฐ)** | ๊ฐ Thought์˜ ์œ ๋ง๋„ ์ ์ˆ˜ํ™” (LLM ๋˜๋Š” ๋ณ„๋„ ํ•จ์ˆ˜) | | **Search (ํƒ์ƒ‰ ์ „๋žต)** | BFS(๋„ˆ๋น„ ์šฐ์„ ) ๋˜๋Š” DFS(๊นŠ์ด ์šฐ์„ ) ์„ ํƒ | --- ## 3. ํƒ์ƒ‰ ์ „๋žต | ์ „๋žต | ๋ฐฉ๋ฒ• | ์ ํ•ฉ ๋ฌธ์ œ | |------|------|---------| | **BFS** (๋„ˆ๋น„ ์šฐ์„ ) | ํ˜„์žฌ ๋ ˆ๋ฒจ์˜ ๋ชจ๋“  Thought ํ‰๊ฐ€ ํ›„ ์ƒ์œ„ K๊ฐœ ์œ ์ง€ | ๋ ˆ๋ฒจ๋ณ„ ํ‰๊ฐ€ ๊ฐ€๋Šฅํ•œ ๋‹จ๊ณ„์  ๋ฌธ์ œ | | **DFS** (๊นŠ์ด ์šฐ์„ ) | ์œ ๋งํ•œ ๊ฒฝ๋กœ ๊นŠ๊ฒŒ ํƒ์ƒ‰, ๋ง‰ํžˆ๋ฉด backtrack | ํ•ด๊ฐ€ ๊นŠ์€ ๊ณณ์— ์žˆ๋Š” ํƒ์ƒ‰ ๋ฌธ์ œ | | **MCTS** (๋ชฌํ…Œ์นด๋ฅผ๋กœ ํŠธ๋ฆฌ ํƒ์ƒ‰) | ์‹œ๋ฎฌ๋ ˆ์ด์…˜ + ํ†ต๊ณ„์  ์„ ํƒ | ๊ฒŒ์ž„ยท๋ณต์žกํ•œ ์˜์‚ฌ๊ฒฐ์ • | --- ## 4. ์„ฑ๋Šฅ ์ˆ˜์น˜ | ๋ฒค์น˜๋งˆํฌ | IO (์ง์ ‘ ์ถœ๋ ฅ) | CoT | ToT | ToT ํ–ฅ์ƒ | |---------|------------|-----|-----|---------| | **Game of 24** (์ˆ˜์‹ ํผ์ฆ) | 7.3% | 4.0% | **74%** | +67%p | | **Creative Writing** | โ€” | โ€” | **+ํ‰๊ฐ€์ ์ˆ˜ ํ–ฅ์ƒ** | ์ฐฝ์˜์„ฑ+๋…ผ๋ฆฌ ๊ท ํ˜• | | **Mini Crosswords** | 0% | 3.7% | **20%** | +16%p | --- ## 5. ToT ์ •ํ™•๋„ ํ–ฅ์ƒ์˜ ์ธ๊ณผ๊ด€๊ณ„ ``` [CoT์˜ ํ•œ๊ณ„] ํ•œ ๋ฒˆ ์ž˜๋ชป๋œ ์ถ”๋ก  โ†’ ์ดํ›„ ๋ชจ๋“  ๋‹จ๊ณ„ ์˜ค์—ผ (์„ ํ˜• ๊ฒฝ๋กœ์˜ ๊ตฌ์กฐ์  ์ทจ์•ฝ์ ) โ†“ [ToT์˜ ํ•ด๊ฒฐ] ์—ฌ๋Ÿฌ ํ›„๋ณด Thought ๋™์‹œ ์ƒ์„ฑ (๋ถ„๊ธฐ) โ†“ LLM ์Šค์Šค๋กœ "์ด ๊ฒฝ๋กœ๊ฐ€ ์˜ฌ๋ฐ”๋ฅธ ๋ฐฉํ–ฅ์ธ๊ฐ€?" ํ‰๊ฐ€ (์ž๊ธฐ ํ‰๊ฐ€: Self-Evaluation) โ†“ ์œ ๋งํ•˜์ง€ ์•Š์€ ๊ฒฝ๋กœ ์กฐ๊ธฐ ๊ฐ€์ง€์น˜๊ธฐ (Pruning) โ†“ ๊ณ„์‚ฐ ์ž์›์„ ์œ ๋งํ•œ ๊ฒฝ๋กœ์— ์ง‘์ค‘ โ†“ ๋ณต์žกํ•œ ๋‹ค๋‹จ๊ณ„ ๋ฌธ์ œ์—์„œ ์ •ํ™•๋„ ๋น„์•ฝ์  ํ–ฅ์ƒ ``` --- ## 6. ToT์˜ ํ•œ๊ณ„ - **๊ณ„์‚ฐ ๋น„์šฉ**: Branch ร— Depth ๋งŒํผ LLM ํ˜ธ์ถœ โ†’ CoT ๋Œ€๋น„ ์ˆ˜์‹ญ~์ˆ˜๋ฐฑ ๋ฐฐ ๋น„์šฉ. - **์†๋„**: ์‹ค์‹œ๊ฐ„ ์‘๋‹ต ์‹œ์Šคํ…œ์— ๋ถ€์ ํ•ฉ. - **ํ‰๊ฐ€๊ธฐ ์‹ ๋ขฐ์„ฑ**: "์ข‹์€ Thought" ํŒ๋‹จ ์ž์ฒด๋ฅผ LLM์ด ํ•˜๋ฏ€๋กœ, ํ‰๊ฐ€ ์˜ค๋ฅ˜ ๊ฐ€๋Šฅ. --- ๐Ÿ”— Knowledge Connections - **Related Topics:** [[Chain-of-Thought (CoT, ์‚ฌ๊ณ  ์‚ฌ์Šฌ)]], [[ReAct (Reasoning + Acting)]], [[๊ฐ•ํ™”ํ•™์Šต (Reinforcement Learning)]], [[GRPO (Group Relative Policy Optimization)]], [[Multi-Hop Reasoning (๋‹ค์ค‘ ํ™‰ ์ถ”๋ก )]], [[Self-Consistency (์ž๊ธฐ ์ผ๊ด€์„ฑ)]] - **Projects/Contexts:** [[AI ์ถ”๋ก  ์‹œ์Šคํ…œ]] - **Contradictions/Notes:** - ToT๋Š” ๋น„์šฉ ๋Œ€๋น„ ์„ฑ๋Šฅ ํŠธ๋ ˆ์ด๋“œ์˜คํ”„๊ฐ€ ๊ทน๋‹จ์  โ†’ ์‹ค์‹œ๊ฐ„ ์„œ๋น„์Šค๋ณด๋‹ค ์˜คํ”„๋ผ์ธ ๋ฐฐ์น˜ยท์—ฐ๊ตฌ์šฉ์œผ๋กœ ์ ํ•ฉ. - **์‹ ๊ทœ ํ‚ค์›Œ๋“œ**: `MCTS (๋ชฌํ…Œ์นด๋ฅผ๋กœ ํŠธ๋ฆฌ ํƒ์ƒ‰)`, `Self-Evaluation`, `Backtracking` โ†’ ํƒ์ƒ‰ ํ ์ถ”๊ฐ€.