--- id: wiki-2026-0508-rouge-metrics title: ROUGE Metrics category: 10_Wiki/Topics status: needs_review canonical_id: self aliases: [NLP-MET-ROUGE-001] duplicate_of: none source_trust_level: A confidence_score: 1.0 tags: [ai, nlp, metrics, rouge, summarization, evaluation, text-Analysis] raw_sources: [] last_reinforced: 2026-04-26 github_commit: pending inferred_by: Claude Opus 4.7 (auto-normalize 2026-05-08) --- # ROUGE Metrics (ROUGE ๋ฉ”ํŠธ๋ฆญ) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "์‚ฌ๋žŒ์ด ์“ด ์ •๋‹ต ์š”์•ฝ๋ฌธ์—์„œ ์ง€๋Šฅ(AI)์ด ์–ผ๋งˆ๋‚˜ ๋งŽ์€ ํ•ต์‹ฌ ๋‹จ์–ด์™€ ๋ฌธ๋งฅ์„ '์žฌํ˜„'ํ•ด๋ƒˆ๋Š”์ง€๋ฅผ ์ •๋Ÿ‰์ ์œผ๋กœ ์ธก์ •ํ•˜๋ผ" โ€” ํ…์ŠคํŠธ ์š”์•ฝ ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์„ ํ‰๊ฐ€ํ•˜๊ธฐ ์œ„ํ•ด ๋ชจ๋ธ์ด ์ƒ์„ฑํ•œ ์š”์•ฝ๋ฌธ๊ณผ ์ฐธ์กฐ ์š”์•ฝ๋ฌธ ์‚ฌ์ด์˜ n-gram ๊ฒน์นจ ์ •๋„๋ฅผ ๊ณ„์‚ฐํ•˜๋Š” ์ง€ํ‘œ. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** "Recall-Oriented Overlap Analysis" โ€” ์š”์•ฝ์˜ ๋ชฉ์ ์€ '์ •๋ณด๋ฅผ ๋น ๋œจ๋ฆฌ์ง€ ์•Š๋Š” ๊ฒƒ'์— ์žˆ๋‹ค๋Š” ๊ด€์ ์—์„œ, ์ฐธ์กฐ ์š”์•ฝ๋ฌธ์˜ ๋‹จ์–ด๋“ค์ด ๋ชจ๋ธ ์ถœ๋ ฅ์— ์–ผ๋งˆ๋‚˜ ํฌํ•จ๋˜์–ด ์žˆ๋Š”์ง€๋ฅผ ์ค‘์‹ฌ์œผ๋กœ ์„ฑ๋Šฅ์„ ์‚ฐ์ถœํ•˜๋Š” ํŒจํ„ด. - **์ฃผ์š” ์„ธ๋ถ€ ์ง€ํ‘œ:** - **ROUGE-N:** ์—ฐ์†๋œ n๊ฐœ์˜ ๋‹จ์–ด(Unigram, Bigram ๋“ฑ)๊ฐ€ ์–ผ๋งˆ๋‚˜ ๊ฒน์น˜๋Š”์ง€ ์ธก์ •. - **ROUGE-L:** ๊ฐ€์žฅ ๊ธด ๊ณตํ†ต ๋ถ€๋ถ„ ์ˆ˜์—ด(LCS)์„ ๊ธฐ๋ฐ˜์œผ๋กœ ๋ฌธ์žฅ ๊ตฌ์กฐ์˜ ์œ ์‚ฌ์„ฑ ์ธก์ •. - **ROUGE-W / ROUGE-S:** ๊ฐ€์ค‘์น˜ ์ ์šฉ ๋ฐ ๊ฑด๋„ˆ๋›ฐ๊ธฐ ํ—ˆ์šฉ ๋ฐฉ์‹์˜ ๋ณ€ํ˜•๋“ค. - **์˜์˜:** ์ฃผ๊ด€์ ์ผ ์ˆ˜ ์žˆ๋Š” '์š”์•ฝ์˜ ํ’ˆ์งˆ'์„ ์ž๋™ํ™”๋œ ์ˆ˜์น˜๋กœ ํ™˜์‚ฐํ•˜์—ฌ, ์ˆ˜๋งŒ ๊ฐœ์˜ ์š”์•ฝ ๊ฒฐ๊ณผ๋ฅผ ์ผ๊ด€๋œ ๊ธฐ์ค€์œผ๋กœ ๋น„๊ตํ•˜๊ณ  ๋ชจ๋ธ์„ ๊ฐœ์„ ํ•˜๊ฒŒ ํ•จ. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & Updates) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ๋‹จ์ˆœํžˆ ๋‹จ์–ด๊ฐ€ ๋งŽ์ด ๊ฒน์นœ๋‹ค๊ณ  ์ข‹์€ ์š”์•ฝ์€ ์•„๋‹ˆ๋ผ๋Š” ํ•œ๊ณ„(์˜๋ฏธ์  ์œ ์‚ฌ์„ฑ ๋ฌด์‹œ)๋ฅผ ๊ทน๋ณตํ•˜๊ธฐ ์œ„ํ•ด, ์ตœ๊ทผ์—๋Š” [[BERT|BERT]]Score์™€ ๊ฐ™์€ ์‹œ๋งจํ‹ฑ ์ž„๋ฒ ๋”ฉ ๊ธฐ๋ฐ˜ ์ง€ํ‘œ๋‚˜ LLM์„ ํŒ๋ณ„์ž๋กœ ์“ฐ๋Š” 'LLM-as-a-judge' ๋ฐฉ์‹์ด ๋ณด์™„์ ์œผ๋กœ ์‚ฌ์šฉ๋จ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” 1,174๊ฐœ ์œ„ํ‚ค ๋ฌธ์„œ์˜ ์ž๋™ ์š”์•ฝ ๊ธฐ๋Šฅ์„ ๊ฒ€์ฆํ•  ๋•Œ, ์ •๋ณด์˜ ๋ˆ„๋ฝ ์—ฌ๋ถ€๋ฅผ ํ™•์ธํ•˜๊ธฐ ์œ„ํ•ด ROUGE-L ์ง€ํ‘œ๋ฅผ ๊ธฐ๋ณธ ์„ฑ๋Šฅ ํ‰๊ฐ€ ์ฒ™๋„๋กœ ํ™œ์šฉํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Natural-Language-Processing|Natural-Language-[[Processing]]-NLP]], [[Performance-Metrics-in-AI|Performance-Metrics-in-AI]], [[RAG-and-Document-Retrieval|RAG-and-Document-Retrieval]], [[Prompt-Engineering-Foundations|Prompt-Engineering-Foundations]] - **Raw Source:** 10_Wiki/Topics/AI/ROUGE-Metrics.md ## ๐Ÿค– LLM ํ™œ์šฉ ํžŒํŠธ (How to Use This Knowledge) **์–ธ์ œ ์ด ์ง€์‹์„ ์“ฐ๋Š”๊ฐ€:** - *(TODO)* **์–ธ์ œ ์“ฐ๋ฉด ์•ˆ ๋˜๋Š”๊ฐ€:** - *(TODO)* ## ๐Ÿงช ๊ฒ€์ฆ ์ƒํƒœ (Validation) - **์ •๋ณด ์ƒํƒœ:** needs_review - **์ถœ์ฒ˜ ์‹ ๋ขฐ๋„:** A - **๊ฒ€ํ†  ์ด์œ :** *(P-Reinforce Phase 1 ์ž๋™ ์ •๊ทœํ™”. ๋ณธ๋ฌธ ๊ฒ€์ฆ ํ•„์š”.)* ## ๐Ÿงฌ ์ค‘๋ณต ๊ฒ€์‚ฌ (Duplicate Check) - **๊ธฐ์กด ์œ ์‚ฌ ๋ฌธ์„œ:** *(TODO: ์ธ๋ฑ์„œ ํด๋Ÿฌ์Šคํ„ฐ ๋ฆฌํฌํŠธ ์ฐธ์กฐ)* - **์ฒ˜๋ฆฌ ๋ฐฉ์‹:** UPDATE (์ž๋™ ์ •๊ทœํ™”) - **์ฒ˜๋ฆฌ ์ด์œ :** Phase 1 ์ •๊ทœํ™” โ€” ์˜› ํ…œํ”Œ๋ฆฟ/๋ˆ„๋ฝ ํ•„๋“œ ๋ณด๊ฐ•. ## ๐Ÿ•“ ๋ณ€๊ฒฝ ์ด๋ ฅ (Changelog) | ๋‚ ์งœ | ๋ณ€๊ฒฝ ๋‚ด์šฉ | ์ฒ˜๋ฆฌ ๋ฐฉ์‹ | ์‹ ๋ขฐ๋„ | |------|-----------|-----------|--------| | 2026-05-08 | P-Reinforce Phase 1 ์ •๊ทœํ™” (frontmatter + ํ—ค๋” ํ‘œ์ค€ํ™”) | UPDATE | A |