--- id: darwin-gรถdel-machine title: "Darwin Gรถdel Machine" category: "10_Wiki/Topics" status: "draft" verification_status: "conceptual" canonical_id: "" aliases: ["DGM"] duplicate_of: "" source_trust_level: "B" confidence_score: 0.85 created_at: 2026-06-12 updated_at: 2026-06-12 review_reason: "" merge_history: [] tags: ["research", "self envolving", "recursive-self-design"] raw_sources: ["NotebookLM Synthesis"] applied_in: ["https://github.com/jennyzzt/dgm"] github_commit: "" --- # [[Darwin Gรถdel Machine]] ## ๐ŸŽฏ ํ•œ ์ค„ ํ†ต์ฐฐ (One-line insight) ์—์ด์ „ํŠธ๊ฐ€ ์ž์‹ ์˜ ์†Œ์Šค ์ฝ”๋“œ๋ฅผ ์ง์ ‘ ์ˆ˜์ •ํ•˜๊ณ  ์„ฑ๊ณต์ ์ธ ๋ฒ„์ „์„ ์•„์นด์ด๋ธŒ์— ์ถ•์ ํ•˜๋ฉฐ ์ง„ํ™”ํ•˜๋Š”, ์ƒ๋ฌผํ•™์  ์ง„ํ™”์™€ ์žฌ๊ท€์  ์ž๊ธฐ ์„ค๊ณ„๊ฐ€ ๊ฒฐํ•ฉ๋œ ๊ฐœ๋ฐฉํ˜• ์ž๊ธฐ ๊ฐœ์„  ํ”„๋ ˆ์ž„์›Œํฌ [1-4]. ## ๐Ÿง  ํ•ต์‹ฌ ๊ฐœ๋… (Core concepts) - **์žฌ๊ท€์  ์ž๊ธฐ ์„ค๊ณ„ (Recursive Self-Design):** ๊ณ ์ •๋œ ํŒŒ๋ผ๋ฏธํ„ฐ ์ตœ์ ํ™”๋ฅผ ๋„˜์–ด ์—์ด์ „ํŠธ์˜ ์Šค์บํด๋“œ, ๋„๊ตฌ, ์›Œํฌํ”Œ๋กœ, ํ”„๋กฌํ”„ํŠธ ์ •์ฑ…์„ ์ •์˜ํ•˜๋Š” ์ฝ”๋“œ๋ฒ ์ด์Šค ์ž์ฒด๋ฅผ ์ˆ˜์ •ํ•จ [4-6]. - **์ง„ํ™”์  ์•„์นด์ด๋ธŒ (Evolutionary Archive):** ๋ชจ๋“  ์—ญ์‚ฌ์  ๋ฒ„์ „("์ข…")์„ ์ €์žฅํ•˜์—ฌ ์„ ํ˜•์  ๊ฐœ์„ ์ด ์•„๋‹Œ ๋‹ค์–‘ํ•œ ์ง„ํ™” ๊ฒฝ๋กœ(๋ถ„๊ธฐ)๋ฅผ ๋ณด์กดํ•˜๊ณ  ํƒ์ƒ‰ํ•จ [2, 7, 8]. - **์ž๊ธฐ ์ฐธ์กฐ์  ๊ฐœ์„  (Self-Referential Improvement):** ์—์ด์ „ํŠธ๊ฐ€ ์ž์‹ ์˜ ์‹คํ–‰ ๋กœ๊ทธ ๋ฐ ์—๋Ÿฌ ๊ธฐ๋ก์„ ๋ถ„์„ํ•˜์—ฌ ๋ณ‘๋ชฉ ์ง€์ ์„ ํŒŒ์•…ํ•˜๊ณ , ์ด๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•œ ์ฝ”๋“œ ํŒจ์น˜๋ฅผ ์Šค์Šค๋กœ ์ž‘์„ฑ ๋ฐ ๊ฒ€์ฆํ•จ [7, 9-11]. - **์ฐธ์‹ ์„ฑ ๊ธฐ๋ฐ˜ ์„ ํƒ (Novelty-Driven Selection):** ๋‹จ์ˆœํžˆ ๋ฒค์น˜๋งˆํฌ ์ ์ˆ˜๋ฟ๋งŒ ์•„๋‹ˆ๋ผ ์„ค๊ณ„์˜ ์ฐธ์‹ ์„ฑ ๋ณด์ƒ์„ ๊ฒฐํ•ฉํ•˜์—ฌ ์กฐ๊ธฐ ์ •์ฒด๋ฅผ ๋ฐฉ์ง€ํ•˜๊ณ  ๋‹ค์–‘ํ•œ ํ•ด๊ฒฐ์ฑ…์„ ํƒ์ƒ‰ํ•จ [2, 12]. ## ๐Ÿงฉ ์ถ”์ถœ๋œ ํŒจํ„ด (Extracted patterns) - **์ธ๊ฐ„ ์ฃผ๋„ ์ดˆ๊ธฐํ™”(0-to-1) ํŒจํ„ด:** ์ธ๊ฐ„ ์—ฐ๊ตฌ์ž๊ฐ€ ์‹œ๋“œ ์—์ด์ „ํŠธ, ์‚ฌ์šฉ ๊ฐ€๋Šฅํ•œ ๋„๊ตฌ(Bash, Edit), ์•„์นด์ด๋ธŒ ๊ทœ์น™, ์ƒŒ๋“œ๋ฐ•์Šค ๋ฐ ํ‰๊ฐ€ ํ”„๋กœํ† ์ฝœ์„ ์„ค์ •ํ•จ [10, 11, 13]. - **AI ์ฃผ๋„ ํ™•์žฅ(1-to-N) ํŒจํ„ด:** ๋ถ€๋ชจ ์—์ด์ „ํŠธ๊ฐ€ ๋กœ๊ทธ๋ฅผ ๊ฒ€์‚ฌํ•˜๊ณ  ํŒจ์น˜๋ฅผ ์ž‘์„ฑํ•˜์—ฌ ์ž์‹ ์—์ด์ „ํŠธ๋ฅผ ์ƒ์„ฑํ•˜๋ฉฐ, ์ž์‹์€ ์ปดํŒŒ์ผ ๋ฐ ๊ธฐ๋Šฅ ํ…Œ์ŠคํŠธ ํ†ต๊ณผ ์‹œ ์•„์นด์ด๋ธŒ์— ๋“ฑ๋ก๋˜์–ด ๋ฏธ๋ž˜์˜ ๋ถ€๋ชจ๊ฐ€ ๋จ [10, 11, 13, 14]. - **๊ตฌ์กฐ์  ์ฝ”๋“œ ํ˜์‹  (Structural Innovation):** ๋‹จ์ˆœํ•œ ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ ํŠœ๋‹์ด ์•„๋‹Œ, ์ •๋ฐ€ ํŒŒ์ผ ๋ทฐ์–ด, ๋ฌธ์ž์—ด ๊ต์ฒด ํ”„๋ฆฌ๋ฏธํ‹ฐ๋ธŒ, ํŒจ์น˜ ๊ฒ€์ฆ ์žฌ์‹œ๋„ ๋ฃจํ”„ ๋“ฑ ์‹คํ–‰ ์—”์ง„์˜ ๊ตฌ์กฐ์  ๊ธฐ๋Šฅ์„ ์ง์ ‘ ๊ฐœ๋ฐœํ•จ [8, 15-17]. ## ๐Ÿ“– ์„ธ๋ถ€ ๋‚ด์šฉ (Details) - **๊ฐœ์š”:** Sakana AI์™€ UBC(๋ธŒ๋ฆฌํ‹ฐ์‹œ ์ปฌ๋Ÿผ๋น„์•„ ๋Œ€ํ•™๊ต)์˜ ํ˜‘๋ ฅ์œผ๋กœ ๊ฐœ๋ฐœ๋˜์—ˆ์œผ๋ฉฐ, ์ฝ”๋”ฉ ์—์ด์ „ํŠธ๊ฐ€ ์ž์‹ ์˜ Python ์†Œ์Šค ์ฝ”๋“œ๋ฅผ ์žฌ๊ท€์ ์œผ๋กœ ์ˆ˜์ •ํ•˜์—ฌ ์„ฑ๋Šฅ์„ ๋†’์ด๋Š” ์‹œ์Šคํ…œ์ž„ [3, 13, 18]. - **์ง„ํ™” ํ”„๋กœ์„ธ์Šค:** 1. ์•„์นด์ด๋ธŒ์—์„œ ๋ถ€๋ชจ ์—์ด์ „ํŠธ๋ฅผ ์ƒ˜ํ”Œ๋งํ•จ [7, 14]. 2. ๋ถ€๋ชจ ์—์ด์ „ํŠธ(Claude 3.5 Sonnet ๋“ฑ ์‚ฌ์šฉ)๊ฐ€ ์ž์‹ ์˜ ๋ฒค์น˜๋งˆํฌ ์ˆ˜ํ–‰ ๋กœ๊ทธ๋ฅผ ๋ถ„์„ํ•˜์—ฌ ์•Œ๊ณ ๋ฆฌ์ฆ˜ ๋ณ‘๋ชฉ์„ ํŒŒ์•…ํ•จ [9, 10, 19]. 3. ์ฝ”๋“œ ์ˆ˜์ • ์ œ์•ˆ์„ ์ƒ์„ฑํ•˜๊ณ  ์ž์‹ ์˜ ์ฝ”๋“œ๋ฒ ์ด์Šค์— ํŒจ์น˜๋ฅผ ์ ์šฉํ•˜์—ฌ ์ž์‹์„ ์ƒ์„ฑํ•จ [7, 14]. 4. ์ž์‹ ์—์ด์ „ํŠธ๋ฅผ ์Šคํ…Œ์ด์ง• ๊ณผ์ œ์—์„œ ํ‰๊ฐ€ํ•˜์—ฌ ์ปดํŒŒ์ผ ๋ฐ ๊ธฐ๋ณธ ๊ธฐ๋Šฅ ์œ ์ง€ ์—ฌ๋ถ€๋ฅผ ํ™•์ธํ•œ ํ›„ ์•„์นด์ด๋ธŒ์— ์ถ”๊ฐ€ํ•จ [7, 11, 14]. - **์„ฑ๋Šฅ ์ง€ํ‘œ:** 80ํšŒ ๋ฐ˜๋ณต(Iteration) ์ˆ˜ํ–‰ ๊ฒฐ๊ณผ, **SWE-bench Verified ํŒจ์Šค์œจ์ด 20%์—์„œ 50%๋กœ ํ–ฅ์ƒ**๋˜์—ˆ์œผ๋ฉฐ, **Polyglot ์ฝ”๋”ฉ ๋ฒค์น˜๋งˆํฌ ์ ์ˆ˜๋Š” 14.2%์—์„œ 30.7%๋กœ ์ƒ์Šน**ํ•จ [9, 12, 20, 21]. - **์ฃผ์š” ์ฝ”๋“œ ์ˆ˜์ค€ ๊ฐœ์„  ์‚ฌํ•ญ:** - **Granular file viewing:** ์ „์ฒด ํŒŒ์ผ ๋Œ€์‹  ๊ด€๋ จ ๋ผ์ธ ๋ฒ”์œ„๋งŒ ์กฐํšŒํ•˜์—ฌ ์ปจํ…์ŠคํŠธ ๋ถ€๋‹ด ๊ฐ์†Œ [16]. - **String-replacement editing:** ์ „์ฒด ํŒŒ์ผ ํŽธ์ง‘์„ ๊ณ ์œ  ๋งค์น˜ ๊ธฐ๋ฐ˜์˜ ์ •๋ฐ€ ๋ถ€๋ถ„ ๋ฌธ์ž์—ด ๊ต์ฒด๋กœ ๋Œ€์ฒด [16]. - **Patch validation & retry:** ๋นˆ ํŒจ์น˜๋‚˜ ํ…Œ์ŠคํŠธ ์ „์šฉ ํŒจ์น˜๋ฅผ ๊ฐ์ง€ํ•˜๊ณ  ์†Œ์Šค ํŒŒ์ผ ๋ณ€๊ฒฝ์œผ๋กœ ์žฌ์‹œ๋„ [16]. - **History-aware tracking:** ์ด์ „ ์‹œ๋„ ๊ธฐ๋ก์„ ํ™œ์šฉํ•˜์—ฌ ์ดํ›„์˜ ํŒจ์น˜ ์ƒ์„ฑ์„ ์กฐ๊ฑดํ™”ํ•จ [15-17]. - **ํ•ต์‹ฌ ๋ฐœ๊ฒฌ:** ์•„์นด์ด๋ธŒ ๊ธฐ๋ฐ˜ ํƒ์ƒ‰์„ ์ œ๊ฑฐํ–ˆ์„ ๋•Œ ์„ฑ๋Šฅ์ด 50%์—์„œ 23%๋กœ ๊ธ‰๊ฐํ–ˆ์œผ๋ฉฐ, ์ด๋Š” ๋‹ค์–‘ํ•œ "๋””๋”ค๋Œ(stepping stones)"์„ ๋ณด์กดํ•˜๋Š” ์ง„ํ™”์  ๋ฐฉ์‹์ด ์ž๊ธฐ ๊ฐœ์„ ์˜ ํ•ต์‹ฌ์ž„์„ ์‹œ์‚ฌํ•จ [8, 12, 22]. ## โš–๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & updates) - **์ตœ์ ํ™” vs ์„ค๊ณ„:** ์ผ๋ฐ˜์ ์ธ AI ์ตœ์ ํ™”๋Š” ๊ณ ์ •๋œ ์„ค๊ณ„ ๊ณต๊ฐ„ ๋‚ด ํŒŒ๋ผ๋ฏธํ„ฐ ์กฐ์ •(D_t+1 = D_t)์— ๊ทธ์น˜์ง€๋งŒ, DGM์€ ์„ค๊ณ„ ๊ณต๊ฐ„ ์ž์ฒด๋ฅผ ๋ณ€๊ฒฝ(S_t+1 = ฮจ(S_t...))ํ•˜๋Š” '์ž๊ธฐ ์„ค๊ณ„'๋ฅผ ์ˆ˜ํ–‰ํ•จ [6, 23, 24]. - **๊ณ ์ •๋œ ์™ธ๋ถ€ ๋ฃจํ”„:** ์—์ด์ „ํŠธ์˜ ๋‚ด๋ถ€ ๋„๊ตฌ์™€ ์›Œํฌํ”Œ๋กœ๋Š” ์ง„ํ™”ํ•˜์ง€๋งŒ, ์ง„ํ™”๋ฅผ ์ฃผ๋„ํ•˜๋Š” ์™ธ๋ถ€์˜ ๊ฐœ๋ฐฉํ˜• ํƒ์ƒ‰ ๋ฃจํ”„๋‚˜ ๋ณด์ƒ ๊ทœ์น™ ์ž์ฒด๋Š” ์•„์ง AI๊ฐ€ ์ˆ˜์ •ํ•˜์ง€ ๋ชปํ•˜๋Š” ๊ณ ์ •๋œ ๊ฒฝ๊ณ„๋กœ ๋‚จ์•„ ์žˆ์Œ [25, 26]. - **์•ˆ์ „์„ฑ ์ด์Šˆ:** ์†Œ์Šค ์ˆ˜์ค€์˜ ์ž๊ธฐ ์ˆ˜์ •์€ ์•ˆ์ „ ๊ฐ€๋“œ๋ ˆ์ผ์„ ์šฐํšŒํ•  ์œ„ํ—˜์ด ์žˆ์œผ๋ฏ€๋กœ, ์ƒŒ๋“œ๋ฐ•์‹ฑ๊ณผ ๋ถˆ๋ณ€์˜ ๊ฐ์‚ฌ ๋กœ๊ทธ(Audit trail) ๋ฐ ์ธ๊ฐ„ ์Šน์ธ ๊ฒŒ์ดํŠธ๊ฐ€ ํ•„์ˆ˜์ ์œผ๋กœ ์š”๊ตฌ๋จ [27-30]. ## ๐Ÿ› ๏ธ ์ ์šฉ ์‚ฌ๋ก€ (Applied in summary) - **SWE-bench Verified ๊ฐœ์„ :** ์‹ค์ œ GitHub ์ด์Šˆ ํ•ด๊ฒฐ๋ ฅ์„ ์ธก์ •ํ•˜๋Š” ๋ฒค์น˜๋งˆํฌ์—์„œ ์žฌ๊ท€์  ์ž๊ธฐ ์ˆ˜์ •์„ ํ†ตํ•ด ์„ฑ๋Šฅ์„ ๋‘ ๋ฐฐ ์ด์ƒ ๋Œ์–ด์˜ฌ๋ฆผ [9, 17, 18]. - **Polyglot Benchmark:** ๋‹ค๊ตญ์–ด ์ฝ”๋”ฉ ๋Šฅ๋ ฅ ํ‰๊ฐ€์—์„œ ์‹œ๋“œ ์—์ด์ „ํŠธ ๋Œ€๋น„ 16.5%p์˜ ์ ˆ๋Œ€์  ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ๊ธฐ๋กํ•จ [20, 31]. - **์ž๊ธฐ ์ˆ˜๋ช… ์œ ์ง€๋ ฅ(Operational Integrity):** ์•„์นด์ด๋ธŒ๋ฅผ ์‚ฌ์šฉํ•˜์ง€ ์•Š๋Š” ๊ฒฝ์šฐ๋ณด๋‹ค DGM ๋ฐฉ์‹์ด ์ฝ”๋“œ ํŽธ์ง‘ ๊ธฐ๋Šฅ์˜ ๋ฌด๊ฒฐ์„ฑ์„ ์œ ์ง€ํ•˜๋Š” ๋น„์œจ(51.3%)์ด ํ›จ์”ฌ ๋†’๊ฒŒ ๋‚˜ํƒ€๋‚จ [15, 17, 32, 33]. ## โœ… ๊ฒ€์ฆ ์ƒํƒœ ๋ฐ ์‹ ๋ขฐ๋„ - **์ƒํƒœ:** draft - **๊ฒ€์ฆ ๋‹จ๊ณ„:** conceptual (์‹ค์ œ ์ ์šฉ ์‚ฌ๋ก€๊ฐ€ SWE-bench ๋“ฑ ํ‘œ์ค€ ๋ฒค์น˜๋งˆํฌ์—์„œ ์ž…์ฆ๋จ) - **์ถœ์ฒ˜ ์‹ ๋ขฐ๋„:** B (๊ณต์‹ ์—ฐ๊ตฌ ๋ณด๊ณ ์„œ ๋ฐ Sakana AI RSI Lab ๊ธฐ์ˆ  ๋ฌธ์„œ ๊ธฐ๋ฐ˜) - **์ค‘๋ณต ๊ฒ€์‚ฌ ๊ฒฐ๊ณผ:** ์‹ ๊ทœ ์ƒ์„ฑ (New discovery) ## ๐Ÿ“ ๋ณ€๊ฒฝ ์ด๋ ฅ (Change history) - 2026-06-12: Initial draft generated via Datacollector_MAC P-Reinforce engine. (Li et al., 2026 ๋ฐ Sakana AI 2025/2026 ์†Œ์Šค ๊ธฐ๋ฐ˜) [18, 34, 35].