feat(growth): Correction Loop — 정정 1회가 시스템 세 곳을 성장시키는 파이프라인 (v2.2.223)

self-evolving 고도화: 사용자 정정이 곧 Ground Truth — 정답지를 사람이 따로 만들지 않고, 태그 통계가 리포트에 머물지 않고 다음 턴의 행동을 바꾼다. ① 정정 감지·태깅 (correctionLoop.ts + agent.ts 훅, fire-and-forget): - "아니야/틀렸어/~가 아니라" 류 정정 발화 감지 (보수적 — 추임새 "아니"는 제외) - LLM 오류 분류 (사실오류/근거누락/맥락누락/추론오류/지시불이행/형식오류, 실패 시 휴리스틱 fallback) → error-tag frontmatter 레슨(lessons/) 저장 - 동시에 회귀 케이스 적립: .astra/eval/corrections.jsonl {질문, 틀린답, 정정} ② 주간 성장 사이클 확장 (1.5단계): - 정정 회귀 테스트: 정정받은 질문을 두뇌 검색 컨텍스트와 함께 재실행 → LLM-judge "같은 실수 반복?" 판정 → growth/regression-report.md (사이클당 ≤8건) - 약점 프로필: 최근 60일 태그 통계 → growth/weakness-profile.json ③ 결핍의 행동화 (memoryContext): - GROUNDING 약함 + agent scope 적용 중 → 전체 두뇌 1회 재검색 (scope 가 정답 문서를 가리는 경우 구제, 더 강한 근거일 때만 채택) - 그래도 약함 → 학습 큐에 지식 공백 자동 proposed 등록 (질문 해시 중복 차단, 20건 폭주 방지, 승인은 사람 — Permission Based Learning 유지) - 약점 프로필 → [자기검토] 블록 주입 (태그 2회 이상만): "너는 최근 X 정정을 N회 받았다 — <유형별 자기검토 지시>" 테스트 25건 추가 (감지 패턴·프로필 집계·큐 등록·영속화·fallback 분류). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-11 19:28:46 +09:00
parent 67927b1d4e
commit 72faa07480
7 changed files with 650 additions and 10 deletions
@@ -20,6 +20,7 @@ import { SessionManager } from './core/session';
 import { AgentWorkflowManager } from './agents/AgentWorkflowManager';
 import { buildAstraModeArchitectureContext } from './lib/contextBuilders/astraModeArchitecture';
 import { isScheduleRequest, buildScheduleContext } from './lib/contextBuilders/scheduleContext';
+import { looksLikeCorrection, captureCorrection } from './intelligence/correctionLoop';
 import { shouldUseMultiAgentWorkflow } from './lib/contextBuilders/multiAgentRouting';
 import { buildThinkingPartnerResponseContract } from './lib/contextBuilders/thinkingPartnerContract';
 import { buildDroppedHistorySummary } from './lib/contextBuilders/droppedHistorySummary';
@@ -536,6 +537,28 @@ export class AgentExecutor {
                }
            }

+            // [Correction Loop ①] 이 발화가 직전 답변에 대한 *정정*이면 fire-and-forget
+            // 캡처 — 오류 분류 → 태깅 레슨 + 회귀 케이스(.astra/eval/corrections.jsonl).
+            // 정정 자체가 Ground Truth 가 되어 주간 회귀 테스트·약점 프로필의 원료가 된다.
+            // 턴 응답을 막지 않는다 (await 없음).
+            if (prompt && loopDepth === 0 && activeBrain?.localBrainPath && looksLikeCorrection(prompt)) {
+                const visible = this.chatHistory.filter(m => !m.internal);
+                const lastAssistant = [...visible].reverse().find(m => m.role === 'assistant');
+                const lastUserIdx = lastAssistant ? visible.lastIndexOf(lastAssistant) - 1 : -1;
+                const priorQuestion = lastUserIdx >= 0 && visible[lastUserIdx]?.role === 'user' ? visible[lastUserIdx].content : '';
+                if (lastAssistant && priorQuestion) {
+                    void captureCorrection({
+                        brainPath: activeBrain.localBrainPath,
+                        question: priorQuestion,
+                        wrongAnswer: lastAssistant.content,
+                        correction: prompt,
+                        llm: { baseUrl: config.ollamaUrl, model: configDefaultModel },
+                    }).then(file => {
+                        if (file) logInfo('Correction Loop: 정정 캡처 완료.', { lesson: file });
+                    }).catch((e: any) => logError('Correction Loop 캡처 실패 (무시).', { error: e?.message ?? String(e) }));
+                }
+            }
+
            // 2. Setup History
            if (prompt !== null) {
                if (loopDepth === 0) {