Files
connectai/.astra/project-context/architecture.md
T
g1nation 0a97324f1b feat: v2.2.92 → v2.2.158 — god-file 분해 + Stocks feature + 대화 연속성
R56–R59: agent.ts 2731→1529줄 god-file 분해 (25 modules)
  · attrParsers + LLM 메서드 8개 (callNonStreaming, streamChatOnce 등)
  · executeActions 415줄 → 8 handler 그룹 (file/run/list/brain/calendar/sheets/tasks)
  · handlePrompt 1100줄 → 7 phase 모듈 (system prompt + budget + autoContinue 등)

R50–R55: extension.ts 1145→349줄 (telegram/settings/provider commands 분리)

Stocks feature 신규: /stocks slash command (v2.2.152~158)
  · .astra/stocks.json 저장소 + Yahoo Finance 현재가 갱신
  · 8 키워드 필터 (ROE/성장성/유동성/수익성/영업효율/기술력/안정성/PBR)
  · Naver 시가총액 페이지 JSON API (m.stock.naver.com) 발굴
  · LLM Top 5 매력도 분석 + Telegram 자동 보고서
  · KST 09:00/15:00 watcher 자동 모니터링

대화 연속성 (v2.2.150~157):
  · [PRIOR TURN CONCLUSION] block 으로 직전 결론 anchor
  · thin follow-up 분류 → boilerplate 헤더 suppression
  · slash 명령 결과 chatHistory mirror (capture wrapper)
  · echo/parrot 금지 system prompt rule

기타: /stocks 슬래시 자동완성 dropdown UI, Naver JSON API 전환 (cheerio 제거)

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-25 09:59:32 +09:00

34 KiB
Raw Blame History

ConnectAI — Project Architecture Context

Snapshot

  • Workspace: ConnectAI v2.2.158 (absolute path varies by environment; resolved from the active VS Code workspace)
  • Description: The personal intelligence layer for Antigravity and VS Code. A private cognitive partner for deep project context, memory, and proactive strategic decision-making.
  • Stack: TypeScript, Node.js, VS Code Extension, LM Studio SDK, Test runner
  • Stats: 395 source files, ~63,423 lines across 5 top-level modules.

Last Refresh

  • Time: 2026-05-25T00:59:03.313Z
  • Files newly analysed: 1
  • Files reused from cache: 394

Directory Map

mindmap
  root((ConnectAI))
    src/
      features/
      sidebar/
      lib/
      agent/
      core/
      extension/
    media/
    tests/
      helpers/
      integration/
      mocks/
    core_py/
    docs/
      records/
      docs/

Module Dependencies

Arrows: which top-level module imports from which.

flowchart LR
    src["src/<br/>247 files"]
    media["media/<br/>6 files"]
    tests["tests/<br/>37 files"]
    core_py["core_py/<br/>6 files"]
    docs["docs/<br/>99 files"]
    tests --> src

Entry Points

Files to read first when learning the codebase.

  • src/extension.ts
  • media/sidebar.html — Astra
  • package.json — npm package manifest

Hub Files

Imported by many other files — touching these has wide blast radius.

  • src/utils.ts — referenced by 87 files
  • src/agent.ts — referenced by 34 files
  • src/config.ts — referenced by 32 files
  • src/core/services.ts — referenced by 14 files
  • src/features/company/index.ts — referenced by 14 files · Public API for 1인 기업 모드. Consumers (sidebarProvider, chatHandlers, command handlers) import from this barrel so internal layout can move around without touching every call site.
  • src/features/company/types.ts — referenced by 14 files · Type definitions for the 1인 기업 (One-Person Company) mode. The mode turns the user into a virtual CEO that dispatches work to a roster of specialist agents. Each turn produces a session directory conta
  • src/sidebarProvider.ts — referenced by 11 files
  • src/lib/contextManager.ts — referenced by 10 files · Context Manager (컨텍스트 한계 관리) "context length = 132k" 는 "답변을 132k 토큰까지 생성해도 된다" 가 아닙니다. 시스템 프롬프트 + 대화 기록 + 입력 문서 + 생성될 답변 + 여유분 ≤ context length 이 모듈은 요청을 보내기 전에 입력 토큰을 추정하고, - 동적으로 출력 상한(maxTokens)을 계

Modules

src/ — 247 files, ~45,859 lines

Sub-directories

  • src/features/ (87) — Astra Office — public API. 다음 세션에서 추가될 OfficeSnapshot presenter / schema 도 같은 entry 로 노출 예정. 현재 노출: full webview panel H
  • src/sidebar/ (35) — Brain profile lifecycle 의 pure helpers — sidebarProvider 의 add/edit/delete 흐름에서 modal UI 와 config 쓰기를 제외한 데이터 변환 만 격리. 현
  • src/lib/ (28) — Astra Mode Architecture Context Builder. 의도: 사용자가 Astra 자체의 mode 디자인 (Guard vs Multi-Agent 가 별도 모드여야 하는지) 을 묻는 메타 질문에 답할
  • src/agent/ (25) — 25 files (.ts)
  • src/core/ (15) — Astra Path Resolver (경로 해결기) Astra의 모든 데이터 파일(.astra 디렉토리)의 경로를 중앙에서 관리합니다. 확장 프로그램의 설치 경로(extensionUri) 기반으로 .astra 디렉토
  • src/extension/ (8) — 8 files (.ts)
  • src/memory/ (8) — Episodic Memory (일화 기억) 과거 대화/회의/결정의 맥락 흐름을 저장합니다. 세션 종료 시 자동으로 에피소드를 요약하여 저장합니다. "왜 이렇게 결정했는지", "어떤 흐름으로 진행했는지" 기록. 저장
  • src/retrieval/ (8) — Brain Index — persistent, mtime-keyed tokenized cache of the Second Brain RAG 검색은 매 질의마다 브레인의 모든 .md 파일을 읽고 토크나이즈해서 TF-I
  • src/docs/ (6) — src Chronicle Records
  • src/integrations/ (6) — Per-chat conversation history for the Telegram bot. Why this exists: the previous bot was stateless — every inbound mess
  • src/lmstudio/ (4) — 4 files (.ts)
  • src/skills/ (4) — 4 files (.ts)

Key files

  • src/utils.ts (471 lines)
  • src/agent.ts (1487 lines)
  • src/config.ts (418 lines)
  • src/features/company/types.ts (446 lines) — Type definitions for the 1인 기업 (One-Person Company) mode. The mode turns the user into a virtual CEO that dispatches work to a roster of specialist agents. Each turn produces a session directory conta
  • src/sidebarProvider.ts (3194 lines)
  • src/core/services.ts (176 lines)
  • src/lib/contextManager.ts (278 lines) — Context Manager (컨텍스트 한계 관리) "context length = 132k" 는 "답변을 132k 토큰까지 생성해도 된다" 가 아닙니다. 시스템 프롬프트 + 대화 기록 + 입력 문서 + 생성될 답변 + 여유분 ≤ context length 이 모듈은 요청을 보내기 전에 입력 토큰을 추정하고, - 동적으로 출력 상한(maxTokens)을 계
  • src/features/company/companyConfig.ts (896 lines) — State + config plumbing for 1인 기업 모드. Two surfaces: - CompanyState (runtime data: enabled flag, company name, which agents are active, per-agent model overrides). Persisted in VS Code's globalState so
  • src/integrations/telegram/telegramClient.ts (154 lines)
  • src/lib/paths.ts (151 lines)
  • src/agent/actions/types.ts (41 lines)
  • src/skills/agentKnowledgeMap.ts (374 lines)
  • src/features/stocks/types.ts (53 lines) — Stocks 모듈 공유 타입. investresults/targetstocks.json 스키마를 그대로 받아서, ConnectAI 의 /.astra/stocks.json 으로 옮긴 뒤 같은 필드명을 유지. 한글 필드명은 사용자의 도메인 데이터라 변경하지 않는다 — 마이그레이션 충돌 회피 + 사용자가 직접 JSON 편집할 때 frictio
  • src/lib/contextBuilders/promptDetection.ts (85 lines) — 사용자 prompt 의 의도 분류 류 detection helpers. 모두 stateless 정규식 매칭. 옛 코드는 agent.ts 의 private 메서드로 박혀 있었는데, system prompt 빌더 (buildJarvisProjectBriefContext 등) 가 이걸 의존하면서 god-file 안에서 서로 얽힘. 헬퍼만 먼저 떼면 의존 그래프가
  • src/retrieval/lessonHelpers.ts (325 lines) — Lesson / Experience Memory — pure helpers (no vscode dependency) "Lesson" = a markdown file in the active brain that captures a past mistake/risk and how to avoid repeating it. Identified by a lessons
  • src/memory/types.ts (126 lines) — Memory Type Definitions (메모리 타입 정의) Astra의 5-Layer Cognitive Memory System의 모든 타입을 정의합니다. ① Short-Term ② Long-Term ③ Project ④ Procedural ⑤ Episodic
  • src/retrieval/scoring.ts (541 lines) — Scoring Engine — TF-IDF + Bilingual Tokenizer 단순 includes() 키워드 매칭을 넘어서, TF-IDF 가중치 기반의 문서 스코어링을 제공합니다. 한국어/영어 양국어 토크나이저를 포함합니다.
  • src/security.ts (159 lines)
  • src/features/secondBrainTrace.ts (792 lines)
  • src/features/providers/types.ts (63 lines) — Cloud LLM provider routing — model id prefix → provider id 매핑. Prefix 규칙: openrouter:anthropic/claude-3.5-sonnet → { provider: 'openrouter', model: 'anthropic/claude-3.5-sonnet' } anthropic:claude-3-5
  • src/integrations/telegram/telegramBot.ts (270 lines)
  • src/lib/contextBuilders/localProjectIntent.ts (233 lines)
  • src/lib/engine.ts (1114 lines)
  • src/lmstudio/streamer.ts (252 lines)
  • src/core/responseRecovery.ts (310 lines) — Response Recovery — Thought Quarantine + Final-only Retry + Auto-Continuation The user already asked their question; they're waiting for an answer, not for a chance to babysit the generation engine. S

media/ — 6 files, ~7,649 lines

Key files

  • media/sidebar.css (2104 lines) — Stylesheet
  • media/sidebar.js (3921 lines)
  • media/sidebar.html (539 lines) — Astra
  • media/settings-panel.html (406 lines) — Astra Settings
  • media/settings-panel.css (210 lines) — Stylesheet
  • media/settings-panel.js (469 lines)

tests/ — 37 files, ~5,875 lines

Depends on: src/

Sub-directories

  • tests/helpers/ (1) — MockLLMClient — IAIService 의 Mock 구현체. 의도: 회사 모드 dispatcher / ChunkedWriter / ceoPlanner 등 LLM 을 호출하는 코드 경로를 CI 환경에서도 테스
  • tests/integration/ (1) — MockLLMClient 자체의 sanity test. 이게 통과하면 dispatcher / ceoPlanner / ChunkedWriter 등 IAIService 를 받는 코드가 실제 LLM 없이 단위 / inte
  • tests/mocks/ (1) — 1 files (.js)

Key files

  • tests/helpers/mockLLMClient.ts (112 lines) — MockLLMClient — IAIService 의 Mock 구현체. 의도: 회사 모드 dispatcher / ChunkedWriter / ceoPlanner 등 LLM 을 호출하는 코드 경로를 CI 환경에서도 테스트 가능하게. 실제 Ollama / LM Studio 없이도 응답을 미리 정의하거나 동적으로 생성 가능. 사용 예: const ai = new
  • tests/agentEngine.test.ts (413 lines) — AgentEngine Tests — Chunked Writer Architecture 예전 buildup(planner → researcher → reflector → writer → synthesizer)을 단일 ChunkedWriter 의 outline → section[N] → polish 로 교체한 뒤의 회귀 테스트. 다루는 범위: 1. ErrorC
  • tests/lmStudioLifecycle.test.ts (326 lines) — Unit tests for ModelLifecycleManager. Strategy: inject mock ILMStudioClient and a simple in-memory IActivityTracker. No real LM Studio or SDK is touched — the manager file does not import the SDK dire
  • tests/localPathPreflight.test.ts (520 lines)
  • tests/telegramBot.test.ts (363 lines) — Unit tests for TelegramBot + truncateForTelegram. Strategy: - TelegramBot is driven by an injected ITelegramClient stub. We script getUpdates to return queued batches and assert that: - the offset cur
  • tests/lmStudioStreamer.test.ts (222 lines) — Unit tests for LMStudioStreamer. Strategy: inject a fake ILMStudioClient that returns a fake model handle whose respond() yields a controllable async iterable. No real SDK or WebSocket touched.
  • tests/secondBrainTrace.test.ts (407 lines)
  • tests/approvalQueue.test.ts (164 lines) — Unit tests for ApprovalQueue. Strategy: drive enqueue → approve / reject / clear / pre-empt directly, confirm the onChange event fires at the right moments and callbacks fire exactly once.
  • tests/projectScaffolder.test.ts (135 lines) — Unit tests for FileSystemProjectScaffolder. Drives against a real temp directory so end-to-end file IO + path-traversal defenses are exercised.
  • tests/resilience_stress.test.ts (197 lines) — Resilience & Boundary Stress Test Suite (v2.77.3) 이 테스트는 ConnectAI 엔진이 극한의 환경(인증 실패, 네트워크 차단, 타임아웃 등)에서 얼마나 안정적으로 복구되고, 신뢰성 지표(Resilience Metrics)를 정확히 기록하는지 검증합니다.
  • tests/skillInjectionService.test.ts (172 lines) — Unit tests for FileSystemSkillInjectionService. Strategy: drive the service against a real temp directory so path-traversal defenses and writeFileSync paths are exercised end-to-end. The service accep
  • tests/dataProcessor.test.ts (87 lines) — /
  • tests/findBrainFilesCache.test.ts (80 lines) — Unit tests for findBrainFiles TTL cache.
  • tests/integration/mockLLMClient.test.ts (86 lines) — MockLLMClient 자체의 sanity test. 이게 통과하면 dispatcher / ceoPlanner / ChunkedWriter 등 IAIService 를 받는 코드가 실제 LLM 없이 단위 / integration 테스트 가능. 향후 dispatcher 의 multi-stage flow 같은 큰 integration 테스트는 이 mock 을
  • tests/officeSchema.test.ts (241 lines)
  • tests/paths.test.ts (84 lines) — Unit tests for the centralized path resolver.
  • tests/systemSpecs.test.ts (90 lines) — Unit tests for SystemSpecs + HeuristicModelMemoryEstimator. Strategy: - HeuristicModelMemoryEstimator is pure — directly drive it with model ids. - NodeSystemSpecsProvider depends on os. so we test: a
  • tests/transaction.test.ts (68 lines) — /
  • tests/vulnerability.test.ts (60 lines) — /
  • tests/brainIndex.test.ts (107 lines)
  • tests/calendarApi.test.ts (131 lines)
  • tests/contextManager.test.ts (149 lines)
  • tests/icsParser.test.ts (134 lines)
  • tests/lessonHelpers.test.ts (191 lines)
  • tests/projectChronicle.test.ts (199 lines)

core_py/ — 6 files, ~409 lines

Key files

  • core_py/events.py (64 lines)
  • core_py/inference.py (91 lines)
  • core_py/loader.py (61 lines)
  • core_py/monitoring.py (56 lines)
  • core_py/optimizer.py (55 lines)
  • core_py/queue_worker.py (82 lines)

docs/ — 99 files, ~3,631 lines

Sub-directories

  • docs/records/ (86) — Astra Project Chronicle Records
  • docs/docs/ (5) — docs Chronicle Records

Key files

  • docs/TELEGRAM_REMOTE_EXECUTION_PLAN.md (452 lines) — Telegram Remote Execution 기획서
  • docs/AgentEngine_Architecture.md (314 lines) — AgentEngine Architecture Document
  • docs/records/ConnectAI/timeline.md (209 lines) — Project Timeline
  • docs/ASTRA_OFFICE_REFACTOR.md (198 lines) — Astra Office Refactor — Design Doc
  • docs/EXPERIENCE_MEMORY_PLAN.md (122 lines) — Experience Memory (Mistake / Lesson Loop) — Implementation Plan
  • docs/records/ConnectAI/development/2026-05-02_connectai_project_knowledge_overview.md (121 lines) — Astra Project Knowledge Overview
  • docs/records/ConnectAI/development/2026-05-03_connectai_project_knowledge_overview.md (121 lines) — Astra Project Knowledge Overview
  • docs/Advanced_Features_Implementation_Guide.md (40 lines) — Advanced Features Implementation Guide
  • docs/PROJECT_CHRONICLE_GUARD_ROADMAP.md (43 lines) — Project Chronicle Guard: Search Engine Roadmap
  • docs/UX_UI_Consistency_Guidelines.md (44 lines) — UX/UI Consistency Guidelines
  • docs/docs/records/docs/README.md (18 lines) — docs Chronicle Records
  • docs/docs/records/docs/bugs/BUG-0001-viewed-integration-retrieval-test-ts-1-59-integration-retrie.md (16 lines) — Bug: Viewed integrationretrieval.test.ts:1-59 integrationretrieval.test.ts를 통해 ...
  • docs/docs/records/docs/chronicle.config.json (11 lines) — JSON configuration
  • docs/docs/records/docs/project-profile.md (31 lines) — Project Profile
  • docs/docs/records/docs/timeline.md (7 lines) — Project Timeline
  • docs/records/ConnectAI/README.md (18 lines) — Astra Project Chronicle Records
  • docs/records/ConnectAI/bugs/BUG-0001-volumes-data-project-antigravity-connectai-프로젝트-코드-리뷰-해줄-수-있.md (16 lines) — Bug: /Volumes/Data/project/Antigravity/ConnectAI 프로젝트 코드 리뷰 해줄 수 있어? 개선할 부분이 있는지, 그러고...
  • docs/records/ConnectAI/bugs/BUG-0002-지금-내가-분석-요청하고-너가-답을-줄때-아래-템플릿에-맞춰-답을-써주고-있는데-개선-포인트가-있는지-확인해.md (16 lines) — Bug: 지금 내가 분석 요청하고 너가 답을 줄때 아래 템플릿에 맞춰 답을 써주고 있는데, 개선 포인트가 있는지 확인해줘. ## 내가 보는 위험 가장 큰...
  • docs/records/ConnectAI/bugs/BUG-0003-volumes-data-project-antigravity-connectai-내-질문에-대한-답변이-잘-정리.md (16 lines) — Bug: /Volumes/Data/project/Antigravity/ConnectAI 내 질문에 대한 답변이 잘 정리되서 알려주긴 하는데 focused...
  • docs/records/ConnectAI/bugs/BUG-0004-volumes-data-project-antigravity-connectai-내-질문에-대한-답변이-잘-정리.md (16 lines) — Bug: /Volumes/Data/project/Antigravity/ConnectAI 내 질문에 대한 답변이 잘 정리되서 알려주긴 하는데 focused...
  • docs/records/ConnectAI/bugs/BUG-0005-다시한번-답줘-volumes-data-project-antigravity-connectai-내-질문에-대한-.md (16 lines) — Bug: 다시한번 답줘. /Volumes/Data/project/Antigravity/ConnectAI 내 질문에 대한 답변이 잘 정리되서 알려주긴 하는...
  • docs/records/ConnectAI/bugs/BUG-0006-volumes-data-project-antigravity-connectai-내-질문에-대한-답변이-잘-정리.md (16 lines) — Bug: /Volumes/Data/project/Antigravity/ConnectAI 내 질문에 대한 답변이 잘 정리되서 알려주긴 하는데 focused...
  • docs/records/ConnectAI/bugs/BUG-0007-volumes-data-project-antigravity-connectai-내-질문에-대한-답변이-잘-정리.md (16 lines) — Bug: /Volumes/Data/project/Antigravity/ConnectAI 내 질문에 대한 답변이 잘 정리되서 알려주긴 하는데 focused...
  • docs/records/ConnectAI/bugs/BUG-0008-volumes-data-project-antigravity-connectai-내-질문에-대한-답변이-잘-정리.md (16 lines) — Bug: /Volumes/Data/project/Antigravity/ConnectAI 내 질문에 대한 답변이 잘 정리되서 알려주긴 하는데 focused...
  • docs/records/ConnectAI/bugs/BUG-0009-문제점을-읽고-어떻게-개선하는게-최선인지-분석해주면-좋겠어-알겠습니다-지금부터-connectai-프로젝트-에.md (16 lines) — Bug: 문제점을 읽고 어떻게 개선하는게 최선인지 분석해주면 좋겠어. 알겠습니다. 지금부터 ConnectAI 프로젝트에만 완전히 집중하겠습니다. ...

VS Code Extension Surface

  • Extension ID: g1nation.astra
  • Activation events: onStartupFinished
  • Commands (29):
    • g1nation.newChat — Astra: New Chat
    • g1nation.exportChat — Astra: Export Chat as Markdown
    • g1nation.explainSelection — Astra: Explain Selected Code
    • g1nation.focusChat — Astra: Focus Chat Input
    • g1nation.showBrainNetwork — Astra: Show Brain Topology
    • g1nation.approval.focus — Astra: Focus Approval Panel
    • g1nation.scaffoldProject — Astra: Scaffold New Project
    • g1nation.telegram.setBotToken — Astra: Set Telegram Bot Token
    • g1nation.telegram.clearBotToken — Astra: Clear Telegram Bot Token
    • g1nation.telegram.testConnection — Astra: Test Telegram Connection
    • g1nation.settings.focus — Astra: Open Settings Panel
    • g1nation.skills.editKnowledgeMap — Astra: Edit Agent ↔ Knowledge Map
    • g1nation.openChat — Astra: Open Chat (Editor Column)
    • g1nation.setupDatacollect — Astra: Setup Datacollect Dependencies (yt-dlp, youtube-transcript-api)
    • g1nation.lesson.create — Astra: New Lesson (Experience Memory)
    • g1nation.lesson.fromConversation — Astra: New Lesson from Current Conversation
    • g1nation.lesson.manage — Astra: Browse / Manage Lessons
    • g1nation.architecture.refresh — Astra: Refresh Project Architecture Context
    • g1nation.architecture.detach — Astra: Detach Project Architecture Context
    • g1nation.architecture.attach — Astra: Attach Project Architecture Context
    • g1nation.architecture.open — Astra: Open Project Architecture Doc
    • g1nation.company.toggle — Astra: Toggle 1인 기업 Mode
    • g1nation.company.manage — Astra: Manage 1인 기업 Agents
    • g1nation.company.openSessions — Astra: Open 1인 기업 Sessions Folder
    • g1nation.company.pixelOffice.open — Astra: Open Pixel Office (Full Screen)
    • g1nation.calendar.connect — Astra: Google Calendar (iCal) 연결 📅
    • g1nation.calendar.refresh — Astra: Google Calendar 새로고침 📅
    • g1nation.calendar.connectOAuth — Astra: Google Calendar OAuth 연결 (쓰기) 🔐
    • g1nation.devilAgent.toggle — Astra: Toggle Devil Agent 🎭
  • Configuration (93 settings):
    • g1nation.multiAgentEnabled (boolean) (default: false) — Enable Multi-Agent Workflow (Planner -> Researcher -> Writer) for complex tasks.
    • g1nation.datacollectBridgeUrl (string) (default: "http://127.0.0.1:3002") — Wiki/Datacollect MCP Bridge URL. /research, /benchmark, /youtube chat slash commands route here. The Bridge must be running (npm run bridge in the Datacollect project).
    • g1nation.datacollectSavePath (string) (default: "")
    • g1nation.datacollectCrawlDepth (number) (default: 1)
    • g1nation.datacollectMaxPages (number) (default: 8)
    • g1nation.datacollectSynthesisTemperature (number) (default: 0.1)
    • g1nation.chatTemperature (number) (default: 0.3)
    • g1nation.memoryEnabled (boolean) (default: true) — Enable layered memory injection before each model response.
    • g1nation.memoryShortTermMessages (number) (default: 8) — Number of recent conversation messages included as short-term memory.
    • g1nation.memoryMediumTermSessions (number) (default: 5) — Number of recent saved chat sessions included as medium-term memory.
    • g1nation.memoryLongTermFiles (number) (default: 6) — Number of relevant Second Brain markdown files included as long-term memory.
    • g1nation.ollamaUrl (string) (default: "http://127.0.0.1:11434") — Base URL for Ollama or LM Studio. Default: http://127.0.0.1:11434
    • g1nation.defaultModel (string) (default: "gemma4:e2b") — Default model name to use for chat requests.
    • g1nation.requestTimeout (number) (default: 300) — Request timeout in seconds. Default: 300
    • g1nation.contextLength (number) (default: 32768) — Model context window in tokens (prompt + generation combined). Set this to the value your loaded model is actually running with in LM Studio / Ollama. Astra budgets prompt and output against this so i
    • g1nation.maxOutputTokens (number) (default: 4096) — Upper bound on tokens generated per response. The effective limit is reduced automatically when the prompt is large so input + output stays within g1nation.contextLength. Default: 4096
    • g1nation.contextSafetyMargin (number) (default: 2048) — Tokens kept free as a safety buffer for token-count estimation error. Default: 2048
    • g1nation.contextOverflowPolicy (string) (default: "stopAtLimit") — Fallback behavior (LM Studio) if the prompt still exceeds the context window after Astra's own budgeting. 'stopAtLimit' fails clearly so you notice; 'truncateMiddle'/'rollingWindow' drop content silen
    • g1nation.autoCompactHistory (boolean) (default: true) — Automatically drop the oldest conversation messages from the request when the prompt would exceed the context budget (the on-screen chat history is unaffected). Default: true
    • g1nation.smallModelContextCap (number) (default: 0) — Optional safety knob, OFF by default (0). Some very small models (≤3B) emit an empty/EOS response when given a prompt near their context window even though it nominally fits. If you observe that with
    • g1nation.autoContinueOnOutputLimit (boolean) (default: true) — When a reply is cut off because it hit the output-token limit, Astra continues it internally (compressed request — original question + the answer so far, not the whole context again) and shows one mer
    • g1nation.maxAutoContinuations (number) (default: 4) — Maximum number of automatic continuation rounds per reply (prevents runaway loops). Raise it (e.g. 56) for long-form answers on slow local models; set 0 to disable auto-continuation. Default: 4
    • g1nation.finalOnlyRetryOnThoughtLeak (boolean) (default: true) — If the model emits only hidden reasoning (, <|channel|>thought, "Thinking Process:" …) and no user-visible answer, Astra silently re-asks it for the final answer only. Hidden reasoning is never
    • g1nation.lmStudio.idleTimeoutMs (number) (default: 300000) — Auto-eject the loaded LM Studio model after this many milliseconds of inactivity. Set to 0 to disable. Default: 300000 (5 minutes).
    • g1nation.lmStudio.autoLoadOnSelect (boolean) (default: true) — Automatically load LM Studio models into memory when selected from the Astra sidebar.
    • g1nation.lmStudio.sampling.topP (number) (default: 0.9) — Nucleus sampling cutoff. Small / quantized models often spew wrong-neighbour tokens (한글 깨짐: 붕괴→붕점) when the tail is wide. Lower (0.80.9) tightens; 1.0 disables. Applied to both SDK and REST paths.
    • g1nation.lmStudio.sampling.topK (number) (default: 20) — Top-K sampling cutoff. 0 disables. Default 20 — tighter for small models, raise to 4080 for large models that already sample well.
    • g1nation.lmStudio.sampling.minP (number) (default: 0.05) — Min-P floor — discards tokens with probability below this fraction of the top token. Good defence against rare-token glitches. 0 disables.
    • g1nation.lmStudio.sampling.repeatPenalty (number) (default: 1.1) — Repeat / frequency penalty to curb stutter (것입니다서입니다…). 1.0 disables. Values 1.051.2 are typical.
    • g1nation.lmStudio.statsInBudget (boolean) (default: true) — Show token/s and time-to-first-token from LM Studio prediction stats in the context-budget badge after each turn (SDK path only).
    • g1nation.lmStudio.draftModel (string) (default: "") — [Speculative decoding] LM Studio model key of a small draft model (e.g. 'gemma-2b-it') used to accelerate the main model. Empty disables. 1.53x throughput on large models. The draft must be downloade
    • g1nation.lmStudio.load.flashAttention (boolean) (default: true) — [Load option] Enable Flash Attention when loading models. Faster generation + lower memory on compatible hardware, especially helpful for long contexts. Default: true.
    • g1nation.lmStudio.load.gpuOffloadRatio (string) (default: "max") — [Load option] How much of the model to offload to GPU. 'max' = all (default), 'off' = CPU only, or a number 01 (e.g. '0.5' = half). Numeric strings are parsed.
    • g1nation.lmStudio.load.offloadKVCacheToGpu (boolean) (default: true) — [Load option] Keep KV cache on GPU memory. Faster but requires VRAM headroom. Default: true.
    • g1nation.lmStudio.load.keepModelInMemory (boolean) (default: true) — [Load option] Prevent the model from being swapped out of system memory. Improves interactive responsiveness; raises RAM use. Default: true.
    • g1nation.lmStudio.load.useFp16ForKVCache (boolean) (default: false) — [Load option] Store KV cache in FP16 (halves cache memory). Tiny quality impact for most models — try if you run out of VRAM at long contexts. Default: false.
    • g1nation.lmStudio.load.evalBatchSize (number) (default: 0) — [Load option] Token batch size during evaluation. 0 = engine default. Higher (5121024) improves prefill speed on GPU at the cost of memory.
    • g1nation.localBrainPath (string) (default: "") — Folder path for your local Second Brain knowledge base. Leave empty to use the default folder.
    • g1nation.brainProfiles (array) (default: []) — Multiple brain profiles. Each item supports id, name, localBrainPath, secondBrainRepo, and description.
    • g1nation.activeBrainId (string) (default: "") — Active brain profile id used for the current chat context.
    • g1nation.secondBrainRepo (string) (default: "") — Optional GitHub repository URL used for Second Brain sync.
    • g1nation.autoPushBrain (boolean) (default: false) — Automatically commit and push Second Brain changes after updates.
    • g1nation.maxContextSize (number) (default: 32000) — Maximum character count for active file context. Default: 32000
    • g1nation.maxAutoSteps (number) (default: 50) — Maximum autonomous steps the agent can take per request. Default: 50
    • g1nation.dryRun (boolean) (default: false) — If enabled, the agent will ask for approval before committing any file changes.
    • g1nation.telegram.enabled (boolean) (default: false) — Enable the Telegram bot integration. When on, Astra polls a bot you configure and replies to incoming messages. Off by default — Astra remains 100% local until you opt in.
    • g1nation.telegram.allowedChatIds (array) (default: []) — Optional allowlist of Telegram chat IDs that may message the bot. When empty, every chat that messages the bot is accepted (use with caution).
    • g1nation.telegram.defaultAgent (string) (default: "") — Agent name (matches an entry in the Agent ↔ Knowledge map) used to scope Second Brain retrieval for Telegram replies. Empty falls back to the map's defaultAgent, then to whole-brain search.
    • g1nation.telegram.agentByChatId (object) (default: {}) — Per-chat override of the Telegram agent. Keys are stringified chat IDs, values are agent names from the knowledge map. Overrides telegram.defaultAgent for the listed chats.
    • g1nation.telegram.contextChunks (number) (default: 6) — How many Second Brain excerpts to inject into Telegram replies. Set 0 to disable RAG (plain prompt only).
    • g1nation.skillKnowledgeMapPath (string) (default: "") — Absolute path to the agent ↔ knowledge mapping JSON. When empty, defaults to '/.astra/agent-knowledge-map.json'.
    • g1nation.skillKnowledgeMap (object) (default: {}) — Inline fallback for the agent ↔ knowledge mapping. Used only when the JSON file is missing. Shape: { defaultAgent?, agents: [{ name, knowledgeFolders, model?, description? }] }. Folder paths can be ab
    • g1nation.agentSkillsPath (string) (default: "") — Absolute path to the agent skills folder (.agent/skills/*.md). When empty, defaults to '/.agent/skills'. Use this on Windows or when your skills live outside the workspace.
    • g1nation.embeddingModel (string) (default: "") — Embedding model registered in LM Studio / Ollama (e.g. 'text-embedding-bge-small-en-v1.5', 'nomic-embed-text', 'multilingual-e5-small'). When empty, Astra uses TF-IDF only. When set, the brain is embe
    • g1nation.embeddingBlendAlpha (number) (default: 0.5) — Hybrid score blend: 0 = pure TF-IDF (sparse / keyword), 1 = pure embedding cosine (dense / semantic), 0.5 = balanced. Only used when g1nation.embeddingModel is set. Default 0.5.
    • g1nation.knowledgeMix.secondBrainWeight (number) (default: 50) — Knowledge Mix (0100): how heavily the assistant should lean on Second Brain evidence vs. its own general knowledge. 0 = Second Brain disabled (model knowledge only). 50 = balanced (legacy default). 1
    • g1nation.workflow.multiAgentMode (string) (default: "auto")
    • g1nation.workflow.autoCtxFractionThreshold (number) (default: 0.3)
    • g1nation.chunkedSwitchTokens (number) (default: 50000)
    • g1nation.chunkedMaxSections (number) (default: 3)
    • …and 33 more

Dependencies

  • Runtime (2): @lmstudio/sdk, pdf-parse
  • Dev (8): @types/jest, @types/node, @types/vscode, @vercel/ncc, esbuild, jest, ts-jest, typescript

README Excerpt

Pulled from the project root README — first ~2 KB.

Astra (by g1nation)

Astra는 Antigravity 및 VS Code 환경에서 작동하는 대표님 전용 **지능형 운영 레이어(Personal Intelligence Layer)**입니다. 단순한 명령 수행을 넘어, 프로젝트의 맥락과 대표님의 의사결정 패턴을 학습하여 최적의 전략적 조언을 제공하는 독립적인 인지 파트너입니다.

🌌 Antigravity & VS Code Unified Assistant

Astra는 범용 AI와 달리 특정 플랫폼에 종속되지 않으며, Antigravity 워크스페이스의 깊은 맥락과 VS Code의 강력한 개발 도구를 하나로 연결합니다.

1. 전용 지능형 판단 체계 (Personal Cognition Layer)

v4.0 운영 정책이 코어에 이식되어 데이터의 신뢰도를 대표님의 기준에 맞춰 스스로 평가합니다. 상충되는 정보 발견 시 즉각적인 **[CONFLICT WARNING]**을 통해 객관적인 판단 근거를 제시합니다.

2. 고밀도 전략 지식망 (Strategic Knowledge Hub)

대표님의 Second Brain과 Antigravity 내의 모든 지식을 온톨로지 기반으로 구조화합니다. 비즈니스 전략, 기술 아키텍처, 리스크 관리가 하나로 통합된 지식 그래프를 통해 추론의 깊이를 보장합니다.

3. 선제적 파트너십 (Proactive Partnership)

작업이 완료된 후, 대표님이 다음에 내려야 할 **전략적 의사결정 포크(Decision Forks)**를 선제적으로 제안합니다. 사용자의 명령을 기다리지 않고, 프로젝트의 흐름을 먼저 읽고 길을 제시합니다.

🛠️ 주요 기능 및 권한

Astra는 대표님의 명시적인 승인 하에 로컬 시스템의 강력한 제어 권한을 행사하여 생산성을 극대화합니다.

작업 범주 설명
플랫폼 최적화 Antigravity 워크스페이스와 VS Code 사이의 유기적인 맥락 전환 및 동기화를 지원합니다.
자율 워크플로우 다중 에이전트 협업을 통해 복잡한 비즈니스 요구사항을 즉시 실행 가능한 단계별 계획으로 분해합니다.
지식 자산화 흩어진 정보들을 P-Reinforce v3.0 표준에 맞게 위키화하여 영구적인 지식 자산으로 전환합니다.
보안 및 프라이버시 100% 로컬 환경에서 작동하여 대표님의 소중한 데이터가 외부로 유출되지 않음을 보장합니다.

🚀 설치 및 시작하기

패키지 설치

  1. g1nation에서 배포된 최신 v2.65.0 VSIX 파일을 확보합니다.
  2. VS Code 명령 팔레트(Cmd+Shift+P)에서 Extensions: Install from VSIX를 선택하여 설치합니다.
  3. Antigravity 환경과 연동하여 나만의 지능형 레이어를 활성화합니다.

Designed for High-Performance Decision Making. Copyright (C) g1nation. All rights reserved.

Last auto-scan: 2026-05-25T00:59:03.313Z · signature fca24b52

Purpose

TODO