f8b21af4be
10_Wiki/Topics 대규모 정리: - 오류 캡처/미완성 stub 문서 227개 제거 - 교차폴더 중복 43클러스터 병합 (63파일 → redirect) - 링크명 정규화: 깨진 링크 수정·redirect 직결·개념 매핑 ~2,400건 - 카테고리 MOC 6개 신규 생성 - Graph 섹션 미해결 related-keyword 링크 10,058건 제거 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
187 lines
6.3 KiB
Markdown
187 lines
6.3 KiB
Markdown
---
|
|
id: wiki-2026-0508-tool-usage-optimization
|
|
title: Tool Usage Optimization
|
|
category: 10_Wiki/Topics
|
|
status: verified
|
|
canonical_id: self
|
|
aliases: [Function Calling, Tool Use, Agent Tools]
|
|
duplicate_of: none
|
|
source_trust_level: A
|
|
confidence_score: 0.9
|
|
verification_status: applied
|
|
tags: [llm, tool-use, function-calling, agentic]
|
|
raw_sources: []
|
|
last_reinforced: 2026-05-10
|
|
github_commit: pending
|
|
tech_stack:
|
|
language: Python
|
|
framework: Anthropic SDK / OpenAI SDK / MCP
|
|
---
|
|
|
|
# Tool Usage Optimization
|
|
|
|
## 매 한 줄
|
|
> **"매 LLM의 외부 capability 의 invoke"**. Tool use (function calling) 매 LLM 이 external API/code/DB 의 structured call 하는 패러다임. 2023 OpenAI function calling → 2024 parallel tools → 2025 MCP standard → 2026 매 every agent 의 backbone. 매 quality 매 tool 정의의 crisp / parallel call / error handling / cache strategy 에 의해 결정.
|
|
|
|
## 매 핵심
|
|
|
|
### 매 핵심 mechanics
|
|
- **Schema** — 매 JSON Schema 로 tool input 정의 — 매 model 이 fill.
|
|
- **Choice** — `auto` / `tool` (forced) / `none`.
|
|
- **Parallel** — 매 single turn 에 multiple tool calls (Claude / GPT-4o+).
|
|
- **Iterative loop** — 매 tool result 의 feed back → continue → repeat.
|
|
|
|
### 매 design principles
|
|
- **Crisp descriptions** — 매 tool description 매 "use this when X, not when Y" 명시.
|
|
- **Few well-named tools** > 매 many overlapping — 매 selection error rate 의 reduce.
|
|
- **Idempotent / safe** — 매 retry-safe — 매 LLM 매 retry 함.
|
|
- **Structured errors** — 매 tool error 매 LLM 이 recover 할 수 있게 actionable.
|
|
|
|
### 매 modern (2025-2026)
|
|
- **MCP (Model Context Protocol)** — 매 Anthropic 이 2024-11 release. 매 client/server tool sharing standard. 매 Claude Desktop, Cursor, Windsurf 모두 support.
|
|
- **Tool result caching** — 매 expensive tool (DB query, web fetch) 매 cache + hash check.
|
|
- **Tool budget** — 매 agent 의 max-call limit 의 prevent runaway.
|
|
|
|
### 매 응용
|
|
1. Code agents (file ops, shell, search).
|
|
2. Customer support (ticket, KB, account API).
|
|
3. Data analysis (SQL, plot, fetch).
|
|
|
|
## 💻 패턴
|
|
|
|
### Anthropic tool definition
|
|
```python
|
|
tools = [{
|
|
"name": "get_weather",
|
|
"description": "Get current weather. Use ONLY for weather; do not use for forecasts beyond 24h.",
|
|
"input_schema": {
|
|
"type": "object",
|
|
"properties": {
|
|
"location": {"type": "string", "description": "City, country (e.g. 'Seoul, KR')"},
|
|
"unit": {"type": "string", "enum": ["c", "f"], "default": "c"},
|
|
},
|
|
"required": ["location"],
|
|
},
|
|
}]
|
|
|
|
resp = client.messages.create(
|
|
model="claude-opus-4-7",
|
|
max_tokens=2000,
|
|
tools=tools,
|
|
messages=[{"role": "user", "content": "Weather in Seoul?"}],
|
|
)
|
|
```
|
|
|
|
### Parallel tool execution
|
|
```python
|
|
import asyncio
|
|
|
|
async def run_tool(call):
|
|
return await TOOLS[call.name](**call.input)
|
|
|
|
# 매 LLM이 매 multiple tool_use blocks 의 emit
|
|
calls = [b for b in resp.content if b.type == "tool_use"]
|
|
results = await asyncio.gather(*(run_tool(c) for c in calls))
|
|
# 매 모든 result 의 single user message 로 feed back
|
|
```
|
|
|
|
### Tool loop (agentic)
|
|
```python
|
|
messages = [{"role": "user", "content": user_query}]
|
|
for _ in range(MAX_TURNS):
|
|
resp = client.messages.create(model="claude-opus-4-7", tools=tools, messages=messages)
|
|
messages.append({"role": "assistant", "content": resp.content})
|
|
if resp.stop_reason != "tool_use":
|
|
break
|
|
tool_results = [
|
|
{"type": "tool_result", "tool_use_id": b.id,
|
|
"content": str(TOOLS[b.name](**b.input))}
|
|
for b in resp.content if b.type == "tool_use"
|
|
]
|
|
messages.append({"role": "user", "content": tool_results})
|
|
```
|
|
|
|
### Structured error
|
|
```python
|
|
def search_db(query: str):
|
|
try:
|
|
return {"ok": True, "rows": db.execute(query).fetchall()}
|
|
except SQLError as e:
|
|
return {"ok": False, "error": str(e),
|
|
"hint": "Check column names; use `\\d table` to inspect."}
|
|
```
|
|
|
|
### Result caching
|
|
```python
|
|
import hashlib, json, functools
|
|
|
|
@functools.lru_cache(maxsize=1024)
|
|
def _cached_fetch(url_hash: str, url: str):
|
|
return requests.get(url, timeout=10).text
|
|
|
|
def web_fetch(url: str):
|
|
return _cached_fetch(hashlib.sha256(url.encode()).hexdigest(), url)
|
|
```
|
|
|
|
### Prompt cache + tools (Anthropic)
|
|
```python
|
|
resp = client.messages.create(
|
|
model="claude-opus-4-7",
|
|
tools=tools, # 매 tools 의 cacheable
|
|
system=[{"type": "text", "text": SYSTEM, "cache_control": {"type": "ephemeral"}}],
|
|
messages=messages,
|
|
)
|
|
```
|
|
|
|
### MCP server (Python)
|
|
```python
|
|
from mcp.server.fastmcp import FastMCP
|
|
mcp = FastMCP("my-tools")
|
|
|
|
@mcp.tool()
|
|
def query_orders(customer_id: str) -> list[dict]:
|
|
"""Return recent orders for a customer."""
|
|
return db.fetch_orders(customer_id)
|
|
|
|
if __name__ == "__main__":
|
|
mcp.run()
|
|
```
|
|
|
|
## 매 결정 기준
|
|
| 상황 | Approach |
|
|
|---|---|
|
|
| Few tools, in-app | Direct SDK tools |
|
|
| Many shared tools across apps | MCP server |
|
|
| Latency-critical | Pre-fetch / parallel tools |
|
|
| Expensive tools | Cache by input hash |
|
|
| Untrusted LLM output | Validate + sandbox tool exec |
|
|
| Long agent loop | Tool budget + checkpoint |
|
|
|
|
**기본값**: 매 5-15 well-named tools, parallel calling enabled, structured errors, prompt caching on system + tool definitions.
|
|
|
|
## 🔗 Graph
|
|
- 변형: [[Function-Calling]] · [[MCP]] · [[ReAct]]
|
|
- 응용: [[Code-Agent]]
|
|
- Adjacent: [[RAG]] · [[Structured-Output]]
|
|
|
|
## 🤖 LLM 활용
|
|
**언제**: 매 external state / capability 의 필요한 모든 LLM app — search, DB, email, code exec, API call.
|
|
**언제 X**: 매 pure text generation (summary, translate) — 매 tool 매 unnecessary.
|
|
|
|
## ❌ 안티패턴
|
|
- **Vague tool names**: `"do_thing"` — 매 LLM 매 selection 의 fail.
|
|
- **Too many tools**: 매 30+ tool 매 confuse — 매 group / route.
|
|
- **No retry on transient error**: 매 502 / timeout 매 transient — 매 retry-safe + LLM 의 hint.
|
|
- **Streaming tool output to user mid-call**: 매 tool result 매 internal — final assistant text 만 user 에.
|
|
- **No max-turn limit**: 매 infinite loop 의 risk.
|
|
|
|
## 🧪 검증 / 중복
|
|
- Verified (Anthropic tool use docs, OpenAI function calling, MCP spec 2024-11, parallel tool calling).
|
|
- 신뢰도 A.
|
|
|
|
## 🕓 Changelog
|
|
| 날짜 | 변경 |
|
|
|---|---|
|
|
| 2026-05-08 | Phase 1 |
|
|
| 2026-05-10 | Manual cleanup — modern tool use + MCP + caching patterns |
|