3 kuukautta sitten · e19fd15694
--- a/.refactor-knowledge-complete.md
+++ b/.refactor-knowledge-complete.md
@@ -0,0 +1,234 @@
 
															+# 知识管理系统重构完成报告
														
 
															+
														
 
															+## 重构日期
														
 
															+2026-03-05
														
 
															+
														
 
															+## 核心改动
														
 
															+
														
 
															+### 1. 数据结构调整
														
 
															+
														
 
															+按照 `agent/docs/knowledge.md` 定义，完成以下字段调整：
														
 
															+
														
 
															+**旧结构 → 新结构：**
														
 
															+- `scenario` → `task`（任务描述）
														
 
															+- `tags_type` → `types`（支持多选数组）
														
 
															+- 新增 `tags`（JSON 对象，业务标签）
														
 
															+- 新增 `scopes`（数组，可见范围）
														
 
															+- 新增 `owner`（所有者）
														
 
															+- `source_*` 字段 → `source`（嵌套对象）
														
 
															+- `eval_*` 和 `metrics_*` 字段 → `eval`（嵌套对象）
														
 
															+- 保留顶层 `message_id`
														
 
															+
														
 
															+**新数据结构示例：**
														
 
															+```json
														
 
															+{
														
 
															+  "id": "knowledge-xxx",
														
 
															+  "message_id": "msg-xxx",
														
 
															+  "types": ["strategy", "tool"],
														
 
															+  "task": "任务描述",
														
 
															+  "tags": {"category": "preference"},
														
 
															+  "scopes": ["org:cybertogether"],
														
 
															+  "owner": "agent:research_agent",
														
 
															+  "content": "知识内容",
														
 
															+  "source": {
														
 
															+    "name": "资源名称",
														
 
															+    "category": "exp",
														
 
															+    "urls": ["https://example.com"],
														
 
															+    "agent_id": "research_agent",
														
 
															+    "submitted_by": "user@example.com",
														
 
															+    "timestamp": "2026-03-05T12:00:00Z"
														
 
															+  },
														
 
															+  "eval": {
														
 
															+    "score": 4,
														
 
															+    "helpful": 5,
														
 
															+    "harmful": 0,
														
 
															+    "confidence": 0.9,
														
 
															+    "helpful_history": [],
														
 
															+    "harmful_history": []
														
 
															+  },
														
 
															+  "created_at": "2026-03-05T12:00:00Z",
														
 
															+  "updated_at": "2026-03-05T12:00:00Z"
														
 
															+}
														
 
															+```
														
 
															+
														
 
															+### 2. 数据库迁移
														
 
															+
														
 
															+**文件：** `knowhub/server.py`
														
 
															+
														
 
															+- 重建 knowledge 表结构
														
 
															+- 使用 JSON 字段存储 `types`, `tags`, `scopes`, `source`, `eval`
														
 
															+- 删除旧的扁平化字段（`tags_type`, `scenario`, `source_*`, `eval_*`, `metrics_*`）
														
 
															+- 备份旧数据库到 `knowhub.db.backup-20260305`
														
 
															+
														
 
															+### 3. API 更新
														
 
															+
														
 
															+**文件：** `knowhub/server.py`
														
 
															+
														
 
															+所有 knowledge API 已更新：
														
 
															+
														
 
															+- `POST /api/knowledge` - 保存知识（使用新结构）
														
 
															+- `GET /api/knowledge/search` - 搜索知识（参数 `tags_type` → `types`）
														
 
															+- `GET /api/knowledge` - 列出知识（参数 `tags_type` → `types`，新增 `scopes`）
														
 
															+- `GET /api/knowledge/{id}` - 获取知识（返回新结构）
														
 
															+- `PUT /api/knowledge/{id}` - 更新知识（使用嵌套 eval 结构）
														
 
															+- `POST /api/knowledge/batch_update` - 批量更新（使用嵌套 eval 结构）
														
 
															+- `POST /api/knowledge/slim` - 知识瘦身（使用新结构）
														
 
															+
														
 
															+### 4. CLI 工具更新
														
 
															+
														
 
															+**文件：** `knowhub/skill/cli.py`
														
 
															+
														
 
															+完全重写 CLI 工具以匹配新的数据结构：
														
 
															+
														
 
															+```bash
														
 
															+# 搜索知识
														
 
															+python -m knowhub.skill.cli search "查询内容" --types strategy
														
 
															+
														
 
															+# 保存知识
														
 
															+python -m knowhub.skill.cli save \
														
 
															+  --task "任务描述" \
														
 
															+  --content "知识内容" \
														
 
															+  --types strategy,tool \
														
 
															+  --tags '{"category":"preference"}' \
														
 
															+  --scopes org:cybertogether
														
 
															+
														
 
															+# 列出知识
														
 
															+python -m knowhub.skill.cli list --limit 10 --types strategy
														
 
															+
														
 
															+# 更新知识
														
 
															+python -m knowhub.skill.cli update knowledge-xxx \
														
 
															+  --score 5 \
														
 
															+  --helpful-case "有效案例"
														
 
															+
														
 
															+# 批量更新
														
 
															+python -m knowhub.skill.cli batch-update --file feedback.json
														
 
															+
														
 
															+# 知识瘦身
														
 
															+python -m knowhub.skill.cli slim --model google/gemini-2.0-flash-001
														
 
															+```
														
 
															+
														
 
															+### 5. Agent 工具更新
														
 
															+
														
 
															+**文件：** `agent/tools/builtin/knowledge.py`
														
 
															+
														
 
															+更新所有工具函数：
														
 
															+
														
 
															+**knowledge_search:**
														
 
															+- 参数 `tags_type` → `types`
														
 
															+- 输出显示 `task` 而不是 `scenario`
														
 
															+
														
 
															+**knowledge_save:**
														
 
															+- 参数 `scenario` → `task`
														
 
															+- 参数 `tags_type` → `types`
														
 
															+- 新增参数：`tags`, `scopes`, `owner`, `source_name`, `source_category`, `submitted_by`
														
 
															+- **重要：** 默认值在 agent 代码中设置（不是服务器端）：
														
 
															+  - `scopes` 默认 `["org:cybertogether"]`
														
 
															+  - `owner` 默认 `f"agent:{agent_id}"`
														
 
															+
														
 
															+**knowledge_list:**
														
 
															+- 参数 `tags_type` → `types`
														
 
															+- 新增参数：`scopes`
														
 
															+
														
 
															+**knowledge_slim:**
														
 
															+- 默认模型改为 `google/gemini-2.0-flash-001`
														
 
															+
														
 
															+### 6. 清理旧代码
														
 
															+
														
 
															+**已删除/备份：**
														
 
															+- `agent/tools/builtin/experience.py` → `experience.py.old`（旧的经验系统）
														
 
															+- `agent/tools/builtin/__init__.py` - 删除 `get_experience` 导入和导出
														
 
															+- `agent/core/runner.py` - 删除 `experiences_path` 参数和 `_load_experiences()` 方法
														
 
															+- `agent/core/runner.py` - 从 BUILTIN_TOOLS 列表中删除 `get_experience`
														
 
															+
														
 
															+### 7. 备份文件
														
 
															+
														
 
															+所有修改前的文件都已备份：
														
 
															+- `knowhub/server.py.old`
														
 
															+- `knowhub/skill/cli.py.old`
														
 
															+- `agent/tools/builtin/knowledge.py.old`
														
 
															+- `agent/tools/builtin/experience.py.old`
														
 
															+- `knowhub.db.backup-20260305`
														
 
															+
														
 
															+## 测试建议
														
 
															+
														
 
															+### 1. 启动 KnowHub Server
														
 
															+
														
 
															+```bash
														
 
															+cd knowhub
														
 
															+python server.py
														
 
															+```
														
 
															+
														
 
															+### 2. 测试 CLI 工具
														
 
															+
														
 
															+```bash
														
 
															+# 保存知识
														
 
															+python -m knowhub.skill.cli save \
														
 
															+  --task "测试任务" \
														
 
															+  --content "测试内容" \
														
 
															+  --types strategy
														
 
															+
														
 
															+# 搜索知识
														
 
															+python -m knowhub.skill.cli search "测试"
														
 
															+
														
 
															+# 列出知识
														
 
															+python -m knowhub.skill.cli list
														
 
															+```
														
 
															+
														
 
															+### 3. 测试 Agent 工具
														
 
															+
														
 
															+在 agent 代码中调用：
														
 
															+
														
 
															+```python
														
 
															+from agent.tools.builtin.knowledge import knowledge_save, knowledge_search
														
 
															+
														
 
															+# 保存知识
														
 
															+await knowledge_save(
														
 
															+    task="测试任务",
														
 
															+    content="测试内容",
														
 
															+    types=["strategy"],
														
 
															+    agent_id="test_agent"
														
 
															+)
														
 
															+
														
 
															+# 搜索知识
														
 
															+await knowledge_search(
														
 
															+    query="测试",
														
 
															+    types=["strategy"]
														
 
															+)
														
 
															+```
														
 
															+
														
 
															+## 注意事项
														
 
															+
														
 
															+1. **默认值设置位置：** 按照用户要求，默认 org (`scopes`) 和 owner 在 agent 代码中设置，不在服务器端设置。
														
 
															+
														
 
															+2. **数据库重建：** 旧数据库已备份，新数据库为空。如需迁移旧数据，需要编写迁移脚本。
														
 
															+
														
 
															+3. **完全移除旧系统：** 已删除所有旧的经验系统代码（experience.py, get_experience 等），不保留兼容接口。
														
 
															+
														
 
															+4. **环境变量：** 确保设置 `OPEN_ROUTER_API_KEY` 和 `KNOWHUB_API`。
														
 
															+
														
 
															+## 下一步
														
 
															+
														
 
															+1. 测试所有 API 端点
														
 
															+2. 如需要，编写数据迁移脚本
														
 
															+3. 更新相关文档
														
 
															+
														
 
															+## 文件清单
														
 
															+
														
 
															+**已修改：**
														
 
															+- `knowhub/server.py` - KnowHub Server（数据库 + API）
														
 
															+- `knowhub/skill/cli.py` - CLI 工具
														
 
															+- `agent/tools/builtin/knowledge.py` - Agent 工具集成
														
 
															+- `agent/tools/builtin/__init__.py` - 删除旧的 experience 导入
														
 
															+- `agent/core/runner.py` - 删除 experiences_path 和相关代码
														
 
															+
														
 
															+**已删除/备份：**
														
 
															+- `agent/tools/builtin/experience.py` → `experience.py.old`
														
 
															+
														
 
															+**已备份：**
														
 
															+- `knowhub/server.py.old`
														
 
															+- `knowhub/skill/cli.py.old`
														
 
															+- `agent/tools/builtin/knowledge.py.old`
														
 
															+- `knowhub.db.backup-20260305`
														
 
															+
														
 
															+**新增：**
														
 
															+- `.refactor-knowledge-complete.md` - 本文档
														
--- a/.refactor-knowledge.md
+++ b/.refactor-knowledge.md
@@ -134,6 +134,9 @@
 
															 - [x] 测试 knowledge_batch_update 工具
														
 
															 - [x] 修复环境变量加载问题（添加 load_dotenv）
														
 
															 - [x] 调整 LLM 模型为 gemini-2.0-flash-001
														
 
															+- [x] 实现 CLI 工具（knowhub/cli.py）
														
 
															+- [x] 更新 skill 文档（knowhub/skill/knowhub.md）
														
 
															+- [x] 添加 CLI 使用文档（knowhub/CLI.md）
														
 
															 - [ ] 测试 goal focus 自动注入
														
 
															 - [ ] 测试完整流程（保存→检索→注入）
														
 
															 - [ ] 清理注释代码（可选）
														
--- a/agent/core/runner.py
+++ b/agent/core/runner.py
@@ -104,15 +104,16 @@ BUILTIN_TOOLS = [
 
															     # 搜索工具
														
 
															     "search_posts",
														
 
															-    "get_experience",
														
 
															     "get_search_suggestions",
														
 
															     # 知识管理工具
														
 
															     "knowledge_search",
														
 
															     "knowledge_save",
														
 
															     "knowledge_update",
														
 
															-    "list_knowledge",
														
 
															-    
														
 
															+    "knowledge_batch_update",
														
 
															+    "knowledge_list",
														
 
															+    "knowledge_slim",
														
 
															+
														
 
															     # 沙箱工具
														
 
															     # "sandbox_create_environment",
														
@@ -198,7 +199,6 @@ class AgentRunner:
 
															         embedding_call: Optional[Callable] = None,
														
 
															         config: Optional[AgentConfig] = None,
														
 
															         skills_dir: Optional[str] = None,
														
 
															-        experiences_path: Optional[str] = "./.cache/experiences.md",
														
 
															         goal_tree: Optional[GoalTree] = None,
														
 
															         debug: bool = False,
														
 
															     ):
														
@@ -215,7 +215,6 @@ class AgentRunner:
 
															             utility_llm_call: 轻量 LLM（用于生成任务标题等），可选
														
 
															             config: [向后兼容] AgentConfig
														
 
															             skills_dir: Skills 目录路径
														
 
															-            experiences_path: 经验文件路径（默认 ./.cache/experiences.md）
														
 
															             goal_tree: 初始 GoalTree（可选）
														
 
															             debug: 保留参数（已废弃）
														
 
															         """
														
@@ -228,8 +227,6 @@ class AgentRunner:
 
															         self.utility_llm_call = utility_llm_call
														
 
															         self.config = config or AgentConfig()
														
 
															         self.skills_dir = skills_dir
														
 
															-        # 保留 experiences_path 参数以向后兼容，但不再使用（经验已迁移到知识系统）
														
 
															-        self.experiences_path = experiences_path or "./.cache/experiences.md"
														
 
															         self.goal_tree = goal_tree
														
 
															         self.debug = debug
														
 
															         self._cancel_events: Dict[str, asyncio.Event] = {}  # trace_id → cancel event
														
@@ -2196,15 +2193,3 @@ created_at: {datetime.now().strftime('%Y-%m-%d %H:%M:%S')}
 
															         if not skills:
														
 
															             return ""
														
 
															         return "\n\n".join(s.to_prompt_text() for s in skills)
														
 
															-
														
 
															-    def _load_experiences(self) -> str:
														
 
															-        """从文件加载经验（./.cache/experiences.md）"""
														
 
															-        if not self.experiences_path:
														
 
															-            return ""
														
 
															-        try:
														
 
															-            if os.path.exists(self.experiences_path):
														
 
															-                with open(self.experiences_path, "r", encoding="utf-8") as f:
														
 
															-                    return f.read().strip()
														
 
															-        except Exception as e:
														
 
															-            logger.warning(f"Failed to load experiences from {self.experiences_path}: {e}")
														
 
															-        return ""
														
--- a/agent/docs/knowledge.md
+++ b/agent/docs/knowledge.md
@@ -26,12 +26,6 @@
 
															     owner: 所有者（格式：{entity_type}:{entity_id}，唯一）
														
 
															         谁创建的，谁有权修改/删除
														
 
															-    visibility: 可见性级别（快速过滤标签）
														
 
															-        private：私有（仅所有者）
														
 
															-        shared：共享（多个实体）
														
 
															-        org：组织级
														
 
															-        public：公开
														
 
															-
														
 
															     content:
														
 
															         基于类型的具体内容，相对完整的一条知识
														
@@ -50,6 +44,7 @@
 
															         confidence: 置信度（0-1）
														
 
															         helpful_history: [(query+trace_id+outcome), ]用于记录反馈时的调用总结
														
 
															         harmful_history: []
														
 
															+
														
 
															 知识检索机制
														
 
															     检索流程
														
 
															         1. 构建可见范围
														
--- a/agent/tools/builtin/__init__.py
+++ b/agent/tools/builtin/__init__.py
@@ -15,11 +15,10 @@ from agent.tools.builtin.file.grep import grep_content
 
															 from agent.tools.builtin.bash import bash_command
														
 
															 from agent.tools.builtin.skill import skill, list_skills
														
 
															 from agent.tools.builtin.subagent import agent, evaluate
														
 
															-from agent.tools.builtin.experience import get_experience
														
 
															 from agent.tools.builtin.search import search_posts, get_search_suggestions
														
 
															 from agent.tools.builtin.sandbox import (sandbox_create_environment, sandbox_run_shell,
														
 
															                                          sandbox_rebuild_with_ports,sandbox_destroy_environment)
														
 
															-from agent.tools.builtin.knowledge import(knowledge_search,knowledge_save,knowledge_list,knowledge_update)
														
 
															+from agent.tools.builtin.knowledge import(knowledge_search,knowledge_save,knowledge_list,knowledge_update,knowledge_batch_update,knowledge_slim)
														
 
															 from agent.trace.goal_tool import goal
														
 
															 # 导入浏览器工具以触发注册
														
 
															 import agent.tools.builtin.browser  # noqa: F401
														
@@ -36,11 +35,12 @@ __all__ = [
 
															     # 系统工具
														
 
															     "bash_command",
														
 
															     "skill",
														
 
															-    "get_experience",
														
 
															     "knowledge_search",
														
 
															     "knowledge_save",
														
 
															     "knowledge_list",
														
 
															     "knowledge_update",
														
 
															+    "knowledge_batch_update",
														
 
															+    "knowledge_slim",
														
 
															     "list_skills",
														
 
															     "agent",
														
 
															     "evaluate",
														
--- a/agent/tools/builtin/experience.py
+++ b/agent/tools/builtin/experience.py
@@ -1,487 +0,0 @@
 
															-import logging
														
 
															-import os
														
 
															-import yaml
														
 
															-import json
														
 
															-import asyncio
														
 
															-import re
														
 
															-from typing import List, Optional, Dict, Any
														
 
															-from datetime import datetime
														
 
															-from ...llm.openrouter import openrouter_llm_call
														
 
															-
														
 
															-logger = logging.getLogger(__name__)
														
 
															-
														
 
															-# 默认经验存储路径（当无法从 context 获取时使用）
														
 
															-DEFAULT_EXPERIENCES_PATH = "./.cache/experiences_restore.md"
														
 
															-
														
 
															-def _get_experiences_path(context: Optional[Any] = None) -> str:
														
 
															-    """
														
 
															-    从 context 中获取 experiences_path，回退到默认路径。
														
 
															-
														
 
															-    context 可能包含 runner 引用，从中读取配置的路径。
														
 
															-    """
														
 
															-    if context and isinstance(context, dict):
														
 
															-        runner = context.get("runner")
														
 
															-        if runner and hasattr(runner, "experiences_path"):
														
 
															-            path = runner.experiences_path or DEFAULT_EXPERIENCES_PATH
														
 
															-            print(f"[Experience] 使用 runner 配置的路径: {runner.experiences_path}")
														
 
															-            return path
														
 
															-
														
 
															-    print(f"[Experience] 使用默认路径: {DEFAULT_EXPERIENCES_PATH}")
														
 
															-    return DEFAULT_EXPERIENCES_PATH
														
 
															-
														
 
															-# ===== 经验进化重写 =====
														
 
															-async def _evolve_body_with_llm(old_body: str, feedback: str) -> str:
														
 
															-    """
														
 
															-    使用检索级别的小模型 (Flash Lite) 执行经验进化重写。
														
 
															-    """
														
 
															-    prompt = f"""你是一个 AI Agent 经验库管理员。请根据反馈建议，对现有的 ACE 规范经验进行重写进化。
														
 
															-
														
 
															-【原经验内容】:
														
 
															-{old_body}
														
 
															-
														
 
															-【实战反馈建议】:
														
 
															-{feedback}
														
 
															-
														
 
															-【重写要求】:
														
 
															-1. 保持 ACE 规范：当 [条件/Context] 时，应该 [动作/Action]（原因：[逻辑/Reason]）。
														
 
															-2. 融合知识：将反馈中的避坑指南、新参数或修正后的选择逻辑融入原经验，使其更具通用性和准确性。
														
 
															-3. 语言：简洁直接，使用中文。
														
 
															-4. 禁止：严禁输出任何开场白、解释语或 Markdown 标题，直接返回重写后的正文。
														
 
															-"""
														
 
															-    try:
														
 
															-        # 调用与检索路由相同的廉价模型
														
 
															-        response = await openrouter_llm_call(
														
 
															-            messages=[{"role": "user", "content": prompt}],
														
 
															-            model="google/gemini-2.0-flash-001" 
														
 
															-        )
														
 
															-        
														
 
															-        evolved_content = response.get("content", "").strip()
														
 
															-        
														
 
															-        # 简单安全校验：如果 LLM 返回太短或为空，回退到原内容+追加
														
 
															-        if len(evolved_content) < 5:
														
 
															-            raise ValueError("LLM output too short")
														
 
															-            
														
 
															-        return evolved_content
														
 
															-        
														
 
															-    except Exception as e:
														
 
															-        logger.warning(f"小模型进化失败，采用追加模式回退: {e}")
														
 
															-        timestamp = datetime.now().strftime('%Y-%m-%d')
														
 
															-        return f"{old_body}\n- [Update {timestamp}]: {feedback}"
														
 
															-    
														
 
															-# ===== 核心挑选逻辑 =====
														
 
															-
														
 
															-async def _route_experiences_by_llm(query_text: str, metadata_list: List[Dict], k: int = 3) -> List[str]:
														
 
															-    """
														
 
															-    第一阶段：语义路由。
														
 
															-    让 LLM 挑选出 2*k 个语义相关的 ID。
														
 
															-    """
														
 
															-    if not metadata_list:
														
 
															-        return []
														
 
															-
														
 
															-    # 扩大筛选范围到 2*k
														
 
															-    routing_k = k * 2
														
 
															-    
														
 
															-    routing_data = [
														
 
															-        {
														
 
															-            "id": m["id"],
														
 
															-            "tags": m["tags"],
														
 
															-            "helpful": m["metrics"]["helpful"]
														
 
															-        } for m in metadata_list
														
 
															-    ]
														
 
															-
														
 
															-    prompt = f"""
														
 
															-你是一个经验检索专家。根据用户的当前意图，从下列经验元数据中挑选出最相关的最多 {routing_k} 个经验 ID。
														
 
															-意图："{query_text}"
														
 
															-
														
 
															-可选经验列表：
														
 
															-{json.dumps(routing_data, ensure_ascii=False, indent=1)}
														
 
															-
														
 
															-请直接输出 ID 列表，用逗号分隔（例如: ex_01, ex_02）。若无相关项请输出 "None"。
														
 
															-"""
														
 
															-
														
 
															-    try:
														
 
															-        print(f"\n[Step 1: 语义路由] 意图: '{query_text}' | 候选总数: {len(metadata_list)} | 目标提取数: {routing_k}")
														
 
															-        
														
 
															-        response = await openrouter_llm_call(
														
 
															-            messages=[{"role": "user", "content": prompt}],
														
 
															-            model="google/gemini-2.0-flash-001" 
														
 
															-        )
														
 
															-        
														
 
															-        content = response.get("content", "").strip()
														
 
															-        selected_ids = [idx.strip() for idx in re.split(r'[,\s]+', content) if idx.strip().startswith("ex_")]
														
 
															-        
														
 
															-        print(f"[Step 1: 语义路由] LLM 初选 ID ({len(selected_ids)}个): {selected_ids}")
														
 
															-        return selected_ids
														
 
															-    except Exception as e:
														
 
															-        logger.error(f"LLM 经验路由失败: {e}")
														
 
															-        return []
														
 
															-
														
 
															-async def _get_structured_experiences(query_text: str, top_k: int = 3, context: Optional[Any] = None):
														
 
															-    """
														
 
															-    1. 解析物理文件
														
 
															-    2. 语义路由：提取 2*k 个 ID
														
 
															-    3. 质量精排：基于 Metrics 筛选出最终的 k 个
														
 
															-    """
														
 
															-    print(f"[Experience System]  runner.experiences_path:  {context.get('runner').experiences_path if context and context.get('runner') else None}")
														
 
															-    experiences_path = _get_experiences_path(context)
														
 
															-
														
 
															-    if not os.path.exists(experiences_path):
														
 
															-        print(f"[Experience System] 警告: 经验文件不存在 ({experiences_path})")
														
 
															-        return []
														
 
															-
														
 
															-    with open(experiences_path, "r", encoding="utf-8") as f:
														
 
															-        file_content = f.read()
														
 
															-
														
 
															-    # --- 阶段 1: 解析 ---
														
 
															-    # 使用正则表达式匹配 YAML frontmatter 块，避免误分割
														
 
															-    pattern = r'---\n(.*?)\n---\n(.*?)(?=\n---\n|\Z)'
														
 
															-    matches = re.findall(pattern, file_content, re.DOTALL)
														
 
															-
														
 
															-    content_map = {}
														
 
															-    metadata_list = []
														
 
															-
														
 
															-    for yaml_str, raw_body in matches:
														
 
															-        try:
														
 
															-            metadata = yaml.safe_load(yaml_str)
														
 
															-
														
 
															-            # 检查 metadata 类型
														
 
															-            if not isinstance(metadata, dict):
														
 
															-                logger.error(f"跳过损坏的经验块: metadata 不是 dict，而是 {type(metadata).__name__}")
														
 
															-                continue
														
 
															-
														
 
															-            eid = metadata.get("id")
														
 
															-            if not eid:
														
 
															-                logger.error("跳过损坏的经验块: 缺少 id 字段")
														
 
															-                continue
														
 
															-
														
 
															-            meta_item = {
														
 
															-                "id": eid,
														
 
															-                "tags": metadata.get("tags", {}),
														
 
															-                "metrics": metadata.get("metrics", {"helpful": 0, "harmful": 0}),
														
 
															-            }
														
 
															-            metadata_list.append(meta_item)
														
 
															-            content_map[eid] = {
														
 
															-                "content": raw_body.strip(),
														
 
															-                "metrics": meta_item["metrics"]
														
 
															-            }
														
 
															-        except Exception as e:
														
 
															-            logger.error(f"跳过损坏的经验块: {e}")
														
 
															-            continue
														
 
															-
														
 
															-    # --- 阶段 2: 语义路由 (取 2*k) ---
														
 
															-    candidate_ids = await _route_experiences_by_llm(query_text, metadata_list, k=top_k)
														
 
															-
														
 
															-    # --- 阶段 3: 质量精排 (根据 Metrics 选出最终的 k) ---
														
 
															-    print(f"[Step 2: 质量精排] 正在根据 Metrics 对候选经验进行打分...")
														
 
															-    scored_items = []
														
 
															-    
														
 
															-    for eid in candidate_ids:
														
 
															-        if eid in content_map:
														
 
															-            item = content_map[eid]
														
 
															-            metrics = item["metrics"]
														
 
															-            # 计算综合分：Helpful 是正分，Harmful 是双倍惩罚扣分
														
 
															-            quality_score = metrics["helpful"] - (metrics["harmful"] * 2.0)
														
 
															-            
														
 
															-            # 过滤门槛：如果被标记为严重有害（score < -2），直接丢弃
														
 
															-            if quality_score < -2:
														
 
															-                print(f"  - 剔除有害经验: {eid} (Helpful: {metrics['helpful']}, Harmful: {metrics['harmful']})")
														
 
															-                continue
														
 
															-                
														
 
															-            scored_items.append({
														
 
															-                "id": eid,
														
 
															-                "content": item["content"],
														
 
															-                "helpful": metrics["helpful"],
														
 
															-                "quality_score": quality_score
														
 
															-            })
														
 
															-
														
 
															-    # 按照质量分排序，质量分相同时按 helpful 排序
														
 
															-    final_sorted = sorted(scored_items, key=lambda x: (x["quality_score"], x["helpful"]), reverse=True)
														
 
															-    
														
 
															-    # 截取最终的 top_k
														
 
															-    result = final_sorted[:top_k]
														
 
															-    
														
 
															-    print(f"[Step 2: 质量精排] 最终选定经验: {[it['id'] for it in result]}")
														
 
															-    print(f"[Experience System] 检索结束。\n")
														
 
															-    return result
														
 
															-
														
 
															-async def _batch_update_experiences(update_map: Dict[str, Dict[str, Any]], context: Optional[Any] = None):
														
 
															-    """
														
 
															-    物理层：批量更新经验。
														
 
															-    修正点：正确使用 new_sections 集合，确保文件结构的完整性与并发进化的同步。
														
 
															-    """
														
 
															-    experiences_path = _get_experiences_path(context)
														
 
															-
														
 
															-    if not os.path.exists(experiences_path) or not update_map:
														
 
															-        return 0
														
 
															-
														
 
															-    with open(experiences_path, "r", encoding="utf-8") as f:
														
 
															-        full_content = f.read()
														
 
															-
														
 
															-    # 使用正则表达式解析，避免误分割
														
 
															-    pattern = r'---\n(.*?)\n---\n(.*?)(?=\n---\n|\Z)'
														
 
															-    matches = re.findall(pattern, full_content, re.DOTALL)
														
 
															-
														
 
															-    new_entries = []
														
 
															-    evolution_tasks = []
														
 
															-    evolution_registry = {}  # task_idx -> entry_idx
														
 
															-
														
 
															-    # --- 第一阶段：处理所有块 ---
														
 
															-    for yaml_str, body in matches:
														
 
															-        try:
														
 
															-            meta = yaml.safe_load(yaml_str)
														
 
															-            if not isinstance(meta, dict):
														
 
															-                logger.error(f"跳过损坏的经验块: metadata 不是 dict")
														
 
															-                continue
														
 
															-
														
 
															-            eid = meta.get("id")
														
 
															-            if not eid:
														
 
															-                logger.error("跳过损坏的经验块: 缺少 id")
														
 
															-                continue
														
 
															-
														
 
															-            if eid in update_map:
														
 
															-                instr = update_map[eid]
														
 
															-                action = instr.get("action")
														
 
															-                feedback = instr.get("feedback")
														
 
															-
														
 
															-                # 处理 mixed 中间态
														
 
															-                if action == "mixed":
														
 
															-                    meta["metrics"]["helpful"] += 1
														
 
															-                    action = "evolve"
														
 
															-
														
 
															-                if action == "helpful":
														
 
															-                    meta["metrics"]["helpful"] += 1
														
 
															-                elif action == "harmful":
														
 
															-                    meta["metrics"]["harmful"] += 1
														
 
															-                elif action == "evolve" and feedback:
														
 
															-                    # 注册进化任务
														
 
															-                    task = _evolve_body_with_llm(body.strip(), feedback)
														
 
															-                    evolution_tasks.append(task)
														
 
															-                    # 记录该任务对应的 entry 索引
														
 
															-                    evolution_registry[len(evolution_tasks) - 1] = len(new_entries)
														
 
															-                    meta["metrics"]["helpful"] += 1
														
 
															-
														
 
															-                meta["updated_at"] = datetime.now().strftime('%Y-%m-%d %H:%M:%S')
														
 
															-
														
 
															-            # 序列化并加入 new_entries
														
 
															-            meta_str = yaml.dump(meta, allow_unicode=True).strip()
														
 
															-            new_entries.append((meta_str, body.strip()))
														
 
															-
														
 
															-        except Exception as e:
														
 
															-            logger.error(f"跳过损坏的经验块: {e}")
														
 
															-            continue
														
 
															-
														
 
															-    # --- 第二阶段：并发进化 ---
														
 
															-    if evolution_tasks:
														
 
															-        print(f"🧬 并发处理 {len(evolution_tasks)} 条经验进化...")
														
 
															-        evolved_results = await asyncio.gather(*evolution_tasks)
														
 
															-
														
 
															-        # 精准回填：替换对应 entry 的 body
														
 
															-        for task_idx, entry_idx in evolution_registry.items():
														
 
															-            meta_str, _ = new_entries[entry_idx]
														
 
															-            new_entries[entry_idx] = (meta_str, evolved_results[task_idx].strip())
														
 
															-
														
 
															-    # --- 第三阶段：原子化写回 ---
														
 
															-    final_parts = []
														
 
															-    for meta_str, body in new_entries:
														
 
															-        final_parts.append(f"---\n{meta_str}\n---\n{body}\n")
														
 
															-
														
 
															-    final_content = "\n".join(final_parts)
														
 
															-    with open(experiences_path, "w", encoding="utf-8") as f:
														
 
															-        f.write(final_content)
														
 
															-
														
 
															-    return len(update_map)
														
 
															-
														
 
															-# ===== 经验库瘦身 =====
														
 
															-
														
 
															-async def slim_experiences(model: str = "anthropic/claude-sonnet-4.5", context: Optional[Any] = None) -> str:
														
 
															-    """
														
 
															-    经验库瘦身：调用顶级大模型，将经验库中语义相似的经验合并精简。
														
 
															-    返回瘦身报告字符串。
														
 
															-    """
														
 
															-    experiences_path = _get_experiences_path(context)
														
 
															-
														
 
															-    if not os.path.exists(experiences_path):
														
 
															-        return "经验文件不存在，无需瘦身。"
														
 
															-
														
 
															-    with open(experiences_path, "r", encoding="utf-8") as f:
														
 
															-        file_content = f.read()
														
 
															-
														
 
															-    # 使用正则表达式解析，避免误分割
														
 
															-    pattern = r'---\n(.*?)\n---\n(.*?)(?=\n---\n|\Z)'
														
 
															-    matches = re.findall(pattern, file_content, re.DOTALL)
														
 
															-
														
 
															-    parsed = []
														
 
															-    for yaml_str, body in matches:
														
 
															-        try:
														
 
															-            meta = yaml.safe_load(yaml_str)
														
 
															-            if not isinstance(meta, dict):
														
 
															-                continue
														
 
															-            parsed.append({"meta": meta, "body": body.strip()})
														
 
															-        except Exception:
														
 
															-            continue
														
 
															-
														
 
															-    if len(parsed) < 2:
														
 
															-        return f"经验库仅有 {len(parsed)} 条，无需瘦身。"
														
 
															-
														
 
															-    # 构造发给大模型的内容
														
 
															-    entries_text = ""
														
 
															-    for p in parsed:
														
 
															-        m = p["meta"]
														
 
															-        entries_text += f"[ID: {m.get('id')}] [Tags: {m.get('tags', {})}] "
														
 
															-        entries_text += f"[Metrics: {m.get('metrics', {})}]\n"
														
 
															-        entries_text += f"{p['body']}\n\n"
														
 
															-
														
 
															-    prompt = f"""你是一个 AI Agent 经验库管理员。以下是当前经验库的全部条目，请执行瘦身操作：
														
 
															-
														
 
															-【任务】:
														
 
															-1. 识别语义高度相似或重复的经验，将它们合并为一条更精炼、更通用的经验。
														
 
															-2. 合并时保留 helpful 最高的那条的 ID 和 metrics（metrics 中 helpful/harmful 取各条之和）。
														
 
															-3. 对于独立的、无重复的经验，保持原样不动。
														
 
															-4. 保持 ACE 规范格式：当 [条件/Context] 时，应该 [动作/Action]（原因：[逻辑/Reason]）。
														
 
															-
														
 
															-【当前经验库】:
														
 
															-{entries_text}
														
 
															-
														
 
															-【输出格式要求】:
														
 
															-严格按以下格式输出每条经验，条目之间用 === 分隔：
														
 
															-ID: <保留的id>
														
 
															-TAGS: <yaml格式的tags>
														
 
															-METRICS: <yaml格式的metrics>
														
 
															-BODY: <合并后的经验正文>
														
 
															-===
														
 
															-
														
 
															-最后一行输出合并报告，格式：
														
 
															-REPORT: 原有 X 条，合并后 Y 条，精简了 Z 条。
														
 
															-
														
 
															-禁止输出任何开场白或解释。"""
														
 
															-
														
 
															-    try:
														
 
															-        print(f"\n[经验瘦身] 正在调用 {model} 分析 {len(parsed)} 条经验...")
														
 
															-        response = await openrouter_llm_call(
														
 
															-            messages=[{"role": "user", "content": prompt}],
														
 
															-            model=model
														
 
															-        )
														
 
															-        content = response.get("content", "").strip()
														
 
															-        if not content:
														
 
															-            return "大模型返回为空，瘦身失败。"
														
 
															-
														
 
															-        # 解析大模型输出，重建经验文件
														
 
															-        report_line = ""
														
 
															-        new_entries = []
														
 
															-        blocks = [b.strip() for b in content.split("===") if b.strip()]
														
 
															-
														
 
															-        for block in blocks:
														
 
															-            if block.startswith("REPORT:"):
														
 
															-                report_line = block
														
 
															-                continue
														
 
															-
														
 
															-            lines = block.split("\n")
														
 
															-            eid, tags, metrics, body_lines = None, {}, {}, []
														
 
															-            current_field = None
														
 
															-            for line in lines:
														
 
															-                if line.startswith("ID:"):
														
 
															-                    eid = line[3:].strip()
														
 
															-                    current_field = None
														
 
															-                elif line.startswith("TAGS:"):
														
 
															-                    try:
														
 
															-                        tags = yaml.safe_load(line[5:].strip()) or {}
														
 
															-                    except Exception:
														
 
															-                        tags = {}
														
 
															-                    current_field = None
														
 
															-                elif line.startswith("METRICS:"):
														
 
															-                    try:
														
 
															-                        metrics = yaml.safe_load(line[8:].strip()) or {}
														
 
															-                    except Exception:
														
 
															-                        metrics = {"helpful": 0, "harmful": 0}
														
 
															-                    current_field = None
														
 
															-                elif line.startswith("BODY:"):
														
 
															-                    body_lines.append(line[5:].strip())
														
 
															-                    current_field = "body"
														
 
															-                elif current_field == "body":
														
 
															-                    body_lines.append(line)
														
 
															-
														
 
															-            if eid and body_lines:
														
 
															-                meta = {
														
 
															-                    "id": eid,
														
 
															-                    "tags": tags,
														
 
															-                    "metrics": metrics,
														
 
															-                    "updated_at": datetime.now().strftime('%Y-%m-%d %H:%M:%S'),
														
 
															-                }
														
 
															-                meta_str = yaml.dump(meta, allow_unicode=True).strip()
														
 
															-                body_str = "\n".join(body_lines).strip()
														
 
															-                new_entries.append(f"---\n{meta_str}\n---\n{body_str}\n")
														
 
															-
														
 
															-        if not new_entries:
														
 
															-            return "解析大模型输出失败，经验库未修改。"
														
 
															-
														
 
															-        # 写回文件
														
 
															-        final = "\n".join(new_entries)
														
 
															-        with open(experiences_path, "w", encoding="utf-8") as f:
														
 
															-            f.write(final)
														
 
															-
														
 
															-        result = f"瘦身完成：{len(parsed)} → {len(new_entries)} 条经验。"
														
 
															-        if report_line:
														
 
															-            result += f"\n{report_line}"
														
 
															-        print(f"[经验瘦身] {result}")
														
 
															-        return result
														
 
															-
														
 
															-    except Exception as e:
														
 
															-        logger.error(f"经验瘦身失败: {e}")
														
 
															-        return f"瘦身失败: {e}"
														
 
															-
														
 
															-# ===== 对外 Tool 接口 =====
														
 
															-
														
 
															-from agent.tools import tool, ToolContext
														
 
															-
														
 
															-@tool(description="通过两阶段检索获取最相关的历史经验")
														
 
															-async def get_experience(query: str, k: int = 3, context: Optional[ToolContext] = None):
														
 
															-    """
														
 
															-    通过两阶段检索获取最相关的历史经验。
														
 
															-    第一阶段语义匹配(2*k)，第二阶段质量精排(k)。
														
 
															-    """
														
 
															-    relevant_items = await _get_structured_experiences(
														
 
															-        query_text=query,
														
 
															-        top_k=k,
														
 
															-        context=context
														
 
															-    )
														
 
															-
														
 
															-    if not relevant_items:
														
 
															-        return "未找到足够相关的优质经验。"
														
 
															-
														
 
															-    return {
														
 
															-        "items": relevant_items,
														
 
															-        "count": len(relevant_items)
														
 
															-    }
														
 
															-
														
 
															-@tool()
														
 
															-async def update_experiences(feedback_list: List[Dict[str, Any]], context: Optional[ToolContext] = None):
														
 
															-    """
														
 
															-    批量反馈历史经验的有效性。
														
 
															-    
														
 
															-    Args:
														
 
															-        feedback_list: 评价列表，每个元素包含:
														
 
															-            - ex_id: (str) 经验 ID
														
 
															-            - is_effective: (bool) 是否有效
														
 
															-            - feedback: (str, optional) 改进建议，若有效且有建议则触发经验进化
														
 
															-    """
														
 
															-    if not feedback_list:
														
 
															-        return "反馈列表为空。"
														
 
															-
														
 
															-    # 将 Agent 的输入转换为底层函数需要的映射表格式
														
 
															-    update_map = {}
														
 
															-    for item in feedback_list:
														
 
															-        ex_id = item.get("ex_id")
														
 
															-        is_effective = item.get("is_effective")
														
 
															-        comment = item.get("feedback", "")
														
 
															-
														
 
															-        action = "helpful" if is_effective else "harmful"
														
 
															-        if is_effective and comment:
														
 
															-            action = "evolve"
														
 
															-        
														
 
															-        update_map[ex_id] = {
														
 
															-            "action": action,
														
 
															-            "feedback": comment
														
 
															-        }
														
 
															-
														
 
															-    count = await _batch_update_experiences(update_map, context)
														
 
															-    return f"成功同步了 {count} 条经验的反馈。感谢你的评价！"
														
--- a/agent/tools/builtin/knowledge.py
+++ b/agent/tools/builtin/knowledge.py
@@ -21,7 +21,7 @@ async def knowledge_search(
 
															     query: str,
														
 
															     top_k: int = 5,
														
 
															     min_score: int = 3,
														
 
															-    tags_type: Optional[List[str]] = None,
														
 
															+    types: Optional[List[str]] = None,
														
 
															     context: Optional[ToolContext] = None,
														
 
															 ) -> ToolResult:
														
 
															     """
														
@@ -31,7 +31,7 @@ async def knowledge_search(
 
															         query: 搜索查询（任务描述）
														
 
															         top_k: 返回数量（默认 5）
														
 
															         min_score: 最低评分过滤（默认 3）
														
 
															-        tags_type: 按类型过滤（tool/usecase/definition/plan/strategy）
														
 
															+        types: 按类型过滤（user_profile/strategy/tool/usecase/definition/plan）
														
 
															         context: 工具上下文
														
 
															     Returns:
														
@@ -43,8 +43,8 @@ async def knowledge_search(
 
															             "top_k": top_k,
														
 
															             "min_score": min_score,
														
 
															         }
														
 
															-        if tags_type:
														
 
															-            params["tags_type"] = ",".join(tags_type)
														
 
															+        if types:
														
 
															+            params["types"] = ",".join(types)
														
 
															         async with httpx.AsyncClient(timeout=60.0) as client:
														
 
															             response = await client.get(f"{KNOWHUB_API}/api/knowledge/search", params=params)
														
@@ -65,8 +65,10 @@ async def knowledge_search(
 
															         output_lines = [f"查询: {query}\n", f"找到 {count} 条相关知识:\n"]
														
 
															         for idx, item in enumerate(results, 1):
														
 
															-            output_lines.append(f"\n### {idx}. [{item['id']}] (⭐ {item.get('score', 3)})")
														
 
															-            output_lines.append(f"**场景**: {item['scenario'][:150]}...")
														
 
															+            eval_data = item.get("eval", {})
														
 
															+            score = eval_data.get("score", 3)
														
 
															+            output_lines.append(f"\n### {idx}. [{item['id']}] (⭐ {score})")
														
 
															+            output_lines.append(f"**任务**: {item['task'][:150]}...")
														
 
															             output_lines.append(f"**内容**: {item['content'][:200]}...")
														
 
															         return ToolResult(
														
@@ -91,11 +93,17 @@ async def knowledge_search(
 
															 @tool()
														
 
															 async def knowledge_save(
														
 
															-    scenario: str,
														
 
															+    task: str,
														
 
															     content: str,
														
 
															-    tags_type: List[str],
														
 
															+    types: List[str],
														
 
															+    tags: Optional[Dict[str, str]] = None,
														
 
															+    scopes: Optional[List[str]] = None,
														
 
															+    owner: Optional[str] = None,
														
 
															+    source_name: str = "",
														
 
															+    source_category: str = "exp",
														
 
															     urls: List[str] = None,
														
 
															     agent_id: str = "research_agent",
														
 
															+    submitted_by: str = "",
														
 
															     score: int = 3,
														
 
															     message_id: str = "",
														
 
															     context: Optional[ToolContext] = None,
														
@@ -104,11 +112,17 @@ async def knowledge_save(
 
															     保存新知识
														
 
															     Args:
														
 
															-        scenario: 任务描述（在什么情景下 + 要完成什么目标）
														
 
															+        task: 任务描述（在什么情景下 + 要完成什么目标）
														
 
															         content: 核心内容
														
 
															-        tags_type: 知识类型标签，可选：tool, usecase, definition, plan, strategy
														
 
															+        types: 知识类型标签，可选：user_profile, strategy, tool, usecase, definition, plan
														
 
															+        tags: 业务标签（JSON 对象）
														
 
															+        scopes: 可见范围（默认 ["org:cybertogether"]）
														
 
															+        owner: 所有者（默认 agent:{agent_id}）
														
 
															+        source_name: 来源名称
														
 
															+        source_category: 来源类别（paper/exp/skill/book）
														
 
															         urls: 参考来源链接列表
														
 
															         agent_id: 执行此调研的 agent ID
														
 
															+        submitted_by: 提交者
														
 
															         score: 初始评分 1-5（默认 3）
														
 
															         message_id: 来源 Message ID
														
 
															         context: 工具上下文
														
@@ -117,14 +131,33 @@ async def knowledge_save(
 
															         保存结果
														
 
															     """
														
 
															     try:
														
 
															+        # 设置默认值（在 agent 代码中，不是服务器端）
														
 
															+        if scopes is None:
														
 
															+            scopes = ["org:cybertogether"]
														
 
															+        if owner is None:
														
 
															+            owner = f"agent:{agent_id}"
														
 
															+
														
 
															         payload = {
														
 
															-            "scenario": scenario,
														
 
															+            "message_id": message_id,
														
 
															+            "types": types,
														
 
															+            "task": task,
														
 
															+            "tags": tags or {},
														
 
															+            "scopes": scopes,
														
 
															+            "owner": owner,
														
 
															             "content": content,
														
 
															-            "tags_type": tags_type,
														
 
															-            "urls": urls or [],
														
 
															-            "agent_id": agent_id,
														
 
															-            "score": score,
														
 
															-            "message_id": message_id
														
 
															+            "source": {
														
 
															+                "name": source_name,
														
 
															+                "category": source_category,
														
 
															+                "urls": urls or [],
														
 
															+                "agent_id": agent_id,
														
 
															+                "submitted_by": submitted_by,
														
 
															+            },
														
 
															+            "eval": {
														
 
															+                "score": score,
														
 
															+                "helpful": 1,
														
 
															+                "harmful": 0,
														
 
															+                "confidence": 0.5,
														
 
															+            }
														
 
															         }
														
 
															         async with httpx.AsyncClient(timeout=30.0) as client:
														
@@ -136,8 +169,8 @@ async def knowledge_save(
 
															         return ToolResult(
														
 
															             title="✅ 知识已保存",
														
 
															-            output=f"知识 ID: {knowledge_id}\n\n场景:\n{scenario[:100]}...",
														
 
															-            long_term_memory=f"保存知识: {knowledge_id} - {scenario[:50]}",
														
 
															+            output=f"知识 ID: {knowledge_id}\n\n任务:\n{task[:100]}...",
														
 
															+            long_term_memory=f"保存知识: {knowledge_id} - {task[:50]}",
														
 
															             metadata={"knowledge_id": knowledge_id}
														
 
															         )
														
@@ -273,7 +306,8 @@ async def knowledge_batch_update(
 
															 @tool()
														
 
															 async def knowledge_list(
														
 
															     limit: int = 10,
														
 
															-    tags_type: Optional[List[str]] = None,
														
 
															+    types: Optional[List[str]] = None,
														
 
															+    scopes: Optional[List[str]] = None,
														
 
															     context: Optional[ToolContext] = None,
														
 
															 ) -> ToolResult:
														
 
															     """
														
@@ -281,7 +315,8 @@ async def knowledge_list(
 
															     Args:
														
 
															         limit: 返回数量限制（默认 10）
														
 
															-        tags_type: 按类型过滤（可选）
														
 
															+        types: 按类型过滤（可选）
														
 
															+        scopes: 按范围过滤（可选）
														
 
															         context: 工具上下文
														
 
															     Returns:
														
@@ -289,8 +324,10 @@ async def knowledge_list(
 
															     """
														
 
															     try:
														
 
															         params = {"limit": limit}
														
 
															-        if tags_type:
														
 
															-            params["tags_type"] = ",".join(tags_type)
														
 
															+        if types:
														
 
															+            params["types"] = ",".join(types)
														
 
															+        if scopes:
														
 
															+            params["scopes"] = ",".join(scopes)
														
 
															         async with httpx.AsyncClient(timeout=30.0) as client:
														
 
															             response = await client.get(f"{KNOWHUB_API}/api/knowledge", params=params)
														
@@ -309,8 +346,9 @@ async def knowledge_list(
 
															         output_lines = [f"共找到 {count} 条知识:\n"]
														
 
															         for item in results:
														
 
															-            score = item.get("eval", {}).get("score", 3)
														
 
															-            output_lines.append(f"- [{item['id']}] (⭐{score}) {item['scenario'][:60]}...")
														
 
															+            eval_data = item.get("eval", {})
														
 
															+            score = eval_data.get("score", 3)
														
 
															+            output_lines.append(f"- [{item['id']}] (⭐{score}) {item['task'][:60]}...")
														
 
															         return ToolResult(
														
 
															             title="📚 知识列表",
														
@@ -329,14 +367,14 @@ async def knowledge_list(
 
															 @tool()
														
 
															 async def knowledge_slim(
														
 
															-    model: str = "anthropic/claude-sonnet-4.5",
														
 
															+    model: str = "google/gemini-2.0-flash-001",
														
 
															     context: Optional[ToolContext] = None,
														
 
															 ) -> ToolResult:
														
 
															     """
														
 
															     知识库瘦身：调用顶级大模型，将知识库中语义相似的知识合并精简
														
 
															     Args:
														
 
															-        model: 使用的模型（默认 claude-sonnet-4.5）
														
 
															+        model: 使用的模型（默认 gemini-2.0-flash-001）
														
 
															         context: 工具上下文
														
 
															     Returns:
														
@@ -370,29 +408,3 @@ async def knowledge_slim(
 
															             error=str(e)
														
 
															         )
														
 
															-
														
 
															-# 兼容接口：get_experience
														
 
															-@tool(description="检索历史经验（strategy 标签的知识）")
														
 
															-async def get_experience(
														
 
															-    query: str,
														
 
															-    k: int = 3,
														
 
															-    context: Optional[ToolContext] = None,
														
 
															-) -> ToolResult:
														
 
															-    """
														
 
															-    检索历史经验（兼容接口，实际调用 knowledge_search 并过滤 strategy 标签）
														
 
															-
														
 
															-    Args:
														
 
															-        query: 搜索查询（任务描述）
														
 
															-        k: 返回数量（默认 3）
														
 
															-        context: 工具上下文
														
 
															-
														
 
															-    Returns:
														
 
															-        相关经验列表
														
 
															-    """
														
 
															-    return await knowledge_search(
														
 
															-        query=query,
														
 
															-        top_k=k,
														
 
															-        min_score=1,  # 经验的评分门槛较低
														
 
															-        tags_type=["strategy"],
														
 
															-        context=context
														
 
															-    )
														
--- a/agent/tools/builtin/knowledge.py.backup
+++ b/agent/tools/builtin/knowledge.py.backup
@@ -1,1183 +0,0 @@
 
															-"""
														
 
															-原子知识保存工具
														
 
															-
														
 
															-提供便捷的 API 让 Agent 快速保存结构化的原子知识
														
 
															-"""
														
 
															-
														
 
															-import os
														
 
															-import re
														
 
															-import json
														
 
															-import yaml
														
 
															-import logging
														
 
															-from datetime import datetime
														
 
															-from pathlib import Path
														
 
															-from typing import List, Dict, Optional, Any
														
 
															-from agent.tools import tool, ToolResult, ToolContext
														
 
															-from ...llm.openrouter import openrouter_llm_call
														
 
															-
														
 
															-logger = logging.getLogger(__name__)
														
 
															-
														
 
															-
														
 
															-def _generate_knowledge_id() -> str:
														
 
															-    """生成知识原子 ID（带微秒和随机后缀避免冲突）"""
														
 
															-    import uuid
														
 
															-    timestamp = datetime.now().strftime('%Y%m%d-%H%M%S')
														
 
															-    random_suffix = uuid.uuid4().hex[:4]
														
 
															-    return f"knowledge-{timestamp}-{random_suffix}"
														
 
															-
														
 
															-
														
 
															-def _format_yaml_list(items: List[str], indent: int = 2) -> str:
														
 
															-    """格式化 YAML 列表"""
														
 
															-    if not items:
														
 
															-        return "[]"
														
 
															-    indent_str = " " * indent
														
 
															-    return "\n" + "\n".join(f"{indent_str}- {item}" for item in items)
														
 
															-
														
 
															-
														
 
															-@tool()
														
 
															-async def save_knowledge(
														
 
															-    scenario: str,
														
 
															-    content: str,
														
 
															-    tags_type: List[str],
														
 
															-    urls: List[str] = None,
														
 
															-    agent_id: str = "research_agent",
														
 
															-    score: int = 3,
														
 
															-    trace_id: str = "",
														
 
															-) -> ToolResult:
														
 
															-    """
														
 
															-    保存原子知识到本地文件（JSON 格式）
														
 
															-
														
 
															-    Args:
														
 
															-        scenario: 任务描述（在什么情景下 + 要完成什么目标 + 得到能达成一个什么结果）
														
 
															-        content: 核心内容
														
 
															-        tags_type: 知识类型标签，可选：tool, usercase, definition, plan, strategy
														
 
															-        urls: 参考来源链接列表（论文/GitHub/博客等）
														
 
															-        agent_id: 执行此调研的 agent ID
														
 
															-        score: 初始评分 1-5（默认 3）
														
 
															-        trace_id: 当前 trace ID（可选）
														
 
															-
														
 
															-    Returns:
														
 
															-        保存结果
														
 
															-    """
														
 
															-    try:
														
 
															-        # 生成 ID
														
 
															-        knowledge_id = _generate_knowledge_id()
														
 
															-
														
 
															-        # 准备目录
														
 
															-        knowledge_dir = Path(".cache/knowledge_atoms")
														
 
															-        knowledge_dir.mkdir(parents=True, exist_ok=True)
														
 
															-
														
 
															-        # 构建文件路径（使用 .json 扩展名）
														
 
															-        file_path = knowledge_dir / f"{knowledge_id}.json"
														
 
															-
														
 
															-        # 构建 JSON 数据结构
														
 
															-        knowledge_data = {
														
 
															-            "id": knowledge_id,
														
 
															-            "trace_id": trace_id or "N/A",
														
 
															-            "tags": {
														
 
															-                "type": tags_type
														
 
															-            },
														
 
															-            "scenario": scenario,
														
 
															-            "content": content,
														
 
															-            "trace": {
														
 
															-                "urls": urls or [],
														
 
															-                "agent_id": agent_id,
														
 
															-                "timestamp": datetime.now().isoformat()
														
 
															-            },
														
 
															-            "eval": {
														
 
															-                "score": score,
														
 
															-                "helpful": 0,
														
 
															-                "harmful": 0,
														
 
															-                "helpful_history": [],
														
 
															-                "harmful_history": []
														
 
															-            },
														
 
															-            "metrics": {
														
 
															-                "helpful": 1,
														
 
															-                "harmful": 0
														
 
															-            },
														
 
															-            "created_at": datetime.now().strftime('%Y-%m-%d %H:%M:%S')
														
 
															-        }
														
 
															-
														
 
															-        # 保存为 JSON 文件
														
 
															-        with open(file_path, "w", encoding="utf-8") as f:
														
 
															-            json.dump(knowledge_data, f, ensure_ascii=False, indent=2)
														
 
															-
														
 
															-        return ToolResult(
														
 
															-            title="✅ 原子知识已保存",
														
 
															-            output=f"知识 ID: {knowledge_id}\n文件路径: {file_path}\n\n场景:\n{scenario[:100]}...",
														
 
															-            long_term_memory=f"保存原子知识: {knowledge_id} - {scenario[:50]}",
														
 
															-            metadata={"knowledge_id": knowledge_id, "file_path": str(file_path)}
														
 
															-        )
														
 
															-
														
 
															-    except Exception as e:
														
 
															-        return ToolResult(
														
 
															-            title="❌ 保存失败",
														
 
															-            output=f"错误: {str(e)}",
														
 
															-            error=str(e)
														
 
															-        )
														
 
															-
														
 
															-
														
 
															-@tool()
														
 
															-async def update_knowledge(
														
 
															-    knowledge_id: str,
														
 
															-    add_helpful_case: Optional[Dict[str, str]] = None,
														
 
															-    add_harmful_case: Optional[Dict[str, str]] = None,
														
 
															-    update_score: Optional[int] = None,
														
 
															-    evolve_feedback: Optional[str] = None,
														
 
															-) -> ToolResult:
														
 
															-    """
														
 
															-    更新已有的原子知识的评估反馈
														
 
															-
														
 
															-    Args:
														
 
															-        knowledge_id: 知识 ID（如 research-20260302-001）
														
 
															-        add_helpful_case: 添加好用的案例 {"case_id": "...", "scenario": "...", "result": "...", "timestamp": "..."}
														
 
															-        add_harmful_case: 添加不好用的案例 {"case_id": "...", "scenario": "...", "result": "...", "timestamp": "..."}
														
 
															-        update_score: 更新评分（1-5）
														
 
															-        evolve_feedback: 经验进化反馈（当提供时，会使用 LLM 重写知识内容）
														
 
															-
														
 
															-    Returns:
														
 
															-        更新结果
														
 
															-    """
														
 
															-    try:
														
 
															-        # 查找文件（支持 JSON 和 MD 格式）
														
 
															-        knowledge_dir = Path(".cache/knowledge_atoms")
														
 
															-        json_path = knowledge_dir / f"{knowledge_id}.json"
														
 
															-        md_path = knowledge_dir / f"{knowledge_id}.md"
														
 
															-
														
 
															-        file_path = None
														
 
															-        if json_path.exists():
														
 
															-            file_path = json_path
														
 
															-            is_json = True
														
 
															-        elif md_path.exists():
														
 
															-            file_path = md_path
														
 
															-            is_json = False
														
 
															-        else:
														
 
															-            return ToolResult(
														
 
															-                title="❌ 文件不存在",
														
 
															-                output=f"未找到知识文件: {knowledge_id}",
														
 
															-                error="文件不存在"
														
 
															-            )
														
 
															-
														
 
															-        # 读取现有内容
														
 
															-        with open(file_path, "r", encoding="utf-8") as f:
														
 
															-            content = f.read()
														
 
															-
														
 
															-        # 解析数据
														
 
															-        if is_json:
														
 
															-            data = json.loads(content)
														
 
															-        else:
														
 
															-            # 解析 YAML frontmatter
														
 
															-            yaml_match = re.search(r'^---\n(.*?)\n---', content, re.DOTALL)
														
 
															-            if not yaml_match:
														
 
															-                return ToolResult(
														
 
															-                    title="❌ 格式错误",
														
 
															-                    output=f"无法解析知识文件格式: {file_path}",
														
 
															-                    error="格式错误"
														
 
															-                )
														
 
															-            data = yaml.safe_load(yaml_match.group(1))
														
 
															-
														
 
															-        # 更新内容
														
 
															-        updated = False
														
 
															-        summary = []
														
 
															-
														
 
															-        if add_helpful_case:
														
 
															-            data["eval"]["helpful"] += 1
														
 
															-            data["eval"]["helpful_history"].append(add_helpful_case)
														
 
															-            data["metrics"]["helpful"] += 1
														
 
															-            summary.append(f"添加 helpful 案例: {add_helpful_case.get('case_id')}")
														
 
															-            updated = True
														
 
															-
														
 
															-        if add_harmful_case:
														
 
															-            data["eval"]["harmful"] += 1
														
 
															-            data["eval"]["harmful_history"].append(add_harmful_case)
														
 
															-            data["metrics"]["harmful"] += 1
														
 
															-            summary.append(f"添加 harmful 案例: {add_harmful_case.get('case_id')}")
														
 
															-            updated = True
														
 
															-
														
 
															-        if update_score is not None:
														
 
															-            data["eval"]["score"] = update_score
														
 
															-            summary.append(f"更新评分: {update_score}")
														
 
															-            updated = True
														
 
															-
														
 
															-        # 经验进化机制
														
 
															-        if evolve_feedback:
														
 
															-            old_content = data.get("content", "")
														
 
															-            evolved_content = await _evolve_knowledge_with_llm(old_content, evolve_feedback)
														
 
															-            data["content"] = evolved_content
														
 
															-            data["metrics"]["helpful"] += 1
														
 
															-            summary.append(f"知识进化: 基于反馈重写内容")
														
 
															-            updated = True
														
 
															-
														
 
															-        if not updated:
														
 
															-            return ToolResult(
														
 
															-                title="⚠️ 无更新",
														
 
															-                output="未指定任何更新内容",
														
 
															-                long_term_memory="尝试更新原子知识但未指定更新内容"
														
 
															-            )
														
 
															-
														
 
															-        # 更新时间戳
														
 
															-        data["updated_at"] = datetime.now().strftime('%Y-%m-%d %H:%M:%S')
														
 
															-
														
 
															-        # 保存更新
														
 
															-        if is_json:
														
 
															-            with open(file_path, "w", encoding="utf-8") as f:
														
 
															-                json.dump(data, f, ensure_ascii=False, indent=2)
														
 
															-        else:
														
 
															-            # 重新生成 YAML frontmatter
														
 
															-            meta_str = yaml.dump(data, allow_unicode=True).strip()
														
 
															-            with open(file_path, "w", encoding="utf-8") as f:
														
 
															-                f.write(f"---\n{meta_str}\n---\n")
														
 
															-
														
 
															-        return ToolResult(
														
 
															-            title="✅ 原子知识已更新",
														
 
															-            output=f"知识 ID: {knowledge_id}\n文件路径: {file_path}\n\n更新内容:\n" + "\n".join(f"- {s}" for s in summary),
														
 
															-            long_term_memory=f"更新原子知识: {knowledge_id}"
														
 
															-        )
														
 
															-
														
 
															-    except Exception as e:
														
 
															-        return ToolResult(
														
 
															-            title="❌ 更新失败",
														
 
															-            output=f"错误: {str(e)}",
														
 
															-            error=str(e)
														
 
															-        )
														
 
															-
														
 
															-
														
 
															-@tool()
														
 
															-async def list_knowledge(
														
 
															-    limit: int = 10,
														
 
															-    tags_type: Optional[List[str]] = None,
														
 
															-) -> ToolResult:
														
 
															-    """
														
 
															-    列出已保存的原子知识
														
 
															-
														
 
															-    Args:
														
 
															-        limit: 返回数量限制（默认 10）
														
 
															-        tags_type: 按类型过滤（可选）
														
 
															-
														
 
															-    Returns:
														
 
															-        知识列表
														
 
															-    """
														
 
															-    try:
														
 
															-        knowledge_dir = Path(".cache/knowledge_atoms")
														
 
															-
														
 
															-        if not knowledge_dir.exists():
														
 
															-            return ToolResult(
														
 
															-                title="📂 知识库为空",
														
 
															-                output="还没有保存任何原子知识",
														
 
															-                long_term_memory="知识库为空"
														
 
															-            )
														
 
															-
														
 
															-        # 获取所有文件
														
 
															-        files = sorted(knowledge_dir.glob("*.md"), key=lambda x: x.stat().st_mtime, reverse=True)
														
 
															-
														
 
															-        if not files:
														
 
															-            return ToolResult(
														
 
															-                title="📂 知识库为空",
														
 
															-                output="还没有保存任何原子知识",
														
 
															-                long_term_memory="知识库为空"
														
 
															-            )
														
 
															-
														
 
															-        # 读取并过滤
														
 
															-        results = []
														
 
															-        for file_path in files[:limit]:
														
 
															-            with open(file_path, "r", encoding="utf-8") as f:
														
 
															-                content = f.read()
														
 
															-
														
 
															-            # 提取关键信息
														
 
															-            import re
														
 
															-            id_match = re.search(r"id: (.+)", content)
														
 
															-            scenario_match = re.search(r"scenario: \|\n  (.+)", content)
														
 
															-            score_match = re.search(r"score: (\d+)", content)
														
 
															-
														
 
															-            knowledge_id = id_match.group(1) if id_match else "unknown"
														
 
															-            scenario = scenario_match.group(1) if scenario_match else "N/A"
														
 
															-            score = score_match.group(1) if score_match else "N/A"
														
 
															-
														
 
															-            results.append(f"- [{knowledge_id}] (⭐{score}) {scenario[:60]}...")
														
 
															-
														
 
															-        output = f"共找到 {len(files)} 条原子知识，显示最近 {len(results)} 条:\n\n" + "\n".join(results)
														
 
															-
														
 
															-        return ToolResult(
														
 
															-            title="📚 原子知识列表",
														
 
															-            output=output,
														
 
															-            long_term_memory=f"列出 {len(results)} 条原子知识"
														
 
															-        )
														
 
															-
														
 
															-    except Exception as e:
														
 
															-        return ToolResult(
														
 
															-            title="❌ 列表失败",
														
 
															-            output=f"错误: {str(e)}",
														
 
															-            error=str(e)
														
 
															-        )
														
 
															-
														
 
															-
														
 
															-# ===== 语义检索功能 =====
														
 
															-
														
 
															-async def _route_knowledge_by_llm(query_text: str, metadata_list: List[Dict], k: int = 5) -> List[str]:
														
 
															-    """
														
 
															-    第一阶段：语义路由。
														
 
															-    让 LLM 挑选出 2*k 个语义相关的 ID。
														
 
															-    """
														
 
															-    if not metadata_list:
														
 
															-        return []
														
 
															-
														
 
															-    # 扩大筛选范围到 2*k
														
 
															-    routing_k = k * 2
														
 
															-
														
 
															-    routing_data = [
														
 
															-        {
														
 
															-            "id": m["id"],
														
 
															-            "tags": m["tags"],
														
 
															-            "scenario": m["scenario"][:100]  # 只取前100字符
														
 
															-        } for m in metadata_list
														
 
															-    ]
														
 
															-
														
 
															-    prompt = f"""
														
 
															-你是一个知识检索专家。根据用户的当前任务需求，从下列原子知识元数据中挑选出最相关的最多 {routing_k} 个知识 ID。
														
 
															-任务需求："{query_text}"
														
 
															-
														
 
															-可选知识列表：
														
 
															-{json.dumps(routing_data, ensure_ascii=False, indent=1)}
														
 
															-
														
 
															-请直接输出 ID 列表，用逗号分隔（例如: knowledge-20260302-001, research-20260302-002）。若无相关项请输出 "None"。
														
 
															-"""
														
 
															-
														
 
															-    try:
														
 
															-        print(f"\n[Step 1: 知识语义路由] 任务: '{query_text}' | 候选总数: {len(metadata_list)} | 目标提取数: {routing_k}")
														
 
															-
														
 
															-        response = await openrouter_llm_call(
														
 
															-            messages=[{"role": "user", "content": prompt}],
														
 
															-            model="google/gemini-2.0-flash-001"
														
 
															-        )
														
 
															-
														
 
															-        content = response.get("content", "").strip()
														
 
															-        selected_ids = [idx.strip() for idx in re.split(r'[,\s]+', content) if idx.strip().startswith(("knowledge-", "research-"))]
														
 
															-
														
 
															-        print(f"[Step 1: 知识语义路由] LLM 初选 ID ({len(selected_ids)}个): {selected_ids}")
														
 
															-        return selected_ids
														
 
															-    except Exception as e:
														
 
															-        logger.error(f"LLM 知识路由失败: {e}")
														
 
															-        return []
														
 
															-
														
 
															-
														
 
															-async def _evolve_knowledge_with_llm(old_content: str, feedback: str) -> str:
														
 
															-    """
														
 
															-    使用 LLM 进行知识进化重写（类似经验进化机制）
														
 
															-    """
														
 
															-    prompt = f"""你是一个 AI Agent 知识库管理员。请根据反馈建议，对现有的知识内容进行重写进化。
														
 
															-
														
 
															-【原知识内容】:
														
 
															-{old_content}
														
 
															-
														
 
															-【实战反馈建议】:
														
 
															-{feedback}
														
 
															-
														
 
															-【重写要求】:
														
 
															-1. 融合知识：将反馈中的避坑指南、新参数或修正后的选择逻辑融入原知识，使其更具通用性和准确性。
														
 
															-2. 保持结构：如果原内容有特定格式（如 Markdown、代码示例等），请保持该格式。
														
 
															-3. 语言：简洁直接，使用中文。
														
 
															-4. 禁止：严禁输出任何开场白、解释语或额外的 Markdown 标题，直接返回重写后的正文。
														
 
															-"""
														
 
															-    try:
														
 
															-        response = await openrouter_llm_call(
														
 
															-            messages=[{"role": "user", "content": prompt}],
														
 
															-            model="google/gemini-2.0-flash-001"
														
 
															-        )
														
 
															-
														
 
															-        evolved_content = response.get("content", "").strip()
														
 
															-
														
 
															-        # 简单安全校验：如果 LLM 返回太短或为空，回退到原内容+追加
														
 
															-        if len(evolved_content) < 5:
														
 
															-            raise ValueError("LLM output too short")
														
 
															-
														
 
															-        return evolved_content
														
 
															-
														
 
															-    except Exception as e:
														
 
															-        logger.warning(f"知识进化失败，采用追加模式回退: {e}")
														
 
															-        timestamp = datetime.now().strftime('%Y-%m-%d')
														
 
															-        return f"{old_content}\n\n---\n[Update {timestamp}]: {feedback}"
														
 
															-
														
 
															-
														
 
															-async def _route_knowledge_by_llm(query_text: str, metadata_list: List[Dict], k: int = 5) -> List[str]:
														
 
															-    """
														
 
															-    第一阶段：语义路由。
														
 
															-    让 LLM 挑选出 2*k 个语义相关的 ID。
														
 
															-    """
														
 
															-    if not metadata_list:
														
 
															-        return []
														
 
															-
														
 
															-    # 扩大筛选范围到 2*k
														
 
															-    routing_k = k * 2
														
 
															-
														
 
															-    routing_data = [
														
 
															-        {
														
 
															-            "id": m["id"],
														
 
															-            "tags": m["tags"],
														
 
															-            "scenario": m["scenario"][:100]  # 只取前100字符
														
 
															-        } for m in metadata_list
														
 
															-    ]
														
 
															-
														
 
															-    prompt = f"""
														
 
															-你是一个知识检索专家。根据用户的当前任务需求，从下列原子知识元数据中挑选出最相关的最多 {routing_k} 个知识 ID。
														
 
															-任务需求："{query_text}"
														
 
															-
														
 
															-可选知识列表：
														
 
															-{json.dumps(routing_data, ensure_ascii=False, indent=1)}
														
 
															-
														
 
															-请直接输出 ID 列表，用逗号分隔（例如: knowledge-20260302-001, research-20260302-002）。若无相关项请输出 "None"。
														
 
															-"""
														
 
															-
														
 
															-    try:
														
 
															-        print(f"\n[Step 1: 知识语义路由] 任务: '{query_text}' | 候选总数: {len(metadata_list)} | 目标提取数: {routing_k}")
														
 
															-
														
 
															-        response = await openrouter_llm_call(
														
 
															-            messages=[{"role": "user", "content": prompt}],
														
 
															-            model="google/gemini-2.0-flash-001"
														
 
															-        )
														
 
															-
														
 
															-        content = response.get("content", "").strip()
														
 
															-        selected_ids = [idx.strip() for idx in re.split(r'[,\s]+', content) if idx.strip().startswith(("knowledge-", "research-"))]
														
 
															-
														
 
															-        print(f"[Step 1: 知识语义路由] LLM 初选 ID ({len(selected_ids)}个): {selected_ids}")
														
 
															-        return selected_ids
														
 
															-    except Exception as e:
														
 
															-        logger.error(f"LLM 知识路由失败: {e}")
														
 
															-        return []
														
 
															-
														
 
															-
														
 
															-async def _get_structured_knowledge(
														
 
															-    query_text: str,
														
 
															-    top_k: int = 5,
														
 
															-    min_score: int = 3,
														
 
															-    context: Optional[Any] = None,
														
 
															-    tags_filter: Optional[List[str]] = None
														
 
															-) -> List[Dict]:
														
 
															-    """
														
 
															-    语义检索原子知识（包括经验）
														
 
															-
														
 
															-    1. 解析知识库文件（支持 JSON 和 YAML 格式）
														
 
															-    2. 语义路由：提取 2*k 个 ID
														
 
															-    3. 质量精排：基于评分筛选出最终的 k 个
														
 
															-
														
 
															-    Args:
														
 
															-        query_text: 查询文本
														
 
															-        top_k: 返回数量
														
 
															-        min_score: 最低评分过滤
														
 
															-        context: 上下文（兼容 experience 接口）
														
 
															-        tags_filter: 标签过滤（如 ["strategy"] 只返回经验）
														
 
															-    """
														
 
															-    knowledge_dir = Path(".cache/knowledge_atoms")
														
 
															-
														
 
															-    if not knowledge_dir.exists():
														
 
															-        print(f"[Knowledge System] 警告: 知识库目录不存在 ({knowledge_dir})")
														
 
															-        return []
														
 
															-
														
 
															-    # 同时支持 .json 和 .md 文件
														
 
															-    json_files = list(knowledge_dir.glob("*.json"))
														
 
															-    md_files = list(knowledge_dir.glob("*.md"))
														
 
															-    files = json_files + md_files
														
 
															-
														
 
															-    if not files:
														
 
															-        print(f"[Knowledge System] 警告: 知识库为空")
														
 
															-        return []
														
 
															-
														
 
															-    # --- 阶段 1: 解析所有知识文件 ---
														
 
															-    content_map = {}
														
 
															-    metadata_list = []
														
 
															-
														
 
															-    for file_path in files:
														
 
															-        try:
														
 
															-            with open(file_path, "r", encoding="utf-8") as f:
														
 
															-                content = f.read()
														
 
															-
														
 
															-            # 根据文件扩展名选择解析方式
														
 
															-            if file_path.suffix == ".json":
														
 
															-                # 解析 JSON 格式
														
 
															-                metadata = json.loads(content)
														
 
															-            else:
														
 
															-                # 解析 YAML frontmatter（兼容旧格式）
														
 
															-                yaml_match = re.search(r'^---\n(.*?)\n---', content, re.DOTALL)
														
 
															-                if not yaml_match:
														
 
															-                    logger.warning(f"跳过无效文件: {file_path}")
														
 
															-                    continue
														
 
															-                metadata = yaml.safe_load(yaml_match.group(1))
														
 
															-
														
 
															-            if not isinstance(metadata, dict):
														
 
															-                logger.warning(f"跳过损坏的知识文件: {file_path}")
														
 
															-                continue
														
 
															-
														
 
															-            kid = metadata.get("id")
														
 
															-            if not kid:
														
 
															-                logger.warning(f"跳过缺少 id 的知识文件: {file_path}")
														
 
															-                continue
														
 
															-
														
 
															-            # 提取 scenario 和 content
														
 
															-            scenario = metadata.get("scenario", "").strip()
														
 
															-            content_text = metadata.get("content", "").strip()
														
 
															-
														
 
															-            # 标签过滤
														
 
															-            tags = metadata.get("tags", {})
														
 
															-            if tags_filter:
														
 
															-                # 检查 tags.type 是否包含任何过滤标签
														
 
															-                tag_types = tags.get("type", [])
														
 
															-                if isinstance(tag_types, str):
														
 
															-                    tag_types = [tag_types]
														
 
															-                if not any(tag in tag_types for tag in tags_filter):
														
 
															-                    continue  # 跳过不匹配的标签
														
 
															-
														
 
															-            meta_item = {
														
 
															-                "id": kid,
														
 
															-                "tags": tags,
														
 
															-                "scenario": scenario,
														
 
															-                "score": metadata.get("eval", {}).get("score", 3),
														
 
															-                "helpful": metadata.get("metrics", {}).get("helpful", 0),
														
 
															-                "harmful": metadata.get("metrics", {}).get("harmful", 0),
														
 
															-            }
														
 
															-            metadata_list.append(meta_item)
														
 
															-            content_map[kid] = {
														
 
															-                "scenario": scenario,
														
 
															-                "content": content_text,
														
 
															-                "tags": tags,
														
 
															-                "score": meta_item["score"],
														
 
															-                "helpful": meta_item["helpful"],
														
 
															-                "harmful": meta_item["harmful"],
														
 
															-            }
														
 
															-        except Exception as e:
														
 
															-            logger.error(f"解析知识文件失败 {file_path}: {e}")
														
 
															-            continue
														
 
															-
														
 
															-    if not metadata_list:
														
 
															-        print(f"[Knowledge System] 警告: 没有有效的知识条目")
														
 
															-        return []
														
 
															-
														
 
															-    # --- 阶段 2: 语义路由 (取 2*k) ---
														
 
															-    candidate_ids = await _route_knowledge_by_llm(query_text, metadata_list, k=top_k)
														
 
															-
														
 
															-    # --- 阶段 3: 质量精排 (根据评分和反馈选出最终的 k) ---
														
 
															-    print(f"[Step 2: 知识质量精排] 正在根据评分和反馈进行打分...")
														
 
															-    scored_items = []
														
 
															-
														
 
															-    for kid in candidate_ids:
														
 
															-        if kid in content_map:
														
 
															-            item = content_map[kid]
														
 
															-            score = item["score"]
														
 
															-            helpful = item["helpful"]
														
 
															-            harmful = item["harmful"]
														
 
															-
														
 
															-            # 计算综合分：基础分 + helpful - harmful*2
														
 
															-            quality_score = score + helpful - (harmful * 2.0)
														
 
															-
														
 
															-            # 过滤门槛：评分低于 min_score 或质量分过低
														
 
															-            if score < min_score or quality_score < 0:
														
 
															-                print(f"  - 剔除低质量知识: {kid} (Score: {score}, Helpful: {helpful}, Harmful: {harmful})")
														
 
															-                continue
														
 
															-
														
 
															-            scored_items.append({
														
 
															-                "id": kid,
														
 
															-                "scenario": item["scenario"],
														
 
															-                "content": item["content"],
														
 
															-                "tags": item["tags"],
														
 
															-                "score": score,
														
 
															-                "quality_score": quality_score,
														
 
															-                "metrics": {
														
 
															-                    "helpful": helpful,
														
 
															-                    "harmful": harmful
														
 
															-                }
														
 
															-            })
														
 
															-
														
 
															-    # 按照质量分排序
														
 
															-    final_sorted = sorted(scored_items, key=lambda x: x["quality_score"], reverse=True)
														
 
															-
														
 
															-    # 截取最终的 top_k
														
 
															-    result = final_sorted[:top_k]
														
 
															-
														
 
															-    print(f"[Step 2: 知识质量精排] 最终选定知识: {[it['id'] for it in result]}")
														
 
															-    print(f"[Knowledge System] 检索结束。\n")
														
 
															-    return result
														
 
															-
														
 
															-
														
 
															-@tool()
														
 
															-async def search_knowledge(
														
 
															-    query: str,
														
 
															-    top_k: int = 5,
														
 
															-    min_score: int = 3,
														
 
															-    tags_type: Optional[List[str]] = None,
														
 
															-    context: Optional[ToolContext] = None,
														
 
															-) -> ToolResult:
														
 
															-    """
														
 
															-    语义检索原子知识库
														
 
															-
														
 
															-    Args:
														
 
															-        query: 搜索查询（任务描述）
														
 
															-        top_k: 返回数量（默认 5）
														
 
															-        min_score: 最低评分过滤（默认 3）
														
 
															-        tags_type: 按类型过滤（tool/usercase/definition/plan）
														
 
															-        context: 工具上下文
														
 
															-
														
 
															-    Returns:
														
 
															-        相关知识列表
														
 
															-    """
														
 
															-    try:
														
 
															-        relevant_items = await _get_structured_knowledge(
														
 
															-            query_text=query,
														
 
															-            top_k=top_k,
														
 
															-            min_score=min_score
														
 
															-        )
														
 
															-
														
 
															-        if not relevant_items:
														
 
															-            return ToolResult(
														
 
															-                title="🔍 未找到相关知识",
														
 
															-                output=f"查询: {query}\n\n知识库中暂无相关的高质量知识。建议进行调研。",
														
 
															-                long_term_memory=f"知识检索: 未找到相关知识 - {query[:50]}"
														
 
															-            )
														
 
															-
														
 
															-        # 格式化输出
														
 
															-        output_lines = [f"查询: {query}\n", f"找到 {len(relevant_items)} 条相关知识:\n"]
														
 
															-
														
 
															-        for idx, item in enumerate(relevant_items, 1):
														
 
															-            output_lines.append(f"\n### {idx}. [{item['id']}] (⭐ {item['score']})")
														
 
															-            output_lines.append(f"**场景**: {item['scenario'][:150]}...")
														
 
															-            output_lines.append(f"**内容**: {item['content'][:200]}...")
														
 
															-
														
 
															-        return ToolResult(
														
 
															-            title="✅ 知识检索成功",
														
 
															-            output="\n".join(output_lines),
														
 
															-            long_term_memory=f"知识检索: 找到 {len(relevant_items)} 条相关知识 - {query[:50]}",
														
 
															-            metadata={
														
 
															-                "count": len(relevant_items),
														
 
															-                "knowledge_ids": [item["id"] for item in relevant_items],
														
 
															-                "items": relevant_items
														
 
															-            }
														
 
															-        )
														
 
															-
														
 
															-    except Exception as e:
														
 
															-        logger.error(f"知识检索失败: {e}")
														
 
															-        return ToolResult(
														
 
															-            title="❌ 检索失败",
														
 
															-            output=f"错误: {str(e)}",
														
 
															-            error=str(e)
														
 
															-        )
														
 
															-
														
 
															-
														
 
															-@tool(description="通过两阶段检索获取最相关的历史经验（strategy 标签的知识）")
														
 
															-async def get_experience(
														
 
															-    query: str,
														
 
															-    k: int = 3,
														
 
															-    context: Optional[ToolContext] = None,
														
 
															-) -> ToolResult:
														
 
															-    """
														
 
															-    检索历史经验（兼容旧接口，实际调用 search_knowledge 并过滤 strategy 标签）
														
 
															-
														
 
															-    Args:
														
 
															-        query: 搜索查询（任务描述）
														
 
															-        k: 返回数量（默认 3）
														
 
															-        context: 工具上下文
														
 
															-
														
 
															-    Returns:
														
 
															-        相关经验列表
														
 
															-    """
														
 
															-    try:
														
 
															-        relevant_items = await _get_structured_knowledge(
														
 
															-            query_text=query,
														
 
															-            top_k=k,
														
 
															-            min_score=1,  # 经验的评分门槛较低
														
 
															-            context=context,
														
 
															-            tags_filter=["strategy"]  # 只返回经验
														
 
															-        )
														
 
															-
														
 
															-        if not relevant_items:
														
 
															-            return ToolResult(
														
 
															-                title="🔍 未找到相关经验",
														
 
															-                output=f"查询: {query}\n\n经验库中暂无相关的经验。",
														
 
															-                long_term_memory=f"经验检索: 未找到相关经验 - {query[:50]}",
														
 
															-                metadata={"items": [], "count": 0}
														
 
															-            )
														
 
															-
														
 
															-        # 格式化输出（兼容旧格式）
														
 
															-        output_lines = [f"查询: {query}\n", f"找到 {len(relevant_items)} 条相关经验:\n"]
														
 
															-
														
 
															-        for idx, item in enumerate(relevant_items, 1):
														
 
															-            output_lines.append(f"\n### {idx}. [{item['id']}]")
														
 
															-            output_lines.append(f"{item['content'][:300]}...")
														
 
															-
														
 
															-        return ToolResult(
														
 
															-            title="✅ 经验检索成功",
														
 
															-            output="\n".join(output_lines),
														
 
															-            long_term_memory=f"经验检索: 找到 {len(relevant_items)} 条相关经验 - {query[:50]}",
														
 
															-            metadata={
														
 
															-                "items": relevant_items,
														
 
															-                "count": len(relevant_items)
														
 
															-            }
														
 
															-        )
														
 
															-
														
 
															-    except Exception as e:
														
 
															-        logger.error(f"经验检索失败: {e}")
														
 
															-        return ToolResult(
														
 
															-            title="❌ 检索失败",
														
 
															-            output=f"错误: {str(e)}",
														
 
															-            error=str(e)
														
 
															-        )
														
 
															-
														
 
															-
														
 
															-# ===== 批量更新功能（类似经验机制）=====
														
 
															-
														
 
															-async def _batch_update_knowledge(
														
 
															-    update_map: Dict[str, Dict[str, Any]],
														
 
															-    context: Optional[Any] = None
														
 
															-) -> int:
														
 
															-    """
														
 
															-    内部函数：批量更新知识（兼容 experience 接口）
														
 
															-
														
 
															-    Args:
														
 
															-        update_map: 更新映射 {knowledge_id: {"action": "helpful/harmful/evolve", "feedback": "..."}}
														
 
															-        context: 上下文（兼容 experience 接口）
														
 
															-
														
 
															-    Returns:
														
 
															-        成功更新的数量
														
 
															-    """
														
 
															-    if not update_map:
														
 
															-        return 0
														
 
															-
														
 
															-    knowledge_dir = Path(".cache/knowledge_atoms")
														
 
															-    if not knowledge_dir.exists():
														
 
															-        return 0
														
 
															-
														
 
															-    success_count = 0
														
 
															-    evolution_tasks = []
														
 
															-    evolution_registry = {}  # task_idx -> (file_path, data)
														
 
															-
														
 
															-    for knowledge_id, instr in update_map.items():
														
 
															-        try:
														
 
															-            # 查找文件
														
 
															-            json_path = knowledge_dir / f"{knowledge_id}.json"
														
 
															-            md_path = knowledge_dir / f"{knowledge_id}.md"
														
 
															-
														
 
															-            file_path = None
														
 
															-            is_json = False
														
 
															-            if json_path.exists():
														
 
															-                file_path = json_path
														
 
															-                is_json = True
														
 
															-            elif md_path.exists():
														
 
															-                file_path = md_path
														
 
															-                is_json = False
														
 
															-            else:
														
 
															-                continue
														
 
															-
														
 
															-            # 读取并解析
														
 
															-            with open(file_path, "r", encoding="utf-8") as f:
														
 
															-                content = f.read()
														
 
															-
														
 
															-            if is_json:
														
 
															-                data = json.loads(content)
														
 
															-            else:
														
 
															-                yaml_match = re.search(r'^---\n(.*?)\n---', content, re.DOTALL)
														
 
															-                if not yaml_match:
														
 
															-                    continue
														
 
															-                data = yaml.safe_load(yaml_match.group(1))
														
 
															-
														
 
															-            # 更新 metrics
														
 
															-            action = instr.get("action")
														
 
															-            feedback = instr.get("feedback", "")
														
 
															-
														
 
															-            # 处理 mixed 中间态
														
 
															-            if action == "mixed":
														
 
															-                data["metrics"]["helpful"] = data.get("metrics", {}).get("helpful", 0) + 1
														
 
															-                action = "evolve"
														
 
															-
														
 
															-            if action == "helpful":
														
 
															-                data["metrics"]["helpful"] = data.get("metrics", {}).get("helpful", 0) + 1
														
 
															-            elif action == "harmful":
														
 
															-                data["metrics"]["harmful"] = data.get("metrics", {}).get("harmful", 0) + 1
														
 
															-            elif action == "evolve" and feedback:
														
 
															-                # 注册进化任务
														
 
															-                old_content = data.get("content", "")
														
 
															-                task = _evolve_knowledge_with_llm(old_content, feedback)
														
 
															-                evolution_tasks.append(task)
														
 
															-                evolution_registry[len(evolution_tasks) - 1] = (file_path, data, is_json)
														
 
															-                data["metrics"]["helpful"] = data.get("metrics", {}).get("helpful", 0) + 1
														
 
															-
														
 
															-            data["updated_at"] = datetime.now().strftime('%Y-%m-%d %H:%M:%S')
														
 
															-
														
 
															-            # 如果不需要进化，直接保存
														
 
															-            if action != "evolve" or not feedback:
														
 
															-                if is_json:
														
 
															-                    with open(file_path, "w", encoding="utf-8") as f:
														
 
															-                        json.dump(data, f, ensure_ascii=False, indent=2)
														
 
															-                else:
														
 
															-                    meta_str = yaml.dump(data, allow_unicode=True).strip()
														
 
															-                    with open(file_path, "w", encoding="utf-8") as f:
														
 
															-                        f.write(f"---\n{meta_str}\n---\n")
														
 
															-                success_count += 1
														
 
															-
														
 
															-        except Exception as e:
														
 
															-            logger.error(f"更新知识失败 {knowledge_id}: {e}")
														
 
															-            continue
														
 
															-
														
 
															-    # 并发进化
														
 
															-    if evolution_tasks:
														
 
															-        import asyncio
														
 
															-        print(f"🧬 并发处理 {len(evolution_tasks)} 条知识进化...")
														
 
															-        evolved_results = await asyncio.gather(*evolution_tasks)
														
 
															-
														
 
															-        # 回填进化结果
														
 
															-        for task_idx, (file_path, data, is_json) in evolution_registry.items():
														
 
															-            data["content"] = evolved_results[task_idx].strip()
														
 
															-
														
 
															-            if is_json:
														
 
															-                with open(file_path, "w", encoding="utf-8") as f:
														
 
															-                    json.dump(data, f, ensure_ascii=False, indent=2)
														
 
															-            else:
														
 
															-                meta_str = yaml.dump(data, allow_unicode=True).strip()
														
 
															-                with open(file_path, "w", encoding="utf-8") as f:
														
 
															-                    f.write(f"---\n{meta_str}\n---\n")
														
 
															-            success_count += 1
														
 
															-
														
 
															-    return success_count
														
 
															-
														
 
															-
														
 
															-@tool()
														
 
															-async def batch_update_knowledge(
														
 
															-    feedback_list: List[Dict[str, Any]],
														
 
															-    context: Optional[ToolContext] = None,
														
 
															-) -> ToolResult:
														
 
															-    """
														
 
															-    批量反馈知识的有效性（类似经验机制）
														
 
															-
														
 
															-    Args:
														
 
															-        feedback_list: 评价列表，每个元素包含:
														
 
															-            - knowledge_id: (str) 知识 ID
														
 
															-            - is_effective: (bool) 是否有效
														
 
															-            - feedback: (str, optional) 改进建议，若有效且有建议则触发知识进化
														
 
															-
														
 
															-    Returns:
														
 
															-        批量更新结果
														
 
															-    """
														
 
															-    try:
														
 
															-        if not feedback_list:
														
 
															-            return ToolResult(
														
 
															-                title="⚠️ 反馈列表为空",
														
 
															-                output="未提供任何反馈",
														
 
															-                long_term_memory="批量更新知识: 反馈列表为空"
														
 
															-            )
														
 
															-
														
 
															-        knowledge_dir = Path(".cache/knowledge_atoms")
														
 
															-        if not knowledge_dir.exists():
														
 
															-            return ToolResult(
														
 
															-                title="❌ 知识库不存在",
														
 
															-                output="知识库目录不存在",
														
 
															-                error="知识库不存在"
														
 
															-            )
														
 
															-
														
 
															-        success_count = 0
														
 
															-        failed_items = []
														
 
															-
														
 
															-        for item in feedback_list:
														
 
															-            knowledge_id = item.get("knowledge_id")
														
 
															-            is_effective = item.get("is_effective")
														
 
															-            feedback = item.get("feedback", "")
														
 
															-
														
 
															-            if not knowledge_id:
														
 
															-                failed_items.append({"id": "unknown", "reason": "缺少 knowledge_id"})
														
 
															-                continue
														
 
															-
														
 
															-            try:
														
 
															-                # 查找文件
														
 
															-                json_path = knowledge_dir / f"{knowledge_id}.json"
														
 
															-                md_path = knowledge_dir / f"{knowledge_id}.md"
														
 
															-
														
 
															-                file_path = None
														
 
															-                is_json = False
														
 
															-                if json_path.exists():
														
 
															-                    file_path = json_path
														
 
															-                    is_json = True
														
 
															-                elif md_path.exists():
														
 
															-                    file_path = md_path
														
 
															-                    is_json = False
														
 
															-                else:
														
 
															-                    failed_items.append({"id": knowledge_id, "reason": "文件不存在"})
														
 
															-                    continue
														
 
															-
														
 
															-                # 读取并解析
														
 
															-                with open(file_path, "r", encoding="utf-8") as f:
														
 
															-                    content = f.read()
														
 
															-
														
 
															-                if is_json:
														
 
															-                    data = json.loads(content)
														
 
															-                else:
														
 
															-                    yaml_match = re.search(r'^---\n(.*?)\n---', content, re.DOTALL)
														
 
															-                    if not yaml_match:
														
 
															-                        failed_items.append({"id": knowledge_id, "reason": "格式错误"})
														
 
															-                        continue
														
 
															-                    data = yaml.safe_load(yaml_match.group(1))
														
 
															-
														
 
															-                # 更新 metrics
														
 
															-                if is_effective:
														
 
															-                    data["metrics"]["helpful"] = data.get("metrics", {}).get("helpful", 0) + 1
														
 
															-                    # 如果有反馈建议，触发进化
														
 
															-                    if feedback:
														
 
															-                        old_content = data.get("content", "")
														
 
															-                        evolved_content = await _evolve_knowledge_with_llm(old_content, feedback)
														
 
															-                        data["content"] = evolved_content
														
 
															-                else:
														
 
															-                    data["metrics"]["harmful"] = data.get("metrics", {}).get("harmful", 0) + 1
														
 
															-
														
 
															-                data["updated_at"] = datetime.now().strftime('%Y-%m-%d %H:%M:%S')
														
 
															-
														
 
															-                # 保存
														
 
															-                if is_json:
														
 
															-                    with open(file_path, "w", encoding="utf-8") as f:
														
 
															-                        json.dump(data, f, ensure_ascii=False, indent=2)
														
 
															-                else:
														
 
															-                    meta_str = yaml.dump(data, allow_unicode=True).strip()
														
 
															-                    with open(file_path, "w", encoding="utf-8") as f:
														
 
															-                        f.write(f"---\n{meta_str}\n---\n")
														
 
															-
														
 
															-                success_count += 1
														
 
															-
														
 
															-            except Exception as e:
														
 
															-                failed_items.append({"id": knowledge_id, "reason": str(e)})
														
 
															-                continue
														
 
															-
														
 
															-        output_lines = [f"成功更新 {success_count} 条知识"]
														
 
															-        if failed_items:
														
 
															-            output_lines.append(f"\n失败 {len(failed_items)} 条:")
														
 
															-            for item in failed_items:
														
 
															-                output_lines.append(f"  - {item['id']}: {item['reason']}")
														
 
															-
														
 
															-        return ToolResult(
														
 
															-            title="✅ 批量更新完成",
														
 
															-            output="\n".join(output_lines),
														
 
															-            long_term_memory=f"批量更新知识: 成功 {success_count} 条，失败 {len(failed_items)} 条"
														
 
															-        )
														
 
															-
														
 
															-    except Exception as e:
														
 
															-        logger.error(f"批量更新知识失败: {e}")
														
 
															-        return ToolResult(
														
 
															-            title="❌ 批量更新失败",
														
 
															-            output=f"错误: {str(e)}",
														
 
															-            error=str(e)
														
 
															-        )
														
 
															-
														
 
															-
														
 
															-# ===== 知识库瘦身功能（类似经验机制）=====
														
 
															-
														
 
															-@tool()
														
 
															-async def slim_knowledge(
														
 
															-    model: str = "anthropic/claude-sonnet-4.5",
														
 
															-    context: Optional[ToolContext] = None,
														
 
															-) -> ToolResult:
														
 
															-    """
														
 
															-    知识库瘦身：调用顶级大模型，将知识库中语义相似的知识合并精简
														
 
															-
														
 
															-    Args:
														
 
															-        model: 使用的模型（默认 claude-sonnet-4.5）
														
 
															-        context: 工具上下文
														
 
															-
														
 
															-    Returns:
														
 
															-        瘦身结果报告
														
 
															-    """
														
 
															-    try:
														
 
															-        knowledge_dir = Path(".cache/knowledge_atoms")
														
 
															-
														
 
															-        if not knowledge_dir.exists():
														
 
															-            return ToolResult(
														
 
															-                title="📂 知识库不存在",
														
 
															-                output="知识库目录不存在，无需瘦身",
														
 
															-                long_term_memory="知识库瘦身: 目录不存在"
														
 
															-            )
														
 
															-
														
 
															-        # 获取所有文件
														
 
															-        json_files = list(knowledge_dir.glob("*.json"))
														
 
															-        md_files = list(knowledge_dir.glob("*.md"))
														
 
															-        files = json_files + md_files
														
 
															-
														
 
															-        if len(files) < 2:
														
 
															-            return ToolResult(
														
 
															-                title="📂 知识库过小",
														
 
															-                output=f"知识库仅有 {len(files)} 条，无需瘦身",
														
 
															-                long_term_memory=f"知识库瘦身: 仅有 {len(files)} 条"
														
 
															-            )
														
 
															-
														
 
															-        # 解析所有知识
														
 
															-        parsed = []
														
 
															-        for file_path in files:
														
 
															-            try:
														
 
															-                with open(file_path, "r", encoding="utf-8") as f:
														
 
															-                    content = f.read()
														
 
															-
														
 
															-                if file_path.suffix == ".json":
														
 
															-                    data = json.loads(content)
														
 
															-                else:
														
 
															-                    yaml_match = re.search(r'^---\n(.*?)\n---', content, re.DOTALL)
														
 
															-                    if not yaml_match:
														
 
															-                        continue
														
 
															-                    data = yaml.safe_load(yaml_match.group(1))
														
 
															-
														
 
															-                parsed.append({
														
 
															-                    "file_path": file_path,
														
 
															-                    "data": data,
														
 
															-                    "is_json": file_path.suffix == ".json"
														
 
															-                })
														
 
															-            except Exception as e:
														
 
															-                logger.error(f"解析文件失败 {file_path}: {e}")
														
 
															-                continue
														
 
															-
														
 
															-        if len(parsed) < 2:
														
 
															-            return ToolResult(
														
 
															-                title="📂 有效知识过少",
														
 
															-                output=f"有效知识仅有 {len(parsed)} 条，无需瘦身",
														
 
															-                long_term_memory=f"知识库瘦身: 有效知识 {len(parsed)} 条"
														
 
															-            )
														
 
															-
														
 
															-        # 构造发给大模型的内容
														
 
															-        entries_text = ""
														
 
															-        for p in parsed:
														
 
															-            data = p["data"]
														
 
															-            entries_text += f"[ID: {data.get('id')}] [Tags: {data.get('tags', {})}] "
														
 
															-            entries_text += f"[Metrics: {data.get('metrics', {})}] [Score: {data.get('eval', {}).get('score', 3)}]\n"
														
 
															-            entries_text += f"Scenario: {data.get('scenario', 'N/A')}\n"
														
 
															-            entries_text += f"Content: {data.get('content', '')[:200]}...\n\n"
														
 
															-
														
 
															-        prompt = f"""你是一个 AI Agent 知识库管理员。以下是当前知识库的全部条目，请执行瘦身操作：
														
 
															-
														
 
															-【任务】:
														
 
															-1. 识别语义高度相似或重复的知识，将它们合并为一条更精炼、更通用的知识。
														
 
															-2. 合并时保留 helpful 最高的那条的 ID 和 metrics（metrics 中 helpful/harmful 取各条之和）。
														
 
															-3. 对于独立的、无重复的知识，保持原样不动。
														
 
															-4. 保持原有的知识结构和格式。
														
 
															-
														
 
															-【当前知识库】:
														
 
															-{entries_text}
														
 
															-
														
 
															-【输出格式要求】:
														
 
															-严格按以下格式输出每条知识，条目之间用 === 分隔：
														
 
															-ID: <保留的id>
														
 
															-TAGS: <yaml格式的tags>
														
 
															-METRICS: <yaml格式的metrics>
														
 
															-SCORE: <评分>
														
 
															-SCENARIO: <场景描述>
														
 
															-CONTENT: <合并后的知识内容>
														
 
															-===
														
 
															-
														
 
															-最后一行输出合并报告，格式：
														
 
															-REPORT: 原有 X 条，合并后 Y 条，精简了 Z 条。
														
 
															-
														
 
															-禁止输出任何开场白或解释。"""
														
 
															-
														
 
															-        print(f"\n[知识瘦身] 正在调用 {model} 分析 {len(parsed)} 条知识...")
														
 
															-        response = await openrouter_llm_call(
														
 
															-            messages=[{"role": "user", "content": prompt}],
														
 
															-            model=model
														
 
															-        )
														
 
															-        content = response.get("content", "").strip()
														
 
															-        if not content:
														
 
															-            return ToolResult(
														
 
															-                title="❌ 大模型返回为空",
														
 
															-                output="大模型返回为空，瘦身失败",
														
 
															-                error="大模型返回为空"
														
 
															-            )
														
 
															-
														
 
															-        # 解析大模型输出
														
 
															-        report_line = ""
														
 
															-        new_entries = []
														
 
															-        blocks = [b.strip() for b in content.split("===") if b.strip()]
														
 
															-
														
 
															-        for block in blocks:
														
 
															-            if block.startswith("REPORT:"):
														
 
															-                report_line = block
														
 
															-                continue
														
 
															-
														
 
															-            lines = block.split("\n")
														
 
															-            kid, tags, metrics, score, scenario, content_lines = None, {}, {}, 3, "", []
														
 
															-            current_field = None
														
 
															-
														
 
															-            for line in lines:
														
 
															-                if line.startswith("ID:"):
														
 
															-                    kid = line[3:].strip()
														
 
															-                    current_field = None
														
 
															-                elif line.startswith("TAGS:"):
														
 
															-                    try:
														
 
															-                        tags = yaml.safe_load(line[5:].strip()) or {}
														
 
															-                    except Exception:
														
 
															-                        tags = {}
														
 
															-                    current_field = None
														
 
															-                elif line.startswith("METRICS:"):
														
 
															-                    try:
														
 
															-                        metrics = yaml.safe_load(line[8:].strip()) or {}
														
 
															-                    except Exception:
														
 
															-                        metrics = {"helpful": 0, "harmful": 0}
														
 
															-                    current_field = None
														
 
															-                elif line.startswith("SCORE:"):
														
 
															-                    try:
														
 
															-                        score = int(line[6:].strip())
														
 
															-                    except Exception:
														
 
															-                        score = 3
														
 
															-                    current_field = None
														
 
															-                elif line.startswith("SCENARIO:"):
														
 
															-                    scenario = line[9:].strip()
														
 
															-                    current_field = "scenario"
														
 
															-                elif line.startswith("CONTENT:"):
														
 
															-                    content_lines.append(line[8:].strip())
														
 
															-                    current_field = "content"
														
 
															-                elif current_field == "scenario":
														
 
															-                    scenario += "\n" + line
														
 
															-                elif current_field == "content":
														
 
															-                    content_lines.append(line)
														
 
															-
														
 
															-            if kid and content_lines:
														
 
															-                new_data = {
														
 
															-                    "id": kid,
														
 
															-                    "tags": tags,
														
 
															-                    "scenario": scenario,
														
 
															-                    "content": "\n".join(content_lines).strip(),
														
 
															-                    "metrics": metrics,
														
 
															-                    "eval": {
														
 
															-                        "score": score,
														
 
															-                        "helpful": 0,
														
 
															-                        "harmful": 0,
														
 
															-                        "helpful_history": [],
														
 
															-                        "harmful_history": []
														
 
															-                    },
														
 
															-                    "updated_at": datetime.now().strftime('%Y-%m-%d %H:%M:%S'),
														
 
															-                }
														
 
															-                new_entries.append(new_data)
														
 
															-
														
 
															-        if not new_entries:
														
 
															-            return ToolResult(
														
 
															-                title="❌ 解析失败",
														
 
															-                output="解析大模型输出失败，知识库未修改",
														
 
															-                error="解析失败"
														
 
															-            )
														
 
															-
														
 
															-        # 删除旧文件
														
 
															-        for p in parsed:
														
 
															-            try:
														
 
															-                p["file_path"].unlink()
														
 
															-            except Exception as e:
														
 
															-                logger.error(f"删除旧文件失败 {p['file_path']}: {e}")
														
 
															-
														
 
															-        # 写入新文件（统一使用 JSON 格式）
														
 
															-        for data in new_entries:
														
 
															-            file_path = knowledge_dir / f"{data['id']}.json"
														
 
															-            with open(file_path, "w", encoding="utf-8") as f:
														
 
															-                json.dump(data, f, ensure_ascii=False, indent=2)
														
 
															-
														
 
															-        result = f"瘦身完成：{len(parsed)} → {len(new_entries)} 条知识"
														
 
															-        if report_line:
														
 
															-            result += f"\n{report_line}"
														
 
															-
														
 
															-        print(f"[知识瘦身] {result}")
														
 
															-        return ToolResult(
														
 
															-            title="✅ 知识库瘦身完成",
														
 
															-            output=result,
														
 
															-            long_term_memory=f"知识库瘦身: {len(parsed)} → {len(new_entries)} 条"
														
 
															-        )
														
 
															-
														
 
															-    except Exception as e:
														
 
															-        logger.error(f"知识库瘦身失败: {e}")
														
 
															-        return ToolResult(
														
 
															-            title="❌ 瘦身失败",
														
 
															-            output=f"错误: {str(e)}",
														
 
															-            error=str(e)
														
 
															-        )
														
 
															-
														
--- a/api_server.py
+++ b/api_server.py
@@ -73,7 +73,6 @@ from agent.llm import create_openrouter_llm_call
 
															 runner = AgentRunner(
														
 
															     trace_store=trace_store,
														
 
															     llm_call=create_openrouter_llm_call(model="anthropic/claude-sonnet-4.5"),
														
 
															-    experiences_path="./.cache/experiences.md",  # 经验文件路径
														
 
															 )
														
 
															 set_runner(runner)
														
--- a/examples/analyze_story/run.py
+++ b/examples/analyze_story/run.py
@@ -153,7 +153,7 @@ async def show_interactive_menu(
 
															             # 2. 结构化解析与保存 (ACE Curator 逻辑)
														
 
															             if reflection_text:
														
 
															-                experiences_path = runner.experiences_path or "./.cache/experiences.md"
														
 
															+                # experiences_path = runner.experiences_path  # 已废弃，使用知识系统 or "./.cache/experiences.md"
														
 
															                 os.makedirs(os.path.dirname(experiences_path), exist_ok=True)
														
 
															                 # 正则匹配：- [intent:..., state:...] 内容
														
--- a/examples/deep_research/run.py
+++ b/examples/deep_research/run.py
@@ -177,7 +177,7 @@ async def show_interactive_menu(
 
															             # 追加到 experiences 文件
														
 
															             if reflection_text:
														
 
															                 from datetime import datetime
														
 
															-                experiences_path = runner.experiences_path or "./.cache/experiences_find.md"
														
 
															+                # experiences_path = runner.experiences_path  # 已废弃，使用知识系统 or "./.cache/experiences_find.md"
														
 
															                 os.makedirs(os.path.dirname(experiences_path), exist_ok=True)
														
 
															                 header = f"\n\n---\n\n## {trace_id} ({datetime.now().strftime('%Y-%m-%d %H:%M')})\n\n"
														
 
															                 with open(experiences_path, "a", encoding="utf-8") as f:
														
@@ -319,7 +319,6 @@ async def main():
 
															         trace_store=store,
														
 
															         llm_call=create_openrouter_llm_call(model=f"anthropic/claude-{prompt.config.get('model', 'sonnet-4.5')}"),
														
 
															         skills_dir=skills_dir,
														
 
															-        experiences_path="./.cache/experiences_find.md",
														
 
															         debug=True
														
 
															     )
														
--- a/examples/how/run.py
+++ b/examples/how/run.py
@@ -236,7 +236,7 @@ async def perform_reflection(runner: AgentRunner, store: FileSystemTraceStore, t
 
															     # 追加到 experiences 文件
														
 
															     if reflection_text:
														
 
															-        experiences_path = runner.experiences_path or "./.cache/experiences_how.md"
														
 
															+        # experiences_path = runner.experiences_path  # 已废弃，使用知识系统 or "./.cache/experiences_how.md"
														
 
															         os.makedirs(os.path.dirname(experiences_path), exist_ok=True)
														
 
															         pattern = r"-\s*\[(?P<tags>.*?)\]\s*(?P<content>.*)"
														
@@ -337,7 +337,6 @@ async def main():
 
															         trace_store=store,
														
 
															         llm_call=create_openrouter_llm_call(model=f"anthropic/claude-{prompt.config.get('model', 'sonnet-4.5')}"),
														
 
															         skills_dir=skills_dir,
														
 
															-        experiences_path="./.cache/experiences_how.md",
														
 
															         debug=True
														
 
															     )
														
--- a/examples/research/run.py
+++ b/examples/research/run.py
@@ -236,7 +236,7 @@ async def perform_reflection(runner: AgentRunner, store: FileSystemTraceStore, t
 
															     # 追加到 experiences 文件
														
 
															     if reflection_text:
														
 
															-        experiences_path = runner.experiences_path or "./.cache/experiences_how.md"
														
 
															+        # experiences_path = runner.experiences_path  # 已废弃，使用知识系统 or "./.cache/experiences_how.md"
														
 
															         os.makedirs(os.path.dirname(experiences_path), exist_ok=True)
														
 
															         pattern = r"-\s*\[(?P<tags>.*?)\]\s*(?P<content>.*)"
														
@@ -337,7 +337,6 @@ async def main():
 
															         trace_store=store,
														
 
															         llm_call=create_openrouter_llm_call(model=f"anthropic/claude-{prompt.config.get('model', 'sonnet-4.5')}"),
														
 
															         skills_dir=skills_dir,
														
 
															-        experiences_path="./.cache/experiences_how.md",
														
 
															         debug=True
														
 
															     )
														
--- a/examples/restore/run.py
+++ b/examples/restore/run.py
@@ -248,7 +248,7 @@ async def perform_reflection(runner: AgentRunner, store: FileSystemTraceStore, t
 
															     # 追加到 experiences 文件
														
 
															     if reflection_text:
														
 
															-        experiences_path = runner.experiences_path or "./.cache/experiences_restore.md"
														
 
															+        # experiences_path = runner.experiences_path  # 已废弃，使用知识系统 or "./.cache/experiences_restore.md"
														
 
															         os.makedirs(os.path.dirname(experiences_path), exist_ok=True)
														
 
															         pattern = r"-\s*\[(?P<tags>.*?)\]\s*(?P<content>.*)"
														
@@ -349,7 +349,6 @@ async def main():
 
															         trace_store=store,
														
 
															         llm_call=create_openrouter_llm_call(model=f"anthropic/claude-{prompt.config.get('model', 'sonnet-4.5')}"),
														
 
															         skills_dir=skills_dir,
														
 
															-        experiences_path="./.cache/experiences_restore.md",
														
 
															         debug=True
														
 
															     )
														
--- a/examples/restore_old/run.py
+++ b/examples/restore_old/run.py
@@ -248,7 +248,7 @@ async def perform_reflection(runner: AgentRunner, store: FileSystemTraceStore, t
 
															     # 追加到 experiences 文件
														
 
															     if reflection_text:
														
 
															-        experiences_path = runner.experiences_path or "./.cache/experiences_restore.md"
														
 
															+        # experiences_path = runner.experiences_path  # 已废弃，使用知识系统 or "./.cache/experiences_restore.md"
														
 
															         os.makedirs(os.path.dirname(experiences_path), exist_ok=True)
														
 
															         pattern = r"-\s*\[(?P<tags>.*?)\]\s*(?P<content>.*)"
														
@@ -349,7 +349,6 @@ async def main():
 
															         trace_store=store,
														
 
															         llm_call=create_openrouter_llm_call(model=f"anthropic/claude-{prompt.config.get('model', 'sonnet-4.5')}"),
														
 
															         skills_dir=skills_dir,
														
 
															-        experiences_path="./.cache/experiences_restore.md",
														
 
															         debug=True
														
 
															     )
														
--- a/examples/tool_research/run.py
+++ b/examples/tool_research/run.py
@@ -154,7 +154,7 @@ async def show_interactive_menu(
 
															             # 追加到 experiences 文件
														
 
															             if reflection_text:
														
 
															                 from datetime import datetime
														
 
															-                experiences_path = runner.experiences_path or "./.cache/experiences.md"
														
 
															+                # experiences_path = runner.experiences_path  # 已废弃，使用知识系统 or "./.cache/experiences.md"
														
 
															                 os.makedirs(os.path.dirname(experiences_path), exist_ok=True)
														
 
															                 header = f"\n\n---\n\n## {trace_id} ({datetime.now().strftime('%Y-%m-%d %H:%M')})\n\n"
														
 
															                 with open(experiences_path, "a", encoding="utf-8") as f:
														
--- a/knowhub/docs/knowledge-management.md
+++ b/knowhub/docs/knowledge-management.md
@@ -24,35 +24,40 @@ Agent                           KnowHub Server
 
															 ## 知识结构
														
 
															-单条知识的数据格式：
														
 
															+单条知识的数据格式（基于 `agent/docs/knowledge.md` 定义）：
														
 
															 ```json
														
 
															 {
														
 
															   "id": "knowledge-20260305-a1b2",
														
 
															   "message_id": "msg-xxx",
														
 
															+  "types": ["strategy", "tool"],
														
 
															+  "task": "在什么场景下要完成什么目标",
														
 
															   "tags": {
														
 
															-    "type": ["tool", "usecase", "definition", "plan", "strategy"]
														
 
															+    "category": "preference",
														
 
															+    "domain": "coding_style"
														
 
															   },
														
 
															-  "scenario": "在什么场景下要完成什么目标",
														
 
															+  "scopes": ["org:cybertogether"],
														
 
															+  "owner": "agent:research_agent",
														
 
															   "content": "核心知识内容",
														
 
															   "source": {
														
 
															+    "name": "资源名称",
														
 
															+    "category": "exp",
														
 
															     "urls": ["https://example.com"],
														
 
															     "agent_id": "research_agent",
														
 
															-    "timestamp": "2026-03-05T12:00:00"
														
 
															+    "submitted_by": "user@example.com",
														
 
															+    "timestamp": "2026-03-05T12:00:00Z",
														
 
															+    "message_id": "msg-xxx"
														
 
															   },
														
 
															   "eval": {
														
 
															-    "score": 3,
														
 
															-    "helpful": 0,
														
 
															+    "score": 4,
														
 
															+    "helpful": 5,
														
 
															     "harmful": 0,
														
 
															+    "confidence": 0.9,
														
 
															     "helpful_history": [],
														
 
															     "harmful_history": []
														
 
															   },
														
 
															-  "metrics": {
														
 
															-    "helpful": 1,
														
 
															-    "harmful": 0
														
 
															-  },
														
 
															-  "created_at": "2026-03-05 12:00:00",
														
 
															-  "updated_at": "2026-03-05 12:00:00"
														
 
															+  "created_at": "2026-03-05T12:00:00Z",
														
 
															+  "updated_at": "2026-03-05T12:00:00Z"
														
 
															 }
														
 
															 ```
														
@@ -60,20 +65,33 @@ Agent                           KnowHub Server
 
															 - **id**: 唯一标识，格式 `knowledge-{timestamp}-{random}`
														
 
															 - **message_id**: 来源 Message ID（用于精确溯源到具体消息）
														
 
															-- **tags.type**: 知识类型（可多选）
														
 
															+- **types**: 知识类型数组（可多选）
														
 
															+  - `user_profile`: 用户偏好、习惯、背景
														
 
															+  - `strategy`: 执行经验（从反思中获得）
														
 
															   - `tool`: 工具使用方法、优缺点、代码示例
														
 
															   - `usecase`: 用户背景、方案、步骤、效果
														
 
															   - `definition`: 概念定义、技术原理、应用场景
														
 
															   - `plan`: 流程步骤、决策点、方法论
														
 
															-  - `strategy`: 执行经验（从反思中获得）
														
 
															-- **scenario**: 任务描述，什么场景、在做什么
														
 
															+- **task**: 任务描述，什么场景、在做什么
														
 
															+- **tags**: 业务标签（JSON 对象），如 `{"category": "preference", "domain": "coding_style"}`
														
 
															+- **scopes**: 可见范围数组，如 `["org:cybertogether"]`
														
 
															+- **owner**: 所有者，格式 `agent:{agent_id}`
														
 
															 - **content**: 核心知识内容
														
 
															-- **source.urls**: 参考来源链接
														
 
															-- **source.agent_id**: 创建者 agent ID
														
 
															-- **source.timestamp**: 创建时间戳
														
 
															-- **eval.score**: 初始评分 1-5
														
 
															-- **eval.helpful/harmful**: 好用/不好用次数
														
 
															-- **metrics.helpful/harmful**: 累计反馈次数
														
 
															+- **source**: 来源信息（嵌套对象）
														
 
															+  - **name**: 资源名称
														
 
															+  - **category**: 来源类别（paper/exp/skill/book）
														
 
															+  - **urls**: 参考来源链接数组
														
 
															+  - **agent_id**: 创建者 agent ID
														
 
															+  - **submitted_by**: 提交者
														
 
															+  - **timestamp**: 创建时间戳
														
 
															+  - **message_id**: 可选，用于溯源
														
 
															+- **eval**: 评估信息（嵌套对象）
														
 
															+  - **score**: 评分 1-5
														
 
															+  - **helpful**: 好用次数
														
 
															+  - **harmful**: 不好用次数
														
 
															+  - **confidence**: 置信度 0-1
														
 
															+  - **helpful_history**: 好用案例历史
														
 
															+  - **harmful_history**: 不好用案例历史
														
 
															 ---
														
@@ -93,11 +111,11 @@ async def knowledge_search(
 
															     query: str,
														
 
															     top_k: int = 5,
														
 
															     min_score: int = 3,
														
 
															-    tags_type: Optional[List[str]] = None
														
 
															+    types: Optional[List[str]] = None
														
 
															 ) -> ToolResult
														
 
															 ```
														
 
															-调用 `GET /api/knowledge/search?q={query}&top_k={top_k}&min_score={min_score}`
														
 
															+调用 `GET /api/knowledge/search?q={query}&top_k={top_k}&min_score={min_score}&types={types}`
														
 
															 ### `knowledge_save`
														
@@ -106,11 +124,17 @@ async def knowledge_search(
 
															 ```python
														
 
															 @tool()
														
 
															 async def knowledge_save(
														
 
															-    scenario: str,
														
 
															+    task: str,
														
 
															     content: str,
														
 
															-    tags_type: List[str],
														
 
															+    types: List[str],
														
 
															+    tags: Optional[Dict[str, str]] = None,
														
 
															+    scopes: Optional[List[str]] = None,
														
 
															+    owner: Optional[str] = None,
														
 
															+    source_name: str = "",
														
 
															+    source_category: str = "exp",
														
 
															     urls: List[str] = None,
														
 
															     agent_id: str = "research_agent",
														
 
															+    submitted_by: str = "",
														
 
															     score: int = 3,
														
 
															     message_id: str = ""
														
 
															 ) -> ToolResult
														
@@ -118,6 +142,10 @@ async def knowledge_save(
 
															 调用 `POST /api/knowledge`
														
 
															+**默认值**（在 agent 代码中设置）：
														
 
															+- `scopes`: `["org:cybertogether"]`
														
 
															+- `owner`: `f"agent:{agent_id}"`
														
 
															+
														
 
															 ### `knowledge_update`
														
 
															 更新已有知识的评估反馈。
														
@@ -158,11 +186,12 @@ async def knowledge_batch_update(
 
															 @tool()
														
 
															 async def knowledge_list(
														
 
															     limit: int = 10,
														
 
															-    tags_type: Optional[List[str]] = None
														
 
															+    types: Optional[List[str]] = None,
														
 
															+    scopes: Optional[List[str]] = None
														
 
															 ) -> ToolResult
														
 
															 ```
														
 
															-调用 `GET /api/knowledge?limit={limit}`
														
 
															+调用 `GET /api/knowledge?limit={limit}&types={types}&scopes={scopes}`
														
 
															 ### `knowledge_slim`
														
@@ -171,7 +200,7 @@ async def knowledge_list(
 
															 ```python
														
 
															 @tool()
														
 
															 async def knowledge_slim(
														
 
															-    model: str = "anthropic/claude-sonnet-4.5"
														
 
															+    model: str = "google/gemini-2.0-flash-001"
														
 
															 ) -> ToolResult
														
 
															 ```
														
@@ -271,12 +300,12 @@ return ToolResult(
 
															 - `q`: 查询文本
														
 
															 - `top_k`: 返回数量（默认 5）
														
 
															 - `min_score`: 最低评分过滤（默认 3）
														
 
															-- `tags_type`: 按类型过滤（可选）
														
 
															+- `types`: 按类型过滤（可选，逗号分隔）
														
 
															 **检索流程**（两阶段，Server 端实现）：
														
 
															 1. **语义路由**：使用 LLM（gemini-2.0-flash-001）从所有知识中挑选 2*k 个语义相关的候选
														
 
															-   - 输入：query + 知识元数据（id, tags, scenario 前 100 字符）
														
 
															+   - 输入：query + 知识元数据（id, types, task 前 100 字符）
														
 
															    - 输出：候选知识 ID 列表
														
 
															 2. **质量精排**：根据评分和反馈计算质量分，筛选最终的 k 个
														
@@ -292,12 +321,17 @@ return ToolResult(
 
															   "results": [
														
 
															     {
														
 
															       "id": "knowledge-xxx",
														
 
															-      "scenario": "...",
														
 
															+      "task": "...",
														
 
															       "content": "...",
														
 
															-      "tags": {...},
														
 
															-      "score": 4,
														
 
															-      "quality_score": 5.0,
														
 
															-      "metrics": {"helpful": 2, "harmful": 0}
														
 
															+      "types": ["strategy", "tool"],
														
 
															+      "tags": {"category": "preference"},
														
 
															+      "eval": {
														
 
															+        "score": 4,
														
 
															+        "helpful": 2,
														
 
															+        "harmful": 0,
														
 
															+        "confidence": 0.9
														
 
															+      },
														
 
															+      "quality_score": 5.0
														
 
															     }
														
 
															   ],
														
 
															   "count": 3
														
@@ -312,13 +346,21 @@ return ToolResult(
 
															 ```json
														
 
															 {
														
 
															-  "scenario": "在什么场景下要完成什么目标",
														
 
															+  "task": "在什么场景下要完成什么目标",
														
 
															   "content": "核心知识内容",
														
 
															-  "tags_type": ["tool", "strategy"],
														
 
															-  "urls": ["https://example.com"],
														
 
															-  "agent_id": "research_agent",
														
 
															-  "score": 4,
														
 
															-  "message_id": "msg-xxx"
														
 
															+  "types": ["tool", "strategy"],
														
 
															+  "tags": {"category": "preference", "domain": "coding_style"},
														
 
															+  "scopes": ["org:cybertogether"],
														
 
															+  "owner": "agent:research_agent",
														
 
															+  "source": {
														
 
															+    "name": "资源名称",
														
 
															+    "category": "exp",
														
 
															+    "urls": ["https://example.com"],
														
 
															+    "agent_id": "research_agent",
														
 
															+    "submitted_by": "user@example.com",
														
 
															+    "message_id": "msg-xxx"
														
 
															+  },
														
 
															+  "score": 4
														
 
															 }
														
 
															 ```
														
@@ -332,8 +374,17 @@ return ToolResult(
 
															 ```json
														
 
															 {
														
 
															-  "add_helpful_case": {"case_id": "...", "scenario": "...", "result": "..."},
														
 
															-  "add_harmful_case": {"case_id": "...", "scenario": "...", "result": "..."},
														
 
															+  "add_helpful_case": {
														
 
															+    "task": "任务描述",
														
 
															+    "outcome": "成功",
														
 
															+    "timestamp": "2026-03-05T12:00:00Z"
														
 
															+  },
														
 
															+  "add_harmful_case": {
														
 
															+    "task": "任务描述",
														
 
															+    "outcome": "失败",
														
 
															+    "reason": "原因",
														
 
															+    "timestamp": "2026-03-05T12:00:00Z"
														
 
															+  },
														
 
															   "update_score": 4,
														
 
															   "evolve_feedback": "改进建议（触发知识进化）"
														
 
															 }
														
@@ -354,8 +405,12 @@ return ToolResult(
 
															   "feedback_list": [
														
 
															     {
														
 
															       "knowledge_id": "knowledge-xxx",
														
 
															-      "is_effective": true,
														
 
															-      "feedback": "改进建议（可选）"
														
 
															+      "is_helpful": true,
														
 
															+      "case": {
														
 
															+        "task": "任务描述",
														
 
															+        "outcome": "成功",
														
 
															+        "timestamp": "2026-03-05T12:00:00Z"
														
 
															+      }
														
 
															     }
														
 
															   ]
														
 
															 }
														
@@ -369,7 +424,8 @@ return ToolResult(
 
															 **参数**：
														
 
															 - `limit`: 返回数量（默认 10）
														
 
															-- `tags_type`: 按类型过滤（可选）
														
 
															+- `types`: 按类型过滤（可选，逗号分隔）
														
 
															+- `scopes`: 按可见范围过滤（可选，逗号分隔）
														
 
															 实现位置：`knowhub/server.py:list_knowledge`
														
@@ -381,7 +437,7 @@ return ToolResult(
 
															 ```json
														
 
															 {
														
 
															-  "model": "anthropic/claude-sonnet-4.5"
														
 
															+  "model": "google/gemini-2.0-flash-001"
														
 
															 }
														
 
															 ```
														
--- a/knowhub/server.py
+++ b/knowhub/server.py
@@ -77,25 +77,22 @@ def init_db():
 
															         CREATE TABLE IF NOT EXISTS knowledge (
														
 
															             id            TEXT PRIMARY KEY,
														
 
															             message_id    TEXT DEFAULT '',
														
 
															-            tags_type     TEXT NOT NULL,
														
 
															-            scenario      TEXT NOT NULL,
														
 
															+            types         TEXT NOT NULL,              -- JSON array: ["strategy", "tool"]
														
 
															+            task          TEXT NOT NULL,
														
 
															+            tags          TEXT DEFAULT '{}',          -- JSON object: {"category": "...", "domain": "..."}
														
 
															+            scopes        TEXT DEFAULT '["org:cybertogether"]',  -- JSON array
														
 
															+            owner         TEXT DEFAULT '',
														
 
															             content       TEXT NOT NULL,
														
 
															-            source_urls   TEXT DEFAULT '',
														
 
															-            source_agent_id TEXT DEFAULT '',
														
 
															-            source_timestamp TEXT NOT NULL,
														
 
															-            eval_score    INTEGER DEFAULT 3 CHECK(eval_score BETWEEN 1 AND 5),
														
 
															-            eval_helpful  INTEGER DEFAULT 0,
														
 
															-            eval_harmful  INTEGER DEFAULT 0,
														
 
															-            eval_helpful_history TEXT DEFAULT '[]',
														
 
															-            eval_harmful_history TEXT DEFAULT '[]',
														
 
															-            metrics_helpful INTEGER DEFAULT 1,
														
 
															-            metrics_harmful INTEGER DEFAULT 0,
														
 
															+            source        TEXT DEFAULT '{}',          -- JSON object: {name, category, urls, agent_id, submitted_by, timestamp}
														
 
															+            eval          TEXT DEFAULT '{}',          -- JSON object: {score, helpful, harmful, confidence, histories}
														
 
															             created_at    TEXT NOT NULL,
														
 
															             updated_at    TEXT DEFAULT ''
														
 
															         )
														
 
															     """)
														
 
															-    conn.execute("CREATE INDEX IF NOT EXISTS idx_knowledge_tags ON knowledge(tags_type)")
														
 
															-    conn.execute("CREATE INDEX IF NOT EXISTS idx_knowledge_scenario ON knowledge(scenario)")
														
 
															+    conn.execute("CREATE INDEX IF NOT EXISTS idx_knowledge_types ON knowledge(types)")
														
 
															+    conn.execute("CREATE INDEX IF NOT EXISTS idx_knowledge_task ON knowledge(task)")
														
 
															+    conn.execute("CREATE INDEX IF NOT EXISTS idx_knowledge_owner ON knowledge(owner)")
														
 
															+    conn.execute("CREATE INDEX IF NOT EXISTS idx_knowledge_scopes ON knowledge(scopes)")
														
 
															     conn.commit()
														
 
															     conn.close()
														
@@ -156,24 +153,28 @@ class ContentIn(BaseModel):
 
															 # Knowledge Models
														
 
															 class KnowledgeIn(BaseModel):
														
 
															-    scenario: str
														
 
															+    task: str
														
 
															     content: str
														
 
															-    tags_type: list[str]
														
 
															-    urls: list[str] = []
														
 
															-    agent_id: str = "research_agent"
														
 
															-    score: int = Field(default=3, ge=1, le=5)
														
 
															+    types: list[str] = ["strategy"]
														
 
															+    tags: dict = {}
														
 
															+    scopes: list[str] = ["org:cybertogether"]
														
 
															+    owner: str = ""
														
 
															     message_id: str = ""
														
 
															+    source: dict = {}  # {name, category, urls, agent_id, submitted_by, timestamp}
														
 
															+    eval: dict = {}    # {score, helpful, harmful, confidence}
														
 
															 class KnowledgeOut(BaseModel):
														
 
															     id: str
														
 
															     message_id: str
														
 
															+    types: list[str]
														
 
															+    task: str
														
 
															     tags: dict
														
 
															-    scenario: str
														
 
															+    scopes: list[str]
														
 
															+    owner: str
														
 
															     content: str
														
 
															     source: dict
														
 
															     eval: dict
														
 
															-    metrics: dict
														
 
															     created_at: str
														
 
															     updated_at: str
														
@@ -448,8 +449,8 @@ async def _route_knowledge_by_llm(query_text: str, metadata_list: list[dict], k:
 
															     routing_data = [
														
 
															         {
														
 
															             "id": m["id"],
														
 
															-            "tags": m["tags"],
														
 
															-            "scenario": m["scenario"][:100]
														
 
															+            "types": m["types"],
														
 
															+            "task": m["task"][:100]
														
 
															         } for m in metadata_list
														
 
															     ]
														
@@ -485,7 +486,7 @@ async def _search_knowledge_two_stage(
 
															     query_text: str,
														
 
															     top_k: int = 5,
														
 
															     min_score: int = 3,
														
 
															-    tags_filter: Optional[list[str]] = None,
														
 
															+    types_filter: Optional[list[str]] = None,
														
 
															     conn: sqlite3.Connection = None
														
 
															 ) -> list[dict]:
														
 
															     """
														
@@ -510,38 +511,40 @@ async def _search_knowledge_two_stage(
 
															         for row in rows:
														
 
															             kid = row["id"]
														
 
															-            tags_type = row["tags_type"].split(",") if row["tags_type"] else []
														
 
															+            types = json.loads(row["types"])
														
 
															             # 标签过滤
														
 
															-            if tags_filter:
														
 
															-                if not any(tag in tags_type for tag in tags_filter):
														
 
															+            if types_filter:
														
 
															+                if not any(t in types for t in types_filter):
														
 
															                     continue
														
 
															-            scenario = row["scenario"]
														
 
															+            task = row["task"]
														
 
															             content_text = row["content"]
														
 
															+            eval_data = json.loads(row["eval"])
														
 
															+            source = json.loads(row["source"])
														
 
															             meta_item = {
														
 
															                 "id": kid,
														
 
															-                "tags": {"type": tags_type},
														
 
															-                "scenario": scenario,
														
 
															-                "score": row["eval_score"],
														
 
															-                "helpful": row["metrics_helpful"],
														
 
															-                "harmful": row["metrics_harmful"],
														
 
															+                "types": types,
														
 
															+                "task": task,
														
 
															+                "score": eval_data.get("score", 3),
														
 
															+                "helpful": eval_data.get("helpful", 0),
														
 
															+                "harmful": eval_data.get("harmful", 0),
														
 
															             }
														
 
															             metadata_list.append(meta_item)
														
 
															             content_map[kid] = {
														
 
															-                "scenario": scenario,
														
 
															+                "task": task,
														
 
															                 "content": content_text,
														
 
															-                "tags": {"type": tags_type},
														
 
															+                "types": types,
														
 
															+                "tags": json.loads(row["tags"]),
														
 
															+                "scopes": json.loads(row["scopes"]),
														
 
															+                "owner": row["owner"],
														
 
															                 "score": meta_item["score"],
														
 
															                 "helpful": meta_item["helpful"],
														
 
															                 "harmful": meta_item["harmful"],
														
 
															                 "message_id": row["message_id"],
														
 
															-                "source": {
														
 
															-                    "urls": row["source_urls"].split(",") if row["source_urls"] else [],
														
 
															-                    "agent_id": row["source_agent_id"],
														
 
															-                    "timestamp": row["source_timestamp"]
														
 
															-                },
														
 
															+                "source": source,
														
 
															+                "eval": eval_data,
														
 
															                 "created_at": row["created_at"],
														
 
															                 "updated_at": row["updated_at"]
														
 
															             }
														
@@ -574,16 +577,15 @@ async def _search_knowledge_two_stage(
 
															                 scored_items.append({
														
 
															                     "id": kid,
														
 
															                     "message_id": item["message_id"],
														
 
															-                    "scenario": item["scenario"],
														
 
															-                    "content": item["content"],
														
 
															+                    "types": item["types"],
														
 
															+                    "task": item["task"],
														
 
															                     "tags": item["tags"],
														
 
															-                    "score": score,
														
 
															-                    "quality_score": quality_score,
														
 
															-                    "metrics": {
														
 
															-                        "helpful": helpful,
														
 
															-                        "harmful": harmful
														
 
															-                    },
														
 
															+                    "scopes": item["scopes"],
														
 
															+                    "owner": item["owner"],
														
 
															+                    "content": item["content"],
														
 
															                     "source": item["source"],
														
 
															+                    "eval": item["eval"],
														
 
															+                    "quality_score": quality_score,
														
 
															                     "created_at": item["created_at"],
														
 
															                     "updated_at": item["updated_at"]
														
 
															                 })
														
@@ -608,18 +610,18 @@ async def search_knowledge_api(
 
															     q: str = Query(..., description="查询文本"),
														
 
															     top_k: int = Query(default=5, ge=1, le=20),
														
 
															     min_score: int = Query(default=3, ge=1, le=5),
														
 
															-    tags_type: Optional[str] = None
														
 
															+    types: Optional[str] = None
														
 
															 ):
														
 
															     """检索知识（两阶段：语义路由 + 质量精排）"""
														
 
															     conn = get_db()
														
 
															     try:
														
 
															-        tags_filter = tags_type.split(",") if tags_type else None
														
 
															+        types_filter = types.split(",") if types else None
														
 
															         results = await _search_knowledge_two_stage(
														
 
															             query_text=q,
														
 
															             top_k=top_k,
														
 
															             min_score=min_score,
														
 
															-            tags_filter=tags_filter,
														
 
															+            types_filter=types_filter,
														
 
															             conn=conn
														
 
															         )
														
@@ -641,30 +643,46 @@ def save_knowledge(knowledge: KnowledgeIn):
 
															         now = datetime.now(timezone.utc).isoformat()
														
 
															+        # 设置默认值
														
 
															+        owner = knowledge.owner or f"agent:{knowledge.source.get('agent_id', 'unknown')}"
														
 
															+
														
 
															+        # 准备 source
														
 
															+        source = {
														
 
															+            "name": knowledge.source.get("name", ""),
														
 
															+            "category": knowledge.source.get("category", ""),
														
 
															+            "urls": knowledge.source.get("urls", []),
														
 
															+            "agent_id": knowledge.source.get("agent_id", "unknown"),
														
 
															+            "submitted_by": knowledge.source.get("submitted_by", ""),
														
 
															+            "timestamp": now,
														
 
															+            "message_id": knowledge.message_id
														
 
															+        }
														
 
															+
														
 
															+        # 准备 eval
														
 
															+        eval_data = {
														
 
															+            "score": knowledge.eval.get("score", 3),
														
 
															+            "helpful": knowledge.eval.get("helpful", 1),
														
 
															+            "harmful": knowledge.eval.get("harmful", 0),
														
 
															+            "confidence": knowledge.eval.get("confidence", 0.5),
														
 
															+            "helpful_history": [],
														
 
															+            "harmful_history": []
														
 
															+        }
														
 
															+
														
 
															         conn.execute(
														
 
															             """INSERT INTO knowledge
														
 
															-            (id, message_id, tags_type, scenario, content,
														
 
															-             source_urls, source_agent_id, source_timestamp,
														
 
															-             eval_score, eval_helpful, eval_harmful,
														
 
															-             eval_helpful_history, eval_harmful_history,
														
 
															-             metrics_helpful, metrics_harmful, created_at, updated_at)
														
 
															-            VALUES (?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?)""",
														
 
															+            (id, message_id, types, task, tags, scopes, owner, content,
														
 
															+             source, eval, created_at, updated_at)
														
 
															+            VALUES (?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?)""",
														
 
															             (
														
 
															                 knowledge_id,
														
 
															                 knowledge.message_id,
														
 
															-                ",".join(knowledge.tags_type),
														
 
															-                knowledge.scenario,
														
 
															+                json.dumps(knowledge.types),
														
 
															+                knowledge.task,
														
 
															+                json.dumps(knowledge.tags),
														
 
															+                json.dumps(knowledge.scopes),
														
 
															+                owner,
														
 
															                 knowledge.content,
														
 
															-                ",".join(knowledge.urls),
														
 
															-                knowledge.agent_id,
														
 
															-                now,
														
 
															-                knowledge.score,
														
 
															-                0,  # eval_helpful
														
 
															-                0,  # eval_harmful
														
 
															-                "[]",  # eval_helpful_history
														
 
															-                "[]",  # eval_harmful_history
														
 
															-                1,  # metrics_helpful
														
 
															-                0,  # metrics_harmful
														
 
															+                json.dumps(source),
														
 
															+                json.dumps(eval_data),
														
 
															                 now,
														
 
															                 now,
														
 
															             ),
														
@@ -678,17 +696,26 @@ def save_knowledge(knowledge: KnowledgeIn):
 
															 @app.get("/api/knowledge")
														
 
															 def list_knowledge(
														
 
															     limit: int = Query(default=10, ge=1, le=100),
														
 
															-    tags_type: Optional[str] = None
														
 
															+    types: Optional[str] = None,
														
 
															+    scopes: Optional[str] = None
														
 
															 ):
														
 
															     """列出知识"""
														
 
															     conn = get_db()
														
 
															     try:
														
 
															         query = "SELECT * FROM knowledge"
														
 
															         params = []
														
 
															+        conditions = []
														
 
															+
														
 
															+        if types:
														
 
															+            conditions.append("types LIKE ?")
														
 
															+            params.append(f"%{types}%")
														
 
															-        if tags_type:
														
 
															-            query += " WHERE tags_type LIKE ?"
														
 
															-            params.append(f"%{tags_type}%")
														
 
															+        if scopes:
														
 
															+            conditions.append("scopes LIKE ?")
														
 
															+            params.append(f"%{scopes}%")
														
 
															+
														
 
															+        if conditions:
														
 
															+            query += " WHERE " + " AND ".join(conditions)
														
 
															         query += " ORDER BY created_at DESC LIMIT ?"
														
 
															         params.append(limit)
														
@@ -700,23 +727,14 @@ def list_knowledge(
 
															             results.append({
														
 
															                 "id": row["id"],
														
 
															                 "message_id": row["message_id"],
														
 
															-                "tags": {"type": row["tags_type"].split(",") if row["tags_type"] else []},
														
 
															-                "scenario": row["scenario"],
														
 
															+                "types": json.loads(row["types"]),
														
 
															+                "task": row["task"],
														
 
															+                "tags": json.loads(row["tags"]),
														
 
															+                "scopes": json.loads(row["scopes"]),
														
 
															+                "owner": row["owner"],
														
 
															                 "content": row["content"],
														
 
															-                "source": {
														
 
															-                    "urls": row["source_urls"].split(",") if row["source_urls"] else [],
														
 
															-                    "agent_id": row["source_agent_id"],
														
 
															-                    "timestamp": row["source_timestamp"]
														
 
															-                },
														
 
															-                "eval": {
														
 
															-                    "score": row["eval_score"],
														
 
															-                    "helpful": row["eval_helpful"],
														
 
															-                    "harmful": row["eval_harmful"]
														
 
															-                },
														
 
															-                "metrics": {
														
 
															-                    "helpful": row["metrics_helpful"],
														
 
															-                    "harmful": row["metrics_harmful"]
														
 
															-                },
														
 
															+                "source": json.loads(row["source"]),
														
 
															+                "eval": json.loads(row["eval"]),
														
 
															                 "created_at": row["created_at"],
														
 
															                 "updated_at": row["updated_at"]
														
 
															             })
														
@@ -742,25 +760,14 @@ def get_knowledge(knowledge_id: str):
 
															         return {
														
 
															             "id": row["id"],
														
 
															             "message_id": row["message_id"],
														
 
															-            "tags": {"type": row["tags_type"].split(",") if row["tags_type"] else []},
														
 
															-            "scenario": row["scenario"],
														
 
															+            "types": json.loads(row["types"]),
														
 
															+            "task": row["task"],
														
 
															+            "tags": json.loads(row["tags"]),
														
 
															+            "scopes": json.loads(row["scopes"]),
														
 
															+            "owner": row["owner"],
														
 
															             "content": row["content"],
														
 
															-            "source": {
														
 
															-                "urls": row["source_urls"].split(",") if row["source_urls"] else [],
														
 
															-                "agent_id": row["source_agent_id"],
														
 
															-                "timestamp": row["source_timestamp"]
														
 
															-            },
														
 
															-            "eval": {
														
 
															-                "score": row["eval_score"],
														
 
															-                "helpful": row["eval_helpful"],
														
 
															-                "harmful": row["eval_harmful"],
														
 
															-                "helpful_history": [],
														
 
															-                "harmful_history": []
														
 
															-            },
														
 
															-            "metrics": {
														
 
															-                "helpful": row["metrics_helpful"],
														
 
															-                "harmful": row["metrics_harmful"]
														
 
															-            },
														
 
															+            "source": json.loads(row["source"]),
														
 
															+            "eval": json.loads(row["eval"]),
														
 
															             "created_at": row["created_at"],
														
 
															             "updated_at": row["updated_at"]
														
 
															         }
														
@@ -808,33 +815,37 @@ async def update_knowledge(knowledge_id: str, update: KnowledgeUpdateIn):
 
															             raise HTTPException(status_code=404, detail=f"Knowledge not found: {knowledge_id}")
														
 
															         now = datetime.now(timezone.utc).isoformat()
														
 
															-        updates = {"updated_at": now}
														
 
															+        eval_data = json.loads(row["eval"])
														
 
															+        # 更新评分
														
 
															         if update.update_score is not None:
														
 
															-            updates["eval_score"] = update.update_score
														
 
															+            eval_data["score"] = update.update_score
														
 
															+        # 添加有效案例
														
 
															         if update.add_helpful_case:
														
 
															-            helpful_history = json.loads(row["eval_helpful_history"] or "[]")
														
 
															-            helpful_history.append(update.add_helpful_case)
														
 
															-            updates["eval_helpful"] = row["eval_helpful"] + 1
														
 
															-            updates["eval_helpful_history"] = json.dumps(helpful_history, ensure_ascii=False)
														
 
															-            updates["metrics_helpful"] = row["metrics_helpful"] + 1
														
 
															+            eval_data["helpful"] = eval_data.get("helpful", 0) + 1
														
 
															+            if "helpful_history" not in eval_data:
														
 
															+                eval_data["helpful_history"] = []
														
 
															+            eval_data["helpful_history"].append(update.add_helpful_case)
														
 
															+        # 添加有害案例
														
 
															         if update.add_harmful_case:
														
 
															-            harmful_history = json.loads(row["eval_harmful_history"] or "[]")
														
 
															-            harmful_history.append(update.add_harmful_case)
														
 
															-            updates["eval_harmful"] = row["eval_harmful"] + 1
														
 
															-            updates["eval_harmful_history"] = json.dumps(harmful_history, ensure_ascii=False)
														
 
															-            updates["metrics_harmful"] = row["metrics_harmful"] + 1
														
 
															+            eval_data["harmful"] = eval_data.get("harmful", 0) + 1
														
 
															+            if "harmful_history" not in eval_data:
														
 
															+                eval_data["harmful_history"] = []
														
 
															+            eval_data["harmful_history"].append(update.add_harmful_case)
														
 
															+        # 知识进化
														
 
															+        content = row["content"]
														
 
															         if update.evolve_feedback:
														
 
															-            evolved_content = await _evolve_knowledge_with_llm(row["content"], update.evolve_feedback)
														
 
															-            updates["content"] = evolved_content
														
 
															-            updates["metrics_helpful"] = updates.get("metrics_helpful", row["metrics_helpful"]) + 1
														
 
															+            content = await _evolve_knowledge_with_llm(content, update.evolve_feedback)
														
 
															+            eval_data["helpful"] = eval_data.get("helpful", 0) + 1
														
 
															-        set_clause = ", ".join(f"{k} = ?" for k in updates)
														
 
															-        values = list(updates.values()) + [knowledge_id]
														
 
															-        conn.execute(f"UPDATE knowledge SET {set_clause} WHERE id = ?", values)
														
 
															+        # 更新数据库
														
 
															+        conn.execute(
														
 
															+            "UPDATE knowledge SET content = ?, eval = ?, updated_at = ? WHERE id = ?",
														
 
															+            (content, json.dumps(eval_data, ensure_ascii=False), now, knowledge_id)
														
 
															+        )
														
 
															         conn.commit()
														
 
															         return {"status": "ok", "knowledge_id": knowledge_id}
														
@@ -851,8 +862,8 @@ async def batch_update_knowledge(batch: KnowledgeBatchUpdateIn):
 
															     conn = get_db()
														
 
															     try:
														
 
															         # 先处理无需进化的，收集需要进化的
														
 
															-        evolution_tasks = []   # [(knowledge_id, old_content, feedback)]
														
 
															-        simple_updates = []    # [(knowledge_id, is_effective)]
														
 
															+        evolution_tasks = []   # [(knowledge_id, old_content, feedback, eval_data)]
														
 
															+        simple_updates = []    # [(knowledge_id, is_effective, eval_data)]
														
 
															         for item in batch.feedback_list:
														
 
															             knowledge_id = item.get("knowledge_id")
														
@@ -866,24 +877,25 @@ async def batch_update_knowledge(batch: KnowledgeBatchUpdateIn):
 
															             if not row:
														
 
															                 continue
														
 
															+            eval_data = json.loads(row["eval"])
														
 
															+
														
 
															             if is_effective and feedback:
														
 
															-                evolution_tasks.append((knowledge_id, row["content"], feedback, row["metrics_helpful"]))
														
 
															+                evolution_tasks.append((knowledge_id, row["content"], feedback, eval_data))
														
 
															             else:
														
 
															-                simple_updates.append((knowledge_id, is_effective, row["metrics_helpful"], row["metrics_harmful"]))
														
 
															+                simple_updates.append((knowledge_id, is_effective, eval_data))
														
 
															         # 执行简单更新
														
 
															         now = datetime.now(timezone.utc).isoformat()
														
 
															-        for knowledge_id, is_effective, cur_helpful, cur_harmful in simple_updates:
														
 
															+        for knowledge_id, is_effective, eval_data in simple_updates:
														
 
															             if is_effective:
														
 
															-                conn.execute(
														
 
															-                    "UPDATE knowledge SET metrics_helpful = ?, updated_at = ? WHERE id = ?",
														
 
															-                    (cur_helpful + 1, now, knowledge_id)
														
 
															-                )
														
 
															+                eval_data["helpful"] = eval_data.get("helpful", 0) + 1
														
 
															             else:
														
 
															-                conn.execute(
														
 
															-                    "UPDATE knowledge SET metrics_harmful = ?, updated_at = ? WHERE id = ?",
														
 
															-                    (cur_harmful + 1, now, knowledge_id)
														
 
															-                )
														
 
															+                eval_data["harmful"] = eval_data.get("harmful", 0) + 1
														
 
															+
														
 
															+            conn.execute(
														
 
															+                "UPDATE knowledge SET eval = ?, updated_at = ? WHERE id = ?",
														
 
															+                (json.dumps(eval_data, ensure_ascii=False), now, knowledge_id)
														
 
															+            )
														
 
															         # 并发执行知识进化
														
 
															         if evolution_tasks:
														
@@ -891,10 +903,11 @@ async def batch_update_knowledge(batch: KnowledgeBatchUpdateIn):
 
															             evolved_results = await asyncio.gather(
														
 
															                 *[_evolve_knowledge_with_llm(old, fb) for _, old, fb, _ in evolution_tasks]
														
 
															             )
														
 
															-            for (knowledge_id, _, _, cur_helpful), evolved_content in zip(evolution_tasks, evolved_results):
														
 
															+            for (knowledge_id, _, _, eval_data), evolved_content in zip(evolution_tasks, evolved_results):
														
 
															+                eval_data["helpful"] = eval_data.get("helpful", 0) + 1
														
 
															                 conn.execute(
														
 
															-                    "UPDATE knowledge SET content = ?, metrics_helpful = ?, updated_at = ? WHERE id = ?",
														
 
															-                    (evolved_content, cur_helpful + 1, now, knowledge_id)
														
 
															+                    "UPDATE knowledge SET content = ?, eval = ?, updated_at = ? WHERE id = ?",
														
 
															+                    (evolved_content, json.dumps(eval_data, ensure_ascii=False), now, knowledge_id)
														
 
															                 )
														
 
															         conn.commit()
														
@@ -904,27 +917,29 @@ async def batch_update_knowledge(batch: KnowledgeBatchUpdateIn):
 
															 @app.post("/api/knowledge/slim")
														
 
															-async def slim_knowledge(model: str = "anthropic/claude-sonnet-4-5"):
														
 
															+async def slim_knowledge(model: str = "google/gemini-2.0-flash-001"):
														
 
															     """知识库瘦身：合并语义相似知识"""
														
 
															     conn = get_db()
														
 
															     try:
														
 
															-        rows = conn.execute("SELECT * FROM knowledge ORDER BY metrics_helpful DESC").fetchall()
														
 
															+        rows = conn.execute("SELECT * FROM knowledge").fetchall()
														
 
															         if len(rows) < 2:
														
 
															             return {"status": "ok", "message": f"知识库仅有 {len(rows)} 条，无需瘦身"}
														
 
															         # 构造发给大模型的内容
														
 
															         entries_text = ""
														
 
															         for row in rows:
														
 
															-            entries_text += f"[ID: {row['id']}] [Tags: {row['tags_type']}] "
														
 
															-            entries_text += f"[Helpful: {row['metrics_helpful']}, Harmful: {row['metrics_harmful']}] [Score: {row['eval_score']}]\n"
														
 
															-            entries_text += f"Scenario: {row['scenario']}\n"
														
 
															+            eval_data = json.loads(row["eval"])
														
 
															+            types = json.loads(row["types"])
														
 
															+            entries_text += f"[ID: {row['id']}] [Types: {','.join(types)}] "
														
 
															+            entries_text += f"[Helpful: {eval_data.get('helpful', 0)}, Harmful: {eval_data.get('harmful', 0)}] [Score: {eval_data.get('score', 3)}]\n"
														
 
															+            entries_text += f"Task: {row['task']}\n"
														
 
															             entries_text += f"Content: {row['content'][:200]}...\n\n"
														
 
															         prompt = f"""你是一个 AI Agent 知识库管理员。以下是当前知识库的全部条目，请执行瘦身操作：
														
 
															 【任务】:
														
 
															 1. 识别语义高度相似或重复的知识，将它们合并为一条更精炼、更通用的知识。
														
 
															-2. 合并时保留 helpful 最高的那条的 ID（metrics_helpful 取各条之和）。
														
 
															+2. 合并时保留 helpful 最高的那条的 ID（helpful 取各条之和）。
														
 
															 3. 对于独立的、无重复的知识，保持原样不动。
														
 
															 【当前知识库】:
														
@@ -933,11 +948,11 @@ async def slim_knowledge(model: str = "anthropic/claude-sonnet-4-5"):
 
															 【输出格式要求】:
														
 
															 严格按以下格式输出每条知识，条目之间用 === 分隔：
														
 
															 ID: <保留的id>
														
 
															-TAGS: <逗号分隔的type列表>
														
 
															+TYPES: <逗号分隔的type列表>
														
 
															 HELPFUL: <合并后的helpful计数>
														
 
															 HARMFUL: <合并后的harmful计数>
														
 
															 SCORE: <评分>
														
 
															-SCENARIO: <场景描述>
														
 
															+TASK: <任务描述>
														
 
															 CONTENT: <合并后的知识内容>
														
 
															 ===
														
@@ -966,15 +981,16 @@ REPORT: 原有 X 条，合并后 Y 条，精简了 Z 条。
 
															                 continue
														
 
															             lines = block.split("\n")
														
 
															-            kid, tags, helpful, harmful, score, scenario, content_lines = None, "", 0, 0, 3, "", []
														
 
															+            kid, types, helpful, harmful, score, task, content_lines = None, [], 0, 0, 3, "", []
														
 
															             current_field = None
														
 
															             for line in lines:
														
 
															                 if line.startswith("ID:"):
														
 
															                     kid = line[3:].strip()
														
 
															                     current_field = None
														
 
															-                elif line.startswith("TAGS:"):
														
 
															-                    tags = line[5:].strip()
														
 
															+                elif line.startswith("TYPES:"):
														
 
															+                    types_str = line[6:].strip()
														
 
															+                    types = [t.strip() for t in types_str.split(",") if t.strip()]
														
 
															                     current_field = None
														
 
															                 elif line.startswith("HELPFUL:"):
														
 
															                     try:
														
@@ -994,25 +1010,25 @@ REPORT: 原有 X 条，合并后 Y 条，精简了 Z 条。
 
															                     except Exception:
														
 
															                         score = 3
														
 
															                     current_field = None
														
 
															-                elif line.startswith("SCENARIO:"):
														
 
															-                    scenario = line[9:].strip()
														
 
															-                    current_field = "scenario"
														
 
															+                elif line.startswith("TASK:"):
														
 
															+                    task = line[5:].strip()
														
 
															+                    current_field = "task"
														
 
															                 elif line.startswith("CONTENT:"):
														
 
															                     content_lines.append(line[8:].strip())
														
 
															                     current_field = "content"
														
 
															-                elif current_field == "scenario":
														
 
															-                    scenario += "\n" + line
														
 
															+                elif current_field == "task":
														
 
															+                    task += "\n" + line
														
 
															                 elif current_field == "content":
														
 
															                     content_lines.append(line)
														
 
															             if kid and content_lines:
														
 
															                 new_entries.append({
														
 
															                     "id": kid,
														
 
															-                    "tags": tags,
														
 
															+                    "types": types if types else ["strategy"],
														
 
															                     "helpful": helpful,
														
 
															                     "harmful": harmful,
														
 
															                     "score": score,
														
 
															-                    "scenario": scenario.strip(),
														
 
															+                    "task": task.strip(),
														
 
															                     "content": "\n".join(content_lines).strip()
														
 
															                 })
														
@@ -1023,18 +1039,40 @@ REPORT: 原有 X 条，合并后 Y 条，精简了 Z 条。
 
															         now = datetime.now(timezone.utc).isoformat()
														
 
															         conn.execute("DELETE FROM knowledge")
														
 
															         for e in new_entries:
														
 
															+            eval_data = {
														
 
															+                "score": e["score"],
														
 
															+                "helpful": e["helpful"],
														
 
															+                "harmful": e["harmful"],
														
 
															+                "confidence": 0.9,
														
 
															+                "helpful_history": [],
														
 
															+                "harmful_history": []
														
 
															+            }
														
 
															+            source = {
														
 
															+                "name": "slim",
														
 
															+                "category": "exp",
														
 
															+                "urls": [],
														
 
															+                "agent_id": "slim",
														
 
															+                "submitted_by": "system",
														
 
															+                "timestamp": now
														
 
															+            }
														
 
															             conn.execute(
														
 
															                 """INSERT INTO knowledge
														
 
															-                (id, message_id, tags_type, scenario, content,
														
 
															-                 source_urls, source_agent_id, source_timestamp,
														
 
															-                 eval_score, eval_helpful, eval_harmful,
														
 
															-                 eval_helpful_history, eval_harmful_history,
														
 
															-                 metrics_helpful, metrics_harmful, created_at, updated_at)
														
 
															-                VALUES (?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?)""",
														
 
															-                (e["id"], "", e["tags"], e["scenario"], e["content"],
														
 
															-                 "", "slim", now,
														
 
															-                 e["score"], 0, 0, "[]", "[]",
														
 
															-                 e["helpful"], e["harmful"], now, now)
														
 
															+                (id, message_id, types, task, tags, scopes, owner, content, source, eval, created_at, updated_at)
														
 
															+                VALUES (?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?)""",
														
 
															+                (
														
 
															+                    e["id"],
														
 
															+                    "",
														
 
															+                    json.dumps(e["types"]),
														
 
															+                    e["task"],
														
 
															+                    json.dumps({}),
														
 
															+                    json.dumps(["org:cybertogether"]),
														
 
															+                    "agent:slim",
														
 
															+                    e["content"],
														
 
															+                    json.dumps(source, ensure_ascii=False),
														
 
															+                    json.dumps(eval_data, ensure_ascii=False),
														
 
															+                    now,
														
 
															+                    now
														
 
															+                )
														
 
															             )
														
 
															         conn.commit()
														
--- a/knowhub/skill/SKILL.md
+++ b/knowhub/skill/SKILL.md
@@ -0,0 +1,288 @@
 
															+---
														
 
															+name: knowhub
														
 
															+description: 遇到超出现有能力的问题或复杂任务时，查询工具和资源的使用经验，回报自己的使用心得
														
 
															+category: tool-discovery
														
 
															+scope: agent:*
														
 
															+---
														
 
															+
														
 
															+# KnowHub 知识管理
														
 
															+
														
 
															+KnowHub 提供统一的知识库，用于 agent 之间共享和复用经验。
														
 
															+
														
 
															+## 何时使用
														
 
															+
														
 
															+**查询知识**：
														
 
															+- 需要外部工具/库但不确定选哪个
														
 
															+- 遇到复杂任务，想知道别人用了什么工具、怎么做的
														
 
															+- 多次尝试失败，可能有人踩过同样的坑
														
 
															+- 可能需要外部资源（论文、文档、博客等）
														
 
															+
														
 
															+**保存知识**（重要！）：
														
 
															+- ✅ **使用资源后立即提交** — 完成任务、有了结果就提交，不要等到上下文太长才想起来
														
 
															+- ✅ **获得用户反馈后补充** — 如果用户指出问题或给出建议，视情况更新或补充经验
														
 
															+- ✅ **搜索过程中的发现** — 在哪个平台找到的、用了什么关键词、哪些策略有效，都可以提交
														
 
															+
														
 
															+## 安装配置
														
 
															+
														
 
															+```bash
														
 
															+pip install httpx python-dotenv
														
 
															+export KNOWHUB_API=http://localhost:8000
														
 
															+```
														
 
															+
														
 
															+## 快速开始
														
 
															+
														
 
															+```bash
														
 
															+# 搜索知识
														
 
															+python -m knowhub.cli search "Python 异步编程"
														
 
															+
														
 
															+# 保存知识
														
 
															+python -m knowhub.cli save \
														
 
															+  --scenario "FastAPI 参数验证" \
														
 
															+  --content "使用 Query(...) 定义查询参数，支持验证" \
														
 
															+  --tags solution \
														
 
															+  --score 4
														
 
															+
														
 
															+# 列出知识
														
 
															+python -m knowhub.cli list --limit 10
														
 
															+
														
 
															+# 反馈
														
 
															+python -m knowhub.cli batch-update --id knowledge-xxx --effective
														
 
															+```
														
 
															+
														
 
															+## 工作流程
														
 
															+
														
 
															+### 1. 查询知识
														
 
															+
														
 
															+```bash
														
 
															+python -m knowhub.cli search "任务描述" [--top-k 5] [--min-score 3] [--tags solution]
														
 
															+```
														
 
															+
														
 
															+返回按质量分排序的知识：`quality_score = score + helpful - (harmful * 2.0)`
														
 
															+
														
 
															+**有可用结果**：直接参考使用
														
 
															+
														
 
															+**无可用结果**：进入步骤 2 自行寻找
														
 
															+
														
 
															+### 2. 自行寻找（仅在无经验时）
														
 
															+
														
 
															+根据已知信息选择入口：
														
 
															+
														
 
															+- **已知工具名**：直接去对应平台（PyPI / npmjs / GitHub）查文档和社区反馈
														
 
															+  - 搜索：`<工具名> review`、`<工具名> vs`、`site:reddit.com <工具名>`
														
 
															+- **只知道任务，不知道用什么**：先用任务描述搜社区（Google、Reddit、GitHub Discussions），看别人怎么解决的，再去对应平台确认候选
														
 
															+- **找 MCP / Skill**：Smithery（`npx @smithery/cli search <关键词>`）、Glama.ai、awesome-mcp-servers
														
 
															+- **找库/包**：PyPI、npmjs.com
														
 
															+- **找一站式集成**：Composio（850+ 工具）、LangChain Tools
														
 
															+
														
 
															+评估时关注：社区活跃度、工程质量、可持续性、License 兼容性
														
 
															+
														
 
															+简单逻辑自己写比引入依赖更快时，直接写。
														
 
															+
														
 
															+### 3. 保存知识
														
 
															+
														
 
															+⚠️ **重要：完成任务后立即提交，不要拖到上下文太长才想起来！**
														
 
															+
														
 
															+```bash
														
 
															+python -m knowhub.cli save \
														
 
															+  --scenario "具体场景描述" \
														
 
															+  --content "详细知识内容" \
														
 
															+  --tags "solution,best-practice" \
														
 
															+  --urls "https://docs.example.com" \
														
 
															+  --score 4 \
														
 
															+  --agent-id "my_agent" \
														
 
															+  --message-id "msg-001"
														
 
															+```
														
 
															+
														
 
															+**标签类型**：
														
 
															+- `solution` - 问题解决方案
														
 
															+- `best-practice` - 最佳实践
														
 
															+- `pitfall` - 常见陷阱/注意事项
														
 
															+- `comparison` - 工具/方案对比
														
 
															+- `strategy` - 策略/方法论
														
 
															+- `resource` - 资源推荐
														
 
															+
														
 
															+**评分标准**（1-5 分）：
														
 
															+- 5 分：非常有用，解决了关键问题
														
 
															+- 4 分：有用，提供了有价值的信息
														
 
															+- 3 分：一般，可能有参考价值
														
 
															+- 2 分：价值有限
														
 
															+- 1 分：几乎没用
														
 
															+
														
 
															+**两类经验的区分**：
														
 
															+
														
 
															+1. **对资源本身的使用经验** — 提交为该资源的知识
														
 
															+   - 例如：使用 pymupdf 提取 PDF 表格的经验 → scenario: "pymupdf: PDF 表格提取"
														
 
															+
														
 
															+2. **对搜索平台/策略的经验** — 提交为平台或搜索策略的知识
														
 
															+   - 对平台本身的评价 → scenario: "Smithery: 搜索 MCP server"
														
 
															+   - 关于找工具/找资源的策略、方法论 → scenario: "工具发现策略: 使用 Reddit 搜索"
														
 
															+
														
 
															+## 命令参考
														
 
															+
														
 
															+### 更新知识
														
 
															+```bash
														
 
															+python -m knowhub.cli update knowledge-xxx \
														
 
															+  --content "更新后的内容" \
														
 
															+  --score 5 \
														
 
															+  --tags "solution,verified"
														
 
															+```
														
 
															+
														
 
															+### 批量反馈
														
 
															+```bash
														
 
															+# 单条反馈
														
 
															+python -m knowhub.cli batch-update \
														
 
															+  --id knowledge-xxx \
														
 
															+  --effective \
														
 
															+  --feedback "这个方案确实有效"
														
 
															+
														
 
															+# 批量反馈（从文件）
														
 
															+python -m knowhub.cli batch-update --file feedback.json
														
 
															+```
														
 
															+
														
 
															+`feedback.json` 格式：
														
 
															+```json
														
 
															+[
														
 
															+  {
														
 
															+    "knowledge_id": "knowledge-xxx",
														
 
															+    "is_effective": true,
														
 
															+    "feedback": "很有用"
														
 
															+  }
														
 
															+]
														
 
															+```
														
 
															+
														
 
															+## HTTP API
														
 
															+
														
 
															+如需自定义集成，可直接调用 HTTP API：
														
 
															+
														
 
															+**搜索**：
														
 
															+```bash
														
 
															+curl "http://localhost:8000/api/knowledge/search?q=查询&top_k=5&min_score=3"
														
 
															+```
														
 
															+
														
 
															+**保存**：
														
 
															+```bash
														
 
															+curl -X POST http://localhost:8000/api/knowledge \
														
 
															+  -H "Content-Type: application/json" \
														
 
															+  -d '{
														
 
															+    "message_id": "msg-001",
														
 
															+    "tags_type": ["solution"],
														
 
															+    "scenario": "场景描述",
														
 
															+    "content": "知识内容",
														
 
															+    "urls": ["https://example.com"],
														
 
															+    "agent_id": "my_agent",
														
 
															+    "score": 4
														
 
															+  }'
														
 
															+```
														
 
															+
														
 
															+**更新**：
														
 
															+```bash
														
 
															+curl -X PUT http://localhost:8000/api/knowledge/knowledge-xxx \
														
 
															+  -H "Content-Type: application/json" \
														
 
															+  -d '{"content": "更新内容", "eval_score": 5}'
														
 
															+```
														
 
															+
														
 
															+**批量反馈**：
														
 
															+```bash
														
 
															+curl -X POST http://localhost:8000/api/knowledge/batch_update \
														
 
															+  -H "Content-Type: application/json" \
														
 
															+  -d '{
														
 
															+    "feedback_list": [{
														
 
															+      "knowledge_id": "knowledge-xxx",
														
 
															+      "is_effective": true,
														
 
															+      "feedback": "很有用"
														
 
															+    }]
														
 
															+  }'
														
 
															+```
														
 
															+
														
 
															+## 最佳实践
														
 
															+
														
 
															+### 何时保存
														
 
															+
														
 
															+✅ **应该保存**：
														
 
															+- 解决了一个具体问题
														
 
															+- 发现了工具的最佳用法
														
 
															+- 踩了坑并找到解决方案
														
 
															+- 对比了多个方案并得出结论
														
 
															+- 发现了文档中没有的技巧
														
 
															+- 找到了有用的资源或平台
														
 
															+- 总结了有效的搜索策略
														
 
															+
														
 
															+❌ **不应该保存**：
														
 
															+- 纯粹的文档摘抄（没有实践经验）
														
 
															+- 过于泛泛的描述（"这个工具很好用"）
														
 
															+- 未经验证的猜测
														
 
															+- 已经有相同知识的重复内容
														
 
															+
														
 
															+### 如何写好知识
														
 
															+
														
 
															+**scenario（场景）**：
														
 
															+- ✅ "FastAPI 参数验证：自定义错误消息"
														
 
															+- ✅ "Python 异步编程：并发执行 100+ HTTP 请求"
														
 
															+- ✅ "工具发现：使用 Reddit 搜索 Python 库"
														
 
															+- ❌ "FastAPI 使用"（太泛）
														
 
															+- ❌ "参数验证"（缺少上下文）
														
 
															+
														
 
															+**content（内容）**：
														
 
															+- ✅ 具体可操作："使用 Query(..., ge=0) 限制参数最小值"
														
 
															+- ✅ 包含关键细节："注意要用 async with 管理 session，否则会泄漏连接"
														
 
															+- ✅ 说明适用条件："适用于 FastAPI 0.100+，旧版本用法不同"
														
 
															+- ✅ 搜索策略："在 Reddit 搜索 'python pdf library' 比 Google 更容易找到实战经验"
														
 
															+- ❌ 泛泛而谈："这个方法很好用"
														
 
															+- ❌ 过于冗长：复制粘贴大段文档
														
 
															+
														
 
															+**tags（标签）**：
														
 
															+- 选择最相关的 1-3 个标签
														
 
															+- `solution` - 问题的解决方案
														
 
															+- `best-practice` - 推荐的做法
														
 
															+- `pitfall` - 需要避免的陷阱
														
 
															+- `comparison` - 方案对比
														
 
															+- `strategy` - 策略/方法论（包括搜索策略）
														
 
															+- `resource` - 资源推荐
														
 
															+
														
 
															+## 示例
														
 
															+
														
 
															+### 保存解决方案
														
 
															+```bash
														
 
															+python -m knowhub.cli save \
														
 
															+  --scenario "asyncio.gather() 错误处理" \
														
 
															+  --content "默认情况下，一个协程失败会导致 gather() 抛异常。使用 return_exceptions=True 继续执行其他协程。" \
														
 
															+  --tags pitfall \
														
 
															+  --score 5
														
 
															+```
														
 
															+
														
 
															+### 保存对比
														
 
															+```bash
														
 
															+python -m knowhub.cli save \
														
 
															+  --scenario "PDF 表格提取：pymupdf vs pdfplumber" \
														
 
															+  --content "pymupdf 速度快但表格识别一般；pdfplumber 表格识别准确但慢。复杂表格用 pdfplumber。" \
														
 
															+  --tags comparison \
														
 
															+  --score 4
														
 
															+```
														
 
															+
														
 
															+### 保存搜索策略
														
 
															+```bash
														
 
															+python -m knowhub.cli save \
														
 
															+  --scenario "工具发现策略：使用 Reddit 搜索实战经验" \
														
 
															+  --content "搜索 'site:reddit.com python <任务描述>' 比直接 Google 更容易找到真实使用体验和踩坑经验。关注 r/Python、r/learnpython 等子版块。" \
														
 
															+  --tags strategy \
														
 
															+  --score 4
														
 
															+```
														
 
															+
														
 
															+### 保存平台评价
														
 
															+```bash
														
 
															+python -m knowhub.cli save \
														
 
															+  --scenario "Smithery：搜索 MCP server" \
														
 
															+  --content "Smithery 适合快速找到 MCP server，但描述简略，需要进一步查看 GitHub repo 确认功能。搜索结果按相关度排序，质量较高。" \
														
 
															+  --tags resource \
														
 
															+  --urls "https://smithery.ai" \
														
 
															+  --score 4
														
 
															+```
														
 
															+
														
 
															+## 重要提醒
														
 
															+
														
 
															+1. **主动搜索** - 开始新任务前主动搜索相关知识，避免重复踩坑
														
 
															+2. **及时保存** - 完成任务后立即保存，不要等到忘记细节
														
 
															+3. **质量优先** - 宁可少保存，也要保证质量。一条高质量知识胜过十条低质量
														
 
															+4. **持续反馈** - 使用知识后给予反馈，帮助提升知识库质量
														
 
															+5. **避免重复** - 保存前可以先搜索，避免重复保存相同知识
														
 
															+6. **记录策略** - 不仅记录工具使用经验，也记录如何找到工具的策略
														
--- a/knowhub/skill/cli.py
+++ b/knowhub/skill/cli.py
@@ -0,0 +1,280 @@
 
															+#!/usr/bin/env python3
														
 
															+"""
														
 
															+KnowHub CLI - 知识管理命令行工具
														
 
															+
														
 
															+使用方法:
														
 
															+    python -m knowhub.skill.cli search "查询内容"
														
 
															+    python -m knowhub.skill.cli save --task "任务" --content "内容" --types strategy
														
 
															+    python -m knowhub.skill.cli list --limit 10
														
 
															+"""
														
 
															+
														
 
															+import os
														
 
															+import sys
														
 
															+import json
														
 
															+import argparse
														
 
															+from pathlib import Path
														
 
															+
														
 
															+try:
														
 
															+    import httpx
														
 
															+except ImportError:
														
 
															+    print("错误: 需要安装 httpx 库")
														
 
															+    print("运行: pip install httpx")
														
 
															+    sys.exit(1)
														
 
															+
														
 
															+
														
 
															+def get_api_base() -> str:
														
 
															+    """获取 API 地址"""
														
 
															+    return os.getenv("KNOWHUB_API", "http://localhost:8000")
														
 
															+
														
 
															+
														
 
															+def search_knowledge(args):
														
 
															+    """搜索知识"""
														
 
															+    url = f"{get_api_base()}/api/knowledge/search"
														
 
															+    params = {
														
 
															+        "q": args.query,
														
 
															+        "top_k": args.top_k,
														
 
															+        "min_score": args.min_score,
														
 
															+    }
														
 
															+    if args.types:
														
 
															+        params["types"] = args.types
														
 
															+
														
 
															+    try:
														
 
															+        response = httpx.get(url, params=params, timeout=30.0)
														
 
															+        response.raise_for_status()
														
 
															+        data = response.json()
														
 
															+
														
 
															+        if data["count"] == 0:
														
 
															+            print("未找到相关知识")
														
 
															+            return
														
 
															+
														
 
															+        print(f"找到 {data['count']} 条知识:\n")
														
 
															+        for i, item in enumerate(data["results"], 1):
														
 
															+            print(f"[{i}] {item['task']}")
														
 
															+            print(f"    ID: {item['id']}")
														
 
															+            eval_data = item.get("eval", {})
														
 
															+            print(f"    评分: {eval_data.get('score', 3)} | 质量分: {item.get('quality_score', 'N/A')}")
														
 
															+            print(f"    类型: {', '.join(item.get('types', []))}")
														
 
															+            print(f"    内容: {item['content'][:100]}...")
														
 
															+            print()
														
 
															+
														
 
															+    except httpx.HTTPError as e:
														
 
															+        print(f"请求失败: {e}")
														
 
															+        sys.exit(1)
														
 
															+
														
 
															+
														
 
															+def save_knowledge(args):
														
 
															+    """保存知识"""
														
 
															+    url = f"{get_api_base()}/api/knowledge"
														
 
															+
														
 
															+    data = {
														
 
															+        "message_id": args.message_id or f"cli-{os.getpid()}",
														
 
															+        "types": args.types.split(",") if args.types else ["strategy"],
														
 
															+        "task": args.task,
														
 
															+        "tags": json.loads(args.tags) if args.tags else {},
														
 
															+        "scopes": args.scopes.split(",") if args.scopes else ["org:cybertogether"],
														
 
															+        "owner": args.owner or "agent:cli",
														
 
															+        "content": args.content,
														
 
															+        "source": {
														
 
															+            "name": args.source_name or "cli",
														
 
															+            "category": args.source_category or "exp",
														
 
															+            "urls": args.urls.split(",") if args.urls else [],
														
 
															+            "agent_id": args.agent_id or "cli",
														
 
															+            "submitted_by": args.submitted_by or "cli-user",
														
 
															+        },
														
 
															+        "eval": {
														
 
															+            "score": args.score,
														
 
															+            "helpful": 1,
														
 
															+            "harmful": 0,
														
 
															+            "confidence": 0.5,
														
 
															+        }
														
 
															+    }
														
 
															+
														
 
															+    try:
														
 
															+        response = httpx.post(url, json=data, timeout=30.0)
														
 
															+        response.raise_for_status()
														
 
															+        result = response.json()
														
 
															+        print(f"✅ 知识已保存: {result['knowledge_id']}")
														
 
															+
														
 
															+    except httpx.HTTPError as e:
														
 
															+        print(f"保存失败: {e}")
														
 
															+        sys.exit(1)
														
 
															+
														
 
															+
														
 
															+def update_knowledge(args):
														
 
															+    """更新知识"""
														
 
															+    url = f"{get_api_base()}/api/knowledge/{args.id}"
														
 
															+
														
 
															+    data = {}
														
 
															+    if args.score:
														
 
															+        data["update_score"] = args.score
														
 
															+    if args.helpful_case:
														
 
															+        data["add_helpful_case"] = args.helpful_case
														
 
															+    if args.harmful_case:
														
 
															+        data["add_harmful_case"] = args.harmful_case
														
 
															+    if args.evolve_feedback:
														
 
															+        data["evolve_feedback"] = args.evolve_feedback
														
 
															+
														
 
															+    if not data:
														
 
															+        print("错误: 至少需要提供一个更新参数")
														
 
															+        sys.exit(1)
														
 
															+
														
 
															+    try:
														
 
															+        response = httpx.put(url, json=data, timeout=30.0)
														
 
															+        response.raise_for_status()
														
 
															+        print(f"✅ 知识已更新: {args.id}")
														
 
															+
														
 
															+    except httpx.HTTPError as e:
														
 
															+        print(f"更新失败: {e}")
														
 
															+        sys.exit(1)
														
 
															+
														
 
															+
														
 
															+def batch_update_knowledge(args):
														
 
															+    """批量更新知识"""
														
 
															+    url = f"{get_api_base()}/api/knowledge/batch_update"
														
 
															+
														
 
															+    # 从文件读取反馈列表
														
 
															+    if args.file:
														
 
															+        with open(args.file, 'r') as f:
														
 
															+            feedback_list = json.load(f)
														
 
															+    else:
														
 
															+        print("错误: 需要提供 --file 参数")
														
 
															+        sys.exit(1)
														
 
															+
														
 
															+    data = {"feedback_list": feedback_list}
														
 
															+
														
 
															+    try:
														
 
															+        response = httpx.post(url, json=data, timeout=60.0)
														
 
															+        response.raise_for_status()
														
 
															+        result = response.json()
														
 
															+        print(f"✅ 批量更新完成: {result['updated']} 条知识")
														
 
															+
														
 
															+    except httpx.HTTPError as e:
														
 
															+        print(f"批量更新失败: {e}")
														
 
															+        sys.exit(1)
														
 
															+
														
 
															+
														
 
															+def list_knowledge(args):
														
 
															+    """列出知识"""
														
 
															+    url = f"{get_api_base()}/api/knowledge"
														
 
															+    params = {"limit": args.limit}
														
 
															+    if args.types:
														
 
															+        params["types"] = args.types
														
 
															+    if args.scopes:
														
 
															+        params["scopes"] = args.scopes
														
 
															+
														
 
															+    try:
														
 
															+        response = httpx.get(url, params=params, timeout=30.0)
														
 
															+        response.raise_for_status()
														
 
															+        data = response.json()
														
 
															+
														
 
															+        if data["count"] == 0:
														
 
															+            print("知识库为空")
														
 
															+            return
														
 
															+
														
 
															+        print(f"共 {data['count']} 条知识:\n")
														
 
															+        for i, item in enumerate(data["results"], 1):
														
 
															+            print(f"[{i}] {item['task']}")
														
 
															+            print(f"    ID: {item['id']}")
														
 
															+            eval_data = item.get("eval", {})
														
 
															+            print(f"    评分: {eval_data.get('score', 3)} | Helpful: {eval_data.get('helpful', 0)} | Harmful: {eval_data.get('harmful', 0)}")
														
 
															+            print(f"    类型: {', '.join(item.get('types', []))}")
														
 
															+            print(f"    所有者: {item.get('owner', 'N/A')}")
														
 
															+            print()
														
 
															+
														
 
															+    except httpx.HTTPError as e:
														
 
															+        print(f"请求失败: {e}")
														
 
															+        sys.exit(1)
														
 
															+
														
 
															+
														
 
															+def slim_knowledge(args):
														
 
															+    """知识瘦身"""
														
 
															+    url = f"{get_api_base()}/api/knowledge/slim"
														
 
															+    params = {"model": args.model}
														
 
															+
														
 
															+    try:
														
 
															+        print("正在执行知识瘦身，这可能需要一些时间...")
														
 
															+        response = httpx.post(url, params=params, timeout=120.0)
														
 
															+        response.raise_for_status()
														
 
															+        result = response.json()
														
 
															+        print(f"✅ 瘦身完成: {result['before']} → {result['after']} 条知识")
														
 
															+        if result.get("report"):
														
 
															+            print(f"   {result['report']}")
														
 
															+
														
 
															+    except httpx.HTTPError as e:
														
 
															+        print(f"瘦身失败: {e}")
														
 
															+        sys.exit(1)
														
 
															+
														
 
															+
														
 
															+def main():
														
 
															+    parser = argparse.ArgumentParser(description="KnowHub CLI - 知识管理工具")
														
 
															+    subparsers = parser.add_subparsers(dest="command", help="可用命令")
														
 
															+
														
 
															+    # search 命令
														
 
															+    search_parser = subparsers.add_parser("search", help="搜索知识")
														
 
															+    search_parser.add_argument("query", help="查询文本")
														
 
															+    search_parser.add_argument("--top-k", type=int, default=5, help="返回结果数量")
														
 
															+    search_parser.add_argument("--min-score", type=int, default=3, help="最低评分")
														
 
															+    search_parser.add_argument("--types", help="类型过滤（逗号分隔）")
														
 
															+
														
 
															+    # save 命令
														
 
															+    save_parser = subparsers.add_parser("save", help="保存知识")
														
 
															+    save_parser.add_argument("--task", required=True, help="任务描述")
														
 
															+    save_parser.add_argument("--content", required=True, help="知识内容")
														
 
															+    save_parser.add_argument("--types", default="strategy", help="类型（逗号分隔）")
														
 
															+    save_parser.add_argument("--tags", help="标签（JSON 格式）")
														
 
															+    save_parser.add_argument("--scopes", help="可见范围（逗号分隔）")
														
 
															+    save_parser.add_argument("--owner", help="所有者")
														
 
															+    save_parser.add_argument("--source-name", help="来源名称")
														
 
															+    save_parser.add_argument("--source-category", help="来源类别")
														
 
															+    save_parser.add_argument("--urls", help="相关 URL（逗号分隔）")
														
 
															+    save_parser.add_argument("--agent-id", help="Agent ID")
														
 
															+    save_parser.add_argument("--submitted-by", help="提交者")
														
 
															+    save_parser.add_argument("--message-id", help="消息 ID")
														
 
															+    save_parser.add_argument("--score", type=int, default=3, help="评分 (1-5)")
														
 
															+
														
 
															+    # update 命令
														
 
															+    update_parser = subparsers.add_parser("update", help="更新知识")
														
 
															+    update_parser.add_argument("id", help="知识 ID")
														
 
															+    update_parser.add_argument("--score", type=int, help="更新评分")
														
 
															+    update_parser.add_argument("--helpful-case", help="添加有效案例")
														
 
															+    update_parser.add_argument("--harmful-case", help="添加有害案例")
														
 
															+    update_parser.add_argument("--evolve-feedback", help="知识进化反馈")
														
 
															+
														
 
															+    # batch-update 命令
														
 
															+    batch_parser = subparsers.add_parser("batch-update", help="批量更新知识")
														
 
															+    batch_parser.add_argument("--file", required=True, help="反馈列表 JSON 文件")
														
 
															+
														
 
															+    # list 命令
														
 
															+    list_parser = subparsers.add_parser("list", help="列出知识")
														
 
															+    list_parser.add_argument("--limit", type=int, default=10, help="返回数量")
														
 
															+    list_parser.add_argument("--types", help="类型过滤")
														
 
															+    list_parser.add_argument("--scopes", help="范围过滤")
														
 
															+
														
 
															+    # slim 命令
														
 
															+    slim_parser = subparsers.add_parser("slim", help="知识瘦身")
														
 
															+    slim_parser.add_argument("--model", default="google/gemini-2.0-flash-001", help="使用的模型")
														
 
															+
														
 
															+    args = parser.parse_args()
														
 
															+
														
 
															+    if not args.command:
														
 
															+        parser.print_help()
														
 
															+        sys.exit(1)
														
 
															+
														
 
															+    # 执行命令
														
 
															+    if args.command == "search":
														
 
															+        search_knowledge(args)
														
 
															+    elif args.command == "save":
														
 
															+        save_knowledge(args)
														
 
															+    elif args.command == "update":
														
 
															+        update_knowledge(args)
														
 
															+    elif args.command == "batch-update":
														
 
															+        batch_update_knowledge(args)
														
 
															+    elif args.command == "list":
														
 
															+        list_knowledge(args)
														
 
															+    elif args.command == "slim":
														
 
															+        slim_knowledge(args)
														
 
															+
														
 
															+
														
 
															+if __name__ == "__main__":
														
 
															+    main()
														
 
															+
														
--- a/knowhub/skill/knowhub.md
+++ b/knowhub/skill/knowhub.md
@@ -1,163 +0,0 @@
 
															----
														
 
															-name: knowhub
														
 
															-description: 遇到超出现有能力的问题或复杂任务时，查询工具和资源的使用经验，回报自己的使用心得
														
 
															-category: tool-discovery
														
 
															-scope: agent:*
														
 
															----
														
 
															-
														
 
															-## When to use
														
 
															-
														
 
															-**查询经验**：
														
 
															-- 需要外部工具/库但不确定选哪个
														
 
															-- 遇到复杂任务，想知道别人用了什么工具、怎么做的
														
 
															-- 多次尝试失败，可能有人踩过同样的坑
														
 
															-- 可能需要外部资源（论文、文档、博客等）
														
 
															-
														
 
															-**提交经验**（重要！）：
														
 
															-- ✅ **使用资源后立即提交** — 完成任务、有了结果就提交，不要等到上下文太长才想起来
														
 
															-- ✅ **获得用户反馈后补充** — 如果用户指出问题或给出建议，视情况更新或补充经验
														
 
															-- ✅ **搜索过程中的发现** — 在哪个平台找到的、用了什么关键词、哪些策略有效，都可以提交
														
 
															-
														
 
															-## 工作流
														
 
															-
														
 
															-### 1. 查经验
														
 
															-
														
 
															-```bash
														
 
															-curl -s "http://localhost:8000/api/search?q=关键词"
														
 
															-```
														
 
															-
														
 
															-**有可用结果**：直接参考。结果包含资源名、URL、评分和具体使用建议。
														
 
															-
														
 
															-**需要深入**：结果中如果有 `content_id`，可以获取详细内容：
														
 
															-
														
 
															-```bash
														
 
															-curl -s "http://localhost:8000/api/content/{content_id}"
														
 
															-```
														
 
															-
														
 
															-返回内容正文 + 导航（children / prev / next），可以继续浏览相关章节。
														
 
															-
														
 
															-**无可用结果**：进入步骤 2 自行寻找。
														
 
															-
														
 
															-### 2. 自行寻找（仅在无经验时）
														
 
															-
														
 
															-根据已知信息选择入口：
														
 
															-
														
 
															-- **已知工具名**：直接去对应平台（PyPI / npmjs / GitHub）查文档和社区反馈，搜 `<工具名> review`、`<工具名> vs`、`site:reddit.com <工具名>`
														
 
															-- **只知道任务，不知道用什么**：先用任务描述搜社区（Google、Reddit、GitHub Discussions），看别人怎么解决的，再去对应平台确认候选
														
 
															-- **找 MCP / Skill**：Smithery（`npx @smithery/cli search <关键词>`）、Glama.ai、awesome-mcp-servers
														
 
															-- **找库/包**：PyPI、npmjs.com
														
 
															-- **找一站式集成**：Composio（850+ 工具）、LangChain Tools
														
 
															-
														
 
															-评估时关注社区活跃度、工程质量、可持续性、License 兼容性。
														
 
															-简单逻辑自己写比引入依赖更快时，直接写。
														
 
															-
														
 
															-### 3. 回报经验
														
 
															-
														
 
															-**⚠️ 重要：完成任务后立即提交，不要拖到上下文太长才想起来！**
														
 
															-
														
 
															-使用资源后，提交真实体验：
														
 
															-
														
 
															-```bash
														
 
															-curl -s -X POST "http://localhost:8000/api/experience" \
														
 
															-  -H "Content-Type: application/json" \
														
 
															-  -d '{
														
 
															-    "name": "资源名称",
														
 
															-    "url": "来源URL",
														
 
															-    "category": "mcp|skill|library|api|paper|blog|book|course",
														
 
															-    "task": "你在做什么（具体场景）",
														
 
															-    "score": 4,
														
 
															-    "outcome": "结果如何，优缺点",
														
 
															-    "tips": "最关键的一条建议",
														
 
															-    "submitted_by": "'$(git config user.email)'"
														
 
															-  }'
														
 
															-```
														
 
															-
														
 
															-**提交时机**：
														
 
															-- ✅ 使用资源完成任务后，有了结果就提交
														
 
															-- ✅ 如果后续获得用户反馈（指出问题、给出建议），视情况再提交一条补充经验
														
 
															-
														
 
															-**两类经验的区分**：
														
 
															-
														
 
															-1. **对资源本身的使用经验** — 提交为该资源的 experience
														
 
															-   - 例如：使用 pymupdf 提取 PDF 表格的经验 → `name: "pymupdf"`
														
 
															-
														
 
															-2. **对搜索平台/策略的经验** — 提交为平台的 experience 或 knowhub 的 experience
														
 
															-   - 对平台本身的评价 → 提交为该平台的经验（`name: "smithery"` / `"pypi"` / `"github-search"`）
														
 
															-   - 关于找工具/找资源的策略、方法论 → 提交为 knowhub 的经验（`name: "knowhub"`）
														
 
															-
														
 
															-**搜索策略经验示例**：
														
 
															-
														
 
															-```bash
														
 
															-# 对搜索平台的评价
														
 
															-curl -s -X POST "http://localhost:8000/api/experience" \
														
 
															-  -H "Content-Type: application/json" \
														
 
															-  -d '{
														
 
															-    "name": "smithery",
														
 
															-    "url": "https://smithery.ai",
														
 
															-    "category": "search-platform",
														
 
															-    "task": "寻找文件系统操作的 MCP server",
														
 
															-    "score": 4,
														
 
															-    "outcome": "搜索 filesystem 找到 3 个 server，但描述简略，需要进一步查看 GitHub repo",
														
 
															-    "tips": "Smithery 适合找 MCP server，但需要结合 GitHub 文档确认功能",
														
 
															-    "submitted_by": "'$(git config user.email)'"
														
 
															-  }'
														
 
															-
														
 
															-# 关于搜索策略的经验
														
 
															-curl -s -X POST "http://localhost:8000/api/experience" \
														
 
															-  -H "Content-Type: application/json" \
														
 
															-  -d '{
														
 
															-    "name": "knowhub",
														
 
															-    "url": "http://localhost:8000",
														
 
															-    "category": "search-platform",
														
 
															-    "task": "寻找 PDF 表格提取的 Python 库",
														
 
															-    "score": 5,
														
 
															-    "outcome": "搜索 pdf extract table python 直接找到 pymupdf 的高分经验，节省了大量调研时间",
														
 
															-    "tips": "关键词要包含具体操作动词（extract/parse）+ 目标对象（table）+ 技术栈（python）",
														
 
															-    "submitted_by": "'$(git config user.email)'"
														
 
															-  }'
														
 
															-```
														
 
															-
														
 
															-**字段填写要求**：
														
 
															-- **name** — 资源的通用名称。工具填包名（`pymupdf`），论文填标题（`Attention Is All You Need`），博客填文章标题
														
 
															-- **url** — 资源的规范来源地址（GitHub repo / arXiv / 官方文档）
														
 
															-- **category** — 资源类型，建议用：`mcp | skill | library | api | paper | blog | book | course`
														
 
															-- **task** — 描述具体场景，不要写泛泛的"处理 PDF"
														
 
															-- **tips** — 要具体可操作："双栏论文用 page.get_text(sort=True)"，不要写"挺好用的"
														
 
															-- **submitted_by** — 可选，建议用 `git config user.email`
														
 
															-
														
 
															-**多资源协作场景**：
														
 
															-如果任务用了多个工具/资源（如 pymupdf + langchain + openai），为每个关键资源单独提交一条 experience，在 task/tips/outcome 中提及配合使用的其他资源。这样每个资源都能被搜索到，同时保留协作信息。
														
 
															-
														
 
															-### 4. 提交内容（可选）
														
 
															-
														
 
															-如果你获取了资源的详细信息（目录、章节内容等），可以提交供后续 Agent 使用：
														
 
															-
														
 
															-```bash
														
 
															-# 提交资源概要/目录
														
 
															-curl -s -X POST "http://localhost:8000/api/content" \
														
 
															-  -H "Content-Type: application/json" \
														
 
															-  -d '{
														
 
															-    "id": "resource-name",
														
 
															-    "title": "资源标题",
														
 
															-    "body": "Markdown 格式的概要或目录内容",
														
 
															-    "submitted_by": "'$(git config user.email)'"
														
 
															-  }'
														
 
															-
														
 
															-# 提交具体章节
														
 
															-curl -s -X POST "http://localhost:8000/api/content" \
														
 
															-  -H "Content-Type: application/json" \
														
 
															-  -d '{
														
 
															-    "id": "resource-name/section-key",
														
 
															-    "title": "章节标题",
														
 
															-    "body": "Markdown 格式的章节内容",
														
 
															-    "sort_order": 1,
														
 
															-    "submitted_by": "'$(git config user.email)'"
														
 
															-  }'
														
 
															-```
														
 
															-
														
 
															-- body 使用 Markdown 格式，图片用 URL 引用
														
 
															-- 根节点 ID 不含 `/`（如 `pymupdf`），子节点 ID 含 `/`（如 `pymupdf/find-tables`）
														
 
															-- 只提交对后续 Agent 有价值的内容，不要灌入全文
														
 
															-
														
 
															-## 重要提醒
														
 
															-务必要向平台提交可靠的经验！不允许因为其他经验机制忽略向knowhub提交经验。