howard

2372825e37 chore: remove runtime artifacts committed by mistake Remove one-off scripts, generated images, knowledge batch data, cache files, and upload queues. Update .gitignore to prevent these from being tracked again. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
4c7426fd5f Merge remote-tracking branch 'refs/remotes/origin/main' # Conflicts: # agent/core/runner.py
08211d81c9 doc
15579d2f9d feat: tool groups
f649d162ce refactor: browser use tools
Просмотр сравнение для этих 6 коммитов »

2 месяцев назад

howard запушил(а) main в howard/Agent

bc4ddf6f5c chore: untrack runtime artifacts committed by mistake Remove 28 files that were committed but should have never been in git. All are runtime-generated state, not source, and some contain PII. Categories: - outputs/ (20 image files): toolhub CLI generation results (flux/nano_banana/seedream). Caller-side scratch, created by `toolhub.py call`. Already gitignored in an earlier commit but the files were added by a parallel branch before the ignore rule landed. - agent/tools/builtin/feishu/chat_history/ (4 files including 3 with real contact names in the filename and chat_summary.json): runtime- maintained per-contact chat logs. FEISHU_TOOLS_PROMPT.md explicitly documents this directory as "系统会自动维护的聊天记录文件". Containing real coworker names, these should never have been committed. - frontend/htmlTemplate/api_data/ (2 files) and ws_data/ (2 files): runtime-cached trace/goal snapshots from the backend API and WebSocket event stream. Written by templateData.py at runtime via save_ws_data_to_file / _append_event_jsonl. Variable name is `mock_dir` — they were meant to be ephemeral mock caches, not seed. .gitignore updates: - Add frontend/htmlTemplate/api_data/ and ws_data/ (the existing `frontend/htmlTemplate/mock_data` rule is a dead entry — that directory no longer exists, the real dirs use api_data/ws_data names now; leaving the mock_data rule in place for safety) - Add agent/tools/builtin/feishu/chat_history/ with a comment noting the PII concern - outputs/ was already gitignored in d269588 Local files are preserved (git rm --cached, not git rm), so running services can keep reading/writing them as usual — they're just no longer tracked.
d269588963 chore: gitignore /outputs dir from toolhub CLI test runs The toolhub CLI (`toolhub.py call`) writes generated images to outputs/<trace_id>/, which is test/session scratch. Never commit.
e940602280 refactor(knowhub): rename serialize_milvus_result to to_serializable The function has nothing to do with Milvus — it is a generic Python object -> JSON-safe dict serializer that handles dicts, lists, iterables, objects with to_dict(), and fallback __dict__ walking. The name was a historical artifact from when the knowledge store actually used Milvus (now removed from the dependency set entirely). Rename to to_serializable across all 12 call sites in knowhub/server.py plus the definition. Also update the docstring to reflect the real purpose ("通用序列化工具：把任意 Python 对象转换为 JSON 可序列化的原生类型").
a914ceea15 docs(tools): cross-framework usage guide + refactor plan tools.md additions: - "跨框架使用（CLI/MCP）" section: design philosophy (stateless -> CLI, stateful -> MCP), CLI entry conventions, trace_id fallback pattern, double-JSON encoding avoidance, MCP integration via .mcp.json (not settings.json — Claude Code doesn't read mcpServers from there) - New entries in the builtin tool table: read_images, toolhub_*, ask/ upload_knowledge - read_file vs read_images usage guidance with adaptive-layout table - Skill installation convention (~/.claude/skills/<name>/SKILL.md) and the size distinction: SKILL.md is runtime-loaded, keep short; tools.md is for developers, can go long tools-refactor-plan.md (new): - Captures the discovery-pattern philosophy and per-family migration plans for two upcoming refactors that will happen in a later session: 1. Content tools (search_posts / youtube_search / x_search) — merge into content_platforms() + content_search() + content_detail() in the same spirit as toolhub_search + toolhub_call 2. Browser tools — collapse 28 @tool functions into ~11 verb-based tools using Literal enum actions. Browser is NOT a good fit for dynamic discovery since the differences are in parameters, not in capabilities - Explicitly rejects "discovery-based browser tools" and "full MCP client wrapper" paths, with reasoning - Lists all open design questions that must be decided before implementation starts
efea909f3b feat(read_images): batch image tool with adaptive grid + shared image utils Problem: when the agent needs to analyze many local images (pick the best photo, compare candidates, batch-judge), reading them one at a time via read_file blows up tokens — each image carries structural overhead per message block, and there is no way to see them side-by-side for comparison. read_images solves this: - Loads 1-12 images concurrently (local paths or URLs, mixable) - Downscales every image to max_dimension (default 1024px) to control per-image token cost - Two layouts: - grid (default): stitches N images into one index-numbered (1,2,3...) grid image so the LLM sees one picture with all candidates and can refer to them by index. Auto-picks columns/thumb_size based on count (2 imgs -> 2x1 @500px, 12 imgs -> 4x3 @320px), so final canvas stays within ~1400px long edge and no per-cell cell gets too small to read after LLM-internal resize - separate: returns N independent downscaled images for tools that really need per-image attention - Output text maps index -> full original path so the LLM can reference "image 3" and resolve it to the source file for downstream edits Grid mode caps at 12 images per call. Beyond that, each cell becomes too small to be useful after the LLM's internal image resize (~1568px long edge). Caller must batch in chunks. Shared utilities (agent/tools/utils/image.py): - load_image / load_images: async local+URL loader - downscale: aspect-preserving resize - build_image_grid: parameterized grid builder with scaled index boxes (index_box = thumb_size // 5, font = box * 0.65, so visual proportions stay constant across different thumb_size) - encode_base64: PIL -> base64 JPEG for tool result images Fixes a latent font bug at the same time: PingFang.ttc on macOS Sequoia cannot be opened by PIL/FreeType (cryptic "cannot open resource"), so search.py and crawler.py were silently rendering collages with the tiny default bitmap font — Chinese titles showed as near-invisible dots. The new font candidate list prioritizes Hiragino Sans GB and STHeiti Medium, both of which PIL can actually read. Refactor search.py and crawler.py to call build_image_grid instead of maintaining their own ~120-line duplicate collage implementations. No behavior change besides the font fix. read_file.py: add a docstring note pointing at read_images for batch use so the LLM can pick the right tool.
Просмотр сравнение для этих 7 коммитов »

2 месяцев назад

howard запушил(а) main в guantao/Tool_Agent

3052459934 fix(nano_banana): defensive check for relative paths in image.data When a caller passes a relative path like "examples/foo/bar.png" in image_urls, Gemini eventually reports a cryptic "Base64 decoding failed for 'examples/foo/bar.png'" because _build_parts was passing the raw path through as if it were base64 data. The real root cause is that remote-service cwd != caller cwd, so os.path.isfile() returns False for relative paths that were valid on the client side. The right fix is on the client side (upload local files before the call), which is done in a companion change to the agent framework's toolhub wrapper. But defensively, when the data field "looks like a file path but the file does not exist", raise a clear ValueError with the actual cwd hint instead of letting Gemini choke on it. - Add _looks_like_path_not_base64 heuristic: anything ending in an image extension, or containing a non-base64 character, is treated as "probably a path" - In _build_parts, when raw is not an http URL and not a local file, check the heuristic and raise immediately with a helpful message that includes cwd
c81698c592 fix: /tools and /search_tools crashed on tool.group_ids AttributeError ToolMeta was refactored to drop the group_ids field (replaced by provider_ids / capability_ids / knowledge_ids), but server.py still read tool.group_ids in two places, causing every /tools call to 500. The group-to-tool relationship is now owned by ToolGroupManager (each ToolGroup has a tool_ids list), not by the tool itself. Reverse-lookup via ToolGroupManager.list_all() produces the same group_ids array the old API contract promised, so clients (the agent framework's toolhub_search wrapper) need no changes. Changes: - In both list_tools and search_tools, build a tool_id -> group_ids reverse-lookup dict once per request using router.registry.group_manager.list_all() (O(N+M), not O(N*M)) - Replace `tool.group_ids` with `tool_to_groups.get(tool.tool_id, [])`
Просмотр сравнение для этих 2 коммитов »

2 месяцев назад

howard запушил(а) main в howard/Agent

1af57b379b Merge remote-tracking branch 'refs/remotes/origin/main'
99b154c30c feat: librarian & dynamic skills
1e2babe59a Merge remote-tracking branch 'origin/main'
51f42b9046 refactor: knowhub
Просмотр сравнение для этих 4 коммитов »

2 месяцев назад

howard запушил(а) main в howard/Agent

36422aa786 fix: tool context & pyproject
47baefc131 doc
Просмотр сравнение для этих 2 коммитов »

2 месяцев назад

howard запушил(а) main в howard/Agent

c74842ad8c Merge remote-tracking branch 'refs/remotes/origin/main'
e04c4decae chore
f6cd43c71d fix: run from specific msg
Просмотр сравнение для этих 3 коммитов »

2 месяцев назад

howard запушил(а) main в howard/Agent

fcb8e72091 doc: knowhub

2 месяцев назад