feat: harden knowledgebase retrieval and role skills#327
feat: harden knowledgebase retrieval and role skills#327OpenCodeEngineer wants to merge 13 commits into
Conversation
|
Added new Slack eval scenario in this branch:
It was executed against Slack ( Result summary:
Report path:
Failure reason is specifically tooling bootstrap evidence: agent reported jq missing and apt install permission error, so jq readiness was not proven. |
|
Rerun status after runtime sudo fix:
Run details:
Conclusion:
|
|
Addressed prompt source duplication identified in review. Root cause:
Changes:
Validation:
|
|
Update pushed on branch This addresses the prompt-source inconsistency you flagged:
Also fixed deploy convergence on AKS:
Operational notes from rollout:
Verification done in cluster
|
|
Post-change validation run:
Result:
Note: this eval exercises SupportEngineer quality heuristics and is not directly tied to the OpenClaw prompt mount change; OpenClaw rollout and health checks are green in-cluster. |
|
Pushed follow-up commit Scope:
Changes:
Runtime validation in AKS (
Post-change Slack eval run:
|
|
Docs update pushed in
This aligns the design doc with the current runtime behavior after the OpenClaw/OpenHands self-modification changes. |
|
Added KB management documentation update in
|
Summary
agent_service.openhands.agent.Agentand preload roles fromagents/agents.yamlknowledgebase-searchskills and inject required KB skill instructions into agent contextdocs_toolsretrieval:agents/shared/knowledgebasedocs_toolsverification flow (instead of rawrg)Validation
uv run python -m pytest tests/test_agents_md_loader.py tests/test_docs_tools.py tests/test_openclaw_docs_context.py tests/test_support_engineer_kb_response.py -vuv run python scripts/eval_slack_e2e.py --scenario knowledgebase_cross_agent_support_to_product --channel C0AATPSADB8 --timeout 600(re-run after merge/deploy)Fixes #328