Skip to content

Investigate integration-test tool harness failures on PR #8725 #8726

@roomote

Description

@roomote

Context

CI status

Failure pattern

  • Multiple assertions report tools not being detected by the e2e harness:
    • apply_diff, read_file, search_files, search_and_replace, list_files, insert_content, execute_command
  • Examples from logs:
    • Roo Code search_files Tool: expected execution with file pattern filter; see out/suite/tools/search-files.test.js:475
    • Roo Code search_and_replace Tool: expected tool execution; see out/suite/tools/search-and-replace.test.js:252
    • Roo Code read_file Tool: expected tool execution; see out/suite/tools/read-file.test.js:228
    • Roo Code apply_diff Tool: expected tool execution; see out/suite/tools/apply-diff.test.js:262
  • Environment in runner: Node 20.19.2, ubuntu-24.04 image

Local verification

  • webview-ui tests pass locally (92 files, 1100 tests)
  • Type check and lint pass in webview package

Hypothesis

  • The e2e tool mocking layer in apps/vscode-e2e may not be capturing tool call events in CI, independent of the UI change in ChatRow.
  • Potential causes: event wiring, test isolation, or dependency/version differences in the VS Code test runner or mocha.

Requests

  • Please triage and advise ownership for the e2e harness
  • Optionally re-run the failing job to check for flakiness

Suggested next steps

  • Inspect how tool call events are intercepted and asserted in apps/vscode-e2e out/suite/tools/*.test.js
  • Verify postMessage or proxy hooks used by the tool harness in CI environment
  • Consider pinning or bumping @vscode/test-electron and mocha if a recent change altered event timing

This issue is created to track integration-test stability separately from the UI-only fix in PR #8725.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Issue/PR - TriageNew issue. Needs quick review to confirm validity and assign labels.supportNot verified as bug, currently just a support request

    Type

    No type

    Projects

    Status

    Done

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions