Skip to content

fix(task_10): support OpenClaw/Claude Code read tool name in read_config grader#54

Open
JoeProAI wants to merge 1 commit intopinchbench:mainfrom
JoeProAI:fix/task10-read-tool-compatibility
Open

fix(task_10): support OpenClaw/Claude Code read tool name in read_config grader#54
JoeProAI wants to merge 1 commit intopinchbench:mainfrom
JoeProAI:fix/task10-read-tool-compatibility

Conversation

@JoeProAI
Copy link

Problem

The task_10_workflow automated grader checks whether the agent used a file-reading tool to read config.json via the read_config criterion. Currently it only recognizes two tool name variants:

  • read_file
  • readFile

And only checks the files parameter for the filename.

OpenClaw and Claude Code record file reads with:

  • Tool name: read or Read
  • Parameters: path or file_path (not files)

This means agents running on these platforms always score 0.0 on read_config even when they correctly read config.json as the first step.

Fix

  • Case-insensitive tool name matching: accepts read_file, readfile, read (covers Cursor, Windsurf, OpenClaw, Claude Code)
  • Also checks path and file_path params alongside files

Impact

This only affects task_10_workflow automated scoring. No other tasks are impacted. The fix is backward-compatible — existing tool names still match.

…fig grader

The automated grader for task_10_workflow checks whether the agent used a
file-reading tool to read config.json. Currently it only recognizes
'read_file' and 'readFile' tool names with a 'files' parameter.

OpenClaw and Claude Code record file reads as tool name 'read' or 'Read'
with 'path' or 'file_path' parameters. This means agents on these
platforms always score 0.0 on read_config even when they correctly read
config.json.

Changes:
- Case-insensitive tool name matching (read_file, readfile, read)
- Also check 'path' and 'file_path' params alongside 'files'
@JoeProAI
Copy link
Author

Hey, just noting this is related to the broader OpenClaw compatibility work — PR #58 (merged today) fixed model ID casing, and PR #20 fixed config format. This one handles the grader side: OpenClaw and Claude Code use read with path/file_path params rather than read_file with files array. Happy to rebase if needed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants