Question
Shall we default all the datasets and traces on HuggingFace to v2?
Right now it isn't obvious whether every link on our surfaces resolves to v2, and the version/ownership story is inconsistent. We've also had feedback that the website isn't clear about which version a visitor is looking at.
What I'm seeing on claw-bench.com
The links currently point at a mix of orgs and versions:
| Resource |
Current link |
| HF Dataset |
huggingface.co/datasets/NAIL-Group/ClawBench |
| HF Traces |
huggingface.co/datasets/TIGER-Lab/ClawBenchV2Trace |
| HF V1 Traces |
huggingface.co/datasets/NAIL-Group/ClawBenchV1Trace |
| GitHub |
github.com/TIGER-AI-Lab/ClawBench |
So the main dataset lives under NAIL-Group while traces live under TIGER-Lab, and v1/v2 are interleaved without a clear default.
Proposal
- Decide whether v2 is the canonical default everywhere (dataset + traces).
- Make every "Dataset" / "Traces" link on the website, README, and HF cards resolve to the v2 artifacts by default, with v1 clearly marked as legacy/archive.
- Consolidate to a single canonical org so the dataset and traces don't split across
NAIL-Group vs TIGER-Lab.
References
Feedback: a few people have said the website isn't very clear right now about which version is the default — worth addressing alongside this.
Question
Shall we default all the datasets and traces on HuggingFace to v2?
Right now it isn't obvious whether every link on our surfaces resolves to v2, and the version/ownership story is inconsistent. We've also had feedback that the website isn't clear about which version a visitor is looking at.
What I'm seeing on claw-bench.com
The links currently point at a mix of orgs and versions:
huggingface.co/datasets/NAIL-Group/ClawBenchhuggingface.co/datasets/TIGER-Lab/ClawBenchV2Tracehuggingface.co/datasets/NAIL-Group/ClawBenchV1Tracegithub.com/TIGER-AI-Lab/ClawBenchSo the main dataset lives under
NAIL-Groupwhile traces live underTIGER-Lab, and v1/v2 are interleaved without a clear default.Proposal
NAIL-GroupvsTIGER-Lab.References
Feedback: a few people have said the website isn't very clear right now about which version is the default — worth addressing alongside this.