First, thank you for building this benchmark suite.
However, it does not work smoothly on a freshly set up OpenClaw environment because the required configuration files/ standards are missing.
It might be helpful to provide a default configuration or setup guideline, so users can run the benchmark under a similar environment.