Port the new PostTrainBench-like eval shared in [this X thread](<https://x.com/jehyeoky248/status/2057103859927941153?s=46>) to verifiers.
Port the new PostTrainBench-like eval shared in this X thread to verifiers.