When I upload .yaml benchmark, I have to wait for the whole bench to finish.
It may take hours for local models.

Could you please add a preview for that like the current one a user sees at the end.
I could click a button "run tests" and check them in real time e.g. if I see many "red" boxes, I would change a prompt and restart without waiting for all tests to finish.