Problem Regarding Evaluation #15

chrisliu298 · 2024-10-25T01:55:17Z

Hi,

Thank you for developing this fantastic dataset for code generation!

I had a quick question about the evaluation script (compute_metric.py). In my experience, it takes about 17 hours complete, even with the provided example generation. When I tried switching to parallel evaluation, the process finished much faster (in seconds), but all the results returned as zero. During evaluation, I used the original code an generated solutions provided in this repo.

Is this expected behavior, or am I missing something? I would appreciate any guidance you could provide.

The text was updated successfully, but these errors were encountered:

sssszh · 2024-11-27T04:35:08Z

@chrisliu298 Hi, have you solved this problem? I’ve encountered the same issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problem Regarding Evaluation #15

Problem Regarding Evaluation #15

chrisliu298 commented Oct 25, 2024

sssszh commented Nov 27, 2024

Problem Regarding Evaluation #15

Problem Regarding Evaluation #15

Comments

chrisliu298 commented Oct 25, 2024

sssszh commented Nov 27, 2024