You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As you mentioned here that deepseek-coder models show SOTA performance on APPS, while you might not report the exact scores or the code for evaluation on APPS benchmark. Will you share the evaluation scripts for APPS?
The text was updated successfully, but these errors were encountered:
Thx for your nice work!
As you mentioned here that deepseek-coder models show SOTA performance on APPS, while you might not report the exact scores or the code for evaluation on APPS benchmark. Will you share the evaluation scripts for APPS?
The text was updated successfully, but these errors were encountered: