Question about evaluation datasets #14

JasonZhu1313 · 2024-05-22T19:25:37Z

Hey,

Great observations and work on disentangling the format following from reasoning! Could we share details on evaluation dataset we used and how we can reproduce the result in the paper? I have fine tuned llama3 on the dataset and achieved worse performance in 30 questions curated from HotpotQA dataset. If you could share some light on this it would be super appreciated! Thanks,
Jason

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about evaluation datasets #14

Question about evaluation datasets #14

JasonZhu1313 commented May 22, 2024

Question about evaluation datasets #14

Question about evaluation datasets #14

Comments

JasonZhu1313 commented May 22, 2024