Skip to content

Pull requests: openai/simple-evals

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Jules version of beginner friendly README
#81 opened May 20, 2025 by rarhs Loading…
feat: add len_var scorer (B-0)
#71 opened May 8, 2025 by Yuu6798 Loading…
fix regex bug in browsecomp
#67 opened Apr 22, 2025 by tengyaolong2000 Loading…
fix: import collision for types
#66 opened Apr 21, 2025 by Ithanil Loading…
Small typo in grader
#57 opened Mar 25, 2025 by chiruu12 Loading…
add aime task
#55 opened Mar 12, 2025 by jason9693 Loading…
Add the F-score metric from the simpleqa paper.
#53 opened Mar 10, 2025 by wbaek Loading…
Initial commit
#45 opened Feb 1, 2025 by osmanjamalfarag Loading…
Grok Sampler
#40 opened Jan 9, 2025 by rolandgvc Loading…
correct string spelling error
#37 opened Dec 27, 2024 by owos Loading…
Use correct _pack_message function name
#12 opened May 20, 2024 by andrewmbenton Loading…
fix typo
#10 opened May 20, 2024 by dongZheX Loading…
Added Chartqa Dataset
#6 opened Apr 14, 2024 by tarunamasa Loading…
Remove blobfile dep, load directly from URL
#4 opened Apr 12, 2024 by arkadyark-cohere Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.