Skip to content

Pull requests: huggingface/evaluate

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix/label distribution entropy
#733 opened Feb 13, 2026 by michaelellis003 Loading…
Add human-centered trust & uncertainty metrics to Hugging Face Evaluate
#728 opened Jan 26, 2026 by dyra-12 Loading…
7 of 8 tasks
Fix minor typos and wording in README
#720 opened Dec 10, 2025 by ajeetkartikay Loading…
add CLIPScore metric
#706 opened Sep 25, 2025 by Sunhill666 Loading…
Refactor IOU and accuracy calculations with np.divide
#700 opened Sep 10, 2025 by gboeer Loading…
Add politeness_score metricCreate politeness_score.py
#676 opened May 24, 2025 by epaunova Loading…
[Feature] Add G-Pass@k Metric
#657 opened Dec 23, 2024 by jnanliu Loading…
Add Diarization Error Rate (DER) metric
#557 opened Mar 4, 2024 by medahmedkrichen Loading…
Add Frechet Inception Distance (FID) Score
#556 opened Mar 2, 2024 by medahmedkrichen Loading…
support multilabel confusion matrix
#533 opened Jan 5, 2024 by 0ssamaak0 Loading…
Customize Gradio Interface that is Launched
#298 opened Sep 22, 2022 by abidlabs Member Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.