Skip to content

feat: change default score display from 'best' to 'average'#34

Closed
ScuttleBot wants to merge 2 commits intopinchbench:mainfrom
ScuttleBot:feat/default-average-score
Closed

feat: change default score display from 'best' to 'average'#34
ScuttleBot wants to merge 2 commits intopinchbench:mainfrom
ScuttleBot:feat/default-average-score

Conversation

@ScuttleBot
Copy link

The leaderboard now shows average scores by default when the page loads, rather than best scores. Users can still toggle to view best scores.

Changes

  • Default score mode changed from 'best' to 'average' in leaderboard-view.tsx
  • Updated URL parameter handling to treat 'average' as the default (keeping URLs clean by omitting the parameter when it's the default value)

Why

Average scores provide a more representative view of model performance across multiple runs, reducing the impact of lucky outliers.

The leaderboard now shows average scores by default when the page loads,
rather than best scores. Users can still toggle to view best scores.

This change also updates the URL parameter handling to treat 'average'
as the default (removing it from URL when selected to keep URLs clean).
@vercel
Copy link

vercel bot commented Mar 13, 2026

@olearycrew is attempting to deploy a commit to the brendan's projects Team on Vercel.

A member of the Team first needs to authorize it.

@vercel
Copy link

vercel bot commented Mar 13, 2026

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
leaderboard Ready Ready Preview, Comment Mar 16, 2026 4:08pm
pinchbench-leaderboard Ready Ready Preview, Comment Mar 16, 2026 4:08pm

Request Review

Merged main into feat/default-average-score branch.
Conflict in leaderboard-view.tsx: main changed the URL cleanup default
to 'best' and added weights param cleanup. Resolved by keeping 'average'
as the default (the purpose of this PR) while incorporating the new
weights parameter cleanup line from main.
@olearycrew
Copy link
Member

I want a discussion on this first: #42

@olearycrew olearycrew closed this Mar 19, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants