Discussion: benchmark package from NPM

The [benchmark package](https://www.npmjs.com/package/benchmark) does (at least) two questionable things:

1. In order to decide whether the performance difference is significant, it uses the [Mann-Whitney U test](http://www.statisticslectures.com/topics/mannwhitneyu/), which is meant for [ordinal scale](https://en.wikipedia.org/wiki/Ordinal_scale) data, instead of [Student's t test](https://en.wikipedia.org/wiki/Student%27s_t-test), which is more appropriate for [ratio scale](https://en.wikipedia.org/wiki/Ratio_scale) data such as the performance measurements we're making here.
2. It [recompiles JavaScript code from a string](https://github.com/bestiejs/benchmark.js/blob/42f3b732bac3640eddb3ae5f50e445f3141016fd/benchmark.js#L1587-L1606) on every test run in order to prevent engine optimizations. Usually we're interested in performance *with* engine optimizations, since this is how code tends to run in the real world. Besides that, since most of the benchmark code is probably out of reach for this compilation trick, optimization is only partly disabled, producing inconsistent optimization characteristics. Maybe this behavior can be disabled; this is worth investigating.

I can think of a couple of options (from least to most effort):

- Accept the quirks and do nothing.
- Switch to an alternative benchmark framework that doesn't have these quirks. I'm not (yet) aware of an alternative. 
- Forego a convenient benchmark framework. Instead, repeat the benchmark code a fixed number of times (say, 10) for each version of Underscore and report each individual result, so that people can compute their own statistics.
- Fork the benchmark package, fix the issues, submit a PR. Use our own version regardless of whether the PR is accepted. If not accepted, publish as a separate package on NPM.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Discussion: benchmark package from NPM #3

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Discussion: benchmark package from NPM #3

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions