Skip to content

[CI][Benchmark] Merge benchmark suite presets implementation #17660

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 91 commits into from
Mar 27, 2025
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
91 commits
Select commit Hold shift + click to select a range
e6ca992
Move UR devops scripts to devops folder
ianayl Feb 27, 2025
3d42db2
Restrict number of cores used
ianayl Feb 28, 2025
fc70520
Merge branch 'sycl' of https://github.com/intel/llvm into unify-bench…
ianayl Mar 4, 2025
4f08dd6
Restore ur-benchmark*.yml
ianayl Mar 4, 2025
497dcce
[benchmarks] improve HTML and Markdown output
pbalcer Mar 5, 2025
3cbed5e
Test UR benchmarking suite
ianayl Mar 5, 2025
1936207
Merge branch 'unify-benchmark-ci' of https://github.com/intel/llvm in…
ianayl Mar 5, 2025
f79bbbf
Bump tolerance to 7%
ianayl Mar 5, 2025
ffc8139
Revert "Bump tolerance to 7%"
ianayl Mar 5, 2025
0a34e0d
[benchmarks] fix failing benchmarks, improve html output
pbalcer Mar 6, 2025
3f42420
[benchmarks] fix python formatting with black
pbalcer Mar 6, 2025
1c7b189
update driver version
pbalcer Mar 6, 2025
ad13e93
simplify preset implementation and fix normal preset
pbalcer Mar 6, 2025
68ed0c4
Add PVC and BMG as runners
ianayl Mar 6, 2025
18fff93
Merge branch 'unify-benchmark-ci' of https://github.com/intel/llvm in…
ianayl Mar 6, 2025
3a65b98
Install dependencies before running UR script
ianayl Mar 6, 2025
220121a
Use venv for python packages
ianayl Mar 6, 2025
37d361c
Install venv before using venv
ianayl Mar 6, 2025
07f1e10
[benchmarks] allow specifying custom results directories
pbalcer Mar 7, 2025
64cf79c
[benchmarks] sort runs by date for html output
pbalcer Mar 7, 2025
6c28d33
simplify presets, remove suites if all set
pbalcer Mar 10, 2025
e15b94f
[benchmarks] use python venv for scripts
pbalcer Mar 10, 2025
78fd037
Run apt with sudo
ianayl Mar 10, 2025
0ed1599
Merge branch 'unify-benchmark-ci' of https://github.com/intel/llvm in…
ianayl Mar 10, 2025
82b6e55
Ignore "missing" apt packages in workflow
ianayl Mar 10, 2025
162cba0
Change pip to install to user
ianayl Mar 10, 2025
848f741
Ignore system controlled python env
ianayl Mar 10, 2025
918604e
[CI] use realpaths when referring to SYCL
ianayl Mar 10, 2025
72d8730
[CI] use minimal preset when running benchmarks
ianayl Mar 10, 2025
066f5a6
[CI] Allow 2 bench scripts locations (#17394)
lukaszstolarczuk Mar 12, 2025
18e5291
add ulls compute benchmarks
pbalcer Mar 12, 2025
237750e
[CI][Benchmark] Decouple results from existing file structure, fetch …
ianayl Mar 11, 2025
ba1297f
[benchmark] Disabling UR test suites
ianayl Mar 12, 2025
cd6097f
update compute benchmarks and fix requirements
pbalcer Mar 13, 2025
c4e92c6
fix url updates
pbalcer Mar 13, 2025
ed8eecc
use timestamps in result file names
pbalcer Mar 13, 2025
130212d
add hostname to benchmark run
pbalcer Mar 13, 2025
a884df8
Merge branch 'sycl' of https://github.com/intel/llvm into unify-bench…
ianayl Mar 13, 2025
5323386
add SubmitGraph benchmark
pbalcer Mar 13, 2025
5bd1d56
Restore sycl-linux-run-tests benchmarking action
ianayl Mar 13, 2025
e9b1375
Restore old SYCL benchmarking CI
ianayl Mar 13, 2025
a3edf7a
Add benchmarking results to sycl-docs.yml
ianayl Mar 13, 2025
6620e4a
[CI] Bump compute bench (#17431)
lukaszstolarczuk Mar 13, 2025
f4a2e39
Initial implementation of unified benchmark workflow
ianayl Mar 13, 2025
5d3b0d9
Merge branch 'unify-benchmark-ci' of https://github.com/intel/llvm in…
ianayl Mar 13, 2025
38394bb
[CI] Use commit hash instead, fix issues with run
ianayl Mar 13, 2025
f232b93
add benchmark metadata
pbalcer Mar 14, 2025
30cd308
apply formatting
pbalcer Mar 14, 2025
5e0539a
fix multiple descriptions/notes
pbalcer Mar 14, 2025
137407a
fix benchmark descriptions
pbalcer Mar 14, 2025
e0f5ca6
fix remote html output
pbalcer Mar 14, 2025
1041db6
fix metadata collection with dry run
pbalcer Mar 14, 2025
fae04f4
cleanup compute bench, fix readme, use newer sycl-bench
pbalcer Mar 14, 2025
cfa4a9c
[CI] configure upload results
ianayl Mar 14, 2025
ca963e6
[CI] Change config to update during workflow run instead
ianayl Mar 14, 2025
45a02e1
[CI] Change save name depending on build
ianayl Mar 14, 2025
98f9d38
bump to 2024-2025
ianayl Mar 14, 2025
ef88ea0
[CI] Enforce commit hash to be string regardless
ianayl Mar 14, 2025
b7acba2
cleanup options in js scripts and fix ordering on bar charts
pbalcer Mar 18, 2025
e330a50
use day on x axis for timeseries
pbalcer Mar 18, 2025
cde744c
Merge branch 'sycl' of https://github.com/intel/llvm into unify-bench…
ianayl Mar 19, 2025
cae7049
[benchmarks] Undo merging in prior tests
ianayl Mar 19, 2025
6bff3d6
add an option to limit build parallelism
pbalcer Mar 20, 2025
3662b43
tiny tweaks for benchmark tags
pbalcer Mar 20, 2025
d2610c3
add support for benchmark tags
pbalcer Mar 20, 2025
ffc60bf
support for tags in html
pbalcer Mar 20, 2025
75dd229
better and more tags
pbalcer Mar 20, 2025
cec8f05
formatting
pbalcer Mar 20, 2025
a0d8370
fix fetching tags from remote json
pbalcer Mar 20, 2025
c7f8d10
fix results /w descriptions and add url/commit of benchmarks
pbalcer Mar 20, 2025
1dad513
fix git repo/hash for benchmarks
pbalcer Mar 20, 2025
8437b89
Merge branch 'sycl' of https://github.com/intel/llvm into unify-bench…
ianayl Mar 21, 2025
2dbf350
Revert changes to workflow files
ianayl Mar 21, 2025
a4a9907
Revert changes to composite actions
ianayl Mar 21, 2025
bdef08b
Revert changes to get_system_info.sh
ianayl Mar 21, 2025
9e51c86
Revert changes not related to metadata
ianayl Mar 22, 2025
6fa722b
Merge branch 'sycl' of https://github.com/intel/llvm into benchmark-m…
ianayl Mar 22, 2025
5cc02c5
Revert changes to html
ianayl Mar 22, 2025
b49ff88
Revert presets.py
ianayl Mar 22, 2025
9357df2
Revert benchmark.yml
ianayl Mar 22, 2025
115dd5e
Merge branch 'benchmark-metadata' of https://github.com/ianayl/sycl i…
ianayl Mar 24, 2025
03bfd15
Update imports to reflect result.py move
ianayl Mar 24, 2025
0ff0142
Add benchmark history updates
ianayl Mar 24, 2025
1ef9251
Correct bad conflict resolution over cudnn/cublas flags
ianayl Mar 25, 2025
31c6695
Remove use of typing to stay consistent across files
ianayl Mar 25, 2025
136f64e
Remove debug comments
ianayl Mar 25, 2025
ccb2a9c
Remove trailing spaces
ianayl Mar 25, 2025
f8ccc30
Specify that git metadata is modifiable
ianayl Mar 25, 2025
c54cd76
Remove unused metadata variable for now
ianayl Mar 25, 2025
c8b974e
Add presets from unify-benchmark-ci
ianayl Mar 25, 2025
15df86b
Merge branch 'benchmark-scripts-presets' of https://github.com/ianayl…
ianayl Mar 26, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 4 additions & 4 deletions devops/scripts/benchmarks/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,8 @@ Scripts for running performance tests on SYCL and Unified Runtime.

- [Velocity Bench](https://github.com/oneapi-src/Velocity-Bench)
- [Compute Benchmarks](https://github.com/intel/compute-benchmarks/)
- [LlamaCpp Benchmarks](https://github.com/ggerganov/llama.cpp)
- [SYCL-Bench](https://github.com/unisa-hpc/sycl-bench)

## Running

Expand All @@ -27,8 +29,6 @@ You can also include additional benchmark parameters, such as environment variab

Once all the required information is entered, click the "Run workflow" button to initiate a new workflow run. This will execute the benchmarks and then post the results as a comment on the specified Pull Request.

By default, all benchmark runs are compared against `baseline`, which is a well-established set of the latest data.

You must be a member of the `oneapi-src` organization to access these features.

## Comparing results
Expand All @@ -37,8 +37,8 @@ By default, the benchmark results are not stored. To store them, use the option

You can compare benchmark results using `--compare` option. The comparison will be presented in a markdown output file (see below). If you want to calculate the relative performance of the new results against the previously saved data, use `--compare <previously_saved_data>` (i.e. `--compare baseline`). In case of comparing only stored data without generating new results, use `--dry-run --compare <name1> --compare <name2> --relative-perf <name1>`, where `name1` indicates the baseline for the relative performance calculation and `--dry-run` prevents the script for running benchmarks. Listing more than two `--compare` options results in displaying only execution time, without statistical analysis.

Baseline, as well as baseline-v2 (for the level-zero adapter v2) is updated automatically during a nightly job. The results
are stored [here](https://oneapi-src.github.io/unified-runtime/benchmark_results.html).
Baseline_L0, as well as Baseline_L0v2 (for the level-zero adapter v2) is updated automatically during a nightly job. The results
are stored [here](https://oneapi-src.github.io/unified-runtime/performance/).

## Output formats
You can display the results in the form of a HTML file by using `--ouptut-html` and a markdown file by using `--output-markdown`. Due to character limits for posting PR comments, the final content of the markdown file might be reduced. In order to obtain the full markdown output, use `--output-markdown full`.
Expand Down
12 changes: 12 additions & 0 deletions devops/scripts/benchmarks/main.py
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@
from history import BenchmarkHistory
from utils.utils import prepare_workdir
from utils.compute_runtime import *
from presets import enabled_suites, presets

import argparse
import re
Expand Down Expand Up @@ -175,6 +176,9 @@ def main(directory, additional_env_vars, save_name, compare_names, filter):
failures = {}

for s in suites:
if s.name() not in enabled_suites(options.preset):
continue

suite_benchmarks = s.benchmarks()
if filter:
suite_benchmarks = [
Expand Down Expand Up @@ -457,6 +461,13 @@ def validate_and_parse_env_args(env_args):
help="Directory for cublas library",
default=None,
)
parser.add_argument(
"--preset",
type=str,
choices=[p for p in presets.keys()],
help="Benchmark preset to run",
default=options.preset,
)
parser.add_argument(
"--results-dir",
type=str,
Expand Down Expand Up @@ -495,6 +506,7 @@ def validate_and_parse_env_args(env_args):
options.current_run_name = args.relative_perf
options.cudnn_directory = args.cudnn_directory
options.cublas_directory = args.cublas_directory
options.preset = args.preset
options.custom_results_dir = args.results_dir
options.build_jobs = args.build_jobs

Expand Down
2 changes: 2 additions & 0 deletions devops/scripts/benchmarks/options.py
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,7 @@
from enum import Enum
import multiprocessing

from presets import presets

class Compare(Enum):
LATEST = "latest"
Expand Down Expand Up @@ -42,6 +43,7 @@ class Options:
compute_runtime_tag: str = "25.05.32567.12"
build_igc: bool = False
current_run_name: str = "This PR"
preset: str = "Full"
custom_results_dir = None
build_jobs: int = multiprocessing.cpu_count()

Expand Down
38 changes: 38 additions & 0 deletions devops/scripts/benchmarks/presets.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,38 @@
# Copyright (C) 2025 Intel Corporation
# Part of the Unified-Runtime Project, under the Apache License v2.0 with LLVM Exceptions.
# See LICENSE.TXT
# SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

presets: dict[str, list[str]] = {
"Full": [
"Compute Benchmarks",
"llama.cpp bench",
"SYCL-Bench",
"Velocity Bench",
"UMF",
],
"SYCL": [
"Compute Benchmarks",
"llama.cpp bench",
"SYCL-Bench",
"Velocity Bench",
],
"Minimal": [
"Compute Benchmarks",
],
"Normal": [
"Compute Benchmarks",
"llama.cpp bench",
"Velocity Bench",
],
"Test": [
"Test Suite",
],
}


def enabled_suites(preset: str) -> list[str]:
try:
return presets[preset]
except KeyError:
raise ValueError(f"Preset '{preset}' not found.")
Loading