feat(kaggle-cli): Add sdk for benchmark notebook by dolaameng · Pull Request #90 · Kaggle/kaggle-benchmarks

dolaameng · 2026-03-24T17:14:40Z

Kaggle Benchmark Client

This PR introduces the BenchmarkNotebookClient SDK to manage Kaggle benchmark tasks. These tasks execute as Kaggle notebooks tagged with the personal-benchmark keyword.

APIs

publish_and_run: Converts local Python benchmark scripts into .ipynb format, generates the kernel-metadata.json file (allowing users to persistently save and edit kernel configuration overrides), and pushes the payload to Kaggle. Implements concurrent run guards to block overlapping executions of the same benchmark (bypassable via force=True).
fork: Pulls pre-existing benchmark notebooks from Kaggle into local workspaces for editing.
get_results: Polls the Kaggle backend for execution status and retrieves one or more .run.json artifacts upon completion. Supports thread-based polling cancellation via threading.Event.

Testing

The code is validated by two test suites:

test_notebook_api.py: Uses API mocks to verify polling logic, artifact extraction, and cancellation offline.
test_kaggle_client.py: Runs against the live Kaggle backend (golden_tests) to test the full execution lifecycle.

src/kaggle_benchmarks/kaggle_client/notebook_api.py

s-alexey · 2026-03-25T10:26:52Z

src/kaggle_benchmarks/kaggle_client/utils.py

+# ---------------------------------------------------------------------------
+
+
+def normalize_status(status: object) -> str:


Might want a narrower type here. object seems not what you intended; see Any vs object.

Suggested change

def normalize_status(status: object) -> str:

def normalize_status(status: Any) -> str:

s-alexey · 2026-03-25T10:30:28Z

src/kaggle_benchmarks/kaggle_client/notebook_api.py

+from kaggle_benchmarks.kaggle_client.utils import (
+    KAGGLE_METADATA_MAP,
+    build_local_metadata,
+    convert_ipynb_to_py,
+    convert_py_to_ipynb,
+    normalize_status,
+    parse_remote_metadata,
+)


Might be clearer to import as a module. It usually makes the code easier to follow; style guide

Suggested change

from kaggle_benchmarks.kaggle_client.utils import (

KAGGLE_METADATA_MAP,

build_local_metadata,

convert_ipynb_to_py,

convert_py_to_ipynb,

normalize_status,

parse_remote_metadata,

)

from kaggle_benchmarks.kaggle_client import utils as kaggle_utils

s-alexey · 2026-03-25T10:36:23Z

src/kaggle_benchmarks/kaggle_client/notebook_api.py

Should we move most of this to the kagglesdk? It seems out of scope for the benchmark package and may overlap with https://github.com/Kaggle/kaggle-sdk-python/tree/main/kagglesdk/benchmarks

Yes that's another option.

Ideally I think we want most heavy works implemented on kagglesdk and we have thin wrappers here for users to directly use without jumping around.

Also it feels more nature to have all kinds of extensions like kaggle-client, vscode-exension together here for community contributions.

We could move part of the code to kagglesdk later, if people find this useful?

s-alexey · 2026-03-25T10:45:48Z

tests/kaggle_client/test_kaggle_client_utils.py

+    meta = MagicMock()
+    meta.ref = "alice/my-benchmark"
+    meta.title = "My Benchmark"
+    meta.language = "python"
+    meta.kernel_type = "notebook"
+    meta.is_private = False
+    meta.enable_gpu = True
+    meta.enable_internet = True
+    meta.enable_tpu = False
+    meta.dataset_data_sources = ["alice/dataset"]
+    meta.competition_data_sources = ["comp1"]
+    meta.kernel_data_sources = ["alice/kernel"]
+    meta.model_data_sources = ["alice/model"]
+    meta.category_ids = ["personal-benchmark", "nlp"]


Here and in other places I would advise against mocking when you can use dependency injections [go/python-tips/013]

Yes here meta is a mocked data object, and parse_remote_metadata takes it as a parameter so it's already DI?

rosbo

Link to my comment thread about adding this to our existing CLI rather than having a separate CLI tool. Let's discuss on that thread: https://docs.google.com/document/d/1xvOIzSAyYVNtff4S7aqELPNEpbAwmiem16jM68pzqJI/edit?disco=AAAB2hylHpA

Adds a kaggle-bench CLI with two subcommands: - run: publish and run a local benchmark script on Kaggle - fork: pull an existing Kaggle benchmark notebook for local editing Extends the BenchmarkNotebookClient SDK from PR Kaggle#90 with a command-line interface so users can trigger benchmark runs directly from the terminal without writing Python boilerplate. Tests: 5 new unit tests covering help output, argument parsing, and correct delegation to BenchmarkNotebookClient.

feat(kaggle-cli): Add sdk for task run.

22853b6

dolaameng commented Mar 24, 2026

View reviewed changes

src/kaggle_benchmarks/kaggle_client/notebook_api.py Outdated Show resolved Hide resolved

migrate to kaggle-sdk

f5c1ce1

dolaameng changed the title ~~feat(kaggle-cli): Add sdk for task run.~~ feat(kaggle-cli): Add sdk for benchmark notebook Mar 25, 2026

dolaameng requested review from andrewmwang, rosbo, s-alexey and yibinlin-google March 25, 2026 03:56

rephrase

b65ff0b

s-alexey reviewed Mar 25, 2026

View reviewed changes

rosbo reviewed Mar 25, 2026

View reviewed changes

dolaameng added 2 commits March 26, 2026 01:08

address comments

1196c9b

improve tests

5bec919

gastondana627 mentioned this pull request Mar 28, 2026

feat(kaggle-client): add kaggle-bench CLI entry point #98

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(kaggle-cli): Add sdk for benchmark notebook#90

feat(kaggle-cli): Add sdk for benchmark notebook#90
dolaameng wants to merge 5 commits intocifrom
dolaameng/kaggle_client_2

dolaameng commented Mar 24, 2026 •

edited

Loading

Uh oh!

Uh oh!

s-alexey Mar 25, 2026

Uh oh!

dolaameng Mar 26, 2026

Uh oh!

s-alexey Mar 25, 2026

Uh oh!

dolaameng Mar 26, 2026

Uh oh!

s-alexey Mar 25, 2026

Uh oh!

dolaameng Mar 25, 2026

Uh oh!

s-alexey Mar 25, 2026 •

edited

Loading

Uh oh!

dolaameng Mar 26, 2026

Uh oh!

rosbo left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		# ---------------------------------------------------------------------------


		def normalize_status(status: object) -> str:

	def normalize_status(status: object) -> str:
	def normalize_status(status: Any) -> str:

Conversation

dolaameng commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Kaggle Benchmark Client

APIs

Testing

Uh oh!

Uh oh!

s-alexey Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

dolaameng Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

s-alexey Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

dolaameng Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

s-alexey Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

dolaameng Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

s-alexey Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dolaameng Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

rosbo left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dolaameng commented Mar 24, 2026 •

edited

Loading

s-alexey Mar 25, 2026 •

edited

Loading