feat: add a LLM TCO calculator tool #445

vinhngx · 2025-11-05T04:25:41Z

Implementing a LLM TCO Calculator per the blog: https://developer.nvidia.com/blog/llm-inference-benchmarking-how-much-does-your-llm-inference-cost/

This notebook allows you to:

setup a NIM LLM inference server
benchmark performance with AIperf
collect the data and plug into the LLM TCO calculator (in the form of an Excel spreadsheet)
Work out the inference costs, such as $/million-tokens

Summary by CodeRabbit

Release Notes

Documentation
- Added comprehensive README for AIPerf Utility Notebooks with setup and usage guidance.
New Features
- New notebook for benchmarking NIM LLM deployments with performance metrics collection across multiple configurations.
- Export aggregated performance results to Excel format for TCO calculator integration.

github-actions · 2025-11-05T04:25:49Z

Try out this PR

Quick install:

pip install --upgrade --force-reinstall git+https://github.com/ai-dynamo/aiperf.git@main

Recommended with virtual environment (using uv):

uv venv --python 3.12 && source .venv/bin/activate
uv pip install --upgrade --force-reinstall git+https://github.com/ai-dynamo/aiperf.git@main

codecov · 2025-11-05T04:27:27Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

coderabbitai · 2025-11-05T04:30:57Z

Walkthrough

The PR introduces two new files: a README documenting AIPerf Utility Notebooks and a comprehensive Jupyter notebook that integrates NVIDIA AIPerf benchmarking with NIM LLM server performance metrics collection, exporting results to Excel for TCO calculator integration.

Changes

Cohort / File(s)	Summary
Documentation `notebooks/README.md`	Introduces README for AIPerf Utility Notebooks with overview and reference to TCO_calculator.ipynb for benchmarking NIM LLM deployments and exporting to TCO calculator.
Benchmark Notebook `notebooks/TCO_calculator.ipynb`	Comprehensive new notebook integrating AIPerf with NIM LLM server. Includes setup/installation of AIPerf, metadata configuration for models and hardware, NIM server orchestration, benchmark execution across multiple concurrencies and sequence lengths, JSON artifact parsing, dataframe aggregation of performance metrics with metadata, and Excel export workflow for downstream TCO calculator import.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~30 minutes

Verify AIPerf installation and deprecation warning handling correctness
Validate benchmark orchestration logic and concurrency/sequence length iteration patterns
Review data aggregation logic that combines JSON profiles with metadata
Confirm Excel export schema and column mappings align with TCO calculator requirements
Validate sample metadata structure and configuration completeness

Poem

🐰 Hop, skip, and benchmark so fine,
AIPerf and NIM in perfect align,
Data streams flow like carrots divine,
To spreadsheets they hop in columns that shine,
TCO calculations now measure and design! ✨

Pre-merge checks

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'feat: add a LLM TCO calculator tool' directly and clearly summarizes the main change - introducing a new LLM TCO calculator tool. It accurately reflects the primary objective of the PR.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 4

🧹 Nitpick comments (1)

notebooks/TCO_calculator.ipynb (1)

772-775: Consider automating the Excel import process.

The current workflow requires manual copying of data from data.xlsx to the TCO calculator spreadsheet. This manual step is error-prone and could be automated using Python libraries like openpyxl.

If you'd like to automate this, I can help generate a script that:

Opens the LLM_TCO_Calculator.xlsx workbook

Locates the "data" sheet

Appends the benchmark data programmatically

Saves the updated workbook

Would you like me to create this automation?

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 80a0901 and eb77e19.

⛔ Files ignored due to path filters (1)

notebooks/LLM_TCO_Calculator.xlsx is excluded by !**/*.xlsx

📒 Files selected for processing (2)

notebooks/README.md (1 hunks)
notebooks/TCO_calculator.ipynb (1 hunks)

🧰 Additional context used

🪛 GitHub Actions: Pre-commit

notebooks/README.md

[error] 1-1: Trailing whitespace check failed. Files were modified by this hook.

[error] 1-1: Add-license hook modified the file to include license header.

notebooks/TCO_calculator.ipynb

[warning] 1-1: No handler registered for file: notebooks/TCO_calculator.ipynb. Please add a new handler to /home/runner/work/aiperf/aiperf/tools/add_copyright.py!

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (10)

GitHub Check: integration-tests (ubuntu-latest, 3.12)
GitHub Check: integration-tests (ubuntu-latest, 3.11)
GitHub Check: integration-tests (macos-latest, 3.11)
GitHub Check: integration-tests (macos-latest, 3.10)
GitHub Check: integration-tests (macos-latest, 3.12)
GitHub Check: integration-tests (ubuntu-latest, 3.10)
GitHub Check: build (macos-latest, 3.10)
GitHub Check: build (macos-latest, 3.11)
GitHub Check: build (ubuntu-latest, 3.10)
GitHub Check: build (ubuntu-latest, 3.11)

🔇 Additional comments (2)

notebooks/TCO_calculator.ipynb (2)

744-765: LGTM!

The Excel export logic is clean and well-structured. Column ordering is logical, with metadata fields first followed by performance metrics.

324-373: No fixes required; the model identifier formats are correct for their respective purposes.

The apparent inconsistency between model identifiers is not problematic. AIPerf's -m parameter (line 349: meta/llama3-8b-instruct) correctly identifies the model deployed at the NIM endpoint, matching the Docker image specification (line 300: nvcr.io/nim/meta/llama3-8b-instruct:latest). The --tokenizer parameter (line 363: meta-llama/Meta-Llama-3-8B-Instruct) correctly uses HuggingFace's model identifier format for token counting—these serve different purposes and do not require matching.

Per AIPerf documentation, the -m parameter must match the deployed endpoint model, which it does. The REQUEST_COUNT = CONCURRENCY * 3 multiplier aligns with NVIDIA's documented benchmarking best practice for obtaining stable measurements.

notebooks/README.md

notebooks/TCO_calculator.ipynb

debermudez

Just a question on the triton container version.

notebooks/TCO_calculator.ipynb

notebooks/LLM_TCO_Calculator.xlsx

debermudez

Lets go ahead with this now.
I will add an item to our backlog to test this in the near future on updated containers.

Signed-off-by: Vinh Nguyen <[email protected]>

Pr 445

vinhngx · 2025-11-13T23:59:40Z

Thanks @matthewkotila and @debermudez . Resolved some minor issues in the PR in this thread. Could this be merged now?
I noticed some failing check, not sure if that affect the merge or not (?)

debermudez · 2025-11-14T00:22:55Z

The mac test should be fine.
@saturley-hall do you know how to resolve the sign off issue for @vinhngx? He is an nvidian.

vinhngx and others added 3 commits October 24, 2025 05:44

add LLM TCO calculator

247340b

fix minor text

3de83cf

Merge branch 'ai-dynamo:main' into main

eb77e19

coderabbitai bot reviewed Nov 5, 2025

View reviewed changes

notebooks/README.md Outdated Show resolved Hide resolved

notebooks/TCO_calculator.ipynb Show resolved Hide resolved

notebooks/TCO_calculator.ipynb Show resolved Hide resolved

notebooks/TCO_calculator.ipynb Show resolved Hide resolved

debermudez reviewed Nov 5, 2025

View reviewed changes

notebooks/TCO_calculator.ipynb Outdated Show resolved Hide resolved

matthewkotila reviewed Nov 7, 2025

View reviewed changes

notebooks/LLM_TCO_Calculator.xlsx Show resolved Hide resolved

matthewkotila reviewed Nov 12, 2025

View reviewed changes

notebooks/LLM_TCO_Calculator.xlsx Show resolved Hide resolved

debermudez approved these changes Nov 12, 2025

View reviewed changes

matthewkotila approved these changes Nov 12, 2025

View reviewed changes

vinhngx and others added 5 commits November 13, 2025 11:39

Merge branch 'main' into main

7b75714

Merge branch 'main' into main

8113300

update triton container version

39eb0aa

Signed-off-by: Vinh Nguyen <[email protected]>

Fix formatting in README.md

20431b9

Signed-off-by: Vinh Nguyen <[email protected]>

clean up cell output

e648299

vinhngx changed the title ~~Adding a LLM TCO calculator tool~~ feat: add a LLM TCO calculator tool Nov 13, 2025

github-actions bot added the feat label Nov 13, 2025

vinhngx and others added 4 commits November 13, 2025 23:30

run precommit hook

e5d248e

Merge branch 'main' into main

7de8134

Merge branch 'main' into pr-445

cf22509

Merge pull request #1 from vinhngx/pr-445

48cd015

Pr 445

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add a LLM TCO calculator tool #445

feat: add a LLM TCO calculator tool #445

Uh oh!

vinhngx commented Nov 5, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

github-actions bot commented Nov 5, 2025

Uh oh!

codecov bot commented Nov 5, 2025

Uh oh!

coderabbitai bot commented Nov 5, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

debermudez left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

debermudez left a comment

Uh oh!

vinhngx commented Nov 13, 2025

Uh oh!

debermudez commented Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: add a LLM TCO calculator tool #445

Are you sure you want to change the base?

feat: add a LLM TCO calculator tool #445

Uh oh!

Conversation

vinhngx commented Nov 5, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Release Notes

Uh oh!

github-actions bot commented Nov 5, 2025

Try out this PR

Uh oh!

codecov bot commented Nov 5, 2025

Codecov Report

Uh oh!

coderabbitai bot commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

Pre-merge checks

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

debermudez left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

debermudez left a comment

Choose a reason for hiding this comment

Uh oh!

vinhngx commented Nov 13, 2025

Uh oh!

debermudez commented Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vinhngx commented Nov 5, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Nov 5, 2025 •

edited

Loading