[benchmark] add a benchmark file for comms used in TorchRec #3435

TroyGarden · 2025-10-05T07:07:35Z

Summary:

context

add benchmark to torch.distributed.all_to_all_single.
in this very first edition, this all_to_all_single is running with sync on the same stream.
basic operations:
pre-comms compute (GPU compute heavy) ==> all_to_all_single comms ==> irrelevant compute (GPU compute heavy, does not depend on comms data) ==> post-comms compute (GPU compute heavy, uses the comms data)

other changs

extend the cmd_conf decorator so that it can support multiple-program selection in command line

python -m torchrec.distributed.benchmark.benchmark_comms \
  a2a_single --name=a2a_sync_base-$(git rev-parse --short HEAD || echo $USER)

add a config (dataclass) class BenchmarkFunc for benchmark_func, which includes most common arguments used in benchmark_func
trace shows the all_to_all_single (comms) is in the same stream as compute.

Differential Revision: D83900855

meta-codesync · 2025-10-05T07:07:43Z

@TroyGarden has exported this pull request. If you are a Meta employee, you can view the originating Diff in D83900855.

Summary: # context * add benchmark to `torch.distributed.all_to_all_single`. * in this very first edition, this all_to_all_single is running with sync on the same stream. * basic operations: pre-comms compute (GPU compute heavy) ==> all_to_all_single comms ==> irrelevant compute (GPU compute heavy, does not depend on comms data) ==> post-comms compute (GPU compute heavy, uses the comms data) # other changs * extend the cmd_conf decorator so that it can support multiple-program selection in command line ``` python -m torchrec.distributed.benchmark.benchmark_comms \ a2a_single --name=a2a_sync_base-$(git rev-parse --short HEAD || echo $USER) ``` * add a config (dataclass) class **`BenchmarkFunc`** for `benchmark_func`, which includes most common arguments used in `benchmark_func` Differential Revision: D83900855

Summary: # context * add benchmark to `torch.distributed.all_to_all_single`. * in this very first edition, this all_to_all_single is running with sync on the same stream. * basic operations: pre-comms compute (GPU compute heavy) ==> all_to_all_single comms ==> irrelevant compute (GPU compute heavy, does not depend on comms data) ==> post-comms compute (GPU compute heavy, uses the comms data) # other changs * extend the cmd_conf decorator so that it can support multiple-program selection in command line ``` python -m torchrec.distributed.benchmark.benchmark_comms \ a2a_single --name=a2a_sync_base-$(git rev-parse --short HEAD || echo $USER) ``` * add a config (dataclass) class **`BenchmarkFunc`** for `benchmark_func`, which includes most common arguments used in `benchmark_func` Reviewed By: spmex Differential Revision: D83900855

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 5, 2025

meta-codesync bot added fb-exported meta-exported labels Oct 5, 2025

TroyGarden force-pushed the export-D83900855 branch from 60277c1 to c98fb11 Compare October 6, 2025 18:29

TroyGarden force-pushed the export-D83900855 branch from c98fb11 to 00272da Compare October 6, 2025 19:51

TroyGarden force-pushed the export-D83900855 branch from 00272da to 02b8ff7 Compare October 6, 2025 20:16

TroyGarden changed the title ~~benchmark for comms used in TorchRec~~ [benchmark] add a benchmark file for comms used in TorchRec Oct 6, 2025

TroyGarden force-pushed the export-D83900855 branch from 02b8ff7 to 0aba8b3 Compare October 6, 2025 20:35

meta-codesync bot closed this in 8cd65b1 Oct 6, 2025

TroyGarden deleted the export-D83900855 branch October 7, 2025 03:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[benchmark] add a benchmark file for comms used in TorchRec #3435

[benchmark] add a benchmark file for comms used in TorchRec #3435

Uh oh!

TroyGarden commented Oct 5, 2025 •

edited

Loading

Uh oh!

meta-codesync bot commented Oct 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[benchmark] add a benchmark file for comms used in TorchRec #3435

[benchmark] add a benchmark file for comms used in TorchRec #3435

Uh oh!

Conversation

TroyGarden commented Oct 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

context

other changs

Uh oh!

meta-codesync bot commented Oct 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

TroyGarden commented Oct 5, 2025 •

edited

Loading