Perfomance questions regarding gpu saturation and use of num_cpus parameter

# Problem

When testing LEM on a single rtx4090 I cannot find a combination of num_cpus and orientation_batch_size that quite saturates the GPU based on gpu utilization as observed using nvidia-smi

## Best config so far
- orientation_batch_size=48
- num_cpu=[ this doesn't seem to have an effect ]

## I'm using the prebuilt version installed using pip in a python3.10 venv as suggested at [the homepage](https://lucaslab-berkeley.github.io/Leopard-EM/) under the pre-package releases.

- As an aside, it would be cool if the example code blocks were copyable. There are many mechanisms to do this, e.g.

![Image](https://github.com/user-attachments/assets/90f4d38f-fc0b-496a-a43b-6969493a1eb2)


# Questions

1.  What is the orientation_batch_size you typically recommend?
2. Is this just sending a stack of images off in a batch to pyTorch? Does that make the answer in the last question effectively "As many as will fit in memory."
3. Is num_cpus actually doing anything right now? It doesn't seem to be based on either my hardware resource monitors, or what I can make out from the code.

- I see that the num_cpus is not being validated as the [GPU is here](https://github.com/Lucaslab-Berkeley/Leopard-EM/blob/da0f97cf220c432b0c88eff748529b859265732d/src/leopard_em/pydantic_models/config/computational_config.py#L24-L27) 
- I see that you test that num_cpus is at least [configurable and shouldn't allow negative numbers](https://github.com/Lucaslab-Berkeley/Leopard-EM/blob/da0f97cf220c432b0c88eff748529b859265732d/tests/pydantic_models/test_computational_config.py#L78-L96), but don't see any test on usage otherwise.


Any tips or tricks on how to get the most out of a given hardware setup would be very helpful. Thanks!




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perfomance questions regarding gpu saturation and use of num_cpus parameter #46

Problem

Best config so far

I'm using the prebuilt version installed using pip in a python3.10 venv as suggested at the homepage under the pre-package releases.

Questions

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Perfomance questions regarding gpu saturation and use of num_cpus parameter #46

Description

Problem

Best config so far

I'm using the prebuilt version installed using pip in a python3.10 venv as suggested at the homepage under the pre-package releases.

Questions

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions