Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SYCL] Optimize use of shared_ptr on kernel enqueue fast path #17569

Open
wants to merge 9 commits into
base: sycl
Choose a base branch
from

Conversation

slawekptak
Copy link
Contributor

Introduce a series of optimizations to how the shared_ptr type variables are passed, to minimize the number of reference counter increments and decrements.

Use references to shared_ptr as local variables.
Pass the ContextImplPtr by reference to selected
functions (on the kernel enqueue fast path).
@slawekptak slawekptak changed the title Optimize use of shared_ptr on kernel enqueue fast path [SYCL] Optimize use of shared_ptr on kernel enqueue fast path Mar 24, 2025
@slawekptak slawekptak requested a review from a team as a code owner March 25, 2025 09:31
@slawekptak slawekptak requested a review from fabiomestre March 25, 2025 09:31
@slawekptak slawekptak force-pushed the shared_ptrs_refs_local_vars_2 branch from 12430be to bdcd30e Compare March 25, 2025 09:56
directly, instead of creating the device object instance.
in another PR.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants