[SYCL][L0][CUDA][HIP] Fix PI_KERNEL_GROUP_INFO_GLOBAL_WORK_SIZE queries #8769

abagusetty · 2023-03-24T17:30:18Z

Address kernel query global_work_size for L0, CUDA, HIP from PI_KERNEL_GROUP_INFO_GLOBAL_WORK_SIZE
Fixes #8766
For instance (for X-dimension)
L0: maxGroupSizeX * maxGroupCountX
CUDA: CU_DEVICE_ATTRIBUTE_MAX_BLOCK_DIM_X * CU_DEVICE_ATTRIBUTE_MAX_GRID_DIM_X

jinz2014 · 2023-03-24T18:11:18Z

I have a question. Is the max global work size independent of the global work size set in a host program for a kernel ?

abagusetty · 2023-03-24T18:11:27Z

/verify with intel/llvm-test-suite#1694

bader · 2023-03-24T18:19:45Z

@abagusetty, FYI. "verify with" command do not validate on CUDA/HIP platforms.
I recommend adding this test to sycl/test-e2e to validate on CUDA/HIP platforms.
+@aelovikov-intel.

abagusetty · 2023-03-24T18:25:14Z

Thanks, I stumbled upon that too and looked at the wording in Spec, which made me think it could be the max global limits.

The exact semantics of this descriptor are defined by each SYCL backend specification, but the intent is to return the kernel’s maximum global work size.

jinz2014 · 2023-03-27T22:30:06Z

The global work sizes from the query will be the same for any kernels. Right ?

abagusetty · 2023-03-28T16:56:12Z

The global work sizes from the query will be the same for any kernels. Right ?

Yes, since the descriptor is a kernel_device_specific one: Any kernel from (custom device type or a built-in kernel) possibly returns the info of device specific global-work-sizes which in turn should be the same for all the kernels IMO.

…m device-types appropriately

sycl/plugins/level_zero/pi_level_zero.cpp

jchlanda · 2023-03-29T06:07:07Z

sycl/plugins/cuda/pi_cuda.hpp

@@ -42,6 +42,11 @@
 #include <unordered_map>
 #include <vector>

+// Helper for one-liner validation
+#define PI_ASSERT(condition, error)                                            \


It's a bit misleading, as it does not assert on the condition, maybe consider renaming it?

PI_ASSERT to PI_ERR_CHECK

abagusetty · 2023-04-07T15:18:00Z

Gentle ping @smaslov-intel @jchlanda

jandres742

+1 on L0 changes.

steffenlarsen

Sorry for the delay. I think these changes look good. I am a little curious what built-in kernels they would apply to, but I assume CUDA, HIP and L0 guarantee full possible work-sizes either way.

abagusetty · 2023-04-12T23:32:07Z

Sorry for the delay. I think these changes look good. I am a little curious what built-in kernels they would apply to, but I assume CUDA, HIP and L0 guarantee full possible work-sizes either way.

Thanks for the feed back on the built-ins, I too stumbled upon that a bit: Just convinced myself that they see the complete device limits.

intel#8769 Signed-off-by: Jaime Arteaga <[email protected]>

[SYCL] fix global_work_size kernel query descriptor

dc09fd9

abagusetty temporarily deployed to aws March 24, 2023 17:46 — with GitHub Actions Inactive

[HIP] add support for HIP plugin

4498c9a

abagusetty mentioned this pull request Mar 24, 2023

[SYCL] enabled kernelinfo (global_work_size) support for CUDA, HIP intel/llvm-test-suite#1694

Draft

abagusetty temporarily deployed to aws March 24, 2023 18:30 — with GitHub Actions Inactive

added a convience assert similar to UR

64da35a

abagusetty temporarily deployed to aws March 24, 2023 19:39 — with GitHub Actions Inactive

abagusetty had a problem deploying to aws March 24, 2023 20:41 — with GitHub Actions Failure

Merge branch 'sycl' into fix8766

290b00e

abagusetty temporarily deployed to aws March 27, 2023 17:54 — with GitHub Actions Inactive

abagusetty temporarily deployed to aws March 27, 2023 19:00 — with GitHub Actions Inactive

Merge branch 'sycl' into fix8766

d872cb4

fix the failing unit test to query for device-built-in kernels, custo…

28393ab

…m device-types appropriately

abagusetty marked this pull request as ready for review March 28, 2023 18:20

abagusetty requested review from a team as code owners March 28, 2023 18:20

abagusetty requested a review from steffenlarsen March 28, 2023 18:20

smaslov-intel reviewed Mar 28, 2023

View reviewed changes

sycl/plugins/level_zero/pi_level_zero.cpp Show resolved Hide resolved

abagusetty temporarily deployed to aws March 28, 2023 19:36 — with GitHub Actions Inactive

abagusetty temporarily deployed to aws March 28, 2023 22:14 — with GitHub Actions Inactive

jchlanda reviewed Mar 29, 2023

View reviewed changes

abagusetty temporarily deployed to aws March 29, 2023 15:00 — with GitHub Actions Inactive

abagusetty temporarily deployed to aws March 29, 2023 17:09 — with GitHub Actions Inactive

abagusetty temporarily deployed to aws April 4, 2023 14:27 — with GitHub Actions Inactive

jandres742 reviewed Apr 10, 2023

View reviewed changes

smaslov-intel approved these changes Apr 10, 2023

View reviewed changes

steffenlarsen approved these changes Apr 12, 2023

View reviewed changes

steffenlarsen merged commit d666b95 into intel:sycl Apr 12, 2023

abagusetty deleted the fix8766 branch April 12, 2023 23:32

jandres742 pushed a commit to jandres742/llvm that referenced this pull request Apr 21, 2023

Port Fix PI_KERNEL_GROUP_INFO_GLOBAL_WORK_SIZE queries

80d88a3

intel#8769 Signed-off-by: Jaime Arteaga <[email protected]>

jandres742 pushed a commit to jandres742/llvm that referenced this pull request Apr 24, 2023

Port Fix PI_KERNEL_GROUP_INFO_GLOBAL_WORK_SIZE queries

4c7a82e

intel#8769 Signed-off-by: Jaime Arteaga <[email protected]>

jandres742 pushed a commit to jandres742/llvm that referenced this pull request May 3, 2023

Port Fix PI_KERNEL_GROUP_INFO_GLOBAL_WORK_SIZE queries

cec80a6

intel#8769 Signed-off-by: Jaime Arteaga <[email protected]>

jandres742 pushed a commit to jandres742/llvm that referenced this pull request May 16, 2023

Port Fix PI_KERNEL_GROUP_INFO_GLOBAL_WORK_SIZE queries

da1e57d

intel#8769 Signed-off-by: Jaime Arteaga <[email protected]>

jandres742 pushed a commit to jandres742/llvm that referenced this pull request May 23, 2023

Port Fix PI_KERNEL_GROUP_INFO_GLOBAL_WORK_SIZE queries

4f0b293

intel#8769 Signed-off-by: Jaime Arteaga <[email protected]>

jandres742 pushed a commit to jandres742/llvm that referenced this pull request May 26, 2023

Port Fix PI_KERNEL_GROUP_INFO_GLOBAL_WORK_SIZE queries

77af538

intel#8769 Signed-off-by: Jaime Arteaga <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCL][L0][CUDA][HIP] Fix PI_KERNEL_GROUP_INFO_GLOBAL_WORK_SIZE queries #8769

[SYCL][L0][CUDA][HIP] Fix PI_KERNEL_GROUP_INFO_GLOBAL_WORK_SIZE queries #8769

Uh oh!

abagusetty commented Mar 24, 2023 •

edited

Loading

Uh oh!

jinz2014 commented Mar 24, 2023

Uh oh!

abagusetty commented Mar 24, 2023

Uh oh!

bader commented Mar 24, 2023

Uh oh!

abagusetty commented Mar 24, 2023

Uh oh!

jinz2014 commented Mar 27, 2023

Uh oh!

abagusetty commented Mar 28, 2023

Uh oh!

Uh oh!

jchlanda Mar 29, 2023

Uh oh!

abagusetty Mar 29, 2023

Uh oh!

abagusetty commented Apr 7, 2023

Uh oh!

jandres742 left a comment

Uh oh!

steffenlarsen left a comment

Uh oh!

abagusetty commented Apr 12, 2023

Uh oh!

Uh oh!

[SYCL][L0][CUDA][HIP] Fix PI_KERNEL_GROUP_INFO_GLOBAL_WORK_SIZE queries #8769

[SYCL][L0][CUDA][HIP] Fix PI_KERNEL_GROUP_INFO_GLOBAL_WORK_SIZE queries #8769

Uh oh!

Conversation

abagusetty commented Mar 24, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jinz2014 commented Mar 24, 2023

Uh oh!

abagusetty commented Mar 24, 2023

Uh oh!

bader commented Mar 24, 2023

Uh oh!

abagusetty commented Mar 24, 2023

Uh oh!

jinz2014 commented Mar 27, 2023

Uh oh!

abagusetty commented Mar 28, 2023

Uh oh!

Uh oh!

jchlanda Mar 29, 2023

Choose a reason for hiding this comment

Uh oh!

abagusetty Mar 29, 2023

Choose a reason for hiding this comment

Uh oh!

abagusetty commented Apr 7, 2023

Uh oh!

jandres742 left a comment

Choose a reason for hiding this comment

Uh oh!

steffenlarsen left a comment

Choose a reason for hiding this comment

Uh oh!

abagusetty commented Apr 12, 2023

Uh oh!

Uh oh!

abagusetty commented Mar 24, 2023 •

edited

Loading