Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Instantiate VSMemHelper with correct policy for merge sort #3897

Draft
wants to merge 4 commits into
base: main
Choose a base branch
from

Conversation

NaderAlAwar
Copy link
Contributor

Description

Closes #3895

Checklist

  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@NaderAlAwar NaderAlAwar requested a review from a team as a code owner February 21, 2025 16:02
@NaderAlAwar NaderAlAwar requested a review from fbusato February 21, 2025 16:02
@NaderAlAwar NaderAlAwar marked this pull request as draft February 21, 2025 16:02
Copy link

copy-pr-bot bot commented Feb 21, 2025

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@NaderAlAwar
Copy link
Contributor Author

/ok to test

Copy link
Contributor

🟩 CI finished in 1h 43m: Pass: 100%/93 | Total: 2d 07h | Avg: 35m 32s | Max: 1h 18m | Hits: 80%/133653
  • 🟩 cub: Pass: 100%/45 | Total: 1d 12h | Avg: 48m 15s | Max: 1h 18m | Hits: 74%/53221

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 10h | Avg: 48m 01s | Max:  1h 18m | Hits:  74%/50803 
      🟩 arm64              Pass: 100%/2   | Total:  1h 45m | Avg: 52m 56s | Max: 53m 04s | Hits:  76%/2418  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  4h 26m | Avg: 53m 22s | Max:  1h 01m | Hits:  64%/5879  
      🟩 12.5               Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 03m | Hits:  71%/2236  
      🟩 12.8               Pass: 100%/38  | Total:  1d 05h | Avg: 46m 54s | Max:  1h 18m | Hits:  75%/45106 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 53m | Avg: 56m 43s | Max:  1h 00m | Hits:  83%/2090  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 26m | Avg: 53m 22s | Max:  1h 01m | Hits:  64%/5879  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 03m | Hits:  71%/2236  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 03h | Avg: 46m 21s | Max:  1h 18m | Hits:  75%/43016 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 53m | Avg: 56m 43s | Max:  1h 00m | Hits:  83%/2090  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 10h | Avg: 47m 51s | Max:  1h 18m | Hits:  73%/51131 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 24m | Avg: 51m 12s | Max: 53m 15s | Hits:  75%/4844  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 38m | Avg: 49m 10s | Max: 49m 16s | Hits:  74%/2418  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 42m | Avg: 51m 07s | Max: 53m 25s | Hits:  73%/2418  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 37m | Avg: 48m 53s | Max: 49m 49s | Hits:  74%/2418  
      🟩 Clang18            Pass: 100%/7   | Total:  5h 15m | Avg: 45m 03s | Max:  1h 00m | Hits:  84%/8135  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 42m | Avg: 51m 24s | Max: 52m 35s | Hits:  74%/2422  
      🟩 GCC8               Pass: 100%/1   | Total: 48m 28s | Avg: 48m 28s | Max: 48m 28s | Hits:  73%/1211  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 42m | Avg: 51m 05s | Max: 52m 12s | Hits:  73%/2422  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 47m | Avg: 53m 47s | Max: 55m 05s | Hits:  73%/2422  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 40m | Avg: 50m 29s | Max: 51m 37s | Hits:  73%/2418  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 44m | Avg: 52m 21s | Max: 55m 01s | Hits:  73%/2418  
      🟩 GCC13              Pass: 100%/11  | Total:  6h 14m | Avg: 34m 03s | Max:  1h 05m | Hits:  87%/13299 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 17m | Hits:  14%/2070  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 30m | Avg:  1h 15m | Max:  1h 18m | Hits:  14%/2070  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 03m | Hits:  71%/2236  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 13h 38m | Avg: 48m 08s | Max:  1h 00m | Hits:  78%/20233 
      🟩 GCC                Pass: 100%/22  | Total: 15h 41m | Avg: 42m 47s | Max:  1h 05m | Hits:  80%/26612 
      🟩 MSVC               Pass: 100%/4   | Total:  4h 49m | Avg:  1h 12m | Max:  1h 18m | Hits:  14%/4140  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 03m | Hits:  71%/2236  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 09m | Avg: 23m 03s | Max: 25m 19s | Hits:  91%/3627  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 07h | Avg: 55m 01s | Max:  1h 18m | Hits:  68%/39922 
      🟩 rtxa6000           Pass: 100%/8   | Total:  3h 51m | Avg: 28m 54s | Max: 54m 22s | Hits:  92%/9672  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 09h | Avg: 54m 02s | Max:  1h 18m | Hits:  68%/43549 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 22s | Avg: 21m 22s | Max: 21m 22s | Hits:  99%/1209  
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 24s | Avg: 16m 24s | Max: 16m 24s | Hits:  99%/1209  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 13m | Avg: 24m 25s | Max: 25m 19s | Hits:  99%/3627  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 00m | Avg: 20m 16s | Max: 20m 53s | Hits:  99%/3627  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 09m | Avg: 23m 03s | Max: 25m 19s | Hits:  91%/3627  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m | Hits:  73%/1209  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 18h 14m | Avg: 54m 44s | Max:  1h 17m | Hits:  66%/23419 
      🟩 20                 Pass: 100%/25  | Total: 17h 56m | Avg: 43m 03s | Max:  1h 18m | Hits:  80%/29802 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 18h 11m | Avg: 24m 14s | Max: 56m 16s | Hits: 84%/80136

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 31m 54s | Avg: 15m 57s | Max: 20m 43s | Hits:  92%/3564  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 17h 30m | Avg: 24m 25s | Max: 56m 16s | Hits:  84%/76573 
      🟩 arm64              Pass: 100%/2   | Total: 40m 58s | Avg: 20m 29s | Max: 21m 53s | Hits:  87%/3563  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  2h 15m | Avg: 27m 09s | Max: 50m 53s | Hits:  81%/8901  
      🟩 12.5               Pass: 100%/2   | Total:  1h 20m | Avg: 40m 27s | Max: 41m 08s | Hits:  79%/3562  
      🟩 12.8               Pass: 100%/38  | Total: 14h 34m | Avg: 23m 00s | Max: 56m 16s | Hits:  85%/67673 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 33m 30s | Avg: 16m 45s | Max: 17m 02s | Hits:  88%/3562  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  2h 15m | Avg: 27m 09s | Max: 50m 53s | Hits:  81%/8901  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 20m | Avg: 40m 27s | Max: 41m 08s | Hits:  79%/3562  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 14h 00m | Avg: 23m 21s | Max: 56m 16s | Hits:  85%/64111 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 33m 30s | Avg: 16m 45s | Max: 17m 02s | Hits:  88%/3562  
      🟩 nvcc               Pass: 100%/43  | Total: 17h 37m | Avg: 24m 35s | Max: 56m 16s | Hits:  84%/76574 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 29m | Avg: 22m 16s | Max: 24m 12s | Hits:  87%/7124  
      🟩 Clang15            Pass: 100%/2   | Total: 42m 49s | Avg: 21m 24s | Max: 21m 47s | Hits:  86%/3562  
      🟩 Clang16            Pass: 100%/2   | Total: 43m 11s | Avg: 21m 35s | Max: 21m 45s | Hits:  86%/3562  
      🟩 Clang17            Pass: 100%/2   | Total: 45m 36s | Avg: 22m 48s | Max: 23m 33s | Hits:  86%/3562  
      🟩 Clang18            Pass: 100%/7   | Total:  1h 58m | Avg: 16m 54s | Max: 25m 13s | Hits:  90%/12467 
      🟩 GCC7               Pass: 100%/2   | Total: 45m 03s | Avg: 22m 31s | Max: 23m 25s | Hits:  86%/3564  
      🟩 GCC8               Pass: 100%/1   | Total: 26m 18s | Avg: 26m 18s | Max: 26m 18s | Hits:  84%/1782  
      🟩 GCC9               Pass: 100%/2   | Total: 47m 16s | Avg: 23m 38s | Max: 24m 49s | Hits:  86%/3564  
      🟩 GCC10              Pass: 100%/2   | Total: 44m 42s | Avg: 22m 21s | Max: 22m 27s | Hits:  86%/3564  
      🟩 GCC11              Pass: 100%/2   | Total: 48m 51s | Avg: 24m 25s | Max: 25m 08s | Hits:  86%/3564  
      🟩 GCC12              Pass: 100%/2   | Total: 51m 43s | Avg: 25m 51s | Max: 25m 55s | Hits:  85%/3564  
      🟩 GCC13              Pass: 100%/10  | Total:  2h 58m | Avg: 17m 52s | Max: 28m 26s | Hits:  91%/17820 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 37m | Avg: 48m 43s | Max: 50m 53s | Hits:  55%/3550  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 11m | Avg: 43m 43s | Max: 56m 16s | Hits:  60%/5325  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 20m | Avg: 40m 27s | Max: 41m 08s | Hits:  79%/3562  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  5h 39m | Avg: 19m 56s | Max: 25m 13s | Hits:  88%/30277 
      🟩 GCC                Pass: 100%/21  | Total:  7h 22m | Avg: 21m 04s | Max: 28m 26s | Hits:  88%/37422 
      🟩 MSVC               Pass: 100%/5   | Total:  3h 48m | Avg: 45m 43s | Max: 56m 16s | Hits:  58%/8875  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 20m | Avg: 40m 27s | Max: 41m 08s | Hits:  79%/3562  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 24m 32s | Avg: 12m 16s | Max: 13m 18s | Hits:  93%/3564  
      🟩 rtx2080            Pass: 100%/33  | Total: 14h 19m | Avg: 26m 02s | Max: 50m 53s | Hits:  83%/58769 
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 27m | Avg: 20m 44s | Max: 56m 16s | Hits:  87%/17803 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total: 16h 43m | Avg: 26m 23s | Max: 56m 16s | Hits:  82%/67671 
      🟩 TestCPU            Pass: 100%/3   | Total: 44m 09s | Avg: 14m 43s | Max: 28m 54s | Hits:  90%/5338  
      🟩 TestGPU            Pass: 100%/4   | Total: 43m 51s | Avg: 10m 57s | Max: 11m 18s | Hits:  99%/7127  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 24m 32s | Avg: 12m 16s | Max: 13m 18s | Hits:  93%/3564  
      🟩 90;90a;100         Pass: 100%/1   | Total: 27m 29s | Avg: 27m 29s | Max: 27m 29s | Hits:  85%/1782  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  9h 07m | Avg: 27m 23s | Max: 50m 53s | Hits:  81%/35611 
      🟩 20                 Pass: 100%/23  | Total:  8h 31m | Avg: 22m 14s | Max: 56m 16s | Hits:  87%/40961 
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 13m 40s | Avg: 6m 50s | Max: 11m 07s | Hits: 98%/296

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 13m 40s | Avg:  6m 50s | Max: 11m 07s | Hits:  98%/296   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 13m 40s | Avg:  6m 50s | Max: 11m 07s | Hits:  98%/296   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 13m 40s | Avg:  6m 50s | Max: 11m 07s | Hits:  98%/296   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 13m 40s | Avg:  6m 50s | Max: 11m 07s | Hits:  98%/296   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 13m 40s | Avg:  6m 50s | Max: 11m 07s | Hits:  98%/296   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 13m 40s | Avg:  6m 50s | Max: 11m 07s | Hits:  98%/296   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 13m 40s | Avg:  6m 50s | Max: 11m 07s | Hits:  98%/296   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 33s | Avg:  2m 33s | Max:  2m 33s | Hits:  97%/148   
      🟩 Test               Pass: 100%/1   | Total: 11m 07s | Avg: 11m 07s | Max: 11m 07s | Hits:  98%/148   
    
  • 🟩 python: Pass: 100%/1 | Total: 28m 48s | Avg: 28m 48s | Max: 28m 48s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 28m 48s | Avg: 28m 48s | Max: 28m 48s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 28m 48s | Avg: 28m 48s | Max: 28m 48s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 28m 48s | Avg: 28m 48s | Max: 28m 48s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 28m 48s | Avg: 28m 48s | Max: 28m 48s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 28m 48s | Avg: 28m 48s | Max: 28m 48s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 28m 48s | Avg: 28m 48s | Max: 28m 48s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 28m 48s | Avg: 28m 48s | Max: 28m 48s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 28m 48s | Avg: 28m 48s | Max: 28m 48s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
+/- CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 93)

# Runner
66 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

@NaderAlAwar
Copy link
Contributor Author

/ok to test

Copy link
Contributor

🟩 CI finished in 1h 13m: Pass: 100%/93 | Total: 1d 18h | Avg: 27m 25s | Max: 1h 05m | Hits: 91%/133929
  • 🟩 cub: Pass: 100%/45 | Total: 1d 06h | Avg: 40m 23s | Max: 1h 05m | Hits: 89%/53485

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 04h | Avg: 40m 01s | Max:  1h 05m | Hits:  89%/51055 
      🟩 arm64              Pass: 100%/2   | Total:  1h 36m | Avg: 48m 19s | Max: 48m 51s | Hits:  94%/2430  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 36m | Avg: 43m 13s | Max: 54m 30s | Hits:  80%/5908  
      🟩 12.5               Pass: 100%/2   | Total:  1h 32m | Avg: 46m 05s | Max: 47m 22s | Hits:  93%/2248  
      🟩 12.8               Pass: 100%/38  | Total:  1d 01h | Avg: 39m 43s | Max:  1h 05m | Hits:  90%/45329 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 48m | Avg: 54m 28s | Max: 56m 20s | Hits:  95%/2100  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 36m | Avg: 43m 13s | Max: 54m 30s | Hits:  80%/5908  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 32m | Avg: 46m 05s | Max: 47m 22s | Hits:  93%/2248  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 23h 20m | Avg: 38m 54s | Max:  1h 05m | Hits:  90%/43229 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 48m | Avg: 54m 28s | Max: 56m 20s | Hits:  95%/2100  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 04h | Avg: 39m 44s | Max:  1h 05m | Hits:  89%/51385 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 47m | Avg: 41m 57s | Max: 43m 24s | Hits:  94%/4868  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 25m | Avg: 42m 52s | Max: 44m 21s | Hits:  94%/2430  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 22m | Avg: 41m 23s | Max: 42m 52s | Hits:  94%/2430  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 20m | Avg: 40m 21s | Max: 40m 56s | Hits:  94%/2430  
      🟩 Clang18            Pass: 100%/7   | Total:  4h 39m | Avg: 39m 54s | Max: 56m 20s | Hits:  96%/8175  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 21m | Avg: 40m 34s | Max: 40m 51s | Hits:  94%/2434  
      🟩 GCC8               Pass: 100%/1   | Total: 41m 41s | Avg: 41m 41s | Max: 41m 41s | Hits:  94%/1217  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 20m | Avg: 40m 05s | Max: 40m 38s | Hits:  94%/2434  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 23m | Avg: 41m 31s | Max: 42m 10s | Hits:  94%/2434  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 22m | Avg: 41m 24s | Max: 41m 33s | Hits:  94%/2430  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 23m | Avg: 41m 47s | Max: 42m 36s | Hits:  94%/2430  
      🟩 GCC13              Pass: 100%/11  | Total:  5h 30m | Avg: 30m 04s | Max: 50m 28s | Hits:  97%/13365 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 59m | Avg: 59m 45s | Max:  1h 05m | Hits:  15%/2080  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 04m | Hits:  15%/2080  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 32m | Avg: 46m 05s | Max: 47m 22s | Hits:  93%/2248  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 11h 36m | Avg: 40m 57s | Max: 56m 20s | Hits:  95%/20333 
      🟩 GCC                Pass: 100%/22  | Total: 13h 03m | Avg: 35m 36s | Max: 50m 28s | Hits:  96%/26744 
      🟩 MSVC               Pass: 100%/4   | Total:  4h 06m | Avg:  1h 01m | Max:  1h 05m | Hits:  15%/4160  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 32m | Avg: 46m 05s | Max: 47m 22s | Hits:  93%/2248  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 03m | Avg: 21m 09s | Max: 23m 35s | Hits:  97%/3645  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 01h | Avg: 45m 29s | Max:  1h 05m | Hits:  86%/40120 
      🟩 rtxa6000           Pass: 100%/8   | Total:  3h 27m | Avg: 25m 56s | Max: 44m 19s | Hits:  98%/9720  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 03h | Avg: 44m 31s | Max:  1h 05m | Hits:  87%/43765 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 19s | Avg: 21m 19s | Max: 21m 19s | Hits:  99%/1215  
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 35s | Avg: 16m 35s | Max: 16m 35s | Hits:  99%/1215  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 08m | Avg: 22m 56s | Max: 23m 31s | Hits:  99%/3645  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 03m | Avg: 21m 14s | Max: 23m 35s | Hits:  99%/3645  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 03m | Avg: 21m 09s | Max: 23m 35s | Hits:  97%/3645  
      🟩 90;90a;100         Pass: 100%/1   | Total: 50m 28s | Avg: 50m 28s | Max: 50m 28s | Hits:  94%/1215  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 14h 56m | Avg: 44m 48s | Max:  1h 05m | Hits:  84%/23535 
      🟩 20                 Pass: 100%/25  | Total: 15h 21m | Avg: 36m 52s | Max:  1h 04m | Hits:  93%/29950 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 11h 07m | Avg: 14m 49s | Max: 35m 14s | Hits: 92%/80136

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 22m 29s | Avg: 11m 14s | Max: 11m 18s | Hits:  97%/3564  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 10h 44m | Avg: 14m 59s | Max: 35m 14s | Hits:  92%/76573 
      🟩 arm64              Pass: 100%/2   | Total: 22m 45s | Avg: 11m 22s | Max: 12m 03s | Hits:  94%/3563  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 19m | Avg: 15m 54s | Max: 30m 44s | Hits:  89%/8901  
      🟩 12.5               Pass: 100%/2   | Total: 48m 22s | Avg: 24m 11s | Max: 25m 10s | Hits:  93%/3562  
      🟩 12.8               Pass: 100%/38  | Total:  8h 59m | Avg: 14m 11s | Max: 35m 14s | Hits:  92%/67673 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 22m 30s | Avg: 11m 15s | Max: 11m 15s | Hits:  94%/3562  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 19m | Avg: 15m 54s | Max: 30m 44s | Hits:  89%/8901  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 48m 22s | Avg: 24m 11s | Max: 25m 10s | Hits:  93%/3562  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  8h 37m | Avg: 14m 21s | Max: 35m 14s | Hits:  92%/64111 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 22m 30s | Avg: 11m 15s | Max: 11m 15s | Hits:  94%/3562  
      🟩 nvcc               Pass: 100%/43  | Total: 10h 44m | Avg: 14m 59s | Max: 35m 14s | Hits:  92%/76574 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 49m 03s | Avg: 12m 15s | Max: 12m 45s | Hits:  94%/7124  
      🟩 Clang15            Pass: 100%/2   | Total: 25m 53s | Avg: 12m 56s | Max: 13m 38s | Hits:  94%/3562  
      🟩 Clang16            Pass: 100%/2   | Total: 25m 43s | Avg: 12m 51s | Max: 13m 19s | Hits:  94%/3562  
      🟩 Clang17            Pass: 100%/2   | Total: 24m 44s | Avg: 12m 22s | Max: 12m 27s | Hits:  94%/3562  
      🟩 Clang18            Pass: 100%/7   | Total:  1h 14m | Avg: 10m 41s | Max: 12m 16s | Hits:  96%/12467 
      🟩 GCC7               Pass: 100%/2   | Total: 24m 27s | Avg: 12m 13s | Max: 12m 39s | Hits:  94%/3564  
      🟩 GCC8               Pass: 100%/1   | Total: 13m 21s | Avg: 13m 21s | Max: 13m 21s | Hits:  94%/1782  
      🟩 GCC9               Pass: 100%/2   | Total: 26m 47s | Avg: 13m 23s | Max: 13m 36s | Hits:  94%/3564  
      🟩 GCC10              Pass: 100%/2   | Total: 25m 03s | Avg: 12m 31s | Max: 12m 40s | Hits:  94%/3564  
      🟩 GCC11              Pass: 100%/2   | Total: 25m 09s | Avg: 12m 34s | Max: 12m 40s | Hits:  94%/3564  
      🟩 GCC12              Pass: 100%/2   | Total: 26m 16s | Avg: 13m 08s | Max: 13m 14s | Hits:  94%/3564  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 53m | Avg: 11m 22s | Max: 14m 01s | Hits:  96%/17820 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 01m | Avg: 30m 47s | Max: 30m 50s | Hits:  66%/3550  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  1h 42m | Avg: 34m 11s | Max: 35m 14s | Hits:  67%/5325  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 48m 22s | Avg: 24m 11s | Max: 25m 10s | Hits:  93%/3562  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  3h 20m | Avg: 11m 46s | Max: 13m 38s | Hits:  95%/30277 
      🟩 GCC                Pass: 100%/21  | Total:  4h 14m | Avg: 12m 08s | Max: 14m 01s | Hits:  95%/37422 
      🟩 MSVC               Pass: 100%/5   | Total:  2h 44m | Avg: 32m 49s | Max: 35m 14s | Hits:  67%/8875  
      🟩 NVHPC              Pass: 100%/2   | Total: 48m 22s | Avg: 24m 11s | Max: 25m 10s | Hits:  93%/3562  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 19m 18s | Avg:  9m 39s | Max: 11m 34s | Hits:  97%/3564  
      🟩 rtx2080            Pass: 100%/33  | Total:  8h 14m | Avg: 14m 59s | Max: 33m 23s | Hits:  92%/58769 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 33m | Avg: 15m 21s | Max: 35m 14s | Hits:  92%/17803 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  9h 33m | Avg: 15m 04s | Max: 33m 58s | Hits:  91%/67671 
      🟩 TestCPU            Pass: 100%/3   | Total: 50m 14s | Avg: 16m 44s | Max: 35m 14s | Hits:  90%/5338  
      🟩 TestGPU            Pass: 100%/4   | Total: 44m 14s | Avg: 11m 03s | Max: 11m 34s | Hits:  99%/7127  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 19m 18s | Avg:  9m 39s | Max: 11m 34s | Hits:  97%/3564  
      🟩 90;90a;100         Pass: 100%/1   | Total: 13m 24s | Avg: 13m 24s | Max: 13m 24s | Hits:  94%/1782  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  5h 21m | Avg: 16m 05s | Max: 33m 23s | Hits:  90%/35611 
      🟩 20                 Pass: 100%/23  | Total:  5h 23m | Avg: 14m 03s | Max: 35m 14s | Hits:  93%/40961 
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 15m 06s | Avg: 7m 33s | Max: 12m 45s | Hits: 98%/308

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 15m 06s | Avg:  7m 33s | Max: 12m 45s | Hits:  98%/308   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 15m 06s | Avg:  7m 33s | Max: 12m 45s | Hits:  98%/308   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 15m 06s | Avg:  7m 33s | Max: 12m 45s | Hits:  98%/308   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 15m 06s | Avg:  7m 33s | Max: 12m 45s | Hits:  98%/308   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 15m 06s | Avg:  7m 33s | Max: 12m 45s | Hits:  98%/308   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 15m 06s | Avg:  7m 33s | Max: 12m 45s | Hits:  98%/308   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 15m 06s | Avg:  7m 33s | Max: 12m 45s | Hits:  98%/308   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 21s | Avg:  2m 21s | Max:  2m 21s | Hits:  98%/154   
      🟩 Test               Pass: 100%/1   | Total: 12m 45s | Avg: 12m 45s | Max: 12m 45s | Hits:  98%/154   
    
  • 🟩 python: Pass: 100%/1 | Total: 50m 42s | Avg: 50m 42s | Max: 50m 42s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 50m 42s | Avg: 50m 42s | Max: 50m 42s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 50m 42s | Avg: 50m 42s | Max: 50m 42s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 50m 42s | Avg: 50m 42s | Max: 50m 42s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 50m 42s | Avg: 50m 42s | Max: 50m 42s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 50m 42s | Avg: 50m 42s | Max: 50m 42s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 50m 42s | Avg: 50m 42s | Max: 50m 42s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 50m 42s | Avg: 50m 42s | Max: 50m 42s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 50m 42s | Avg: 50m 42s | Max: 50m 42s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
+/- CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 93)

# Runner
66 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Review
Development

Successfully merging this pull request may close these issues.

[BUG]: Tuning policy will not be selected properly for CUB Merge Sort
1 participant