-
Notifications
You must be signed in to change notification settings - Fork 108
batched - dense: Testing and fixing Serial QR #2342
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
94496d2
to
4bfb715
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
edit - duplicated comment
18fc349
to
b24686b
Compare
2b4e03b
to
f4dfbe9
Compare
cefa092
to
e2771fd
Compare
The serial QR algorithms does not have unit-tests and is failing for non square matrices. See issue kokkos#2328. This first commit fixes the issue with rectangular matrices and adds a basic test for that use case. Next will work on adding a test that exercises the interfaces on multiple matrices of different sizes within a parallel_for. Finally equivalent tests will be added for the square case as well. Fixing unused variable error It looks like the Left NoTranspose ApplyQ is not doing the correct thing. Will have a look at that next. Spliting the tests a bit better, looking at Q and Qt together to detect issues with consistency between the two. Also eventually allows to use GEMM to figure out which one is wrong. Final fixes and adjusting the tolerance Signed-off-by: Luc Berger-Vergiat <[email protected]>
Signed-off-by: Luc Berger-Vergiat <[email protected]>
Signed-off-by: Luc Berger-Vergiat <[email protected]>
Signed-off-by: Luc Berger-Vergiat <[email protected]>
Signed-off-by: Luc Berger-Vergiat <[email protected]>
Signed-off-by: Luc Berger-Vergiat <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I requested small changes. I also have three additional points.
- It seems that
Side::Right
specialization ofKokkosBatched::SerialApplyQ
is not tested. Is it possible to add a test case for this specialization? - Could you also consider to add docstrings in APIs under
KokkosBatched_ApplyQ_Decl.hpp
andKokkosBatched_QR_Decl.hpp
? It would be helpful to generate API reference - Could you also add
static_assertion
to check the template argument is valid, e.g., check ifArgTrans
isNoTranspose
orTranspose
.
Checking that the side, mode and algo are all valid. Cleaning up the test by using create_mirror_view_and_copy when appropriate. Somehow the semantic of create_mirror_view_and_copy is different from the regular deep_copy so need to make a few changes after testing on GPUs... Signed-off-by: Luc Berger-Vergiat <[email protected]>
Signed-off-by: Luc Berger-Vergiat <[email protected]>
Signed-off-by: Luc Berger-Vergiat <[email protected]>
Signed-off-by: Luc Berger-Vergiat <[email protected]>
I addressed 1. and 3. however for the documentation I will delay until we have it setup in the repo for batched algorithms. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
As for the documentation, I can contribute if needed.
Let me know if you decided how to implement.
@lucbv some of these changes were incompatible with usage in Trilinos (intrepid2), this is showing in the nightly integration builds:
Should I be able to add a guard on version and include updates to include the Impl namespace to resolve? |
Update in Intrepid2_ProjectionTools.hpp for compatibility changes in kokkos/kokkos-kernels#2342 Signed-off-by: Nathan Ellingwood <[email protected]>
Yeah, simple namespace update was all that was needed, PR up trilinos/Trilinos#13876 |
Update in Intrepid2_ProjectionTools.hpp for compatibility changes in kokkos/kokkos-kernels#2342 Signed-off-by: Nathan Ellingwood <[email protected]> Signed-off-by: Chris Siefert <[email protected]>
Update in Intrepid2_ProjectionTools.hpp for compatibility changes in kokkos/kokkos-kernels#2342 Signed-off-by: Nathan Ellingwood <[email protected]>
Update in Intrepid2_ProjectionTools.hpp for compatibility changes in kokkos/kokkos-kernels#2342 Signed-off-by: Nathan Ellingwood <[email protected]>
* deprecate `__KOKKOSBATCHED_PROMOTION__` (#2392) * deprecate __KOKKOSBATCHED_PROMOTION__ Signed-off-by: Carl Pearson <[email protected]> * __KOKKOSBATCHED_PROMOTION___DEPRECATED_MACRO -> KOKKOSBATCHED_IMPL_PROMOTION Signed-off-by: Carl Pearson <[email protected]> --------- Signed-off-by: Carl Pearson <[email protected]> * Reduce duplicated code in trsv (#2388) * Reduce duplicated code in trsv A slightly different for loop can avoid having to separate out the last iteration. * Fix inner loops underflow * Change to a more-readable loop Signed-off-by: James Foucar <[email protected]> * Implement batched serial laswp (#2395) * Implement batched serial laswp Signed-off-by: Yuuichi Asahi <[email protected]> * fix: integer types in laswp test Signed-off-by: Yuuichi Asahi <[email protected]> * change loop order based on base layout Signed-off-by: Yuuichi Asahi <[email protected]> * update serial laswp test based on review Signed-off-by: Yuuichi Asahi <[email protected]> --------- Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * 2392 missed a case where __KOKKOSBATCHED_PROMOTION__ was defined (#2396) Signed-off-by: Carl Pearson <[email protected]> * Rename reserved identifiers (#2373) * clang-tidy file for reserved identifiers Signed-off-by: Carl Pearson <[email protected]> * Batched: don't use reserved names in include guards Signed-off-by: Carl Pearson <[email protected]> * BLAS: don't use reserved names in include guards Signed-off-by: Carl Pearson <[email protected]> --------- Signed-off-by: Carl Pearson <[email protected]> * implement batched serial pbtrs (#2330) * implement batched serial pbtrs Signed-off-by: Yuuichi Asahi <[email protected]> * format Signed-off-by: Yuuichi Asahi <[email protected]> * fix: docstrings for pbtrs Signed-off-by: Yuuichi Asahi <[email protected]> * move implementation details under Impl namespace Signed-off-by: Yuuichi Asahi <[email protected]> * Add missing check for pbtrs Signed-off-by: Yuuichi Asahi <[email protected]> * fix: conflicts Signed-off-by: Yuuichi Asahi <[email protected]> * fix: use EXPECT_NEAR_KK_REL for check Signed-off-by: Yuuichi Asahi <[email protected]> * remove unused variable xm from pbtrs impl Signed-off-by: Yuuichi Asahi <[email protected]> --------- Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * Fixing potential overflow issue in inner product trait (#2397) * Fixing potential overflow issue in inner product trait When result type is double and inputs are floats, one input has to be cast to double so the multiplication operator for double is used instead of the float multiplication operator that could overflow for valid double values. Handle the complex case for mixed input/output fp types Signed-off-by: Luc Berger-Vergiat <[email protected]> * Adding fixes for various integer overflow fixes. Signed-off-by: Luc Berger-Vergiat <[email protected]> --------- Signed-off-by: Luc Berger-Vergiat <[email protected]> * Bump actions/dependency-review-action from 4.3.4 to 4.3.5 (#2400) Bumps [actions/dependency-review-action](https://github.com/actions/dependency-review-action) from 4.3.4 to 4.3.5. - [Release notes](https://github.com/actions/dependency-review-action/releases) - [Commits](https://github.com/actions/dependency-review-action/compare/5a2ce3f5b92ee19cbb1541a4984c76d921601d7c...a6993e2c61fd5dc440b409aa1d6904921c5e1894) --- updated-dependencies: - dependency-name: actions/dependency-review-action dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump github/codeql-action from 3.26.13 to 3.27.0 (#2402) Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.26.13 to 3.27.0. - [Release notes](https://github.com/github/codeql-action/releases) - [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md) - [Commits](https://github.com/github/codeql-action/compare/f779452ac5af1c261dce0346a8f964149f49322b...662472033e021d55d94146f66f6058822b0b39fd) --- updated-dependencies: - dependency-name: github/codeql-action dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump actions/checkout from 4.2.1 to 4.2.2 (#2401) Bumps [actions/checkout](https://github.com/actions/checkout) from 4.2.1 to 4.2.2. - [Release notes](https://github.com/actions/checkout/releases) - [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md) - [Commits](https://github.com/actions/checkout/compare/eef61447b9ff4aafe5dcd4e0bbf5d482be7e7871...11bd71901bbe5b1630ceea73d27597364c9af683) --- updated-dependencies: - dependency-name: actions/checkout dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Deprecate __KOKKOSBATCHED_ENABLE_INTEL_MKL__ (#2403) In favor of KOKKOSBATCHED_IMPL_ENABLE_INTEL_MKL Signed-off-by: Carl Pearson <[email protected]> * Fix batched serial laswp (#2404) * fix include path of Impl Signed-off-by: Yuuichi Asahi <[email protected]> * improve batched serial laswp tests Signed-off-by: Yuuichi Asahi <[email protected]> * fix comments in Test_Batched_SerialLaswp.hpp Signed-off-by: Yuuichi Asahi <[email protected]> --------- Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * implement batched serial iamax (#2399) * implement batched serial iamax Signed-off-by: Yuuichi Asahi <[email protected]> * Add missing static_assertion in iamax Signed-off-by: Yuuichi Asahi <[email protected]> * fix: CodeQL Signed-off-by: Yuuichi Asahi <[email protected]> * fix: reintroduce RealType in impl_test_batched_iamax Signed-off-by: Yuuichi Asahi <[email protected]> * fix: use view size_type as a return type of iamax Signed-off-by: Yuuichi Asahi <[email protected]> --------- Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * Mult result conversion (#2405) * CodeQL: trying to fix issues with multiplication results conversion This avoids potential overflow when low precision data is multiplied and then store in higher precision variable: size_t = int * int Focusing on issues in the library for now, unit-tests will be fixed later. Signed-off-by: Luc Berger-Vergiat <[email protected]> * Applying clang-format Signed-off-by: Luc Berger-Vergiat <[email protected]> * Switching a few static_cast to size_t for clarity After discussion in the PR, these changes should not result in issues when passed to the view constructors and improve clarity for future maintenance. Signed-off-by: Luc Berger-Vergiat <[email protected]> --------- Signed-off-by: Luc Berger-Vergiat <[email protected]> * Deprecate __KOKKOSBATCHED_ENABLE_INTEL_MKL_BATCHED__ (#2406) In favor of KOKKOSBATCHED_IMPL_ENABLE_INTEL_MKL_BATCHED Signed-off-by: Carl Pearson <[email protected]> * Free allocated `MatrixPrec` (#2407) Let's set a good example in our examples Signed-off-by: Carl Pearson <[email protected]> * Fix potential overflow issue in spiluk (#2409) Just like the previous round of fixes related to multiplication overflowing when result type has wider range, this should get CodeQL to be a little happier. Signed-off-by: Luc Berger-Vergiat <[email protected]> * SerialInverseLU: fix overflow in integer multiplication (#2410) Last one of a series of fixes to clean-up the CodeQL safety issues, after that we should be all clean! Signed-off-by: Luc Berger-Vergiat <[email protected]> * CI: `address` sanitizer and most of `undefined` sanitizer (#2408) * Add address sanitizer and most of undefined sanitizer. Exclude vptr due to Preconditioner visibility. Exclude signed integer overflow because we do this all over the place. Signed-off-by: Carl Pearson <[email protected]> * Reducing ETI scope a lot to improve build size and time This is not a permanent fix, we probably need to set this build on a different platform but should be enough to get one set of results and observe how good/bad we are doing... Signed-off-by: Carl Pearson <[email protected]> * ci: osx-ci -> ubuntu-asan-ubsan-ci Signed-off-by: Carl Pearson <[email protected]> * ci: drop compiler warnings on ci sanitizers build Signed-off-by: Carl Pearson <[email protected]> * ci: Kokkos_DIR -> Kokkos_ROOT Signed-off-by: Carl Pearson <[email protected]> * ci: ditch relative paths and working directories Signed-off-by: Carl Pearson <[email protected]> * ci: drop Kokkos_ENABLE_DEPRECATED_CODE_3 Signed-off-by: Carl Pearson <[email protected]> * ci: fix kokkos kernels source path Signed-off-by: Carl Pearson <[email protected]> * ci: add UBSAN_OPTIONS to get stack trace Signed-off-by: Carl Pearson <[email protected]> --------- Signed-off-by: Carl Pearson <[email protected]> Co-authored-by: Luc Berger <[email protected]> * Bump actions/dependency-review-action from 4.3.5 to 4.4.0 (#2411) Bumps [actions/dependency-review-action](https://github.com/actions/dependency-review-action) from 4.3.5 to 4.4.0. - [Release notes](https://github.com/actions/dependency-review-action/releases) - [Commits](https://github.com/actions/dependency-review-action/compare/a6993e2c61fd5dc440b409aa1d6904921c5e1894...4081bf99e2866ebe428fc0477b69eb4fcda7220a) --- updated-dependencies: - dependency-name: actions/dependency-review-action dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump softprops/action-gh-release from 2.0.8 to 2.0.9 (#2412) Bumps [softprops/action-gh-release](https://github.com/softprops/action-gh-release) from 2.0.8 to 2.0.9. - [Release notes](https://github.com/softprops/action-gh-release/releases) - [Changelog](https://github.com/softprops/action-gh-release/blob/master/CHANGELOG.md) - [Commits](https://github.com/softprops/action-gh-release/compare/c062e08bd532815e2082a85e87e3ef29c3e6d191...e7a8f85e1c67a31e6ed99a94b41bd0b71bbee6b8) --- updated-dependencies: - dependency-name: softprops/action-gh-release dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * ODE - RK: fixing small issues reported by Yaro (#2229) * ODE - RK: fixing small issues reported by Yaro 1. fix integer division to floating point division 2. fix evaluation of max scaled error 3. increase or decrease time step using uniform formula 4. use num_steps instead of max_steps for dt calculation 5. add a time step when using constant dt to avoid issues with round-off errors 6. fixing exponent and moving adaptivity computation out of RKStep 7. adding time step counter 8. adding more tests and keep track of time steps if wanted Signed-off-by: Luc Berger-Vergiat <[email protected]> * RK: fixing variable name after rebase Signed-off-by: Luc Berger-Vergiat <[email protected]> * RK: enabling most methods after fixing test related issues Signed-off-by: Luc Berger-Vergiat <[email protected]> * RK: passing new unit-tests Signed-off-by: Luc Berger-Vergiat <[email protected]> * Applying clang-format Signed-off-by: Luc Berger-Vergiat <[email protected]> * RK: fix bad subview creation Signed-off-by: Luc Berger-Vergiat <[email protected]> * RK: fix bug that computes the inital step size for non-adaptive case This prevents having the user defined time step and leads to wrong results. The rate of convergence tests are now passing! Signed-off-by: Luc Berger-Vergiat <[email protected]> * clang-format... Signed-off-by: Luc Berger-Vergiat <[email protected]> * RK: tweaking the tolerances a bit On GPU the lowest order method (RK1-2) is accumulating a bit more errors than on CPU. Only an issue when comparing values to zero where the absolute tolerance is needed to detect good conv. Signed-off-by: Luc Berger-Vergiat <[email protected]> * Adding reference for some implementation details and heuristic values Signed-off-by: Luc Berger-Vergiat <[email protected]> --------- Signed-off-by: Luc Berger-Vergiat <[email protected]> * remove unused member lower_only (#2415) Signed-off-by: Carl Pearson <[email protected]> * Fix remaining UB sanitizer errors (#2417) * D2 coloring, VBBIT: fix color interval test * Fix UB sanitizer error in graph coarsening Initialize bool member of struct. * Fix some issues with create_reverse_map * replaceSumInto tests: initialize bool atomic_ member Signed-off-by: Brian Kelley <[email protected]> * Use fused gtest v1.11.0 from Kokkos (#2416) * fused gtest v1.11.0 Signed-off-by: Carl Pearson <[email protected]> * gtest: suppress unused-function for intel compilers Signed-off-by: Carl Pearson <[email protected]> * gtest: suppress warnings for EDG-based compilers Signed-off-by: Carl Pearson <[email protected]> * gtest: remove TestNameIs class Signed-off-by: Carl Pearson <[email protected]> * gtest: fix guards for C++20 Signed-off-by: Carl Pearson <[email protected]> * unit_tests: tighten up types in comparisons Signed-off-by: Carl Pearson <[email protected]> --------- Signed-off-by: Carl Pearson <[email protected]> * Two-stage GS: initialize functor bool members (#2420) Signed-off-by: Brian Kelley <[email protected]> * Remove unused KokkosBatched_InverseLU_Serial_Impl.hpp (#2422) It's contents are replicated in KokkosBatched_LU_Serial_Impl.hpp. Signed-off-by: Carl Pearson <[email protected]> * Ode sycl fix (#2423) * ODE: skipping autocatalytic test on SYCL For the time being it is unclear why this particular case leads to a runtime error from the SYCL API? Signed-off-by: Luc Berger-Vergiat <[email protected]> * ODE: formatting Signed-off-by: Luc Berger-Vergiat <[email protected]> * ODE: forgot to check if the SYCL space is enabled in Kokkos Signed-off-by: Luc Berger-Vergiat <[email protected]> --------- Signed-off-by: Luc Berger-Vergiat <[email protected]> * Bump github/codeql-action from 3.27.0 to 3.27.1 (#2425) Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.27.0 to 3.27.1. - [Release notes](https://github.com/github/codeql-action/releases) - [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md) - [Commits](https://github.com/github/codeql-action/compare/662472033e021d55d94146f66f6058822b0b39fd...4f3212b61783c3c68e8309a0f18a699764811cda) --- updated-dependencies: - dependency-name: github/codeql-action dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump softprops/action-gh-release from 2.0.9 to 2.1.0 (#2430) Bumps [softprops/action-gh-release](https://github.com/softprops/action-gh-release) from 2.0.9 to 2.1.0. - [Release notes](https://github.com/softprops/action-gh-release/releases) - [Changelog](https://github.com/softprops/action-gh-release/blob/master/CHANGELOG.md) - [Commits](https://github.com/softprops/action-gh-release/compare/e7a8f85e1c67a31e6ed99a94b41bd0b71bbee6b8...01570a1f39cb168c169c802c3bceb9e93fb10974) --- updated-dependencies: - dependency-name: softprops/action-gh-release dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump github/codeql-action from 3.27.1 to 3.27.4 (#2431) Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.27.1 to 3.27.4. - [Release notes](https://github.com/github/codeql-action/releases) - [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md) - [Commits](https://github.com/github/codeql-action/compare/4f3212b61783c3c68e8309a0f18a699764811cda...ea9e4e37992a54ee68a9622e985e60c8e8f12d9f) --- updated-dependencies: - dependency-name: github/codeql-action dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Enable block support for sptrsv streams (#2429) Signed-off-by: James Foucar <[email protected]> * Bump github/codeql-action from 3.27.4 to 3.27.5 (#2438) Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.27.4 to 3.27.5. - [Release notes](https://github.com/github/codeql-action/releases) - [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md) - [Commits](https://github.com/github/codeql-action/compare/ea9e4e37992a54ee68a9622e985e60c8e8f12d9f...f09c1c0a94de965c15400f5634aa42fac8fb8f88) --- updated-dependencies: - dependency-name: github/codeql-action dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump step-security/harden-runner from 2.10.1 to 2.10.2 (#2437) Bumps [step-security/harden-runner](https://github.com/step-security/harden-runner) from 2.10.1 to 2.10.2. - [Release notes](https://github.com/step-security/harden-runner/releases) - [Commits](https://github.com/step-security/harden-runner/compare/91182cccc01eb5e619899d80e4e971d6181294a7...0080882f6c36860b6ba35c610c98ce87d4e2f26f) --- updated-dependencies: - dependency-name: step-security/harden-runner dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump actions/dependency-review-action from 4.4.0 to 4.5.0 (#2436) Bumps [actions/dependency-review-action](https://github.com/actions/dependency-review-action) from 4.4.0 to 4.5.0. - [Release notes](https://github.com/actions/dependency-review-action/releases) - [Commits](https://github.com/actions/dependency-review-action/compare/4081bf99e2866ebe428fc0477b69eb4fcda7220a...3b139cfc5fae8b618d3eae3675e383bb1769c019) --- updated-dependencies: - dependency-name: actions/dependency-review-action dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Update version, changelog, checks for 4.5 (#2441) * Update develop version to 4.5.99 Signed-off-by: Nathan Ellingwood <[email protected]> * Update kokkos version for codequl and docs checks Signed-off-by: Nathan Ellingwood <[email protected]> --------- Signed-off-by: Nathan Ellingwood <[email protected]> * workflow: updating kokkos version for CI testing (#2440) Updating our workflows to test develop against 4.5.00. Signed-off-by: Luc Berger-Vergiat <[email protected]> * ci: Disable PR tests which ran on `sogpu01` at Sandia (#2439) * ci: Disable PR_BDW_CLANG1001_THREADS_SERIAL_LEFT_REL This job ran on sogpu01, which is now used for another purpose. Signed-off-by: Carl Pearson <[email protected]> * ci: Disable VOLTA70 jobs These jobs ran on sogpu01, which is now used for another purpose. Signed-off-by: Carl Pearson <[email protected]> --------- Signed-off-by: Carl Pearson <[email protected]> * Sptrsv stream test fixes (#2444) SPTRSV_CUSPARSE algorithm is not supported for streams, so it made no sense to add it to the list of tested algs inside test_sptrsv_streams. The result was tri_solve_streams being a no-op for this algorithm. Somehow, this was not caught until the block algorithm was being used. Also, some minor cleanup of alg enum handling in the sptrsv handle. Use a switch statement with a default to catch unhandled enum vals. print_algorithm should just use the alg string to avoid a duplicated switch/ifelseif chain. StringToSPTRSVAlgorithm was not handling several of the enum vals and also returning strings inconsistent with the strings in return_algorithm_string. Grep revealed no one using this function, so I removed it. I added a check in tri_solve_streams to throw an error if an unsupported alg is used. Signed-off-by: James Foucar <[email protected]> * Test_Sparse_TestUtils_RandCsMat.hpp: tighten up types (#2443) Signed-off-by: Carl Pearson <[email protected]> * Improve batched serial trsm implementation and testing (#2432) * Use Trsv instead of Trsm if X is a rank 1 matrix Signed-off-by: Yuuichi Asahi <[email protected]> * Add missing specialization of Trsm serial implementation Signed-off-by: Yuuichi Asahi <[email protected]> * Add missing tests for serial Trsm Signed-off-by: Yuuichi Asahi <[email protected]> * fix: format Signed-off-by: Yuuichi Asahi <[email protected]> * fix: initialization order based on codeQL Signed-off-by: Yuuichi Asahi <[email protected]> * Allow trsm serial to work on rank 1 matrix Signed-off-by: Yuuichi Asahi <[email protected]> * fix: MKL interface of Serial Trsm Signed-off-by: Yuuichi Asahi <[email protected]> * simplify do_conj logic in trsm serial internal Signed-off-by: Yuuichi Asahi <[email protected]> --------- Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * Bump github/codeql-action from 3.27.5 to 3.27.6 (#2453) Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.27.5 to 3.27.6. - [Release notes](https://github.com/github/codeql-action/releases) - [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md) - [Commits](https://github.com/github/codeql-action/compare/f09c1c0a94de965c15400f5634aa42fac8fb8f88...aa578102511db1f4524ed59b8cc2bae4f6e88195) --- updated-dependencies: - dependency-name: github/codeql-action dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Adopt StaticCrsGraph and move it to KokkosSparse:: (#2419) * Moved Static{Ccs,Crs}Graph to namespace KokkosSparse and into their own new header files * Deprecate symbols in Kokkos namespace * CRS counterpart used to come from the Containers subpackage in Kokkos "Core" but now is transitioned to Kokkos Kernels. Fix a couple typos Giving up on raising warnings Avoid deprecation warnings when including deprecated header Kokkos_StaticCrsGraph.hpp Adjust test category per review Fix copy/pasta error third -> fourth Update sparse/unit_test/Test_Sparse_StaticCrsGraph.hpp Fixup deprecate code off Fix typo allocat[i]on Rely on ADL for creat_mirror(GRAPH) Update header and namespace triangle counting test Fixing a small issue with the triangle counting performance test for the upcoming move of StaticCrsGraph Sparse - StaticCrsGraph: fixing issues with perf tests and docs A few performance tests had not been updated to use the new namespace for the graph overload of create_mirror. Additionally some of the inline documentation for StaticCrsGraph is wrong... Sparse - StaticCrsGraph: apply clang-format Signed-off-by: Luc Berger-Vergiat <[email protected]> * Improve batched serial trsv (#2452) * Add missing specialization of Serial Trsv Signed-off-by: Yuuichi Asahi <[email protected]> * Improve testing of serial trsv Signed-off-by: Yuuichi Asahi <[email protected]> * format Signed-off-by: Yuuichi Asahi <[email protected]> * fix code QR Signed-off-by: Yuuichi Asahi <[email protected]> * simplify tests for serial trsv Signed-off-by: Yuuichi Asahi <[email protected]> * format Signed-off-by: Yuuichi Asahi <[email protected]> * static-assertion check outside debug mode Signed-off-by: Yuuichi Asahi <[email protected]> * disallow vector_length=8 if AVX512 is unavailable Signed-off-by: Yuuichi Asahi <[email protected]> --------- Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * update changelog for 4.5 (#2454) Signed-off-by: Nathan Ellingwood <[email protected]> * Workaround issue #2455 (#2457) Signed-off-by: Nathan Ellingwood <[email protected]> * fix: alias for serial trsv (#2458) Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * Fix the package version (#2460) Signed-off-by: Thomas Padioleau <[email protected]> * Deprecated remarks to trsv serial impl (#2461) * Add deprecated warnings for older interfaces of serial trsv Signed-off-by: Yuuichi Asahi <[email protected]> * use if constexpr for selective interface of trsv Signed-off-by: Yuuichi Asahi <[email protected]> * format Signed-off-by: Yuuichi Asahi <[email protected]> * format Signed-off-by: Yuuichi Asahi <[email protected]> --------- Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * Bump github/codeql-action from 3.27.6 to 3.27.9 (#2462) Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.27.6 to 3.27.9. - [Release notes](https://github.com/github/codeql-action/releases) - [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md) - [Commits](https://github.com/github/codeql-action/compare/aa578102511db1f4524ed59b8cc2bae4f6e88195...df409f7d9260372bd5f19e5b04e83cb3c43714ae) --- updated-dependencies: - dependency-name: github/codeql-action dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump softprops/action-gh-release from 2.1.0 to 2.2.0 (#2463) Bumps [softprops/action-gh-release](https://github.com/softprops/action-gh-release) from 2.1.0 to 2.2.0. - [Release notes](https://github.com/softprops/action-gh-release/releases) - [Changelog](https://github.com/softprops/action-gh-release/blob/master/CHANGELOG.md) - [Commits](https://github.com/softprops/action-gh-release/compare/01570a1f39cb168c169c802c3bceb9e93fb10974...7b4da11513bf3f43f9999e90eabced41ab8bb048) --- updated-dependencies: - dependency-name: softprops/action-gh-release dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Fix unit test headers installation (#2334) Signed-off-by: Thomas Padioleau <[email protected]> * Allow dyn-rank-view in serial trsv (#2464) * Allow dyn-rank-view in serial trsv Signed-off-by: Yuuichi Asahi <[email protected]> * suppress shape checks Signed-off-by: Yuuichi Asahi <[email protected]> --------- Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * Bump github/codeql-action from 3.27.9 to 3.28.0 (#2470) Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.27.9 to 3.28.0. - [Release notes](https://github.com/github/codeql-action/releases) - [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md) - [Commits](https://github.com/github/codeql-action/compare/df409f7d9260372bd5f19e5b04e83cb3c43714ae...48ab28a6f5dbc2a99bf1e0131198dd8f1df78169) --- updated-dependencies: - dependency-name: github/codeql-action dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump actions/upload-artifact from 4.4.3 to 4.5.0 (#2471) Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 4.4.3 to 4.5.0. - [Release notes](https://github.com/actions/upload-artifact/releases) - [Commits](https://github.com/actions/upload-artifact/compare/b4b15b8c7c6ac21ea08fcf65892d2ee8f75cf882...6f51ac03b9356f520e9adb1b1b7802705f340c2b) --- updated-dependencies: - dependency-name: actions/upload-artifact dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Add dot, iamax, nrm1, nrm2 examples (#2467) * Add iamax, nrm1, nrm2 examples Signed-off-by: Carl Pearson <[email protected]> * iamax example: printf -> iostream Signed-off-by: Carl Pearson <[email protected]> * add dot wiki example Signed-off-by: Carl Pearson <[email protected]> --------- Signed-off-by: Carl Pearson <[email protected]> * nrm1: assert vector accessibility (#2465) Signed-off-by: Carl Pearson <[email protected]> * Fixes while documenting (#2466) * BLAS - scal: removing check on assignable memory spaces That check is stricter than required as we will values by reference to perform copies and won't try to reassign pointers. Signed-off-by: Luc Berger-Vergiat <[email protected]> * BLAS - rot: check at runtime that X and Y have same extent Signed-off-by: Luc Berger-Vergiat <[email protected]> * BLAS - rot: improving static assertions Signed-off-by: Luc Berger-Vergiat <[email protected]> * BLAS - rotg: check for non-complex types Signed-off-by: Luc Berger-Vergiat <[email protected]> * BLAS - ger: check that matrix stores values as non-const Signed-off-by: Luc Berger-Vergiat <[email protected]> * BLAS - trmm: check for valid execution space type. Signed-off-by: Luc Berger-Vergiat <[email protected]> * BLAS: fix missing semi-colon at end of static_assert Signed-off-by: Luc Berger-Vergiat <[email protected]> * Applying clang-format Signed-off-by: Luc Berger-Vergiat <[email protected]> * More clang-format Signed-off-by: Luc Berger-Vergiat <[email protected]> * Blas - rot: fixing interface of rot The cosine coefficient is strictly real while the sine coefficient can be real or complex leading to a bug in the current API. This commit should fix that for the native and TPL implementation and the associated unit-test is also fixed accordingly. Signed-off-by: Luc Berger-Vergiat <[email protected]> * BLAS - ROT: fixing types for Host TPL calls to ROT function The types for the arguments c and s are actually different and need to be appropriately propagated through the TPL layers of the library. Signed-off-by: Luc Berger-Vergiat <[email protected]> --------- Signed-off-by: Luc Berger-Vergiat <[email protected]> * BLAS - ROT: calling appropriate cuBLAS function (#2473) After changing the rot interface from using all real to a mix of real and complex coefficients, the cuBLAS call needs to be made to {Z,C}rot instead of {Zd,Cs}rot. Signed-off-by: Luc Berger-Vergiat <[email protected]> * rotg/rot/rotmg/rotm docs and examples (#2472) * rotg/rot/rotmg/rotm docs and examples - Add wiki examples for rotg/rot and rotmg/rotm - Improve inline/doxygen comments for routines Signed-off-by: Brian Kelley <[email protected]> * Small example change, formatting Signed-off-by: Brian Kelley <[email protected]> * Remove unused include Signed-off-by: Brian Kelley <[email protected]> * Suppress clang-format on doxygen section Signed-off-by: Brian Kelley <[email protected]> * rot: add missing include Signed-off-by: Brian Kelley <[email protected]> --------- Signed-off-by: Brian Kelley <[email protected]> * update changelog and readme to 4.5.1 with develop (#2475) Signed-off-by: Nathan Ellingwood <[email protected]> * Improve batched serial gemm (#2469) * Add ConjTrans to Serial Gemm Signed-off-by: Yuuichi Asahi <[email protected]> * improve checks in serial Gemm Signed-off-by: Yuuichi Asahi <[email protected]> * improve selective interface of batched gemm Signed-off-by: Yuuichi Asahi <[email protected]> * check info in serial gemm testing Signed-off-by: Yuuichi Asahi <[email protected]> * fix: op type of serial invoke Signed-off-by: Yuuichi Asahi <[email protected]> * format Signed-off-by: Yuuichi Asahi <[email protected]> * remove the global namespace Signed-off-by: Yuuichi Asahi <[email protected]> --------- Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * Fix -Werror in a couple wiki examples (#2476) Resolves warnings error: unused parameter 'argc' [-Werror=unused-parameter] etc. Signed-off-by: Nathan Ellingwood <[email protected]> * ODE: fix potentially uninitialized warning from CodeQL (#2477) * ODE: fix potentially uninitialized warning from CodeQL Signed-off-by: Luc Berger-Vergiat <[email protected]> * Clang-format Signed-off-by: Luc Berger-Vergiat <[email protected]> --------- Signed-off-by: Luc Berger-Vergiat <[email protected]> * Bump github/codeql-action from 3.28.0 to 3.28.1 (#2479) Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.28.0 to 3.28.1. - [Release notes](https://github.com/github/codeql-action/releases) - [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md) - [Commits](https://github.com/github/codeql-action/compare/48ab28a6f5dbc2a99bf1e0131198dd8f1df78169...b6a472f63d85b9c78a3ac5e89422239fc15e9b3c) --- updated-dependencies: - dependency-name: github/codeql-action dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump softprops/action-gh-release from 2.2.0 to 2.2.1 (#2482) Bumps [softprops/action-gh-release](https://github.com/softprops/action-gh-release) from 2.2.0 to 2.2.1. - [Release notes](https://github.com/softprops/action-gh-release/releases) - [Changelog](https://github.com/softprops/action-gh-release/blob/master/CHANGELOG.md) - [Commits](https://github.com/softprops/action-gh-release/compare/7b4da11513bf3f43f9999e90eabced41ab8bb048...c95fe1489396fe8a9eb87c0abf8aa5b2ef267fda) --- updated-dependencies: - dependency-name: softprops/action-gh-release dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump step-security/harden-runner from 2.10.2 to 2.10.3 (#2481) Bumps [step-security/harden-runner](https://github.com/step-security/harden-runner) from 2.10.2 to 2.10.3. - [Release notes](https://github.com/step-security/harden-runner/releases) - [Commits](https://github.com/step-security/harden-runner/compare/0080882f6c36860b6ba35c610c98ce87d4e2f26f...c95a14d0e5bab51a9f56296a4eb0e416910cd350) --- updated-dependencies: - dependency-name: step-security/harden-runner dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump actions/upload-artifact from 4.5.0 to 4.6.0 (#2480) Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 4.5.0 to 4.6.0. - [Release notes](https://github.com/actions/upload-artifact/releases) - [Commits](https://github.com/actions/upload-artifact/compare/6f51ac03b9356f520e9adb1b1b7802705f340c2b...65c4c4a1ddee5b72f698fdd19549f0f0fb45cf08) --- updated-dependencies: - dependency-name: actions/upload-artifact dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * implement batched serial getrf (#2331) * fix: conflicts Signed-off-by: Yuuichi Asahi <[email protected]> * fix: gpu version Signed-off-by: Yuuichi Asahi <[email protected]> * fix: docstring for getrf Signed-off-by: Yuuichi Asahi <[email protected]> * fix: conflicts Signed-off-by: Yuuichi Asahi <[email protected]> * format Signed-off-by: Yuuichi Asahi <[email protected]> * fix: errors from code style Signed-off-by: Yuuichi Asahi <[email protected]> * fix: format Signed-off-by: Yuuichi Asahi <[email protected]> * fix: conflicts Signed-off-by: Yuuichi Asahi <[email protected]> * Improve implementation details of getrf Signed-off-by: Yuuichi Asahi <[email protected]> * format Signed-off-by: Yuuichi Asahi <[email protected]> * Update create_triangular_matrix function Signed-off-by: Yuuichi Asahi <[email protected]> * Merging Test_Batched_SerialGetrf.hpp and Test_Batched_SerialGetrf_Real.hpp Signed-off-by: Yuuichi Asahi <[email protected]> * remove the global namespace Signed-off-by: Yuuichi Asahi <[email protected]> * Add missing maybe_unused for checkGetrfInput Signed-off-by: Yuuichi Asahi <[email protected]> * Improve docstrings and comments to describe getrf algo Signed-off-by: Yuuichi Asahi <[email protected]> * Add a complicated analytical test based on review Signed-off-by: Yuuichi Asahi <[email protected]> --------- Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * Bump step-security/harden-runner from 2.10.3 to 2.10.4 (#2484) Bumps [step-security/harden-runner](https://github.com/step-security/harden-runner) from 2.10.3 to 2.10.4. - [Release notes](https://github.com/step-security/harden-runner/releases) - [Commits](https://github.com/step-security/harden-runner/compare/c95a14d0e5bab51a9f56296a4eb0e416910cd350...cb605e52c26070c328afc4562f0b4ada7618a84e) --- updated-dependencies: - dependency-name: step-security/harden-runner dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * implement batched serial getrs (#2483) * implement batched serial getrs Signed-off-by: Yuuichi Asahi <[email protected]> * unuse getrf in the getrs analytical test Signed-off-by: Yuuichi Asahi <[email protected]> --------- Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * Bump github/codeql-action from 3.28.1 to 3.28.5 (#2487) Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.28.1 to 3.28.5. - [Release notes](https://github.com/github/codeql-action/releases) - [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md) - [Commits](https://github.com/github/codeql-action/compare/b6a472f63d85b9c78a3ac5e89422239fc15e9b3c...f6091c0113d1dcf9b98e269ee48e8a7e51b7bdd4) --- updated-dependencies: - dependency-name: github/codeql-action dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * fix: getrs serial internal implementations (#2488) * fix: getrs serial internal implementations Signed-off-by: Yuuichi Asahi <[email protected]> * disallow optimization to fix serial getrs Signed-off-by: Yuuichi Asahi <[email protected]> * no unroll in Laswp Signed-off-by: Yuuichi Asahi <[email protected]> --------- Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * Bump github/codeql-action from 3.28.5 to 3.28.8 (#2490) Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.28.5 to 3.28.8. - [Release notes](https://github.com/github/codeql-action/releases) - [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md) - [Commits](https://github.com/github/codeql-action/compare/f6091c0113d1dcf9b98e269ee48e8a7e51b7bdd4...dd746615b3b9d728a6a37ca2045b68ca76d4841a) --- updated-dependencies: - dependency-name: github/codeql-action dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * SPRASE- SpADD: moving spadd out of experimental (#2492) * SPRASE- SpADD: moving spadd out of experimental Signed-off-by: Luc Berger-Vergiat <[email protected]> * applying clang-format Signed-off-by: Luc Berger-Vergiat <[email protected]> --------- Signed-off-by: Luc Berger-Vergiat <[email protected]> * Sparse - spgemm: moving symbolic and numeric phases out of the experimental namespace (#2493) * Sparse - spgemm: moving symbolic and numeric phases out of the experimental namespace Signed-off-by: Luc Berger-Vergiat <[email protected]> * Applying clang-format Signed-off-by: Luc Berger-Vergiat <[email protected]> --------- Signed-off-by: Luc Berger-Vergiat <[email protected]> * implement batched serial ger (#2491) * implement batched serial ger Signed-off-by: Yuuichi Asahi <[email protected]> * fixx: view constructors for x and y in ger test Signed-off-by: Yuuichi Asahi <[email protected]> --------- Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * Sparse - Gauss Seidel: moving GS out of experimental namespace (#2494) * Sparse - Gauss Seidel: moving GS out of experimental namespace Signed-off-by: Luc Berger-Vergiat <[email protected]> * Applying clang-format Signed-off-by: Luc Berger-Vergiat <[email protected]> * Sparse - GS: excluding deprecated functions from doxygen parsing Signed-off-by: Luc Berger-Vergiat <[email protected]> * Removing no longer used namespace alias Signed-off-by: Luc Berger-Vergiat <[email protected]> --------- Signed-off-by: Luc Berger-Vergiat <[email protected]> * Iluk deprecate experimental (#2495) * Sparse - SpILUK: move kernels out of Experimental namespace Will make other commits to remove the usage through out the library. Signed-off-by: Luc Berger-Vergiat <[email protected]> * Sparse - SpILUK: updating unit and performance tests After moving the functions out of experimental we need to update tests so we do not call it ourselves anymore. Signed-off-by: Luc Berger-Vergiat <[email protected]> * Applying clang-format Signed-off-by: Luc Berger-Vergiat <[email protected]> --------- Signed-off-by: Luc Berger-Vergiat <[email protected]> * deprecate KOKKOS_CUSPARSE_SAFE_CALL -> KOKKOSPARSE_IMPL_CUSPARSE_SAFE_CALL (#2426) Signed-off-by: Carl Pearson <[email protected]> * Sparse - SpTRSV: moving kenerls out of Experimental (#2496) * Sparse - SpTRSV: moving kenerls out of Experimental Moving the generic SpTRSV kenrels out of experimental but keeping the supernodal and stream variants in experimental as they are less tested and more recent respectively. Signed-off-by: Luc Berger-Vergiat <[email protected]> * Applying clang-format and removing deprecated call from doxygen Signed-off-by: Luc Berger-Vergiat <[email protected]> --------- Signed-off-by: Luc Berger-Vergiat <[email protected]> * Bump github/codeql-action from 3.28.8 to 3.28.9 (#2498) Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.28.8 to 3.28.9. - [Release notes](https://github.com/github/codeql-action/releases) - [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md) - [Commits](https://github.com/github/codeql-action/compare/dd746615b3b9d728a6a37ca2045b68ca76d4841a...9e8d0789d4a0fa9ceb6b1738f7e269594bdd67f0) --- updated-dependencies: - dependency-name: github/codeql-action dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Implement batched serial syr (#2497) * Introduce OpReal functor to provide real operator Signed-off-by: Yuuichi Asahi <[email protected]> * implement batched serial syr Signed-off-by: Yuuichi Asahi <[email protected]> * remove unused variable Signed-off-by: Yuuichi Asahi <[email protected]> * fix view constructor Signed-off-by: Yuuichi Asahi <[email protected]> * Add docstring and assertion for ArgUplo and ArgTrans parameters Signed-off-by: Yuuichi Asahi <[email protected]> --------- Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * Adding ctest config for cdash submissions (#2500) Signed-off-by: Luc Berger-Vergiat <[email protected]> * Bump step-security/harden-runner from 2.10.4 to 2.11.0 (#2501) Bumps [step-security/harden-runner](https://github.com/step-security/harden-runner) from 2.10.4 to 2.11.0. - [Release notes](https://github.com/step-security/harden-runner/releases) - [Commits](https://github.com/step-security/harden-runner/compare/cb605e52c26070c328afc4562f0b4ada7618a84e...4d991eb9b905ef189e4c376166672c3f2f230481) --- updated-dependencies: - dependency-name: step-security/harden-runner dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * implement batched serial lacgv (#2499) Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * Refactor serial tbsv implementation details and tests (#2478) * refactor serial tbsv implementation details and tests Signed-off-by: Yuuichi Asahi <[email protected]> * fix: test names in SerialTbsv complex Signed-off-by: Yuuichi Asahi <[email protected]> * use EXPECT_NEAR_KK_REL for comparison Signed-off-by: Yuuichi Asahi <[email protected]> * use EXPECT_NEAR_KK_REL for general tests Signed-off-by: Yuuichi Asahi <[email protected]> * Add docstring and assertion for Arg parameters Signed-off-by: Yuuichi Asahi <[email protected]> --------- Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * Worflow: refactoring SPR to standard cmake and suppress warnings (#2502) The -Wpass-fail warnings are poluting the output of the compiler this does not really tell us anything either so let us suppress these. Addtionally, suppressing linker warnging with -Wno-unused-command-line-argument. The build comes out a lot cleaner now. Signed-off-by: Luc Berger-Vergiat <[email protected]> * refactor serial pbtrf implementation details and tests (#2503) Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * Bump ossf/scorecard-action from 2.4.0 to 2.4.1 (#2508) Bumps [ossf/scorecard-action](https://github.com/ossf/scorecard-action) from 2.4.0 to 2.4.1. - [Release notes](https://github.com/ossf/scorecard-action/releases) - [Changelog](https://github.com/ossf/scorecard-action/blob/main/RELEASE.md) - [Commits](https://github.com/ossf/scorecard-action/compare/62b2cac7ed8198b15735ed49ab1e5cf35480ba46...f49aabe0b5af0936a0987cfb85d86b75731b0186) --- updated-dependencies: - dependency-name: ossf/scorecard-action dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump actions/upload-artifact from 4.6.0 to 4.6.1 (#2509) Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 4.6.0 to 4.6.1. - [Release notes](https://github.com/actions/upload-artifact/releases) - [Commits](https://github.com/actions/upload-artifact/compare/65c4c4a1ddee5b72f698fdd19549f0f0fb45cf08...4cec3d8aa04e39d1a68397de0c4cd6fb9dce8ec1) --- updated-dependencies: - dependency-name: actions/upload-artifact dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump github/codeql-action from 3.28.9 to 3.28.10 (#2507) Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.28.9 to 3.28.10. - [Release notes](https://github.com/github/codeql-action/releases) - [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md) - [Commits](https://github.com/github/codeql-action/compare/9e8d0789d4a0fa9ceb6b1738f7e269594bdd67f0...b56ba49b26e50535fa1e7f7db0f4f7b4bf65d80d) --- updated-dependencies: - dependency-name: github/codeql-action dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Batched svd stability (#2505) * Batched - SVD: adding a check for numerical stability When computing the eigenvalues of a symmetric 2x2 system we know check for potential cancelation error in the discriminant computation. Added an associated test. Signed-off-by: Luc Berger-Vergiat <[email protected]> * Adding appropriate coverage for layout and scalar types... Signed-off-by: Luc Berger-Vergiat <[email protected]> * clang-format Signed-off-by: Luc Berger-Vergiat <[email protected]> --------- Signed-off-by: Luc Berger-Vergiat <[email protected]> * SYCL - SIMD: preventing AVX512 code to be compiled when targeting GPU (#2510) * SYCL - SIMD: preventing AVX512 code to be compiled when targeting GPU There is a check missing to verify that we are not trying to compile AVX512 intrinsic in GPU code which is obviously not a good idea. Signed-off-by: Luc Berger-Vergiat <[email protected]> * Switching to a more appropriate compiler macro Signed-off-by: Luc Berger-Vergiat <[email protected]> --------- Signed-off-by: Luc Berger-Vergiat <[email protected]> * SYCL: Fix Test_Common_AlignPtrTo.hpp (#2506) * SYCL: Fix Test_Common_AlignPtrTo.hpp Signed-off-by: Daniel Arndt <[email protected]> * Check compiler version Signed-off-by: Daniel Arndt <[email protected]> * Check for ExecutionSpace again Signed-off-by: Daniel Arndt <[email protected]> --------- Signed-off-by: Daniel Arndt <[email protected]> * Same fix as PR #2435 but rebased on develop (#2511) Signed-off-by: Berger Vergiat Luc <[email protected]> Co-authored-by: Berger Vergiat Luc <[email protected]> * docs: updating to the new documentation and refactoring workflow (#2512) * docs: updating to the new documentation and refactoring workflow Updating sphinx version to allow automated build with python 3.13 Update furo version to work with sphinx Okay, checked locally so should work now! --------- Signed-off-by: Luc Berger-Vergiat <[email protected]> * Docs: deploy documentation after PR merge (#2513) Signed-off-by: Luc Berger-Vergiat <[email protected]> * Bump sphinx-design from 0.5.0 to 0.6.1 in /docs (#2515) Bumps [sphinx-design](https://github.com/executablebooks/sphinx-design) from 0.5.0 to 0.6.1. - [Release notes](https://github.com/executablebooks/sphinx-design/releases) - [Changelog](https://github.com/executablebooks/sphinx-design/blob/main/CHANGELOG.md) - [Commits](https://github.com/executablebooks/sphinx-design/compare/v0.5.0...v0.6.1) --- updated-dependencies: - dependency-name: sphinx-design dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump actions/download-artifact from 4.1.8 to 4.1.9 (#2520) Bumps [actions/download-artifact](https://github.com/actions/download-artifact) from 4.1.8 to 4.1.9. - [Release notes](https://github.com/actions/download-artifact/releases) - [Commits](https://github.com/actions/download-artifact/compare/fa0a91b85d4f404e444e00e005971372dc801d16...cc203385981b70ca67e1cc392babf9cc229d5806) --- updated-dependencies: - dependency-name: actions/download-artifact dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump slsa-framework/slsa-github-generator from 2.0.0 to 2.1.0 (#2521) Bumps [slsa-framework/slsa-github-generator](https://github.com/slsa-framework/slsa-github-generator) from 2.0.0 to 2.1.0. - [Release notes](https://github.com/slsa-framework/slsa-github-generator/releases) - [Changelog](https://github.com/slsa-framework/slsa-github-generator/blob/main/CHANGELOG.md) - [Commits](https://github.com/slsa-framework/slsa-github-generator/compare/v2.0.0...v2.1.0) --- updated-dependencies: - dependency-name: slsa-framework/slsa-github-generator dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump sphinx from 6.2.1 to 8.2.3 in /docs (#2516) Bumps [sphinx](https://github.com/sphinx-doc/sphinx) from 6.2.1 to 8.2.3. - [Release notes](https://github.com/sphinx-doc/sphinx/releases) - [Changelog](https://github.com/sphinx-doc/sphinx/blob/master/CHANGES.rst) - [Commits](https://github.com/sphinx-doc/sphinx/compare/v6.2.1...v8.2.3) --- updated-dependencies: - dependency-name: sphinx dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump furo from 2024.4.27 to 2024.8.6 in /docs (#2519) Bumps [furo](https://github.com/pradyunsg/furo) from 2024.4.27 to 2024.8.6. - [Release notes](https://github.com/pradyunsg/furo/releases) - [Changelog](https://github.com/pradyunsg/furo/blob/main/docs/changelog.md) - [Commits](https://github.com/pradyunsg/furo/compare/2024.04.27...2024.08.06) --- updated-dependencies: - dependency-name: furo dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Remove old docker images (#2514) Signed-off-by: Luc Berger-Vergiat <[email protected]> * Bump myst-parser from 3.0.1 to 4.0.1 in /docs (#2517) Bumps [myst-parser](https://github.com/executablebooks/MyST-Parser) from 3.0.1 to 4.0.1. - [Release notes](https://github.com/executablebooks/MyST-Parser/releases) - [Changelog](https://github.com/executablebooks/MyST-Parser/blob/master/CHANGELOG.md) - [Commits](https://github.com/executablebooks/MyST-Parser/compare/v3.0.1...v4.0.1) --- updated-dependencies: - dependency-name: myst-parser dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump sphinx-copybutton from 0.5.0 to 0.5.2 in /docs (#2518) Bumps [sphinx-copybutton](https://github.com/executablebooks/sphinx-copybutton) from 0.5.0 to 0.5.2. - [Release notes](https://github.com/executablebooks/sphinx-copybutton/releases) - [Changelog](https://github.com/executablebooks/sphinx-copybutton/blob/master/CHANGELOG.md) - [Commits](https://github.com/executablebooks/sphinx-copybutton/compare/v0.5.0...v0.5.2) --- updated-dependencies: - dependency-name: sphinx-copybutton dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Reverting build_requirements to previous versions (#2522) Next week when dependabot asks to update the versions of our documentation tools let's ask it to stop tracking them for the time being as the new suggested versions are failing... Signed-off-by: Luc Berger-Vergiat <[email protected]> * docs: pinning github actions by SHA (#2523) Signed-off-by: Luc Berger-Vergiat <[email protected]> * docs: adding hashes for requirements.txt (#2524) * dosc: adding hashes for requirements.txt Signed-off-by: Luc Berger-Vergiat <[email protected]> * Adding more hashes... Signed-off-by: Luc Berger-Vergiat <[email protected]> --------- Signed-off-by: Luc Berger-Vergiat <[email protected]> * Fixing issue with pragma unroll in laswp (#2525) Signed-off-by: Luc Berger-Vergiat <[email protected]> * fix typos in BUILD.md (#2526) Signed-off-by: Timo Heister <[email protected]> * docs: modifying the workflow to add --require-hases for pip (#2527) This forces pip to check hashes when it installs our dependencies as part of the workflow. Also adding hashes for source and binary versions of the dependencies. Finally modifying the link of the documentation badge in README so it points to the actual documentation instead of pointing to the last action run. Signed-off-by: Luc Berger-Vergiat <[email protected]> * Bump github/codeql-action from 3.28.10 to 3.28.11 (#2528) Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.28.10 to 3.28.11. - [Release notes](https://github.com/github/codeql-action/releases) - [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md) - [Commits](https://github.com/github/codeql-action/compare/b56ba49b26e50535fa1e7f7db0f4f7b4bf65d80d...6bb031afdd8eb862ea3fc1848194185e076637e5) --- updated-dependencies: - dependency-name: github/codeql-action dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * refactor batched serial pbtrs implementation details and tests (#2504) Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * batched - dense: Testing and fixing Serial QR (#2342) * batched - dense: Testing and fixing Serial QR The serial QR algorithms does not have unit-tests and is failing for non square matrices. See issue #2328. This first commit fixes the issue with rectangular matrices and adds a basic test for that use case. Next will work on adding a test that exercises the interfaces on multiple matrices of different sizes within a parallel_for. Finally equivalent tests will be added for the square case as well. Fixing unused variable error It looks like the Left NoTranspose ApplyQ is not doing the correct thing. Will have a look at that next. Spliting the tests a bit better, looking at Q and Qt together to detect issues with consistency between the two. Also eventually allows to use GEMM to figure out which one is wrong. Final fixes and adjusting the tolerance Signed-off-by: Luc Berger-Vergiat <[email protected]> * Removing the stride for the workspace Signed-off-by: Luc Berger-Vergiat <[email protected]> * Applying clang-format Signed-off-by: Luc Berger-Vergiat <[email protected]> * Adding checks on inputs Signed-off-by: Luc Berger-Vergiat <[email protected]> * More clean-ups Signed-off-by: Luc Berger-Vergiat <[email protected]> * clang-format Signed-off-by: Luc Berger-Vergiat <[email protected]> * Batched - QR: adding static assertion on struct templates Checking that the side, mode and algo are all valid. Cleaning up the test by using create_mirror_view_and_copy when appropriate. Somehow the semantic of create_mirror_view_and_copy is different from the regular deep_copy so need to make a few changes after testing on GPUs... Signed-off-by: Luc Berger-Vergiat <[email protected]> * Batched - QR: adding check for apply Q on the right side. Signed-off-by: Luc Berger-Vergiat <[email protected]> * clang-format Signed-off-by: Luc Berger-Vergiat <[email protected]> * clang-format Signed-off-by: Luc Berger-Vergiat <[email protected]> --------- Signed-off-by: Luc Berger-Vergiat <[email protected]> * deprecate __KOKKOSBATCHED_ENABLE_LAPACKE__ (#2450) In favor of KOKKOSBATCHED_IMPL_ENABLE_LAPACKE Signed-off-by: Carl Pearson <[email protected]> * don't run clang-tidy on google benchmark code (#2533) Signed-off-by: Carl Pearson <[email protected]> * Remove use of some deprecated macros (#2532) Replaces ``` \#ifdef KOKKOS_ACTIVE_EXECUTION_MEMORY_SPACE_SYCL x \#else y \#endif ``` With ``` \#ifdef KOKKOS_ENABLE_SYCL KOKKOS_IF_ON_DEVICE((x)) KOKKOS_IF_ON_HOST((y)) \#else y \#endif ``` Signed-off-by: Carl Pearson <[email protected]> * Remove some use of some deprecated macros (#2531) * KOKKOS_ACTIVE_EXECUTION_MEMORY_SPACE_SYCL * KOKKOS_ACTIVE_EXECUTION_MEMORY_SPACE_HOST These uses are guarded by KOKKOS_ENABLE_SYCL, so the removal is relatively straightforward. Other sites may require some rethinking. Signed-off-by: Carl Pearson <[email protected]> * Sparse - sptrsv: Adjusting the perf test CUDA guard see issue #2428 (#2535) The csrsv2 functions actually got removed from CUDA 12 as it was a major release. These are still available up to CUDA 11.8 though. Signed-off-by: Luc Berger-Vergiat <[email protected]> * Implement batched serial gbtrf (#2489) * fix: conflicts Signed-off-by: Yuuichi Asahi <[email protected]> * improve gbtrf unit-test to deal with non-rectangular cases Signed-off-by: Yuuichi Asahi <[email protected]> * improve gbtrf unit-test Signed-off-by: Yuuichi Asahi <[email protected]> * fix: errors from codeQL Signed-off-by: Yuuichi Asahi <[email protected]> * remove unused View2DType Signed-off-by: Yuuichi Asahi <[email protected]> * use ger internal to simplify the gbtrf implementation details Signed-off-by: Yuuichi Asahi <[email protected]> * Add docstring and assertion for ArgAlgo parameter in gbtrf Signed-off-by: Yuuichi Asahi <[email protected]> * format Test_Batched_Dense.hpp Signed-off-by: Yuuichi Asahi <[email protected]> * fix check function for gbtrf Signed-off-by: Yuuichi Asahi <[email protected]> * rename full matrix to dense matrix Signed-off-by: Yuuichi Asahi <[email protected]> --------- Signed-off-by: Yuuichi Asahi <[email protected]> Co-authored-by: Yuuichi Asahi <[email protected]> * Add nightly testing on Frontier (#2536) * Remove more reserved identifiers (#2534) * __A -> A Signed-off-by: Carl Pearson <[email protected]> * __B -> B_ Signed-off-by: Carl Pearson <[email protected]> * __C -> C_ Signed-off-by: Carl Pearson <[email protected]> * Rename _-prefixed include guards Signed-off-by: Carl Pearson <[email protected]> --------- Signed-off-by: Carl Pearson <[email protected]> * std::filesystem library for gcc < 9.1 and Clang < 9.0 (#2545) Signed-off-by: Carl Pearson <[email protected]> * Rework dot performance tests (#2544) * cleanup KokkosBlas_dot Signed-off-by: Carl Pearson <[email protected]> * format perf_test/blas/blas1/KokkosBlas_dot_perf_test_benchmark.cpp Signed-off-by: Carl Pearson <[email protected]> * clean up KokkosBlas_dot_mv Signed-off-by: Carl Pearson <[email protected]> * KokkosBlas_dot: fix FLOP count Signed-off-by: Carl Pearson <[email protected]> * KokkosBlas_dot_mv: fix FLOP count Signed-off-by: Carl Pearson <[email protected]> * clean up KokkosBlas_team_dot Signed-off-by: Carl Pearson <[email protected]> --------- Signed-off-by: Carl Pearson <[email protected]> * update develop version to 4.6.99 (#2540) Signed-off-by: Nathan Ellingwood <[email protected]> * Remove `KokkosBlas_dot_mv_perf_test` (#2548) * Add DefaultHostExecutionSpace dot perf_test Signed-off-by: Carl Pearson <[email protected]> * Remove KokkosBlas_dot_mv_perf_test Duplicates functionality in benchmark-based perf test Signed-off-by: Carl Pearson <[email protected]> --------- Signed-off-by: Carl Pearson <cwpea…
The serial QR algorithms does not have unit-tests and is failing for non square matrices. See issue #2328.
This first commit fixes the issue with rectangular matrices and adds a basic test for that use case.
Next will work on adding a test that exercises the interfaces on multiple matrices of different sizes within a parallel_for. Finally equivalent tests will be added for the square case as well.
Tasks: