Skip to content

Conversation

lucbv
Copy link
Contributor

@lucbv lucbv commented Sep 19, 2024

The serial QR algorithms does not have unit-tests and is failing for non square matrices. See issue #2328.
This first commit fixes the issue with rectangular matrices and adds a basic test for that use case.
Next will work on adding a test that exercises the interfaces on multiple matrices of different sizes within a parallel_for. Finally equivalent tests will be added for the square case as well.

Tasks:

  • analytical test on rectangular matrix (SerialQR, ApplyQ, FormQ, Q*Q^t=I)
    • SerialQR
    • ApplyQ
    • FormQ
  • multiple variable size rectangular matrices (same API as above)
    • SerialQR
    • ApplyQ
    • FormQ
  • analytical test on square matrix
    • SerialQR
    • ApplyQ
    • FormQ
  • multiple variable size square matrices
    • SerialQR
    • ApplyQ
    • FormQ

Copy link
Contributor

@cwpearson cwpearson left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

edit - duplicated comment

@lucbv lucbv self-assigned this Oct 15, 2024
@lucbv lucbv force-pushed the batched_QR_fixes branch 2 times, most recently from 18fc349 to b24686b Compare November 6, 2024 20:45
@lucbv lucbv force-pushed the batched_QR_fixes branch 2 times, most recently from 2b4e03b to f4dfbe9 Compare December 4, 2024 20:39
@lucbv lucbv removed the AT: WIP label Dec 4, 2024
The serial QR algorithms does not have unit-tests and is failing
for non square matrices. See issue kokkos#2328.
This first commit fixes the issue with rectangular matrices and
adds a basic test for that use case. Next will work on adding a
test that exercises the interfaces on multiple matrices of different
sizes within a parallel_for. Finally equivalent tests will be added
for the square case as well.
Fixing unused variable error
It looks like the Left NoTranspose ApplyQ is not doing the correct
thing. Will have a look at that next.

Spliting the tests a bit better, looking at Q and Qt
together to detect issues with consistency between the two.
Also eventually allows to use GEMM to figure out which one
is wrong.

Final fixes and adjusting the tolerance

Signed-off-by: Luc Berger-Vergiat <[email protected]>
@lucbv lucbv force-pushed the batched_QR_fixes branch from 6aa3beb to f3460b6 Compare March 7, 2025 22:11
lucbv added 5 commits March 7, 2025 15:32
Signed-off-by: Luc Berger-Vergiat <[email protected]>
Signed-off-by: Luc Berger-Vergiat <[email protected]>
Signed-off-by: Luc Berger-Vergiat <[email protected]>
Signed-off-by: Luc Berger-Vergiat <[email protected]>
Signed-off-by: Luc Berger-Vergiat <[email protected]>
@lucbv lucbv requested review from cwpearson and yasahi-hpc March 8, 2025 00:08
Copy link
Contributor

@yasahi-hpc yasahi-hpc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I requested small changes. I also have three additional points.

  1. It seems that Side::Right specialization of KokkosBatched::SerialApplyQ is not tested. Is it possible to add a test case for this specialization?
  2. Could you also consider to add docstrings in APIs under KokkosBatched_ApplyQ_Decl.hpp and KokkosBatched_QR_Decl.hpp? It would be helpful to generate API reference
  3. Could you also add static_assertion to check the template argument is valid, e.g., check if ArgTrans is NoTranspose or Transpose.

Checking that the side, mode and algo are all valid.
Cleaning up the test by using create_mirror_view_and_copy
when appropriate.

Somehow the semantic of create_mirror_view_and_copy is different
from the regular deep_copy so need to make a few changes after
testing on GPUs...

Signed-off-by: Luc Berger-Vergiat <[email protected]>
@lucbv lucbv force-pushed the batched_QR_fixes branch from dd9ecda to a9b516e Compare March 10, 2025 21:11
lucbv added 2 commits March 10, 2025 17:18
Signed-off-by: Luc Berger-Vergiat <[email protected]>
Signed-off-by: Luc Berger-Vergiat <[email protected]>
@lucbv lucbv requested a review from yasahi-hpc March 10, 2025 23:22
@lucbv
Copy link
Contributor Author

lucbv commented Mar 10, 2025

I addressed 1. and 3. however for the documentation I will delay until we have it setup in the repo for batched algorithms.
This should not take too long as I am in the process of writing the main page for that and then we will be able to add the APIs one by one as we get to them.

Copy link
Contributor

@yasahi-hpc yasahi-hpc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.
As for the documentation, I can contribute if needed.
Let me know if you decided how to implement.

@lucbv lucbv merged commit d343008 into kokkos:develop Mar 11, 2025
19 checks passed
@ndellingwood
Copy link
Contributor

ndellingwood commented Mar 12, 2025

@lucbv some of these changes were incompatible with usage in Trilinos (intrepid2), this is showing in the nightly integration builds:

17:53:53 [ 66%] Building CXX object packages/intrepid2/unit-test/Projection/CMakeFiles/Intrepid2_unit-test_Projection_test_project_fields_Serial_DOUBLE.dir/test_project_fields_Serial_DOUBLE.cpp.o
17:53:56 In file included from /home/jenkins/blake-new/workspace/KokkosEco_Trilinos_Blake_OneAPI2023_2_0_ICPX_Serial/Build/packages/intrepid2/unit-test/Projection/test_project_fields_Serial_DOUBLE.cpp:15:
17:53:56 In file included from /home/jenkins/blake-new/workspace/KokkosEco_Trilinos_Blake_OneAPI2023_2_0_ICPX_Serial/Trilinos/packages/intrepid2/unit-test/Projection/test_project_fields.hpp:34:
17:53:56 /home/jenkins/blake-new/workspace/KokkosEco_Trilinos_Blake_OneAPI2023_2_0_ICPX_Serial/Trilinos/packages/intrepid2/src/Projection/Intrepid2_ProjectionTools.hpp:527:9: error: no member named 'SerialQR_Internal' in namespace 'KokkosBatched'; did you mean 'KokkosBatched::Impl::SerialQR_Internal'?
17:53:56         KokkosBatched::SerialQR_Internal::invoke(A0_host.extent(0), A0_host.extent(1),
17:53:56         ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
17:53:56         KokkosBatched::Impl::SerialQR_Internal
17:53:56 /home/jenkins/blake-new/workspace/KokkosEco_Trilinos_Blake_OneAPI2023_2_0_ICPX_Serial/Trilinos/kokkos-kernels/batched/dense/impl/KokkosBatched_QR_Serial_Internal.hpp:34:8: note: 'KokkosBatched::Impl::SerialQR_Internal' declared here
17:53:56 struct SerialQR_Internal {
17:53:56        ^
17:53:56 In file included from /home/jenkins/blake-new/workspace/KokkosEco_Trilinos_Blake_OneAPI2023_2_0_ICPX_Serial/Build/packages/intrepid2/unit-test/Projection/test_project_fields_Serial_DOUBLE.cpp:15:
17:53:56 In file included from /home/jenkins/blake-new/workspace/KokkosEco_Trilinos_Blake_OneAPI2023_2_0_ICPX_Serial/Trilinos/packages/intrepid2/unit-test/Projection/test_project_fields.hpp:34:
17:53:56 /home/jenkins/blake-new/workspace/KokkosEco_Trilinos_Blake_OneAPI2023_2_0_ICPX_Serial/Trilinos/packages/intrepid2/src/Projection/Intrepid2_ProjectionTools.hpp:583:11: error: no member named 'SerialQR_Internal' in namespace 'KokkosBatched'; did you mean 'KokkosBatched::Impl::SerialQR_Internal'?
17:53:56           KokkosBatched::SerialQR_Internal::invoke(A.extent(0), A.extent(1),
17:53:56           ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
17:53:56           KokkosBatched::Impl::SerialQR_Internal
17:53:56 /home/jenkins/blake-new/workspace/KokkosEco_Trilinos_Blake_OneAPI2023_2_0_ICPX_Serial/Trilinos/kokkos-kernels/batched/dense/impl/KokkosBatched_QR_Serial_Internal.hpp:34:8: note: 'KokkosBatched::Impl::SerialQR_Internal' declared here
17:53:56 struct SerialQR_Internal {

Should I be able to add a guard on version and include updates to include the Impl namespace to resolve?

ndellingwood added a commit to ndellingwood/Trilinos that referenced this pull request Mar 12, 2025
Update in Intrepid2_ProjectionTools.hpp for compatibility changes
in kokkos/kokkos-kernels#2342

Signed-off-by: Nathan Ellingwood <[email protected]>
@ndellingwood
Copy link
Contributor

Yeah, simple namespace update was all that was needed, PR up trilinos/Trilinos#13876

csiefer2 pushed a commit to trilinos/Trilinos that referenced this pull request Mar 14, 2025
Update in Intrepid2_ProjectionTools.hpp for compatibility changes
in kokkos/kokkos-kernels#2342

Signed-off-by: Nathan Ellingwood <[email protected]>
Signed-off-by: Chris Siefert <[email protected]>
achauphan pushed a commit to trilinos/Trilinos that referenced this pull request Mar 17, 2025
Update in Intrepid2_ProjectionTools.hpp for compatibility changes
in kokkos/kokkos-kernels#2342

Signed-off-by: Nathan Ellingwood <[email protected]>
fryeguy52 pushed a commit to fryeguy52/Trilinos that referenced this pull request Jul 23, 2025
Update in Intrepid2_ProjectionTools.hpp for compatibility changes
in kokkos/kokkos-kernels#2342

Signed-off-by: Nathan Ellingwood <[email protected]>
lucbv added a commit that referenced this pull request Aug 11, 2025
* deprecate `__KOKKOSBATCHED_PROMOTION__` (#2392)

* deprecate __KOKKOSBATCHED_PROMOTION__

Signed-off-by: Carl Pearson <[email protected]>

* __KOKKOSBATCHED_PROMOTION___DEPRECATED_MACRO -> KOKKOSBATCHED_IMPL_PROMOTION

Signed-off-by: Carl Pearson <[email protected]>

---------

Signed-off-by: Carl Pearson <[email protected]>

* Reduce duplicated code in trsv (#2388)

* Reduce duplicated code in trsv
A slightly different for loop can avoid having to
separate out the last iteration.

* Fix inner loops underflow
* Change to a more-readable loop

Signed-off-by: James Foucar <[email protected]>

* Implement batched serial laswp (#2395)

* Implement batched serial laswp

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: integer types in laswp test

Signed-off-by: Yuuichi Asahi <[email protected]>

* change loop order based on base layout

Signed-off-by: Yuuichi Asahi <[email protected]>

* update serial laswp test based on review

Signed-off-by: Yuuichi Asahi <[email protected]>

---------

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* 2392 missed a case where __KOKKOSBATCHED_PROMOTION__ was defined (#2396)

Signed-off-by: Carl Pearson <[email protected]>

* Rename reserved identifiers (#2373)

* clang-tidy file for reserved identifiers

Signed-off-by: Carl Pearson <[email protected]>

* Batched: don't use reserved names in include guards

Signed-off-by: Carl Pearson <[email protected]>

* BLAS: don't use reserved names in include guards

Signed-off-by: Carl Pearson <[email protected]>

---------

Signed-off-by: Carl Pearson <[email protected]>

* implement batched serial pbtrs (#2330)

* implement batched serial pbtrs

Signed-off-by: Yuuichi Asahi <[email protected]>

* format

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: docstrings for pbtrs

Signed-off-by: Yuuichi Asahi <[email protected]>

* move implementation details under Impl namespace

Signed-off-by: Yuuichi Asahi <[email protected]>

* Add missing check for pbtrs

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: conflicts

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: use EXPECT_NEAR_KK_REL for check

Signed-off-by: Yuuichi Asahi <[email protected]>

* remove unused variable xm from pbtrs impl

Signed-off-by: Yuuichi Asahi <[email protected]>

---------

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* Fixing potential overflow issue in inner product trait (#2397)

* Fixing potential overflow issue in inner product trait

When result type is double and inputs are floats, one input has
to be cast to double so the multiplication operator for double
is used instead of the float multiplication operator that could
overflow for valid double values.

Handle the complex case for mixed input/output fp types

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Adding fixes for various integer overflow fixes.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

---------

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Bump actions/dependency-review-action from 4.3.4 to 4.3.5 (#2400)

Bumps [actions/dependency-review-action](https://github.com/actions/dependency-review-action) from 4.3.4 to 4.3.5.
- [Release notes](https://github.com/actions/dependency-review-action/releases)
- [Commits](https://github.com/actions/dependency-review-action/compare/5a2ce3f5b92ee19cbb1541a4984c76d921601d7c...a6993e2c61fd5dc440b409aa1d6904921c5e1894)

---
updated-dependencies:
- dependency-name: actions/dependency-review-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump github/codeql-action from 3.26.13 to 3.27.0 (#2402)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.26.13 to 3.27.0.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/f779452ac5af1c261dce0346a8f964149f49322b...662472033e021d55d94146f66f6058822b0b39fd)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump actions/checkout from 4.2.1 to 4.2.2 (#2401)

Bumps [actions/checkout](https://github.com/actions/checkout) from 4.2.1 to 4.2.2.
- [Release notes](https://github.com/actions/checkout/releases)
- [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md)
- [Commits](https://github.com/actions/checkout/compare/eef61447b9ff4aafe5dcd4e0bbf5d482be7e7871...11bd71901bbe5b1630ceea73d27597364c9af683)

---
updated-dependencies:
- dependency-name: actions/checkout
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Deprecate __KOKKOSBATCHED_ENABLE_INTEL_MKL__ (#2403)

In favor of KOKKOSBATCHED_IMPL_ENABLE_INTEL_MKL

Signed-off-by: Carl Pearson <[email protected]>

* Fix batched serial laswp (#2404)

* fix include path of Impl

Signed-off-by: Yuuichi Asahi <[email protected]>

* improve batched serial laswp tests

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix comments in Test_Batched_SerialLaswp.hpp

Signed-off-by: Yuuichi Asahi <[email protected]>

---------

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* implement batched serial iamax (#2399)

* implement batched serial iamax

Signed-off-by: Yuuichi Asahi <[email protected]>

* Add missing static_assertion in iamax

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: CodeQL

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: reintroduce RealType in impl_test_batched_iamax

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: use view size_type as a return type of iamax

Signed-off-by: Yuuichi Asahi <[email protected]>

---------

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* Mult result conversion (#2405)

* CodeQL: trying to fix issues with multiplication results conversion

This avoids potential overflow when low precision data is multiplied
and then store in higher precision variable: size_t = int * int
Focusing on issues in the library for now, unit-tests will be fixed
later.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Applying clang-format

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Switching a few static_cast to size_t for clarity

After discussion in the PR, these changes should not result in issues
when passed to the view constructors and improve clarity for future
maintenance.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

---------

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Deprecate __KOKKOSBATCHED_ENABLE_INTEL_MKL_BATCHED__ (#2406)

In favor of KOKKOSBATCHED_IMPL_ENABLE_INTEL_MKL_BATCHED

Signed-off-by: Carl Pearson <[email protected]>

* Free allocated `MatrixPrec` (#2407)

Let's set a good example in our examples

Signed-off-by: Carl Pearson <[email protected]>

* Fix potential overflow issue in spiluk (#2409)

Just like the previous round of fixes related to multiplication
overflowing when result type has wider range, this should get
CodeQL to be a little happier.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* SerialInverseLU: fix overflow in integer multiplication (#2410)

Last one of a series of fixes to clean-up the CodeQL
safety issues, after that we should be all clean!

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* CI: `address` sanitizer and most of `undefined` sanitizer (#2408)

* Add address sanitizer and most of undefined sanitizer.

Exclude vptr due to Preconditioner visibility.
Exclude signed integer overflow because we do this all over the place.

Signed-off-by: Carl Pearson <[email protected]>

* Reducing ETI scope a lot to improve build size and time

This is not a permanent fix, we probably need to set this build on a different platform but should be enough to get one set of results and observe how good/bad we are doing...

Signed-off-by: Carl Pearson <[email protected]>

* ci: osx-ci -> ubuntu-asan-ubsan-ci

Signed-off-by: Carl Pearson <[email protected]>

* ci: drop compiler warnings on ci sanitizers build

Signed-off-by: Carl Pearson <[email protected]>

* ci: Kokkos_DIR -> Kokkos_ROOT

Signed-off-by: Carl Pearson <[email protected]>

* ci: ditch relative paths and working directories

Signed-off-by: Carl Pearson <[email protected]>

* ci: drop Kokkos_ENABLE_DEPRECATED_CODE_3

Signed-off-by: Carl Pearson <[email protected]>

* ci: fix kokkos kernels source path

Signed-off-by: Carl Pearson <[email protected]>

* ci: add UBSAN_OPTIONS to get stack trace

Signed-off-by: Carl Pearson <[email protected]>

---------

Signed-off-by: Carl Pearson <[email protected]>
Co-authored-by: Luc Berger <[email protected]>

* Bump actions/dependency-review-action from 4.3.5 to 4.4.0 (#2411)

Bumps [actions/dependency-review-action](https://github.com/actions/dependency-review-action) from 4.3.5 to 4.4.0.
- [Release notes](https://github.com/actions/dependency-review-action/releases)
- [Commits](https://github.com/actions/dependency-review-action/compare/a6993e2c61fd5dc440b409aa1d6904921c5e1894...4081bf99e2866ebe428fc0477b69eb4fcda7220a)

---
updated-dependencies:
- dependency-name: actions/dependency-review-action
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump softprops/action-gh-release from 2.0.8 to 2.0.9 (#2412)

Bumps [softprops/action-gh-release](https://github.com/softprops/action-gh-release) from 2.0.8 to 2.0.9.
- [Release notes](https://github.com/softprops/action-gh-release/releases)
- [Changelog](https://github.com/softprops/action-gh-release/blob/master/CHANGELOG.md)
- [Commits](https://github.com/softprops/action-gh-release/compare/c062e08bd532815e2082a85e87e3ef29c3e6d191...e7a8f85e1c67a31e6ed99a94b41bd0b71bbee6b8)

---
updated-dependencies:
- dependency-name: softprops/action-gh-release
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* ODE - RK: fixing small issues reported by Yaro (#2229)

* ODE - RK: fixing small issues reported by Yaro

1. fix integer division to floating point division
2. fix evaluation of max scaled error
3. increase or decrease time step using uniform formula
4. use num_steps instead of max_steps for dt calculation
5. add a time step when using constant dt to avoid issues with round-off errors
6. fixing exponent and moving adaptivity computation out of RKStep
7. adding time step counter
8. adding more tests and keep track of time steps if wanted

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* RK: fixing variable name after rebase

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* RK: enabling most methods after fixing test related issues

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* RK: passing new unit-tests

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Applying clang-format

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* RK: fix bad subview creation

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* RK: fix bug that computes the inital step size for non-adaptive case

This prevents having the user defined time step and leads to
wrong results. The rate of convergence tests are now passing!

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* clang-format...

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* RK: tweaking the tolerances a bit

On GPU the lowest order method (RK1-2) is accumulating a bit more
errors than on CPU. Only an issue when comparing values to zero
where the absolute tolerance is needed to detect good conv.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Adding reference for some implementation details and heuristic values

Signed-off-by: Luc Berger-Vergiat <[email protected]>

---------

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* remove unused member lower_only (#2415)

Signed-off-by: Carl Pearson <[email protected]>

* Fix remaining UB sanitizer errors (#2417)

* D2 coloring, VBBIT: fix color interval test
* Fix UB sanitizer error in graph coarsening
Initialize bool member of struct.

* Fix some issues with create_reverse_map
* replaceSumInto tests: initialize bool atomic_ member

Signed-off-by: Brian Kelley <[email protected]>

* Use fused gtest v1.11.0 from Kokkos (#2416)

* fused gtest v1.11.0

Signed-off-by: Carl Pearson <[email protected]>

* gtest: suppress unused-function for intel compilers

Signed-off-by: Carl Pearson <[email protected]>

* gtest: suppress warnings for EDG-based compilers

Signed-off-by: Carl Pearson <[email protected]>

* gtest: remove TestNameIs class

Signed-off-by: Carl Pearson <[email protected]>

* gtest: fix guards for C++20

Signed-off-by: Carl Pearson <[email protected]>

* unit_tests: tighten up types in comparisons

Signed-off-by: Carl Pearson <[email protected]>

---------

Signed-off-by: Carl Pearson <[email protected]>

* Two-stage GS: initialize functor bool members (#2420)

Signed-off-by: Brian Kelley <[email protected]>

* Remove unused KokkosBatched_InverseLU_Serial_Impl.hpp (#2422)

It's contents are replicated in KokkosBatched_LU_Serial_Impl.hpp.

Signed-off-by: Carl Pearson <[email protected]>

* Ode sycl fix (#2423)

* ODE: skipping autocatalytic test on SYCL

For the time being it is unclear why this particular case
leads to a runtime error from the SYCL API?

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* ODE: formatting

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* ODE: forgot to check if the SYCL space is enabled in Kokkos

Signed-off-by: Luc Berger-Vergiat <[email protected]>

---------

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Bump github/codeql-action from 3.27.0 to 3.27.1 (#2425)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.27.0 to 3.27.1.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/662472033e021d55d94146f66f6058822b0b39fd...4f3212b61783c3c68e8309a0f18a699764811cda)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump softprops/action-gh-release from 2.0.9 to 2.1.0 (#2430)

Bumps [softprops/action-gh-release](https://github.com/softprops/action-gh-release) from 2.0.9 to 2.1.0.
- [Release notes](https://github.com/softprops/action-gh-release/releases)
- [Changelog](https://github.com/softprops/action-gh-release/blob/master/CHANGELOG.md)
- [Commits](https://github.com/softprops/action-gh-release/compare/e7a8f85e1c67a31e6ed99a94b41bd0b71bbee6b8...01570a1f39cb168c169c802c3bceb9e93fb10974)

---
updated-dependencies:
- dependency-name: softprops/action-gh-release
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump github/codeql-action from 3.27.1 to 3.27.4 (#2431)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.27.1 to 3.27.4.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/4f3212b61783c3c68e8309a0f18a699764811cda...ea9e4e37992a54ee68a9622e985e60c8e8f12d9f)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Enable block support for sptrsv streams (#2429)

Signed-off-by: James Foucar <[email protected]>

* Bump github/codeql-action from 3.27.4 to 3.27.5 (#2438)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.27.4 to 3.27.5.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/ea9e4e37992a54ee68a9622e985e60c8e8f12d9f...f09c1c0a94de965c15400f5634aa42fac8fb8f88)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump step-security/harden-runner from 2.10.1 to 2.10.2 (#2437)

Bumps [step-security/harden-runner](https://github.com/step-security/harden-runner) from 2.10.1 to 2.10.2.
- [Release notes](https://github.com/step-security/harden-runner/releases)
- [Commits](https://github.com/step-security/harden-runner/compare/91182cccc01eb5e619899d80e4e971d6181294a7...0080882f6c36860b6ba35c610c98ce87d4e2f26f)

---
updated-dependencies:
- dependency-name: step-security/harden-runner
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump actions/dependency-review-action from 4.4.0 to 4.5.0 (#2436)

Bumps [actions/dependency-review-action](https://github.com/actions/dependency-review-action) from 4.4.0 to 4.5.0.
- [Release notes](https://github.com/actions/dependency-review-action/releases)
- [Commits](https://github.com/actions/dependency-review-action/compare/4081bf99e2866ebe428fc0477b69eb4fcda7220a...3b139cfc5fae8b618d3eae3675e383bb1769c019)

---
updated-dependencies:
- dependency-name: actions/dependency-review-action
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Update version, changelog, checks for 4.5 (#2441)

* Update develop version to 4.5.99

Signed-off-by: Nathan Ellingwood <[email protected]>

* Update kokkos version for codequl and docs checks

Signed-off-by: Nathan Ellingwood <[email protected]>

---------

Signed-off-by: Nathan Ellingwood <[email protected]>

* workflow: updating kokkos version for CI testing (#2440)

Updating our workflows to test develop against 4.5.00.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* ci: Disable PR tests which ran on `sogpu01` at Sandia (#2439)

* ci: Disable PR_BDW_CLANG1001_THREADS_SERIAL_LEFT_REL

This job ran on sogpu01, which is now used for another purpose.

Signed-off-by: Carl Pearson <[email protected]>

* ci: Disable VOLTA70 jobs

These jobs ran on sogpu01, which is now used for another purpose.

Signed-off-by: Carl Pearson <[email protected]>

---------

Signed-off-by: Carl Pearson <[email protected]>

* Sptrsv stream test fixes (#2444)

SPTRSV_CUSPARSE algorithm is not supported for streams, so it
made no sense to add it to the list of tested algs inside
test_sptrsv_streams. The result was tri_solve_streams being
a no-op for this algorithm. Somehow, this was not caught until
the block algorithm was being used.

Also, some minor cleanup of alg enum handling in the sptrsv handle.
Use a switch statement with a default to catch unhandled enum vals.
print_algorithm should just use the alg string to avoid a duplicated
switch/ifelseif chain. StringToSPTRSVAlgorithm was not handling
several of the enum vals and also returning strings inconsistent
with the strings in return_algorithm_string. Grep revealed no one
using this function, so I removed it.

I added a check in tri_solve_streams to throw an error if an unsupported
alg is used.

Signed-off-by: James Foucar <[email protected]>

* Test_Sparse_TestUtils_RandCsMat.hpp: tighten up types (#2443)

Signed-off-by: Carl Pearson <[email protected]>

* Improve batched serial trsm implementation and testing (#2432)

* Use Trsv instead of Trsm if X is a rank 1 matrix

Signed-off-by: Yuuichi Asahi <[email protected]>

* Add missing specialization of Trsm serial implementation

Signed-off-by: Yuuichi Asahi <[email protected]>

* Add missing tests for serial Trsm

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: format

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: initialization order based on codeQL

Signed-off-by: Yuuichi Asahi <[email protected]>

* Allow trsm serial to work on rank 1 matrix

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: MKL interface of Serial Trsm

Signed-off-by: Yuuichi Asahi <[email protected]>

* simplify do_conj logic in trsm serial internal

Signed-off-by: Yuuichi Asahi <[email protected]>

---------

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* Bump github/codeql-action from 3.27.5 to 3.27.6 (#2453)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.27.5 to 3.27.6.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/f09c1c0a94de965c15400f5634aa42fac8fb8f88...aa578102511db1f4524ed59b8cc2bae4f6e88195)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Adopt StaticCrsGraph and move it to KokkosSparse:: (#2419)

* Moved Static{Ccs,Crs}Graph to namespace KokkosSparse and into their
  own new header files
* Deprecate symbols in Kokkos namespace
* CRS counterpart used to come from the Containers subpackage in Kokkos
  "Core" but now is transitioned to Kokkos Kernels.



Fix a couple typos



Giving up on raising warnings



Avoid deprecation warnings when including deprecated header Kokkos_StaticCrsGraph.hpp



Adjust test category per review





Fix copy/pasta error third -> fourth





Update sparse/unit_test/Test_Sparse_StaticCrsGraph.hpp

Fixup deprecate code off



Fix typo allocat[i]on



Rely on ADL for creat_mirror(GRAPH)



Update header and namespace triangle counting test

Fixing a small issue with the triangle counting performance test for the upcoming move of StaticCrsGraph

Sparse - StaticCrsGraph: fixing issues with perf tests and docs

A few performance tests had not been updated to use the new namespace
for the graph overload of create_mirror. Additionally some of the
inline documentation for StaticCrsGraph is wrong...



Sparse - StaticCrsGraph: apply clang-format

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Improve batched serial trsv (#2452)

* Add missing specialization of Serial Trsv

Signed-off-by: Yuuichi Asahi <[email protected]>

* Improve testing of serial trsv

Signed-off-by: Yuuichi Asahi <[email protected]>

* format

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix code QR

Signed-off-by: Yuuichi Asahi <[email protected]>

* simplify tests for serial trsv

Signed-off-by: Yuuichi Asahi <[email protected]>

* format

Signed-off-by: Yuuichi Asahi <[email protected]>

* static-assertion check outside debug mode

Signed-off-by: Yuuichi Asahi <[email protected]>

* disallow vector_length=8 if AVX512 is unavailable

Signed-off-by: Yuuichi Asahi <[email protected]>

---------

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* update changelog for 4.5 (#2454)

Signed-off-by: Nathan Ellingwood <[email protected]>

* Workaround issue #2455 (#2457)

Signed-off-by: Nathan Ellingwood <[email protected]>

* fix: alias for serial trsv (#2458)

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* Fix the package version (#2460)

Signed-off-by: Thomas Padioleau <[email protected]>

* Deprecated remarks to trsv serial impl (#2461)

* Add deprecated warnings for older interfaces of serial trsv

Signed-off-by: Yuuichi Asahi <[email protected]>

* use if constexpr for selective interface of trsv

Signed-off-by: Yuuichi Asahi <[email protected]>

* format

Signed-off-by: Yuuichi Asahi <[email protected]>

* format

Signed-off-by: Yuuichi Asahi <[email protected]>

---------

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* Bump github/codeql-action from 3.27.6 to 3.27.9 (#2462)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.27.6 to 3.27.9.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/aa578102511db1f4524ed59b8cc2bae4f6e88195...df409f7d9260372bd5f19e5b04e83cb3c43714ae)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump softprops/action-gh-release from 2.1.0 to 2.2.0 (#2463)

Bumps [softprops/action-gh-release](https://github.com/softprops/action-gh-release) from 2.1.0 to 2.2.0.
- [Release notes](https://github.com/softprops/action-gh-release/releases)
- [Changelog](https://github.com/softprops/action-gh-release/blob/master/CHANGELOG.md)
- [Commits](https://github.com/softprops/action-gh-release/compare/01570a1f39cb168c169c802c3bceb9e93fb10974...7b4da11513bf3f43f9999e90eabced41ab8bb048)

---
updated-dependencies:
- dependency-name: softprops/action-gh-release
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Fix unit test headers installation (#2334)

Signed-off-by: Thomas Padioleau <[email protected]>

* Allow dyn-rank-view in serial trsv (#2464)

* Allow dyn-rank-view in serial trsv

Signed-off-by: Yuuichi Asahi <[email protected]>

* suppress shape checks

Signed-off-by: Yuuichi Asahi <[email protected]>

---------

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* Bump github/codeql-action from 3.27.9 to 3.28.0 (#2470)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.27.9 to 3.28.0.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/df409f7d9260372bd5f19e5b04e83cb3c43714ae...48ab28a6f5dbc2a99bf1e0131198dd8f1df78169)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump actions/upload-artifact from 4.4.3 to 4.5.0 (#2471)

Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 4.4.3 to 4.5.0.
- [Release notes](https://github.com/actions/upload-artifact/releases)
- [Commits](https://github.com/actions/upload-artifact/compare/b4b15b8c7c6ac21ea08fcf65892d2ee8f75cf882...6f51ac03b9356f520e9adb1b1b7802705f340c2b)

---
updated-dependencies:
- dependency-name: actions/upload-artifact
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Add dot, iamax, nrm1, nrm2 examples (#2467)

* Add iamax, nrm1, nrm2 examples

Signed-off-by: Carl Pearson <[email protected]>

* iamax example: printf -> iostream

Signed-off-by: Carl Pearson <[email protected]>

* add dot wiki example

Signed-off-by: Carl Pearson <[email protected]>

---------

Signed-off-by: Carl Pearson <[email protected]>

* nrm1: assert vector accessibility (#2465)

Signed-off-by: Carl Pearson <[email protected]>

* Fixes while documenting (#2466)

* BLAS - scal: removing check on assignable memory spaces

That check is stricter than required as we will values by reference
to perform copies and won't try to reassign pointers.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* BLAS - rot: check at runtime that X and Y have same extent

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* BLAS - rot: improving static assertions

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* BLAS - rotg: check for non-complex types

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* BLAS - ger: check that matrix stores values as non-const

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* BLAS - trmm: check for valid execution space type.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* BLAS: fix missing semi-colon at end of static_assert

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Applying clang-format

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* More clang-format

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Blas - rot: fixing interface of rot

The cosine coefficient is strictly real while the sine coefficient
can be real or complex leading to a bug in the current API. This
commit should fix that for the native and TPL implementation and
the associated unit-test is also fixed accordingly.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* BLAS - ROT: fixing types for Host TPL calls to ROT function

The types for the arguments c and s are actually different and need
to be appropriately propagated through the TPL layers of the library.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

---------

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* BLAS - ROT: calling appropriate cuBLAS function (#2473)

After changing the rot interface from using all real to a mix
of real and complex coefficients, the cuBLAS call needs to be made
to {Z,C}rot instead of {Zd,Cs}rot.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* rotg/rot/rotmg/rotm docs and examples (#2472)

* rotg/rot/rotmg/rotm docs and examples

- Add wiki examples for rotg/rot and rotmg/rotm
- Improve inline/doxygen comments for routines

Signed-off-by: Brian Kelley <[email protected]>

* Small example change, formatting

Signed-off-by: Brian Kelley <[email protected]>

* Remove unused include

Signed-off-by: Brian Kelley <[email protected]>

* Suppress clang-format on doxygen section

Signed-off-by: Brian Kelley <[email protected]>

* rot: add missing include

Signed-off-by: Brian Kelley <[email protected]>

---------

Signed-off-by: Brian Kelley <[email protected]>

* update changelog and readme to 4.5.1 with develop (#2475)

Signed-off-by: Nathan Ellingwood <[email protected]>

* Improve batched serial gemm (#2469)

* Add ConjTrans to Serial Gemm

Signed-off-by: Yuuichi Asahi <[email protected]>

* improve checks in serial Gemm

Signed-off-by: Yuuichi Asahi <[email protected]>

* improve selective interface of batched gemm

Signed-off-by: Yuuichi Asahi <[email protected]>

* check info in serial gemm testing

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: op type of serial invoke

Signed-off-by: Yuuichi Asahi <[email protected]>

* format

Signed-off-by: Yuuichi Asahi <[email protected]>

* remove the global namespace

Signed-off-by: Yuuichi Asahi <[email protected]>

---------

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* Fix -Werror in a couple wiki examples (#2476)

Resolves warnings error: unused parameter 'argc'
[-Werror=unused-parameter] etc.

Signed-off-by: Nathan Ellingwood <[email protected]>

* ODE: fix potentially uninitialized warning from CodeQL (#2477)

* ODE: fix potentially uninitialized warning from CodeQL

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Clang-format

Signed-off-by: Luc Berger-Vergiat <[email protected]>

---------

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Bump github/codeql-action from 3.28.0 to 3.28.1 (#2479)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.28.0 to 3.28.1.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/48ab28a6f5dbc2a99bf1e0131198dd8f1df78169...b6a472f63d85b9c78a3ac5e89422239fc15e9b3c)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump softprops/action-gh-release from 2.2.0 to 2.2.1 (#2482)

Bumps [softprops/action-gh-release](https://github.com/softprops/action-gh-release) from 2.2.0 to 2.2.1.
- [Release notes](https://github.com/softprops/action-gh-release/releases)
- [Changelog](https://github.com/softprops/action-gh-release/blob/master/CHANGELOG.md)
- [Commits](https://github.com/softprops/action-gh-release/compare/7b4da11513bf3f43f9999e90eabced41ab8bb048...c95fe1489396fe8a9eb87c0abf8aa5b2ef267fda)

---
updated-dependencies:
- dependency-name: softprops/action-gh-release
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump step-security/harden-runner from 2.10.2 to 2.10.3 (#2481)

Bumps [step-security/harden-runner](https://github.com/step-security/harden-runner) from 2.10.2 to 2.10.3.
- [Release notes](https://github.com/step-security/harden-runner/releases)
- [Commits](https://github.com/step-security/harden-runner/compare/0080882f6c36860b6ba35c610c98ce87d4e2f26f...c95a14d0e5bab51a9f56296a4eb0e416910cd350)

---
updated-dependencies:
- dependency-name: step-security/harden-runner
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump actions/upload-artifact from 4.5.0 to 4.6.0 (#2480)

Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 4.5.0 to 4.6.0.
- [Release notes](https://github.com/actions/upload-artifact/releases)
- [Commits](https://github.com/actions/upload-artifact/compare/6f51ac03b9356f520e9adb1b1b7802705f340c2b...65c4c4a1ddee5b72f698fdd19549f0f0fb45cf08)

---
updated-dependencies:
- dependency-name: actions/upload-artifact
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* implement batched serial getrf (#2331)

* fix: conflicts

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: gpu version

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: docstring for getrf

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: conflicts

Signed-off-by: Yuuichi Asahi <[email protected]>

* format

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: errors from code style

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: format

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: conflicts

Signed-off-by: Yuuichi Asahi <[email protected]>

* Improve implementation details of getrf

Signed-off-by: Yuuichi Asahi <[email protected]>

* format

Signed-off-by: Yuuichi Asahi <[email protected]>

* Update create_triangular_matrix function

Signed-off-by: Yuuichi Asahi <[email protected]>

* Merging Test_Batched_SerialGetrf.hpp and Test_Batched_SerialGetrf_Real.hpp

Signed-off-by: Yuuichi Asahi <[email protected]>

* remove the global namespace

Signed-off-by: Yuuichi Asahi <[email protected]>

* Add missing maybe_unused for checkGetrfInput

Signed-off-by: Yuuichi Asahi <[email protected]>

* Improve docstrings and comments to describe getrf algo

Signed-off-by: Yuuichi Asahi <[email protected]>

* Add a complicated analytical test based on review

Signed-off-by: Yuuichi Asahi <[email protected]>

---------

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* Bump step-security/harden-runner from 2.10.3 to 2.10.4 (#2484)

Bumps [step-security/harden-runner](https://github.com/step-security/harden-runner) from 2.10.3 to 2.10.4.
- [Release notes](https://github.com/step-security/harden-runner/releases)
- [Commits](https://github.com/step-security/harden-runner/compare/c95a14d0e5bab51a9f56296a4eb0e416910cd350...cb605e52c26070c328afc4562f0b4ada7618a84e)

---
updated-dependencies:
- dependency-name: step-security/harden-runner
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* implement batched serial getrs (#2483)

* implement batched serial getrs

Signed-off-by: Yuuichi Asahi <[email protected]>

* unuse getrf in the getrs analytical test

Signed-off-by: Yuuichi Asahi <[email protected]>

---------

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* Bump github/codeql-action from 3.28.1 to 3.28.5 (#2487)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.28.1 to 3.28.5.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/b6a472f63d85b9c78a3ac5e89422239fc15e9b3c...f6091c0113d1dcf9b98e269ee48e8a7e51b7bdd4)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* fix: getrs serial internal implementations (#2488)

* fix: getrs serial internal implementations

Signed-off-by: Yuuichi Asahi <[email protected]>

* disallow optimization to fix serial getrs

Signed-off-by: Yuuichi Asahi <[email protected]>

* no unroll in Laswp

Signed-off-by: Yuuichi Asahi <[email protected]>

---------

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* Bump github/codeql-action from 3.28.5 to 3.28.8 (#2490)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.28.5 to 3.28.8.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/f6091c0113d1dcf9b98e269ee48e8a7e51b7bdd4...dd746615b3b9d728a6a37ca2045b68ca76d4841a)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* SPRASE- SpADD: moving spadd out of experimental (#2492)

* SPRASE- SpADD: moving spadd out of experimental

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* applying clang-format

Signed-off-by: Luc Berger-Vergiat <[email protected]>

---------

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Sparse - spgemm: moving symbolic and numeric phases out of the experimental namespace (#2493)

* Sparse - spgemm: moving symbolic and numeric phases out of the
experimental namespace

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Applying clang-format

Signed-off-by: Luc Berger-Vergiat <[email protected]>

---------

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* implement batched serial ger (#2491)

* implement batched serial ger

Signed-off-by: Yuuichi Asahi <[email protected]>

* fixx: view constructors for x and y in ger test

Signed-off-by: Yuuichi Asahi <[email protected]>

---------

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* Sparse - Gauss Seidel: moving GS out of experimental namespace (#2494)

* Sparse - Gauss Seidel: moving GS out of experimental namespace

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Applying clang-format

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Sparse - GS: excluding deprecated functions from doxygen parsing

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Removing no longer used namespace alias

Signed-off-by: Luc Berger-Vergiat <[email protected]>

---------

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Iluk deprecate experimental (#2495)

* Sparse - SpILUK: move kernels out of Experimental namespace

Will make other commits to remove the usage through out the library.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Sparse - SpILUK: updating unit and performance tests

After moving the functions out of experimental we need to update
tests so we do not call it ourselves anymore.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Applying clang-format

Signed-off-by: Luc Berger-Vergiat <[email protected]>

---------

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* deprecate KOKKOS_CUSPARSE_SAFE_CALL -> KOKKOSPARSE_IMPL_CUSPARSE_SAFE_CALL (#2426)

Signed-off-by: Carl Pearson <[email protected]>

* Sparse - SpTRSV: moving kenerls out of Experimental (#2496)

* Sparse - SpTRSV: moving kenerls out of Experimental

Moving the generic SpTRSV kenrels out of experimental but
keeping the supernodal and stream variants in experimental
as they are less tested and more recent respectively.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Applying clang-format and removing deprecated call from doxygen

Signed-off-by: Luc Berger-Vergiat <[email protected]>

---------

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Bump github/codeql-action from 3.28.8 to 3.28.9 (#2498)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.28.8 to 3.28.9.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/dd746615b3b9d728a6a37ca2045b68ca76d4841a...9e8d0789d4a0fa9ceb6b1738f7e269594bdd67f0)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Implement batched serial syr (#2497)

* Introduce OpReal functor to provide real operator

Signed-off-by: Yuuichi Asahi <[email protected]>

* implement batched serial syr

Signed-off-by: Yuuichi Asahi <[email protected]>

* remove unused variable

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix view constructor

Signed-off-by: Yuuichi Asahi <[email protected]>

* Add docstring and assertion for ArgUplo and ArgTrans parameters

Signed-off-by: Yuuichi Asahi <[email protected]>

---------

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* Adding ctest config for cdash submissions (#2500)

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Bump step-security/harden-runner from 2.10.4 to 2.11.0 (#2501)

Bumps [step-security/harden-runner](https://github.com/step-security/harden-runner) from 2.10.4 to 2.11.0.
- [Release notes](https://github.com/step-security/harden-runner/releases)
- [Commits](https://github.com/step-security/harden-runner/compare/cb605e52c26070c328afc4562f0b4ada7618a84e...4d991eb9b905ef189e4c376166672c3f2f230481)

---
updated-dependencies:
- dependency-name: step-security/harden-runner
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* implement batched serial lacgv (#2499)

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* Refactor serial tbsv implementation details and tests (#2478)

* refactor serial tbsv implementation details and tests

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: test names in SerialTbsv complex

Signed-off-by: Yuuichi Asahi <[email protected]>

* use EXPECT_NEAR_KK_REL for comparison

Signed-off-by: Yuuichi Asahi <[email protected]>

* use EXPECT_NEAR_KK_REL for general tests

Signed-off-by: Yuuichi Asahi <[email protected]>

* Add docstring and assertion for Arg parameters

Signed-off-by: Yuuichi Asahi <[email protected]>

---------

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* Worflow: refactoring SPR to standard cmake and suppress warnings (#2502)

The -Wpass-fail warnings are poluting the output of the compiler
this does not really tell us anything either so let us suppress
these. Addtionally, suppressing linker warnging with
-Wno-unused-command-line-argument. The build comes out a lot
cleaner now.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* refactor serial pbtrf implementation details and tests (#2503)

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* Bump ossf/scorecard-action from 2.4.0 to 2.4.1 (#2508)

Bumps [ossf/scorecard-action](https://github.com/ossf/scorecard-action) from 2.4.0 to 2.4.1.
- [Release notes](https://github.com/ossf/scorecard-action/releases)
- [Changelog](https://github.com/ossf/scorecard-action/blob/main/RELEASE.md)
- [Commits](https://github.com/ossf/scorecard-action/compare/62b2cac7ed8198b15735ed49ab1e5cf35480ba46...f49aabe0b5af0936a0987cfb85d86b75731b0186)

---
updated-dependencies:
- dependency-name: ossf/scorecard-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump actions/upload-artifact from 4.6.0 to 4.6.1 (#2509)

Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 4.6.0 to 4.6.1.
- [Release notes](https://github.com/actions/upload-artifact/releases)
- [Commits](https://github.com/actions/upload-artifact/compare/65c4c4a1ddee5b72f698fdd19549f0f0fb45cf08...4cec3d8aa04e39d1a68397de0c4cd6fb9dce8ec1)

---
updated-dependencies:
- dependency-name: actions/upload-artifact
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump github/codeql-action from 3.28.9 to 3.28.10 (#2507)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.28.9 to 3.28.10.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/9e8d0789d4a0fa9ceb6b1738f7e269594bdd67f0...b56ba49b26e50535fa1e7f7db0f4f7b4bf65d80d)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Batched svd stability (#2505)

* Batched - SVD: adding a check for numerical stability

When computing the eigenvalues of a symmetric 2x2
system we know check for potential cancelation error
in the discriminant computation. Added an associated
test.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Adding appropriate coverage for layout and scalar types...

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* clang-format

Signed-off-by: Luc Berger-Vergiat <[email protected]>

---------

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* SYCL - SIMD: preventing AVX512 code to be compiled when targeting GPU (#2510)

* SYCL - SIMD: preventing AVX512 code to be compiled when targeting GPU

There is a check missing to verify that we are not trying to compile
AVX512 intrinsic in GPU code which is obviously not a good idea.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Switching to a more appropriate compiler macro

Signed-off-by: Luc Berger-Vergiat <[email protected]>

---------

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* SYCL: Fix Test_Common_AlignPtrTo.hpp (#2506)

* SYCL: Fix Test_Common_AlignPtrTo.hpp

Signed-off-by: Daniel Arndt <[email protected]>

* Check compiler version

Signed-off-by: Daniel Arndt <[email protected]>

* Check for ExecutionSpace again

Signed-off-by: Daniel Arndt <[email protected]>

---------

Signed-off-by: Daniel Arndt <[email protected]>

* Same fix as PR #2435 but rebased on develop (#2511)

Signed-off-by: Berger Vergiat Luc <[email protected]>
Co-authored-by: Berger Vergiat Luc <[email protected]>

* docs: updating to the new documentation and refactoring workflow (#2512)

* docs: updating to the new documentation and refactoring workflow
Updating sphinx version to allow automated build with python 3.13
Update furo version to work with sphinx
Okay, checked locally so should work now!

---------

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Docs: deploy documentation after PR merge (#2513)

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Bump sphinx-design from 0.5.0 to 0.6.1 in /docs (#2515)

Bumps [sphinx-design](https://github.com/executablebooks/sphinx-design) from 0.5.0 to 0.6.1.
- [Release notes](https://github.com/executablebooks/sphinx-design/releases)
- [Changelog](https://github.com/executablebooks/sphinx-design/blob/main/CHANGELOG.md)
- [Commits](https://github.com/executablebooks/sphinx-design/compare/v0.5.0...v0.6.1)

---
updated-dependencies:
- dependency-name: sphinx-design
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump actions/download-artifact from 4.1.8 to 4.1.9 (#2520)

Bumps [actions/download-artifact](https://github.com/actions/download-artifact) from 4.1.8 to 4.1.9.
- [Release notes](https://github.com/actions/download-artifact/releases)
- [Commits](https://github.com/actions/download-artifact/compare/fa0a91b85d4f404e444e00e005971372dc801d16...cc203385981b70ca67e1cc392babf9cc229d5806)

---
updated-dependencies:
- dependency-name: actions/download-artifact
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump slsa-framework/slsa-github-generator from 2.0.0 to 2.1.0 (#2521)

Bumps [slsa-framework/slsa-github-generator](https://github.com/slsa-framework/slsa-github-generator) from 2.0.0 to 2.1.0.
- [Release notes](https://github.com/slsa-framework/slsa-github-generator/releases)
- [Changelog](https://github.com/slsa-framework/slsa-github-generator/blob/main/CHANGELOG.md)
- [Commits](https://github.com/slsa-framework/slsa-github-generator/compare/v2.0.0...v2.1.0)

---
updated-dependencies:
- dependency-name: slsa-framework/slsa-github-generator
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump sphinx from 6.2.1 to 8.2.3 in /docs (#2516)

Bumps [sphinx](https://github.com/sphinx-doc/sphinx) from 6.2.1 to 8.2.3.
- [Release notes](https://github.com/sphinx-doc/sphinx/releases)
- [Changelog](https://github.com/sphinx-doc/sphinx/blob/master/CHANGES.rst)
- [Commits](https://github.com/sphinx-doc/sphinx/compare/v6.2.1...v8.2.3)

---
updated-dependencies:
- dependency-name: sphinx
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump furo from 2024.4.27 to 2024.8.6 in /docs (#2519)

Bumps [furo](https://github.com/pradyunsg/furo) from 2024.4.27 to 2024.8.6.
- [Release notes](https://github.com/pradyunsg/furo/releases)
- [Changelog](https://github.com/pradyunsg/furo/blob/main/docs/changelog.md)
- [Commits](https://github.com/pradyunsg/furo/compare/2024.04.27...2024.08.06)

---
updated-dependencies:
- dependency-name: furo
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Remove old docker images (#2514)

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Bump myst-parser from 3.0.1 to 4.0.1 in /docs (#2517)

Bumps [myst-parser](https://github.com/executablebooks/MyST-Parser) from 3.0.1 to 4.0.1.
- [Release notes](https://github.com/executablebooks/MyST-Parser/releases)
- [Changelog](https://github.com/executablebooks/MyST-Parser/blob/master/CHANGELOG.md)
- [Commits](https://github.com/executablebooks/MyST-Parser/compare/v3.0.1...v4.0.1)

---
updated-dependencies:
- dependency-name: myst-parser
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump sphinx-copybutton from 0.5.0 to 0.5.2 in /docs (#2518)

Bumps [sphinx-copybutton](https://github.com/executablebooks/sphinx-copybutton) from 0.5.0 to 0.5.2.
- [Release notes](https://github.com/executablebooks/sphinx-copybutton/releases)
- [Changelog](https://github.com/executablebooks/sphinx-copybutton/blob/master/CHANGELOG.md)
- [Commits](https://github.com/executablebooks/sphinx-copybutton/compare/v0.5.0...v0.5.2)

---
updated-dependencies:
- dependency-name: sphinx-copybutton
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Reverting build_requirements to previous versions (#2522)

Next week when dependabot asks to update the versions
of our documentation tools let's ask it to stop tracking
them for the time being as the new suggested versions
are failing...

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* docs: pinning github actions by SHA (#2523)

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* docs: adding hashes for requirements.txt (#2524)

* dosc: adding hashes for requirements.txt

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Adding more hashes...

Signed-off-by: Luc Berger-Vergiat <[email protected]>

---------

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Fixing issue with pragma unroll in laswp (#2525)

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* fix typos in BUILD.md (#2526)

Signed-off-by: Timo Heister <[email protected]>

* docs: modifying the workflow to add --require-hases for pip (#2527)

This forces pip to check hashes when it installs our dependencies
as part of the workflow. Also adding hashes for source and binary
versions of the dependencies. Finally modifying the link of the
documentation badge in README so it points to the actual
documentation instead of pointing to the last action run.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Bump github/codeql-action from 3.28.10 to 3.28.11 (#2528)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.28.10 to 3.28.11.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/b56ba49b26e50535fa1e7f7db0f4f7b4bf65d80d...6bb031afdd8eb862ea3fc1848194185e076637e5)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* refactor batched serial pbtrs implementation details and tests (#2504)

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* batched - dense: Testing and fixing Serial QR (#2342)

* batched - dense: Testing and fixing Serial QR

The serial QR algorithms does not have unit-tests and is failing
for non square matrices. See issue #2328.
This first commit fixes the issue with rectangular matrices and
adds a basic test for that use case. Next will work on adding a
test that exercises the interfaces on multiple matrices of different
sizes within a parallel_for. Finally equivalent tests will be added
for the square case as well.
Fixing unused variable error
It looks like the Left NoTranspose ApplyQ is not doing the correct
thing. Will have a look at that next.

Spliting the tests a bit better, looking at Q and Qt
together to detect issues with consistency between the two.
Also eventually allows to use GEMM to figure out which one
is wrong.

Final fixes and adjusting the tolerance

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Removing the stride for the workspace

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Applying clang-format

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Adding checks on inputs

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* More clean-ups

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* clang-format

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Batched - QR: adding static assertion on struct templates

Checking that the side, mode and algo are all valid.
Cleaning up the test by using create_mirror_view_and_copy
when appropriate.

Somehow the semantic of create_mirror_view_and_copy is different
from the regular deep_copy so need to make a few changes after
testing on GPUs...

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Batched - QR: adding check for apply Q on the right side.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* clang-format

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* clang-format

Signed-off-by: Luc Berger-Vergiat <[email protected]>

---------

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* deprecate __KOKKOSBATCHED_ENABLE_LAPACKE__ (#2450)

In favor of KOKKOSBATCHED_IMPL_ENABLE_LAPACKE

Signed-off-by: Carl Pearson <[email protected]>

* don't run clang-tidy on google benchmark code (#2533)

Signed-off-by: Carl Pearson <[email protected]>

* Remove use of some deprecated macros (#2532)

Replaces

```
\#ifdef KOKKOS_ACTIVE_EXECUTION_MEMORY_SPACE_SYCL
x
\#else
y
\#endif
```

With

```
\#ifdef KOKKOS_ENABLE_SYCL
KOKKOS_IF_ON_DEVICE((x))
KOKKOS_IF_ON_HOST((y))
\#else
y
\#endif
```

Signed-off-by: Carl Pearson <[email protected]>

* Remove some use of some deprecated macros (#2531)

* KOKKOS_ACTIVE_EXECUTION_MEMORY_SPACE_SYCL
* KOKKOS_ACTIVE_EXECUTION_MEMORY_SPACE_HOST

These uses are guarded by KOKKOS_ENABLE_SYCL, so the removal
is relatively straightforward. Other sites may require some
rethinking.

Signed-off-by: Carl Pearson <[email protected]>

* Sparse - sptrsv: Adjusting the perf test CUDA guard see issue #2428 (#2535)

The csrsv2 functions actually got removed from CUDA 12 as it was a
major release. These are still available up to CUDA 11.8 though.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Implement batched serial gbtrf (#2489)

* fix: conflicts

Signed-off-by: Yuuichi Asahi <[email protected]>

* improve gbtrf unit-test to deal with non-rectangular cases

Signed-off-by: Yuuichi Asahi <[email protected]>

* improve gbtrf unit-test

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: errors from codeQL

Signed-off-by: Yuuichi Asahi <[email protected]>

* remove unused View2DType

Signed-off-by: Yuuichi Asahi <[email protected]>

* use ger internal to simplify the gbtrf implementation details

Signed-off-by: Yuuichi Asahi <[email protected]>

* Add docstring and assertion for ArgAlgo parameter in gbtrf

Signed-off-by: Yuuichi Asahi <[email protected]>

* format Test_Batched_Dense.hpp

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix check function for gbtrf

Signed-off-by: Yuuichi Asahi <[email protected]>

* rename full matrix to dense matrix

Signed-off-by: Yuuichi Asahi <[email protected]>

---------

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* Add nightly testing on Frontier (#2536)

* Remove more reserved identifiers (#2534)

* __A -> A

Signed-off-by: Carl Pearson <[email protected]>

* __B -> B_

Signed-off-by: Carl Pearson <[email protected]>

* __C -> C_

Signed-off-by: Carl Pearson <[email protected]>

* Rename _-prefixed include guards

Signed-off-by: Carl Pearson <[email protected]>

---------

Signed-off-by: Carl Pearson <[email protected]>

* std::filesystem library for gcc < 9.1 and Clang < 9.0 (#2545)

Signed-off-by: Carl Pearson <[email protected]>

* Rework dot performance tests (#2544)

* cleanup KokkosBlas_dot

Signed-off-by: Carl Pearson <[email protected]>

* format perf_test/blas/blas1/KokkosBlas_dot_perf_test_benchmark.cpp

Signed-off-by: Carl Pearson <[email protected]>

* clean up KokkosBlas_dot_mv

Signed-off-by: Carl Pearson <[email protected]>

* KokkosBlas_dot: fix FLOP count

Signed-off-by: Carl Pearson <[email protected]>

* KokkosBlas_dot_mv: fix FLOP count

Signed-off-by: Carl Pearson <[email protected]>

* clean up KokkosBlas_team_dot

Signed-off-by: Carl Pearson <[email protected]>

---------

Signed-off-by: Carl Pearson <[email protected]>

* update develop version to 4.6.99 (#2540)

Signed-off-by: Nathan Ellingwood <[email protected]>

* Remove `KokkosBlas_dot_mv_perf_test` (#2548)

* Add DefaultHostExecutionSpace dot perf_test

Signed-off-by: Carl Pearson <[email protected]>

* Remove KokkosBlas_dot_mv_perf_test

Duplicates functionality in benchmark-based perf test

Signed-off-by: Carl Pearson <[email protected]>

---------

Signed-off-by: Carl Pearson <cwpea…
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants