Implement `gesdd` #899

jmachado-amd · 2025-02-25T19:57:27Z

This PR adds a minimal implementation of gesdd, using syevd as a backend. A complete implementation, using bdsdc, will appear in the future.

…gesdd

clients/common/lapack/testing_gesdd.hpp

library/src/lapack/roclapack_gesdd.cpp

EdDAzevedo

I wonder perhaps whether these different algorithms (say using Jacobi or forming A'*A or using bidiagonal form) each have advantages and disadvantages and can be retained but selected as an algorithm option by the application? Just a thought.

EdDAzevedo · 2025-02-26T05:27:32Z

library/src/lapack/roclapack_gesdd.hpp

+
+        for(j = tid; j < n; j += gridDim.x * blockDim.x)
+        {
+            for(k = 0; k < n; k++)


I wonder whether the code can use more threads or more parallelism, say batch index b in z-dimension, j in x-dimension (as inner loop) , k in y-dimension. Just a thought.

It can be improved, but this is intended as an intermediate step and not the final algorithm.

EdDAzevedo · 2025-02-26T05:30:26Z

library/src/lapack/roclapack_gesdd.hpp

+                                   size_t* size_workArr2)
+{
+    // If quick return, set workspace to zero
+    if(n == 0 || m == 0 || batch_count == 0)


minor comment: perhaps set all sizes to zero first, then check if (n==0) and return. This is just defensive programming to make sure all the size variables are initialized, no undefined values. Just a minor suggestion.

EdDAzevedo · 2025-02-26T05:33:23Z

library/src/lapack/roclapack_gesdd.hpp

+    *size_splits = std::max({f1});
+    *size_tmptau_W = std::max({g1});
+    *size_tau = std::max({h1});
+    *size_workArr = sizeof(T) * std::max({m, n}) * std::max({m, n}) * batch_count;


Just double checking whether the size_workArr is max(m,n)^2 * batch_count. It might be a very big number.

Yes, this looks like a typo, and I'll check it later. Thanks for catching that!

As it turns out, that code was semantically incorrect: it was supposed to reserve space for the off-diagonal elements of the input matrix, as required by stedc. Thanks again for catching it!

amd-jnovotny · 2025-02-26T13:21:58Z

@jmachado-amd : Would this feature require a changelog update? If you're planning on adding that as a separate PR, then no worries.

jmachado-amd · 2025-03-05T23:37:35Z

@EdDAzevedo, my opinion is that svd algorithms based on creating the explicit products A A^* or A^* A are always an intermediate step and should not stay in the long run. However, I can't ignore the fact that they can be quite efficient sometimes, and some users will prefer to have the option of selecting those during runtime.

jmachado-amd added 6 commits February 13, 2025 23:46

Add necessary files for gesdd, stub implementation

8963eea

Minimal working implementation

ec890ef

Tidy-up sources

f68d422

Small changes

1c16d43

Update tests

c33d0f3

Update main gesdd method

4e9d288

jmachado-amd added the noOptimizations Disable optimized kernels for small sizes for some routines label Feb 25, 2025

jmachado-amd requested review from jzuniga-amd, tfalders, cgmb, qjojo, EdDAzevedo, AGonzales-amd and a team as code owners February 25, 2025 19:57

jmachado-amd added 2 commits February 25, 2025 14:59

Merge remote-tracking branch 'origin/develop' into implement-minimal-…

0612272

…gesdd

Update error bounds of orthogonality tests

67bf799

EdDAzevedo reviewed Feb 26, 2025

View reviewed changes

clients/common/lapack/testing_gesdd.hpp Outdated Show resolved Hide resolved

EdDAzevedo reviewed Feb 26, 2025

View reviewed changes

library/src/lapack/roclapack_gesdd.cpp Show resolved Hide resolved

EdDAzevedo reviewed Feb 26, 2025

View reviewed changes

library/src/lapack/roclapack_gesdd.cpp Outdated Show resolved Hide resolved

EdDAzevedo approved these changes Feb 26, 2025

View reviewed changes

jmachado-amd added the ci:no-ccache Disable ccache label Feb 26, 2025

EdDAzevedo reviewed Feb 26, 2025

View reviewed changes

jmachado-amd removed the ci:no-ccache Disable ccache label Feb 26, 2025

Update test tolerance

197b844

jmachado-amd added 2 commits March 5, 2025 21:48

Update gesdd_getMemorySize method

61b9a44

Update changelog and code comments

8876006

amd-jnovotny approved these changes Mar 5, 2025

View reviewed changes

Implement review comment

53f3558

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement `gesdd` #899

Implement `gesdd` #899

jmachado-amd commented Feb 25, 2025

EdDAzevedo left a comment

EdDAzevedo Feb 26, 2025

jmachado-amd Mar 5, 2025

EdDAzevedo Feb 26, 2025

jmachado-amd Mar 5, 2025

EdDAzevedo Feb 26, 2025

jmachado-amd Feb 26, 2025

jmachado-amd Mar 5, 2025

amd-jnovotny commented Feb 26, 2025

jmachado-amd commented Mar 5, 2025

Implement gesdd #899

Are you sure you want to change the base?

Implement gesdd #899

Conversation

jmachado-amd commented Feb 25, 2025

EdDAzevedo left a comment

Choose a reason for hiding this comment

EdDAzevedo Feb 26, 2025

Choose a reason for hiding this comment

jmachado-amd Mar 5, 2025

Choose a reason for hiding this comment

EdDAzevedo Feb 26, 2025

Choose a reason for hiding this comment

jmachado-amd Mar 5, 2025

Choose a reason for hiding this comment

EdDAzevedo Feb 26, 2025

Choose a reason for hiding this comment

jmachado-amd Feb 26, 2025

Choose a reason for hiding this comment

jmachado-amd Mar 5, 2025

Choose a reason for hiding this comment

amd-jnovotny commented Feb 26, 2025

jmachado-amd commented Mar 5, 2025

Implement `gesdd` #899

Implement `gesdd` #899