[Operator] Add diagonal backward #329

awayzjj · 2024-11-26T11:39:35Z

PR Category

Type of Change

Description

Issue

Close #314

Progress

Change is properly reviewed (1 reviewer required, 2 recommended).
Change is responded to an issue.
Change is fully covered by a UT.

Performance

awayzjj · 2024-11-26T12:04:51Z

@StrongSpoon Hi, I have 2 questions.

How to registered backward independently into aten library, I could not find a reference in the repo, so I still implemented a class with forward and backward function as a draft.
I implemented the backward function, and the UT passed.

    def backward(ctx, out_grad):
        logging.debug("GEMS DIAGONAL BACKWARD")
        (inp,) = ctx.saved_tensors
        grad_input = torch.zeros_like(inp)
        diag = torch.diagonal(grad_input, ctx.offset, ctx.dim1, ctx.dim2)
        diag.copy_(out_grad)
        return grad_input, None, None, None

it is simple, use torch.diagonal to get a view of the grad_input, and copy the value from the out_grad, I wonder if we need a triton kernel here.

Thank you very much!

awayzjj · 2024-11-28T02:06:14Z

@StrongSpoon A gentle reminder.

@StrongSpoon Hi, I have 2 questions.

How to registered backward independently into aten library, I could not find a reference in the repo, so I still implemented a class with forward and backward function as a draft.

I implemented the backward function, and the UT passed.
    def backward(ctx, out_grad):
        logging.debug("GEMS DIAGONAL BACKWARD")
        (inp,) = ctx.saved_tensors
        grad_input = torch.zeros_like(inp)
        diag = torch.diagonal(grad_input, ctx.offset, ctx.dim1, ctx.dim2)
        diag.copy_(out_grad)
        return grad_input, None, None, None
it is simple, use torch.diagonal to get a view of the grad_input, and copy the value from the out_grad, I wonder if we need a triton kernel here.

Thank you very much!

StrongSpoon · 2024-12-04T03:01:38Z

Hi awayzjj,

We used to implement operator with both forward and backward as a subclass of torch.autograd.Function, and register it to the forward interface using AutogradCUDA key. But recently we found that AutogradCUDA could not work with torch.compile perfectly. As a solution, we recommend to implement forward function and backward function, and register them into aten library respectively.
Take tanh as an example, its forward interface is defined in https://github.com/pytorch/pytorch/blob/main/aten/src/ATen/native/native_functions.yaml. In the previous practice, we implemented class Tanh and registered it into tanh interface. But it's better to register forward function to tanh and backward function to tanh_backward, using CUDA as dispatch key.

- func: tanh(Tensor self) -> Tensor
  device_check: NoCheck   # TensorIterator
  structured_delegate: tanh.out
  variants: function, method
  dispatch:
    QuantizedCPU: tanh_quantized_cpu
    MkldnnCPU: mkldnn_tanh
    SparseCPU, SparseCUDA: tanh_sparse
    SparseCsrCPU, SparseCsrCUDA, SparseCsrMeta: tanh_sparse_csr
    NestedTensorCPU, NestedTensorCUDA: NestedTensor_tanh
  tags: [core, pointwise]
- func: tanh_backward(Tensor grad_output, Tensor output) -> Tensor
  python_module: nn
  structured_delegate: tanh_backward.grad_input

StrongSpoon · 2024-12-04T03:03:00Z

As is the same, there is a definition of diagonal_backward interface, which is expected to be reimplemented by developers.

StrongSpoon · 2024-12-04T03:05:04Z

Besides, we require developer to implement the function by writing a Triton kernel function instead of torch apis. If you are confused about the format, please refer to our source code.

awayzjj · 2024-12-05T01:33:42Z

Besides, we require developer to implement the function by writing a Triton kernel function instead of torch apis. If you are confused about the format, please refer to our source code.

Please review my PR, thanks!

StrongSpoon

please provide the benchmark results.

StrongSpoon · 2024-12-05T01:21:18Z

src/flag_gems/ops/diagonal.py

+    return grad_input
+
+
+def diagonal_backward(grad_output, input_sizes, offset, dim1, dim2):


it's okay to fuse backward and diagonal_backward into one.

StrongSpoon · 2024-12-05T01:23:37Z

src/flag_gems/ops/diagonal.py

+        input_sizes, dtype=grad_output.dtype, device=grad_output.device
+    )
+    diag = torch.diagonal(grad_input, offset, dim1, dim2)
+    copy_func.instantiate(grad_output.ndim)(grad_output, out0=diag)


since torch.zeros also calls for a kernel, there exist two kernels indeed. I wonder if it's feasible to initialize grad_input as an empty tensor, and assign it in one kernel function.

Got it, I'll give it a try.

tests/test_special_ops.py

StrongSpoon · 2024-12-05T01:32:11Z

tests/test_special_ops.py

+    res_out = to_reference(res_out)
+    res_in_grad = to_reference(res_in_grad)
+    gems_assert_equal(res_out, ref_out)
+    gems_assert_close(res_in_grad, ref_in_grad, dtype)


why not require them equal? I thought backward function doesn't change the value.

Yes, I fixed it.

awayzjj · 2024-12-05T09:39:23Z

Hi, @StrongSpoon
CI failed, but the failed UT is not relavent with my PR(I can reproduce it after runing the UT several times on the latest master branch)

StrongSpoon

lgtm

* diagonal v0 * impl triton version * fix code format * fix --ref cpu failed * use gems_assert_equal to validate res_in_grad

StrongSpoon self-assigned this Dec 4, 2024

awayzjj force-pushed the diagonal_backward branch from b66ae57 to 2851a03 Compare December 4, 2024 10:39

awayzjj requested a review from StrongSpoon December 5, 2024 01:31

StrongSpoon reviewed Dec 5, 2024

View reviewed changes

awayzjj force-pushed the diagonal_backward branch from e000f6e to 6515873 Compare December 5, 2024 08:34

awayzjj added 5 commits December 5, 2024 08:40

diagonal v0

8af1b67

impl triton version

538d82b

fix code format

066920e

fix --ref cpu failed

d771625

use gems_assert_equal to validate res_in_grad

6515873

StrongSpoon approved these changes Dec 5, 2024

View reviewed changes

StrongSpoon merged commit 923567e into FlagOpen:master Dec 5, 2024
8 of 9 checks passed

StrongSpoon pushed a commit that referenced this pull request Dec 12, 2024

[Operator] Add diagonal backward (#329)

824eb6c

* diagonal v0 * impl triton version * fix code format * fix --ref cpu failed * use gems_assert_equal to validate res_in_grad

DuanYaQi pushed a commit that referenced this pull request Dec 17, 2024

[Operator] Add diagonal backward (#329)

06a5aa3

* diagonal v0 * impl triton version * fix code format * fix --ref cpu failed * use gems_assert_equal to validate res_in_grad

Gxiandy pushed a commit to Gxiandy/FlagGems that referenced this pull request Jan 12, 2025

[Operator] Add diagonal backward (FlagOpen#329)

fa974b2

* diagonal v0 * impl triton version * fix code format * fix --ref cpu failed * use gems_assert_equal to validate res_in_grad

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Operator] Add diagonal backward #329

[Operator] Add diagonal backward #329

awayzjj commented Nov 26, 2024 •

edited

Loading

awayzjj commented Nov 26, 2024 •

edited

Loading

awayzjj commented Nov 28, 2024

StrongSpoon commented Dec 4, 2024

StrongSpoon commented Dec 4, 2024

StrongSpoon commented Dec 4, 2024

awayzjj commented Dec 5, 2024

StrongSpoon left a comment

StrongSpoon Dec 5, 2024

StrongSpoon Dec 5, 2024

awayzjj Dec 5, 2024

StrongSpoon Dec 5, 2024

awayzjj Dec 5, 2024

awayzjj commented Dec 5, 2024

StrongSpoon left a comment

		return grad_input


		def diagonal_backward(grad_output, input_sizes, offset, dim1, dim2):

[Operator] Add diagonal backward #329

[Operator] Add diagonal backward #329

Conversation

awayzjj commented Nov 26, 2024 • edited Loading

PR Category

Type of Change

Description

Issue

Progress

Performance

awayzjj commented Nov 26, 2024 • edited Loading

awayzjj commented Nov 28, 2024

StrongSpoon commented Dec 4, 2024

StrongSpoon commented Dec 4, 2024

StrongSpoon commented Dec 4, 2024

awayzjj commented Dec 5, 2024

StrongSpoon left a comment

Choose a reason for hiding this comment

StrongSpoon Dec 5, 2024

Choose a reason for hiding this comment

StrongSpoon Dec 5, 2024

Choose a reason for hiding this comment

awayzjj Dec 5, 2024

Choose a reason for hiding this comment

StrongSpoon Dec 5, 2024

Choose a reason for hiding this comment

awayzjj Dec 5, 2024

Choose a reason for hiding this comment

awayzjj commented Dec 5, 2024

StrongSpoon left a comment

Choose a reason for hiding this comment

awayzjj commented Nov 26, 2024 •

edited

Loading

awayzjj commented Nov 26, 2024 •

edited

Loading