add conv2d conv1d forward function #166

FatJhon · 2024-08-16T02:12:26Z

We have completed the forward fuction of the conv1d, conv2d conv2d_depthwise operators.

tongxin · 2024-08-19T00:55:06Z

There are new conflicts needed to be resolve. Please see to that.

Bowen12992

Update the code to the newest to make all the checks work

StrongSpoon

review ongoing

src/flag_gems/ops/conv1d.py

StrongSpoon · 2024-08-26T06:21:12Z

src/flag_gems/ops/conv1d.py

+            + (tl.arange(0, BLOCK_C_IN) * padded_width_input)[:, None]
+            + i * stride_width
+            + tl.arange(0, BLOCK_W)
+        )


doesn't support width_kernel that is not power of 2?

As Continuous data loading problem. Currently, block only supports powers of 2, so the h w of the kernel only supports this specification

StrongSpoon · 2024-08-26T06:37:43Z

src/flag_gems/ops/conv1d.py

+        weight_value = tl.load(weight + weight_offset, mask=mask_weight, other=0)
+        input_value = tl.reshape(input_value, (BLOCK_N, BLOCK_OUT_WEIGHT))
+        weight_value = tl.reshape(weight_value, (BLOCK_OUT_WEIGHT, BLOCK_O))
+        accumulator = tl.dot(input_value, weight_value, allow_tf32=False)


accumulator redefined

StrongSpoon · 2024-08-26T06:43:16Z

tests/test_reduction_ops.py

+@pytest.mark.parametrize("kernel", [(17, 2, 2)])
+@pytest.mark.parametrize("stride", [2])
+@pytest.mark.parametrize("padding", [1])
+@pytest.mark.parametrize("dtype", [torch.float32])


more test cases

StrongSpoon

please implement the performance tests.

StrongSpoon · 2024-08-26T08:55:30Z

src/flag_gems/ops/conv2d.py

+    )
+    input_ci_offset = offset_ci[:, None, None] * height_input * width_input
+    input_group_offset = (
+        pid_group[None, None, None, None] * c_input * width_input * height_input


initializing input_group_offset as a scalar is okay

StrongSpoon · 2024-08-27T02:30:43Z

src/flag_gems/ops/conv2d.py

+        input_col = mm(out_grad, weight_reshape.T)
+
+        # return dx,None,None,None,None,None,None
+        conv2d_col2img[grid](


maybe we could fuse mm and col2img together as bwd kernel

but not necessarily

StrongSpoon · 2024-11-04T05:44:49Z

src/flag_gems/ops/__init__.py

@@ -16,6 +16,9 @@
 from .bmm import bmm
 from .cat import cat
 from .clamp import clamp, clamp_tensor
+from .conv1d import conv1d
+from .conv2d import conv2d
+from .conv_depthwise2d import _conv_depthwise2d


Is conv_depthwise2d implemented?

StrongSpoon · 2024-11-04T05:46:24Z

src/flag_gems/ops/conv1d.py

+        padding_width = padding[0]
+    else:
+        padding_width = padding
+    return conv2d(


I suggest implementing a kernel function for conv1d specifically. Calling conv2d will cost additional runtime.

StrongSpoon · 2024-11-04T06:47:14Z

src/flag_gems/ops/conv2d.py

+    accum = tl.zeros((BLOCK_NI_HO_WO, BLOCK_CO), dtype=tl.float32)
+
+    for h in range(kernel_height):
+        for w in range(kernel_width):


since kernel_height and kernel_width are processed as loop iterator range, why not support those non-power-of-2?

iclementine · 2024-11-06T07:46:50Z

src/flag_gems/ops/conv2d.py

+
+class Conv2d(torch.autograd.Function):
+    @staticmethod
+    def forward(ctx, input, weight, bias, stride, padding, dilation, groups):


Dilation is not used at all, so it does not support dilation other than 1?

Dilation is not used.

I think we should raise an error if dilation > 1.

added dilation function

iclementine · 2024-11-06T08:52:36Z

src/flag_gems/ops/conv2d.py

+    Returns:
+        Output size of 2D convolution.
+    """
+    return (in_size + 2 * padding - kernel_size) // stride + 1


Does it support asymmetric padding? I suppose not. Alright, torch's convolution does not support asymmetric padding.

iclementine · 2024-11-29T05:44:57Z

src/flag_gems/ops/conv2d.py

+
+        return (
+            input,
+            None,


Gradient of weight and bias should also be computed.

StrongSpoon · 2024-11-22T09:12:10Z

benchmark/core_shapes.yaml

+# default conv shape for input and weight stride padding groups
+# default Ni Ci Hi WI  Co Hk Wk stride padding groups
+ConvBenchmark:
+  shapes:


5 shapes are enough for core mode.

StrongSpoon assigned iclementine and StrongSpoon Aug 19, 2024

Bowen12992 reviewed Aug 26, 2024

View reviewed changes

StrongSpoon reviewed Aug 26, 2024

View reviewed changes

StrongSpoon reviewed Aug 27, 2024

View reviewed changes

FatJhon force-pushed the dev_xcoresigma_jiangbin_conv branch from 121285e to 5adc6e6 Compare October 6, 2024 02:50

Jiang Bin and others added 8 commits October 6, 2024 10:50

add deconv confusion

5adc6e6

[Operator] Add conv1d&conv2d's perf test

0f78fe6

[Fix] Change SMALL_SIZES to fit in conv2d's implementation

e508cee

[Chore] Move in special SIZES in perf test

cb72757

modify for conv2d to support more value for kernel h w

c47d007

Merge remote-tracking branch 'remotes/origin/master' into jiang_commit

534bda2

add depthwise

ce12a38

Merge branch 'master' into dev_xcoresigma_jiangbin_conv

4a0527d

StrongSpoon reviewed Nov 4, 2024

View reviewed changes

Jiang Bin added 2 commits November 4, 2024 07:24

add conv_depthwise2d

052c619

add more test for conv2d

efd6191

iclementine reviewed Nov 6, 2024

View reviewed changes

Jiang Bin and others added 7 commits November 11, 2024 07:20

add conv2d backward

d41b935

check stride padding of conv1d

4f9b9e9

add backward of conv2d

ab36109

Merge branch 'master' into jiang_commit

91f5b37

add dilations && perf

469bdd7

Merge branch 'master' into dev_xcoresigma_jiangbin_conv

04db890

delete useless info

7016596

iclementine reviewed Nov 29, 2024

View reviewed changes

src/flag_gems/ops/conv2d.py

return (

input,

None,

Copy link

Collaborator

iclementine Nov 29, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Gradient of weight and bias should also be computed.

StrongSpoon reviewed Dec 4, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add conv2d conv1d forward function #166

add conv2d conv1d forward function #166

FatJhon commented Aug 16, 2024 •

edited

Loading

tongxin commented Aug 19, 2024

Bowen12992 left a comment •

edited

Loading

StrongSpoon left a comment

StrongSpoon Aug 26, 2024

FatJhon Sep 22, 2024

StrongSpoon Aug 26, 2024

StrongSpoon Aug 26, 2024

FatJhon Nov 6, 2024

StrongSpoon left a comment

StrongSpoon Aug 26, 2024

StrongSpoon Aug 27, 2024

StrongSpoon Aug 27, 2024

FatJhon Sep 22, 2024

StrongSpoon Nov 4, 2024

FatJhon Nov 6, 2024

StrongSpoon Nov 4, 2024

StrongSpoon Nov 4, 2024

iclementine Nov 6, 2024

FatJhon Nov 6, 2024

iclementine Nov 18, 2024

FatJhon Nov 22, 2024

iclementine Nov 6, 2024 •

edited

Loading

iclementine Nov 29, 2024

StrongSpoon Nov 22, 2024

add conv2d conv1d forward function #166

Are you sure you want to change the base?

add conv2d conv1d forward function #166

Conversation

FatJhon commented Aug 16, 2024 • edited Loading

tongxin commented Aug 19, 2024

Bowen12992 left a comment • edited Loading

Choose a reason for hiding this comment

StrongSpoon left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

StrongSpoon left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

iclementine Nov 6, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

FatJhon commented Aug 16, 2024 •

edited

Loading

Bowen12992 left a comment •

edited

Loading

iclementine Nov 6, 2024 •

edited

Loading