Support Dynamically Quantized Convolutions #9021

mcr229 · 2025-03-06T20:07:07Z

We wish to have some initial support for Dynamically Quantized Convolutions.

Let's first write a test to drive development of the feature. Let's add a test here first:

executorch/backends/xnnpack/test/ops/test_conv2d.py

Line 4 in e9c2315

# This source code is licensed under the BSD-style license found in the

let's just do 2d convolutions for now. Take a look at how we test dqlinears in general:

executorch/backends/xnnpack/test/ops/test_linear.py

Line 327 in e9c2315

def _test_dqlinear(

And let's try to add a test here. Since we're adding quantizer support, we should make sure that after

.quantize()
.export()

we should check that a choose_q_param node is in the graph. Now, after we've added this test, when we run it with
python -m unittest backends.xnnpack.test.ops.test_conv2d.... It should fail because it can't find the choose_q_params node. Let's first start by enabling the quantizer to properly annotate convolutions.

executorch/backends/xnnpack/quantizer/xnnpack_quantizer.py

Line 266 in e9c2315

DYNAMIC_OPS = [

Since we're first starting with conv2d, we should only annotate dynamically quantized convs if they are 2d. We can add a check that the len(outputpadding) == 2 somewhere here:
https://github.com/pytorch/executorch/blob/main/backends/xnnpack/quantizer/xnnpack_quantizer_utils.py#L295

Now that we have it annotated, it should bass through the test that's checking for the choose_q_param. Now we just need to update our partitioner to allow DynamicallyQuantizedConvolutions:

executorch/backends/xnnpack/partition/config/gemm_configs.py

Line 396 in e9c2315

ConfigPrecisionType.STATIC_QUANT,

Again it would be nice to check in our constraints that if we detect a dynamically quantized convolution, and it is 1d, then we don't partition. After this the test should be passing. There may be some more lingering issues with the wiring, if that's the case feel free to reach out in the discord group:

https://discord.com/channels/1334270993966825602/1336777807509979188

cc @digantdesai @cbilgin

The text was updated successfully, but these errors were encountered:

keyprocedure · 2025-04-04T17:15:56Z

I’d be happy to work on this and learn as I go!

mcr229 · 2025-04-04T21:10:03Z

thank you @keyprocedure, i assigned it to you. Feel free to ask lots of questions in the xnnpack discord channel. This one is definitely more involved, so happy to discuss and answer any questions you may have

iliasslasri · 2025-04-06T08:02:41Z

@keyprocedure I would love to help! we can chat on discrod @ilqqq .

mcr229 added this to ExecuTorch - CPU Mar 6, 2025

mcr229 moved this to Backlog in ExecuTorch - CPU Mar 6, 2025

mcr229 added this to the 0.6.0 milestone Mar 6, 2025

mcr229 added the module: xnnpack Issues related to xnnpack delegation and the code under backends/xnnpack/ label Mar 6, 2025

iseeyuan added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Mar 7, 2025

github-actions bot mentioned this issue Mar 10, 2025

Weekly issue metrics report - 2025-03-01..2025-03-07 wdvr/pytorch#15

Open

This was referenced Mar 17, 2025

Weekly issue metrics report - 2025-03-01..2025-03-07 wdvr/pytorch#17

Open

Weekly issue metrics report - 2025-03-01..2025-03-07 wdvr/pytorch#19

Open

github-actions bot mentioned this issue Mar 31, 2025

Weekly issue metrics report - 2025-03-01..2025-03-07 wdvr/pytorch#21

Open

mcr229 added the good first issue Good for newcomers label Apr 4, 2025

github-project-automation bot added this to New Contributors Projects and Issues Apr 4, 2025

mcr229 assigned keyprocedure Apr 4, 2025

github-actions bot mentioned this issue Apr 7, 2025

Weekly issue metrics report - 2025-03-01..2025-03-07 wdvr/pytorch#25

Open

This was referenced Apr 16, 2025

Support dynamically quantized 2D convolutions #10248

Closed

Support dynamically quantized 2D convolutions #10347

Merged

mcr229 closed this as completed in #10347 Apr 22, 2025

mcr229 closed this as completed in cfd1be3 Apr 22, 2025

github-project-automation bot moved this to Done in New Contributors Projects and Issues Apr 22, 2025

github-project-automation bot moved this from Backlog to Done in ExecuTorch - CPU Apr 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Dynamically Quantized Convolutions #9021

Support Dynamically Quantized Convolutions #9021

mcr229 commented Mar 6, 2025 •

edited

Loading

keyprocedure commented Apr 4, 2025

mcr229 commented Apr 4, 2025

iliasslasri commented Apr 6, 2025

Support Dynamically Quantized Convolutions #9021

Support Dynamically Quantized Convolutions #9021

Comments

mcr229 commented Mar 6, 2025 • edited Loading

keyprocedure commented Apr 4, 2025

mcr229 commented Apr 4, 2025

iliasslasri commented Apr 6, 2025

mcr229 commented Mar 6, 2025 •

edited

Loading