improve batching sumcheck concurrency to dominate by max num var polynomial #895

hero78119 · 2025-04-03T06:49:48Z

Previously (devirgo) sumcheck concurrency when batching with different number of variables, are dominated by the smallest num_vars. The reason is because for the num_var < log_2(threads) we can NOT divide it's evaluation into #thread region.

This PR addressed the issue by introducing a extra meta information for each polynomial, so we are able to differentiate those small poly and calculate its multiplicity correctly

Design rationale

For some extreme small polynomial (num_var << log(threads)), we define a new PolyType::Phase2Only to differentiate them. These type of small polynomial will be handle by main_worker only during phase 1 sumcheck. And when moving to phase2 sumcheck, those small poly will expand the size to match with all. This cost is negligible giving phase2 only need to run on #thread evaluations.

benchmark

There is no different with before/after change, since we dont have batch sumcheck with different num_vars in critical path, which meet expected

e2e Fibonacci 2^20

fibonacci_max_steps_1048576/prove_fibonacci/fibonacci_max_steps_1048576
                        time:   [2.8894 s 2.9028 s 2.9180 s]
                        change: [+0.1414% +0.8059% +1.5474%] (p = 0.05 < 0.05)
                        Change within noise threshold.

e2e Fibonacci 2^21

fibonacci_max_steps_2097152/prove_fibonacci/fibonacci_max_steps_2097152
                        time:   [5.1785 s 5.2061 s 5.2291 s]
                        change: [+0.7256% +1.3249% +1.9085%] (p = 0.00 < 0.05)
                        Change within noise threshold.

e2e Fibonacci 2^22

fibonacci_max_steps_4194304/prove_fibonacci/fibonacci_max_steps_4194304
                        time:   [10.308 s 10.330 s 10.353 s]
                        change: [+0.9437% +1.2216% +1.4899%] (p = 0.00 < 0.05)
                        Change within noise threshold.

Extract from PR #895 Remove forked transcript to simplify design. ### benchmark To assure no impact for e2e Fibonacci 2^20 ``` fibonacci_max_steps_1048576/prove_fibonacci/fibonacci_max_steps_1048576 time: [2.8696 s 2.8901 s 2.9127 s] change: [-1.0773% -0.1480% +0.8449%] (p = 0.79 > 0.05) No change in performance detected. ``` Fibonacci 2^21 ``` fibonacci_max_steps_2097152/prove_fibonacci/fibonacci_max_steps_2097152 time: [5.2648 s 5.2801 s 5.2959 s] change: [+0.4221% +0.9989% +1.5705%] (p = 0.00 < 0.05) Change within noise threshold. ```

spherel

LGTM!

improve batching sumcheck parallism dominated by max num var

5aced4a

hero78119 marked this pull request as draft April 3, 2025 06:49

hero78119 changed the title ~~improve batching sumcheck parallism dominated by max num var~~ improve batching sumcheck concurrency to dominate by max num var Apr 3, 2025

hero78119 changed the title ~~improve batching sumcheck concurrency to dominate by max num var~~ improve batching sumcheck concurrency to dominate by max num var polynomial Apr 3, 2025

hero78119 added 9 commits April 3, 2025 20:42

pass on normal poly logic

6f7c695

all compile & test pass

3a83908

refactor unittest

93b38f6

rename api

73c2396

sumcheck refactor and rename

46fdbf2

all test passed

3f07940

fix mpcs sumcheck usage

61a3bf8

more comment

c6aa233

rewrite unittest to fit into arbitrary cores

fcba46e

hero78119 marked this pull request as ready for review April 4, 2025 06:52

hero78119 requested a review from spherel April 4, 2025 06:58

hero78119 added 2 commits April 4, 2025 14:58

fmt

2138b52

Merge branch 'master' into feat/optimize_sumcheck_concurrency

7da125e

hero78119 mentioned this pull request Apr 7, 2025

simplify design without transcript fork #899

Merged

spherel approved these changes Apr 11, 2025

View reviewed changes

hero78119 added this pull request to the merge queue Apr 12, 2025

Merged via the queue into scroll-tech:master with commit a4def62 Apr 12, 2025
4 checks passed

hero78119 deleted the feat/optimize_sumcheck_concurrency branch April 12, 2025 01:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve batching sumcheck concurrency to dominate by max num var polynomial #895

improve batching sumcheck concurrency to dominate by max num var polynomial #895

hero78119 commented Apr 3, 2025 •

edited

Loading

spherel left a comment

improve batching sumcheck concurrency to dominate by max num var polynomial #895

improve batching sumcheck concurrency to dominate by max num var polynomial #895

Conversation

hero78119 commented Apr 3, 2025 • edited Loading

Design rationale

benchmark

spherel left a comment

Choose a reason for hiding this comment

hero78119 commented Apr 3, 2025 •

edited

Loading