Initial version of sp1-prover component for testing #12

akonring · 2024-07-25T13:57:50Z

Closes: EspressoSystems/zkrollup-integration/issues/4

tl;dr

This PR merely lays the foundation for an sp1-prover Rust component which may be able to replace the existing zkevm-prover in the future. The new sp1-prover component is comprised of three (mocked) services: aggregator, executor_service and hashdb_service.
Testing with the new sp1-prover component can be done in 2 different configurations:
1. make run-sp1
  This runs the full node test but with the new sp1-prover in addition to the existing zkevm-prover. The new sp1-prover will announce itself to the aggregator and will eventually receive proof requests. sp1-prover can be configured to emulate the zkevm-prover’s aggregator-client-mock.cpp using data/mocked_data.json as mock data with which verification will succeed. Or using other data (e.g. data/mocked_sp1_data.json) which will cause the demo (aggregator) to eventually fail in verification. See more details below in: "Running e2e test with additional sp1 prover".
2. make run-sp1-only
  This configuration is for exploring the work needed to completely replace the zkevm-prover including execturor_service and hashdb_service. Upon startup the synchronizer needs to connect to both executor and hashdb to be able to make progress.
  For now, only a few RPC calls are stubbed for these services and the rest remains unimplemented. See more details below: "Running e2e test with sp1-prover only".
The overall goal with make run-sp1 (and this PR) is to unblock any potential contracts work further up the stack (Aggregator/EthTxManager/L1). Fully replacing the zkevm-prover (blackbox) seems like a very heavy lift. This is mainly due to the prover’s responsibilities as executor and hashdb. It might be worth investigating if we can extract the “prover”-part of the zkevm-prover (communicating with the aggregator) while keeping the rest of the logic (executor/hashdb) intact. However, the intricacies of the prover logic make this seem non-trivial. Finally, since the purpose of this PR is mainly to unblock other work and explore different design strategies for replacing the prover, the code is unpolished and lags basic error handling and logging infra.

Common Setup

Make sure that cargo is able to cross compile targeting musl libc (for MacOS/M1 see fx https://github.com/FiloSottile/homebrew-musl-cross) and add the correct linker. Fx:

[target.x86_64-unknown-linux-musl]
linker = "x86_64-linux-musl-gcc"

Build the binary:

cdk-validium-node % cd sp1-prover
sp1-prover % cargo build --release --target=x86_64-unknown-linux-musl

Build the docker images (sp1-prover and zkevm-node):

cdk-validium-node % make build-docker-sp1

Running e2e test with additional sp1-prover

(Make sure that the Common Setup has been completed)

In order to test that the verification indeed fails when we replace the proof, we do the following:

cdk-validium-node % cd test
test % make run-sp1

This will run the e2e test with both sp1-prover and zkevm-prover. The sp1-prover will connect to the aggregator and await any proof requests. We can check if sp1-prover has been asked for a final proof:

% docker logs -f sp1-prover |& grep -e "GenFinalProofRequest" -e "GetProofRequest"
Received request: GetProofRequest { id: "aNMsMmQDUOh3KBOuCS9A0mvFIaORcEx2bzhru", timeout: 0 }
Received request: GenFinalProofRequest { recursive_proof: "88888670604050723159190639550237390237901487387303122609079617855313706601738", aggregator_addr: "0x70997970c51812dc3a010c7d01b50e0d17dc79c8" }
Received request: GetProofRequest { id: "RZkQ5d14SrB6hBIxTLGzwM2iCguw8qzl9uuNl", timeout: 0 }

To check that the zkevm-aggregator has received the proof and it has been correctly verified, we check:

test % docker logs -f zkevm-aggregator |& grep -e "Final proof verified"
2024-07-25T13:10:12.783Z	INFO	aggregator/aggregator.go:1300	Final proof verified	{"pid": 1, "version": "923b75a8", "txId": "proof-from-1-to-1", "batches": "1-1"}

Making the verification fail:

We can make the verification fail by submitting a different proof to the aggregator.

Change line 11 in sp1-prover/src/aggregator from:

static MOCKED_DATA: &str = include_str!("data/mocked_data.json");

to

static MOCKED_DATA: &str = include_str!("data/mocked_sp1_data.json");

such that the aggregator fetches the mock data from mocked_sp1_data.json instead.

Rebuild binary and docker image (see: Common Setup section).

Run the demo again and check that the verification fails:

test % make run-sp1
test % docker logs -f zkevm-aggregator |& grep -e "ERROR"
2024-07-25T13:34:53.258Z	ERROR	etherman/etherman.go:1027	error converting proof. Error: invalid proof length. Length: 1730, Proof:[...]

Running e2e test with sp1-prover only

~~Todo~~
Postponing further description of this configuration because (as of now) replacing the full prover (incl. executor and hashdb) seems infeasible to do within a reasonable timeline.

This PR does not:

Let nix support the cross compilation through the use of cross-shell.nix. For now, the cross compilation can be done outside of the nix environment. This should be fixed in the near future to avoid inconsistencies between environments.

philippecamacho · 2024-08-20T14:38:05Z

Thanks for this analysis @akonring. Seems to me that we should go for the easiest route as we are working on a demo for now. So having both provers work side by side sounds the way to go. What is the input to our SP1 zkVM prover? The list of EVM transactions, something more?

akonring · 2024-08-22T07:47:03Z

Thanks for taking a look. The original plan of the issue was to swap the prover to SP1. This turns out to be complicated because fully replacing the prover (incl. state transition proofs) is just a lot of work. On the other hand, merely setting up a demo with a single SP1 prover (producing only namespace proofs and ignoring state transition proofs) is also a heavy lift because existing CDK-validium-node stack relies on the zkEVM-prover running its other components (exectuor, hashdb).

Naturally, this lead to the idea of having the two provers work in parallel which is what this PR explores.

Currently the aggregator naively distributes the proof requests between available provers (provers that have initially connected to aggregator). Going forward, we might be able to modify the interface between aggregator and prover such that the aggregator can be aware of the type of prover that is connected to it such that only requests for namespace proofs will be distributed to our new SP1-prover while the original state-transition proofs will be handled by another zkEVM prover.

What is the input to our SP1 zkVM prover?

As of now, the new SP1 prover is just an additional prover mimicking the functionality of the existing mock zkEVM prover so the API is the same but we might want to change this (see above comment).

The L2 batch and other input that is necessary to compute the proofs can be seen in the interface description here:

cdk-validium-node/proto/src/proto/aggregator/v1/aggregator.proto

Line 263 in 1ed2273

bytes batch_l2_data = 6;

alxiong · 2024-08-22T13:23:33Z

Going forward, we might be able to modify the interface between aggregator and prover such that the aggregator can be aware of the type of prover that is connected to it such that only requests for namespace proofs will be distributed to our new SP1-prover while the original state-transition proofs will be handled by another zkEVM prover.

completely agree

What is the input to our SP1 zkVM prover?

As of now, the new SP1 prover is just an additional prover mimicking the functionality of the existing mock zkEVM prover so the API is the same but we might want to change this (see above comment).

we should start with the Fibonacci program input, which is basically n the n-th value in the sequence:
https://github.com/EspressoSystems/zkrollup-integration/blob/da52d5a82aa9e2458dad6546aace299a4127c239/sp1/script/src/bin/prove.rs#L52-L54

and once Chengyu finishes his part, we update the input based on his script here:
https://github.com/EspressoSystems/zkrollup-integration/blob/0903049888eba153a056a140644cab2cff04629d/sp1/script/src/bin/prove.rs#L59

akonring added 2 commits July 25, 2024 12:22

add sp1-prover rust project

a320c1a

adjust docker and make files

923b75a

akonring self-assigned this Jul 25, 2024

akonring requested review from mrain, philippecamacho, ggutoski and alxiong July 25, 2024 14:08

fix data and make file

e6ff6d5

akonring marked this pull request as ready for review July 26, 2024 15:21

add new entry to make file to avoid breaking e2e

36cf9a6

akonring force-pushed the ak/sp1-prover-init branch from c0a68e1 to 36cf9a6 Compare July 28, 2024 09:50

akonring mentioned this pull request Sep 3, 2024

Move SP1 prover from cdk-validium-node to zkrollup-integration EspressoSystems/zkrollup-integration#50

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial version of sp1-prover component for testing #12

Initial version of sp1-prover component for testing #12

akonring commented Jul 25, 2024 •

edited

Loading

philippecamacho commented Aug 20, 2024

akonring commented Aug 22, 2024

alxiong commented Aug 22, 2024

Initial version of sp1-prover component for testing #12

Are you sure you want to change the base?

Initial version of sp1-prover component for testing #12

Conversation

akonring commented Jul 25, 2024 • edited Loading

tl;dr

Common Setup

Running e2e test with additional sp1-prover

Running e2e test with sp1-prover only

This PR does not:

philippecamacho commented Aug 20, 2024

akonring commented Aug 22, 2024

alxiong commented Aug 22, 2024

akonring commented Jul 25, 2024 •

edited

Loading