Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

refactor(levm): maintain cached state between executions in levm #2371

Merged
merged 30 commits into from
Apr 3, 2025

Conversation

jrchatruc
Copy link
Collaborator

@jrchatruc jrchatruc commented Apr 1, 2025

Motivation

This PR refactors levm introducing a GeneralizedDatabase (the name is not the best, suggestions are welcome), which is simply the combination of the regular store (i.e. the on-disk database) and the cache (the in memory structure we use to keep track of changes done to accounts as execution happens).

More importantly, it makes levm (the VM struct) hold a mutable reference to said database instead of a regular owned value. This is to bring the levm api more in line to what we need it to be for ethrex; the main problem we are solving is to be able to run multiple transactions while keeping the underlying cache changes from those transactions. Once we are done with execution (because we've finished executing or building the block or group of blocks), calling get_state_transitions will drain the cachedb for all the account updates and return them.

This change allowed, among other things, to implement for levm the previously uninplemented function execute_block_without_clearing_state, which is used for syncing.

Additionally, this PR introduces error handling for the levm Database trait; previously its member functions did not return errors and we were just unwrapping on some of their implementations.

Description

Closes #issue_number

Copy link

github-actions bot commented Apr 1, 2025

Lines of code report

Total lines added: 230
Total lines removed: 341
Total lines changed: 571

Detailed view
+----------------------------------------------------------+-------+------+
| File                                                     | Lines | Diff |
+----------------------------------------------------------+-------+------+
| ethrex/cmd/ef_tests/state/report.rs                      | 923   | +1   |
+----------------------------------------------------------+-------+------+
| ethrex/cmd/ef_tests/state/runner/levm_runner.rs          | 366   | +8   |
+----------------------------------------------------------+-------+------+
| ethrex/cmd/ef_tests/state/runner/mod.rs                  | 287   | +6   |
+----------------------------------------------------------+-------+------+
| ethrex/cmd/ef_tests/state/runner/revm_runner.rs          | 525   | -7   |
+----------------------------------------------------------+-------+------+
| ethrex/cmd/ef_tests/state/utils.rs                       | 80    | +3   |
+----------------------------------------------------------+-------+------+
| ethrex/cmd/ethrex/initializers.rs                        | 337   | +1   |
+----------------------------------------------------------+-------+------+
| ethrex/cmd/ethrex/networks.rs                            | 22    | -1   |
+----------------------------------------------------------+-------+------+
| ethrex/crates/networking/p2p/p2p.rs                      | 11    | +1   |
+----------------------------------------------------------+-------+------+
| ethrex/crates/networking/p2p/sync.rs                     | 576   | +14  |
+----------------------------------------------------------+-------+------+
| ethrex/crates/networking/p2p/sync/bytecode_fetcher.rs    | 45    | +2   |
+----------------------------------------------------------+-------+------+
| ethrex/crates/networking/p2p/sync/storage_fetcher.rs     | 238   | +3   |
+----------------------------------------------------------+-------+------+
| ethrex/crates/networking/p2p/sync_manager.rs             | 124   | +124 |
+----------------------------------------------------------+-------+------+
| ethrex/crates/networking/rpc/engine/fork_choice.rs       | 372   | -12  |
+----------------------------------------------------------+-------+------+
| ethrex/crates/networking/rpc/eth/filter.rs               | 602   | -1   |
+----------------------------------------------------------+-------+------+
| ethrex/crates/networking/rpc/eth/gas_price.rs            | 140   | -1   |
+----------------------------------------------------------+-------+------+
| ethrex/crates/networking/rpc/eth/max_priority_fee.rs     | 125   | -1   |
+----------------------------------------------------------+-------+------+
| ethrex/crates/networking/rpc/rpc.rs                      | 704   | -17  |
+----------------------------------------------------------+-------+------+
| ethrex/crates/vm/backends/levm/db.rs                     | 130   | +36  |
+----------------------------------------------------------+-------+------+
| ethrex/crates/vm/backends/levm/mod.rs                    | 481   | -71  |
+----------------------------------------------------------+-------+------+
| ethrex/crates/vm/backends/mod.rs                         | 255   | -85  |
+----------------------------------------------------------+-------+------+
| ethrex/crates/vm/errors.rs                               | 119   | +2   |
+----------------------------------------------------------+-------+------+
| ethrex/crates/vm/levm/src/db/error.rs                    | 6     | +6   |
+----------------------------------------------------------+-------+------+
| ethrex/crates/vm/levm/src/db/mod.rs                      | 23    | +2   |
+----------------------------------------------------------+-------+------+
| ethrex/crates/vm/levm/src/execution_handlers.rs          | 234   | -11  |
+----------------------------------------------------------+-------+------+
| ethrex/crates/vm/levm/src/hooks/default_hook.rs          | 289   | -21  |
+----------------------------------------------------------+-------+------+
| ethrex/crates/vm/levm/src/opcode_handlers/block.rs       | 206   | -2   |
+----------------------------------------------------------+-------+------+
| ethrex/crates/vm/levm/src/opcode_handlers/environment.rs | 343   | -16  |
+----------------------------------------------------------+-------+------+
| ethrex/crates/vm/levm/src/opcode_handlers/system.rs      | 652   | -75  |
+----------------------------------------------------------+-------+------+
| ethrex/crates/vm/levm/src/utils.rs                       | 534   | -20  |
+----------------------------------------------------------+-------+------+
| ethrex/crates/vm/levm/src/vm.rs                          | 439   | +21  |
+----------------------------------------------------------+-------+------+

@jrchatruc jrchatruc marked this pull request as ready for review April 1, 2025 12:28
@jrchatruc jrchatruc requested a review from a team as a code owner April 1, 2025 12:28
store_wrapper,
block_cache,
} => LEVM::create_access_list(tx.clone(), header, store_wrapper.clone(), block_cache)?,
Evm::LEVM { db } => LEVM::create_access_list(tx.clone(), header, db)?,
};
match result {
(
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

❤️ all the changes in this file.

Copy link
Collaborator

@mpaulucci mpaulucci left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🚀

@@ -151,7 +151,8 @@ init-l2-no-metrics: ## 🚀 Initializes an L2 Lambda ethrex Client
--http.addr 0.0.0.0 \
--authrpc.port ${L2_AUTH_PORT} \
--metrics.port ${L2_PROMETHEUS_METRICS_PORT} \
--datadir ${ethrex_L2_DEV_LIBMDBX}
--datadir ${ethrex_L2_DEV_LIBMDBX} \
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we want to initialize with LEVM?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I removed this levm default here 175dcf9 for now until we fixed the state reconstruct issues that arise on a separate PR

Copy link
Contributor

@JereSalo JereSalo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This PR is gold. I will try to do my best for merging it working ASAP. Great work!

InternalError::Custom("Error at LEVM::get_state_transitions()".to_owned())
})?;
let mut db = load_initial_state_levm(test);
let levm_account_updates = backends::levm::LEVM::get_state_transitions(&mut db, *fork)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this may be wrong.
When loading the initial state now we are also clearing up the cache. So load_initial_state_levm doesn't do the same thing it used to do.
If I'm not mistaken, the execution of get_state_transitions in this scenario is going to result in no account updates because the Cache is empty.

Now, the thing gets messy here, because we don't have the new_state anymore in the LEVM's ExecutionReport, therefore, we lost the cache result.
Maybe now EFTestRunnerError::FailedToEnsurePostState(ExecutionReport, String) needs to contain the new state separately;
EFTestRunnerError::FailedToEnsurePostState(ExecutionReport, String, CacheDB) or something similar should work, so that in ensure_post_state() we return the updated cache with LEVM's changes to the state.

It would be something like:

return Err(EFTestRunnerError::FailedToEnsurePostState(
    execution_report.clone(),
    error_reason,
    db.cache,
));

That is just a suggestion, other alternatives can work too. Maybe I'm complicating things more than necessary!

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That makes sense, if I understand correctly this would mean adding the cache as a return value only to the errors used for EF tests; if that's so then the approach makes sense to me

@@ -105,37 +81,8 @@ impl LEVM {
receipts.push(receipt);
}

// Here we update block_cache with balance increments caused by withdrawals.
if let Some(withdrawals) = &block.body.withdrawals {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have yet to take a closer look but I think that maybe we need to call process_withdrawals() here so that we update the cache with those changes.
If I'm not wrong this may be a possible culprit behind the failing of the Hive tests. (I hope so)

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah that's correct, I readded withdrawal processing here f5f0c88 but there are still hive test regressions so there's more to it than this

@@ -436,38 +362,35 @@ impl LEVM {
pub fn create_access_list(
mut tx: GenericTransaction,
header: &BlockHeader,
store: Arc<dyn LevmDatabase>,
block_cache: &CacheDB,
db: &mut GeneralizedDatabase,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmm, I think that mutating the Cache in these scenarios may be wrong. Here we execute with the VM just to get valuable information, but don't actually want to alter the state.
I am not 100% sure yet but I will look for cases like this, because they may cause some trouble. A quick fix would be to clone the database when we don't actually want to mutate it, and avoid sending it as mutable so that it's clear that this isn't our intention.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should be ok since the cache we are mutating is one that was instantiated just for the purpose of running the create_access_list flow from the RPC endpoint.

// CREATE tx

let sender_nonce = get_account(&mut cache, db.clone(), env.origin).info.nonce;
let sender_nonce = get_account_no_push_cache(db, env.origin)?.info.nonce;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is there a particular reason for changing get_account() for get_account_no_push_cache() here? Or is it just a mistake?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice catch, this was indeed a mistake; restored the call to get_account here 67e52c6

for (address, pre_value) in &test.pre.0 {
let account = world_state.get_account_info(*address);
let account = world_state.get_account_info(*address).unwrap();
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should these be catched? Either way I'd change it with expects

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Changed to a map_err here 0ec0b4b

Comment on lines 55 to 61
fn get_block_hash(&self, block_number: u64) -> Result<Option<CoreH256>, DatabaseError> {
let a = self
.store
.get_block_header(block_number)
.map_err(|e| DatabaseError::Custom(e.to_string()))?;

a.map(|a| CoreH256::from(a.compute_block_hash().0))
Ok(a.map(|a| CoreH256::from(a.compute_block_hash().0)))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think something like this could be done. Else I'd change the a name to something more descriptive

        Ok(self
            .store
            .get_block_header(block_number)
            .map_err(|e| DatabaseError::Custom(e.to_string()))?
            .map(|header| CoreH256::from(header.compute_block_hash().0)))

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Improved the code with your suggestion here fbbbf49

.get_storage_at_hash(self.block_hash, address, key)
.unwrap()
.unwrap_or_default()
.map_err(|e| DatabaseError::Custom(e.to_string()))?
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We might use more specific error types. Maybe in a later PR

**Motivation**
- Fix hive tests that were failing (tested all RPC tests and they pass)
<!-- Why does this pull request exist? What are its goals? -->

**Description**
- Created `stateless_execute()`, a wrapper of execute() that doesn't
alter the state. It actually backs up the cache before executing and
then restores it. I know cloning the cache is not ideal but
optimizations to that can be performed later. We use this method when
creating access list.
- Clear the cache when instantiating a new VM: our storage slots have
"original value" and "current value", this is for gas costs, but when
executing a new transaction it should happen that all the original
values and current values have to be the same. Before the database
refactor we were doing this outside of LEVM, now it's embedded in the
code!

Update: I'm currently trying to fix the failing test from
`ethereum/engine` suite. It is the test with name "Replace Blob
Transactions (Cancun) (ethrex)"

Closes #issue_number
Copy link

github-actions bot commented Apr 3, 2025

Benchmark Results Comparison

PR Results

Benchmark Results: Factorial

Command Mean [ms] Min [ms] Max [ms] Relative
revm_Factorial 236.4 ± 0.8 235.1 238.3 1.00
levm_Factorial 800.6 ± 7.6 792.4 818.3 3.39 ± 0.03

Benchmark Results: Factorial - Recursive

Command Mean [s] Min [s] Max [s] Relative
revm_FactorialRecursive 1.466 ± 0.108 1.330 1.600 1.00
levm_FactorialRecursive 13.853 ± 0.209 13.659 14.128 9.45 ± 0.71

Benchmark Results: Fibonacci

Command Mean [ms] Min [ms] Max [ms] Relative
revm_Fibonacci 204.4 ± 2.9 199.0 211.1 1.00
levm_Fibonacci 794.5 ± 14.7 784.6 834.2 3.89 ± 0.09

Benchmark Results: ManyHashes

Command Mean [ms] Min [ms] Max [ms] Relative
revm_ManyHashes 9.0 ± 0.1 8.8 9.2 1.00
levm_ManyHashes 17.0 ± 0.7 16.5 18.8 1.89 ± 0.08

Benchmark Results: BubbleSort

Command Mean [s] Min [s] Max [s] Relative
revm_BubbleSort 3.211 ± 0.016 3.187 3.245 1.00
levm_BubbleSort 5.656 ± 0.038 5.597 5.713 1.76 ± 0.01

Benchmark Results: ERC20 - Transfer

Command Mean [ms] Min [ms] Max [ms] Relative
revm_ERC20Transfer 249.2 ± 6.5 244.6 263.6 1.00
levm_ERC20Transfer 483.8 ± 4.5 476.7 491.3 1.94 ± 0.05

Benchmark Results: ERC20 - Mint

Command Mean [ms] Min [ms] Max [ms] Relative
revm_ERC20Mint 142.1 ± 1.2 140.7 144.9 1.00
levm_ERC20Mint 317.3 ± 6.3 312.3 332.8 2.23 ± 0.05

Benchmark Results: ERC20 - Approval

Command Mean [s] Min [s] Max [s] Relative
revm_ERC20Approval 1.049 ± 0.012 1.037 1.067 1.00
levm_ERC20Approval 1.847 ± 0.015 1.825 1.877 1.76 ± 0.03

Main Results

Benchmark Results: Factorial

Command Mean [ms] Min [ms] Max [ms] Relative
revm_Factorial 234.7 ± 0.5 233.8 235.4 1.00
levm_Factorial 919.2 ± 15.4 906.7 959.4 3.92 ± 0.07

Benchmark Results: Factorial - Recursive

Command Mean [s] Min [s] Max [s] Relative
revm_FactorialRecursive 1.458 ± 0.065 1.402 1.575 1.00
levm_FactorialRecursive 15.789 ± 0.033 15.733 15.831 10.83 ± 0.49

Benchmark Results: Fibonacci

Command Mean [ms] Min [ms] Max [ms] Relative
revm_Fibonacci 202.0 ± 1.3 200.8 204.8 1.00
levm_Fibonacci 915.6 ± 4.4 909.1 922.6 4.53 ± 0.04

Benchmark Results: ManyHashes

Command Mean [ms] Min [ms] Max [ms] Relative
revm_ManyHashes 8.7 ± 0.1 8.6 8.9 1.00
levm_ManyHashes 18.4 ± 0.3 18.1 19.1 2.10 ± 0.04

Benchmark Results: BubbleSort

Command Mean [s] Min [s] Max [s] Relative
revm_BubbleSort 3.223 ± 0.019 3.209 3.273 1.00
levm_BubbleSort 6.168 ± 0.050 6.111 6.269 1.91 ± 0.02

Benchmark Results: ERC20 - Transfer

Command Mean [ms] Min [ms] Max [ms] Relative
revm_ERC20Transfer 247.7 ± 4.2 245.1 257.0 1.00
levm_ERC20Transfer 535.9 ± 9.6 528.4 561.6 2.16 ± 0.05

Benchmark Results: ERC20 - Mint

Command Mean [ms] Min [ms] Max [ms] Relative
revm_ERC20Mint 140.6 ± 0.6 140.0 142.0 1.00
levm_ERC20Mint 348.5 ± 2.0 345.3 351.6 2.48 ± 0.02

Benchmark Results: ERC20 - Approval

Command Mean [s] Min [s] Max [s] Relative
revm_ERC20Approval 1.037 ± 0.003 1.030 1.041 1.00
levm_ERC20Approval 2.004 ± 0.015 1.985 2.025 1.93 ± 0.02

jrchatruc and others added 8 commits April 3, 2025 11:37
…ambdaclass/ethrex into introduce-generalized-database-to-levm
**Motivation**
Add support for hoodi testnet
**Motivation**
Use the principle of least privilege and don't grand write permissions
that are then forwarded to potentially malicious actions.
**Motivation**
The codebase (mainly rpc) currently interacts with the synced by trying
to acquire its lock, which works if we only need to know if the synced
is busy, but no longer works if we need more precise information about
the sync such as what is the mode of the current sync. This PR
introduces the `SyncSupervisor` who is in charge of storing the latest
fcu head, starting and restarting sync cycles and informing the current
sync status at all times
<!-- Why does this pull request exist? What are its goals? -->

**Description**

<!-- A clear and concise general description of the changes this PR
introduces -->

<!-- Link to issues: Resolves #111, Resolves #222 -->

Closes #2282
**Motivation**
Previous changes have sped up other components of the snap sync process,
making faults in the byte code fetcher more evident. The byte code
fetcher used the same batch size as storage requests, 300, which is far
more than the byte codes normally returned by a peer request, causing
the byte code fetcher to keep on fetching the last batches when all
other fetchers have already finished.
This PR reduces the batch size down to 70 so that it coincides with the
amount of byte codes regularly returned by peers
<!-- Why does this pull request exist? What are its goals? -->

**Description**
* Rename constant `BATCH_SIZE` -> `STORAGE_BATCH_SIZE`
* Add constant `BYTECODE_BATCH_SIZE`
<!-- A clear and concise general description of the changes this PR
introduces -->

<!-- Link to issues: Resolves #111, Resolves #222 -->

Closes #issue_number
**Motivation**
Avoid update to the cli code to end up in an outdated README

**Description**
- Added a job that checks that the help output in the ethrex command
that is in the README is in sync with the code.

Closes #2247
@jrchatruc jrchatruc merged commit 0bfbe71 into fix-leak-zkvm Apr 3, 2025
48 checks passed
@jrchatruc jrchatruc deleted the introduce-generalized-database-to-levm branch April 3, 2025 15:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants