feat(tests): add gas-repricings SLOAD tests #1769

jochem-brouwer · 2025-11-10T00:18:31Z

🗒️ Description

Part of #1755.
NOTE: this is built on top of #1734 (but can't figure out how to point this PR to that branch, maybe because it is on a different remote?)

🔗 Related Issues or PRs

#1755

✅ Checklist

TODOs:

Cute Animal Picture

Implements parametrized storage benchmarks to measure gas costs for different storage state transitions: - Cold writes (0 -> non-zero): ~20,000 gas per slot - Warm updates (non-zero -> non-zero): ~2,900 gas per slot - Storage clearing (non-zero -> 0): ~2,900 gas + refund The implementation uses a single contract that accepts parameters via calldata, pre-deployed with 500 filled slots to enable testing all transition types. This provides accurate gas measurements with minimal loop overhead (~39 gas/slot). Tests are marked as stateful to avoid gas exhaustion expectations and work with the execution-specs testing framework.

LouisTsai-Csie

Thanks @jochem-brouwer for this PR, please run tox -e static and resolve the linting issue! I only review test_vector_sload.py as the remaining files I already reviewed in CPerezz's PR.

LouisTsai-Csie · 2025-11-10T12:45:56Z

tests/benchmark/stateful/vector_storage/test_vector_sload.py

+"""
+abstract: Vector storage benchmark with single parametrized contract,
+targeting SLOAD.
+
+This parametrized test takes takes these arguments:
+- The amount of slots to load
+- The slot key incrementer
+
+The final value is used in the test as boolean: if 0 is used,
+the key is not incremented, and thus the same key is read each time.
+
+Each test is also tested against these keys in the access list (or not).
+This thus marks if the target slots are warm or cold.
+"""
+
+import pytest
+from execution_testing import (
+    Account,
+    Alloc,
+    Block,
+    BlockchainTestFiller,
+    Bytecode,
+    Fork,
+    Op,
+    Storage,
+    Transaction,
+    While
+)


This seems duplicate as import & doc string below!

LouisTsai-Csie · 2025-11-10T13:55:32Z

tests/benchmark/stateful/vector_storage/test_vector_sload.py

+    bytecode += Op.ADD(Op.CALLDATALOAD(Op.PUSH0))
+
+    bytecode += Op.SWAP1
+
+    bytecode += Op.PUSH1(start_marker)
+
+    bytecode += Op.JUMPDEST  # end_marker
+    bytecode += Op.STOP


I am not sure if i understand here correctly:

Initial stack here: [current_slot, entries_lest]

PUSH0 # [0, current_slot, entries_left] CALLDATALOAD # [incrementer, current_slot, entries_left] ADD # [current_slot + incrementer, entries_left] SWAP1 # [entires_left, current_slot + incrementer] PUSH1 marker # [marker, entries_left, current_slot+incrementer] JUMPDEST # STOP

When executing to JUMPDEST, should we jump back to the start marker via JUMPI? But here seems missing a DUP somewhere. Or maybe there is missing some parameter: Op.ADD(Op.CALLDATALOAD(Op.PUSH0)) should be Op.ADD(Op.CALLDATALOAD(Op.PUSH0), N)

Yeah I think the bytecode is missing both correct stack initialization and the JUMP instruction to complete the loop.

LouisTsai-Csie · 2025-11-10T13:56:24Z

tests/benchmark/stateful/vector_storage/test_vector_sload.py

+# for a way how I believe we can do this (using 7702 accounts with prefilled
+# storage and then executing code on there, which we can change because its a 7702 account)
+@pytest.mark.valid_from("Prague")
+@pytest.mark.stateful  # Mark as stateful instead of benchmark


All the tests under stateful folder will automatically inherit this label.

LouisTsai-Csie · 2025-11-10T13:59:17Z

tests/benchmark/stateful/vector_storage/test_vector_sload.py

+
+    # Create transaction to call the contract
+    # Use a reasonable gas limit that covers the operation
+    gas_limit = 21000 + 10000 + (num_slots * 50000)


I think we do not need to specify the gas limit for this transaction? By default it would be transaction gas limit cap or block gas limit in our framework

spencer-tb

Added some small comments. Thanks.

spencer-tb · 2025-11-13T17:32:55Z

tests/benchmark/stateful/vector_storage/test_vector_sload.py

+        sender=sender,
+    )
+
+    blockchain_test(


These should use the benchmark_test format now! Please see here for usage and skip validation field.

spencer-tb · 2025-11-13T17:34:34Z

tests/benchmark/stateful/vector_storage/test_vector_storage.py

+        storage_contract: Account(storage=expected_storage),
+    }
+
+    blockchain_test(


Similar here for benchmark_test

spencer-tb · 2025-11-13T17:35:11Z

tests/benchmark/stateful/vector_storage/test_vector_sload.py

+
+    calldata = incrementer.to_bytes(32, "big") + num_slots.to_bytes(32, "big")
+
+    access_lists: List[AccessList] = []


I think this is a missing import.

spencer-tb · 2025-11-13T17:37:25Z

tests/benchmark/stateful/vector_storage/test_vector_sload.py

+# storage and then executing code on there, which we can change because its a 7702 account)
+@pytest.mark.valid_from("Prague")
+@pytest.mark.stateful  # Mark as stateful instead of benchmark
+@pytest.mark.parametrize("num_slots", [1, 10, 50, 100, 200])


Can we reduce the parameterization slightly and maybe drop the 10/50?

spencer-tb · 2025-11-13T17:42:37Z

tests/benchmark/stateful/vector_storage/test_vector_sload.py

+    bytecode += Op.ADD(Op.CALLDATALOAD(Op.PUSH0))
+
+    bytecode += Op.SWAP1
+
+    bytecode += Op.PUSH1(start_marker)
+
+    bytecode += Op.JUMPDEST  # end_marker
+    bytecode += Op.STOP


Yeah I think the bytecode is missing both correct stack initialization and the JUMP instruction to complete the loop.

spencer-tb · 2025-11-13T17:47:46Z

tests/benchmark/stateful/vector_storage/test_vector_sload.py

+    if storage_keys_set:
+        key = 0
+        for i in range(num_slots):
+            initial_storage[key] = 1
+            slots.add(key)
+            key += incrementer


When storage_keys_set=False and warm_slots=True, I think the test should make slots warm (via access list) even though they don't exist in storage, but it doesn't because slots is empty. As a result, I think the access list contains no storage keys, so the slots remain cold instead of being warmed.

CPerezz and others added 2 commits November 3, 2025 15:57

feat(tests): add SLOAD benchmark

2adf4ea

LouisTsai-Csie requested changes Nov 10, 2025

View reviewed changes

LouisTsai-Csie added A-test-benchmark Area: Tests Benchmarks—Performance measurement (eg. `tests/benchmark/*`, `p/t/s/e/benchmark/*`) C-test Category: test P-high labels Nov 10, 2025

parithosh mentioned this pull request Nov 11, 2025

Get SSTORE/SLOAD benchmark data ASAP, @louis will be looking into this and following up with carlos ethpandaops/gas-lighting-tracker#7

Open

spencer-tb mentioned this pull request Nov 13, 2025

Add extra storage read & write benchmark cases #1755

Open

7 tasks

spencer-tb reviewed Nov 13, 2025

View reviewed changes


		calldata = incrementer.to_bytes(32, "big") + num_slots.to_bytes(32, "big")

		access_lists: List[AccessList] = []

feat(tests): add gas-repricings SLOAD tests #1769

Are you sure you want to change the base?

feat(tests): add gas-repricings SLOAD tests #1769

Uh oh!

Conversation

jochem-brouwer commented Nov 10, 2025

🗒️ Description

🔗 Related Issues or PRs

✅ Checklist

Cute Animal Picture

Uh oh!

LouisTsai-Csie left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

spencer-tb left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants