Gas Optimization

Overview for Developing Gas-Efficient Smart Contracts

PreviousFoundry NextResources

Last updated 11 months ago

Was this helpful?

Gas Optimization

Overview for Developing Gas-Efficient Smart Contracts

NOTE: This overview is a formalization of my notes from one of courses. I highly recommend taking any of his or .

Introduction

To master Ethereum development, there are three areas that one should be very competent in:

Design Patterns
Security
Gas Optimization
- This will be the primary focus of this overview.

It is important for smart contract engineers to be conscious of gas optimization, as our design choices impact not only our users, but the entire Ethereum network. This overview is meant to aid in grasping the broader implications of these choices.

Gas Basics

The gas cost of a transaction depends on exactly 5 factors:

The transaction data that was sent.
The amount of memory that was used.
The state changes that took place.
The opcodes that were executed.
The current gas price.

The gas units of a transaction are static and determined by 1-4 above. We can think of gas units as measured units of computation.

To calculate the gas cost of a transaction, we simply multiply its gas units by the gas price.

GasCost_{gwei}=GasUnits*GasPrice_{gwei}

GasCost_{usd}=\frac{GasUnits*GasPrice_{gwei}*EthPrice_{usd}}{10^9}

Gas optimization focuses on taking advantage of controllable factors to reduce the cost of transactions. For example, by making better design decisions, we can reduce the gas units of a particular on-chain action by 50%, but we can't control the gas prices or the price of ETH.

The cheapest transaction we can submit to the Ethereum network is a native transfer (e.g., Alice transfers 5 ETH to Bob), which costs 21,000 gas units. This means that all transactions on Ethereum must cost at least 21,000 gas units.

Block Limit

The block limit is simply the maximum block size. For Bitcoin, the block limit is 1 MB. Ethereum, does not have an explicit byte limit. Instead, Ethereum limits the amount of computations per block, which means the Ethereum block limit is defined in gas units. The current Ethereum block limit is 30 million gas units.

Theoretically, a block limit of 30 million gas units can fit 1,428 native transfer transactions, since each costs 21,000 gas units. On the other extreme, this limit can only fit 30 tornado cash transactions, since each costs ~1 million gas units.

Throughput

The throughput of the network is defined as the number of transactions per second (TPS) that can be verified.

A new Ethereum block is generated every 15 seconds. This means that if all transactions were native transfers, the throughput would be 95 TPS.

\frac{1428 \; TXs}{1 \; Block}*\frac{1 \; Block}{15 \; seconds} = 95 \; TPS

Similarly, if all transactions were tornado cash transactions, the throughput would be 2 TPS.

At time of writing this, the actual Ethereum throughput ranges from 15-25 TPS.

Implications

There cannot be more than 1428 transactions in one block.
The max throughput for Ethereum is 95 TPS.
The highest bidders will get their transactions included in the block. This is why gas prices fluctuate.
One native transfer transaction (21,000 gas units) is 0.07% of Ethereum's computational capacity per 15 seconds.
If a transaction requires more than 30 million gas units to execute, it will never execute because it does not fit in a single block.
Your design choices as a smart contract engineer impact not only your users but the entire Ethereum network.

Storage

Storage Layout

This refers to the way data is organized and accessed within a smart contract. Smart contracts on Ethereum have a key-value storage model, where each key is a 256-bit number, and each value is also 256 bits. The storage layout dictates how different variables (like integers, addresses, or more complex data structures) are mapped to these 256-bit keys.

Storage Slots

Each 256-bit key in the key-value storage model is referred to as a "slot". Every slot can store a value up to 256 bits. The values stored in these slots correspond to the state variables defined in the smart contract. The order of state variable declarations and the type of each variable impacts storage efficiency and the cost of reading from and writing to these variables.

Accessing Slots in Solidity

The EVM uses storage slots to know which values to access. In other words, the EVM does not care what we name our variables because it only understands the location of our variables.

To better understand the nature of storage slots, let's use some code. The smart contract below contains one state variable, a, and a function that returns the storage slot of a.

contract Storage {
    uint256 private a;
    
    function storageLocation() external pure returns(uint256) {
        uint256 slotLocation;
        
        assembly {
            slotLocation := a.slot
        }
        
        return slotLocation:
    }
}

If we called storageLocation(), it would return 0. If we declared a variable before a, then storageLocation() would return 1.

Now, if we wanted to fetch the value of a variable given a storage slot, we can refactor our smart contract as shown below.

contract Storage {
    uint256 private a = 99;
    uint256 private b = 13;
    uint256 private c = 2;
    
    function getSlotValue(uint256 slot) external view returns(uint256) {
        uint256 value;
        
        assembly {
            value := sload(slot)
        }
        
        return value:
    }
}

Calling getSlotValue(0) would return 99, getSlotValue(1) would return 13, and getSlotValue(2) would return 2.

This roughly demonstrates how the order of variable declarations in a smart contract impacts the location of their storage slots. To expand on this a bit further, if we declared a combination of different variable types, such as uint256, uint8, bool, and address[], it's important to be more cautious with how we declare our storage variables. For example, we could pack the uint8 and bool variables in the same slot. However, packing these variables together is only advantageous if these two variables are always used in conjunction. If they are not, the storage gas savings does not outweigh the increased cost of reading from or writing to only one of the two variables in the slot.

Opcode Basics

To understand what opcodes are, let's consider the smart contracts below. From a semantic perspective, the language of the Solidity code is very straight forward. However, this language is extremely foreign to a computer. Since the EVM is a computer, we need to translate the Solidity code to a language that the EVM can understand by compiling the Solidity code. The EVM understands assembly code, which is a set of highly specific and concise instructions designed to carry out specific tasks. These instructions are called opcodes.

contract TestContractOne {
    uint256 private a = 3;
    
    function doTheThing() external view returns(uint256) {
        return a + 1;
    }
}

Given the code above, the following is a simplified breakdown of the most relevant opcodes that will execute on the EVM when doTheThing() is called.

Opcode

Description

Stack State

PUSH1 00

Push 0 onto the stack.

SLOAD

Treat 0 as a storage slot and load the corresponding value.

PUSH1 01

Push 1 onto the stack.

3 1 <- top of stack

ADD

Add 1 and 3.

Here is a slightly more complex example:

contract TestContracTwo {
    uint256 private a = 3;
    uint256 private b = 6;
    
    function doTheThing() external view returns(uint256) {
        return 5 * a + 4 * b;
    }
}

The simplified opcode breakdown looks like this:

Opcode

Description

Stack State

PUSH1 05

Push 5 onto the stack.

PUSH1 00

Push 0 onto the stack.

5 0 <- top of stack

SLOAD

Treat 0 as a storage slot and load the corresponding value.

3 <- top of stack

PUSH1 04

Push 4 onto the stack.

4 <- top of stack

PUSH1 01

Push 1 onto the stack.

1 <- top of stack

SLOAD

Treat 1 as a storage slot and load the corresponding value.

6 <- top of stack

MUL

Multiply 6 and 4.

24 <- top of stack

SWAP2

Swap the topmost value in the stack with the value located two positions below it.

5 <- top of stack

MUL

Multiply 5 and 3.

15 <- top of stack

ADD

Add 15 and 24.

Each EVM opcode has a specific cost in terms of gas units. Some are more expensive than others, depending on the complexity of the operation. The majority of the gas cost associated with executing a transaction is simply the sum of all the opcodes executed within that transaction.

For a full, comprehensive list of opcodes and their associated cost, here are two helpful resources:

Function Selectors

To understand what a function selector is, we first need to understand what a function signature is in the context of Ethereum smart contracts.

Function Signature

A concise representation of a function that includes the function's name and the types of its input parameters
Ex: transfer(address,uint256)
Ex: someFunc(bool[],bytes)

Function Selector

The first 4 bytes of the keccak-256 hash of the function signature, used to identify functions in bytecode.
Ex: keccak256("transfer(address,uint256)") -> 0xa9059cbb
Ex: keccak256("someFunc(bool[],bytes)") -> 0x7a2cf21d

Now, let’s recall that there are two types of function calls that we can make to smart contracts:

read: functions that only read data without making any state changes.
write: functions that alter the contract’s state or involve ETH transfers, which requires a transaction to be published.

When calling a write function on a smart contract, we are simply sending a transaction to the address of the smart contract. However, for the smart contract to know which function we want to interact with, the function selector and the ABI-encoded function parameter values must be included in the input field of the transaction object. If the function has no parameters, only the function selector must be included. This, of course, is abstracted away from users, but it's important to understand.

For reference, here is an example of a transaction object.

{
    "from": "0x1923f626bb8dc025849e00f99c25fe2b2f7fb0db",
    "gas": "0x55555",
    "maxFeePerGas": "0x1234",
    "maxPriorityFeePerGas": "0x1234",
    "input": "0xabcd",
    "nonce": "0x0",
    "to": "0x07a565b7ed7d7a678680a4c162885bedbb695fe0",
    "value": "0x1234"
 }

Now, let's consider the smart contract below.

contract TestContract {
    function doNothing(uint256 someNumber) external payable {
        // do nothing
    }
}

Say we want to call doNothing(uint256 someNumber) on this smart contract and send it 100 Wei. The following two Foundry commands are equivalent ways to do accomplish this.

cast send 0xc5Ae07D32067005CC098240A44828Cd7A087d4FC "doNothing(uint256)" 100 --value 0.0000000000000001ether --rpc-url <RPC_URL> --private-key <PRIVATE_KEY>

cast send 0xc5Ae07D32067005CC098240A44828Cd7A087d4FC 0xdce1d5ba0000000000000000000000000000000000000000000000000000000000000064 --value 0.0000000000000001ether --rpc-url <RPC_URL> --private-key <PRIVATE_KEY>

Clearly, the first command is easier to understand, but the second command shows that we can get the same result by explicitly providing the function selector and ABI-encoded parameter in the transaction’s input field.

To verify this, the smart contract above was deployed to Gnosis, and two transactions were published with the commands above.

Both transactions resulted in the same outcome. Below are screenshots of what the input data looks like for both transactions on Gnosisscan. The first image is the default (decoded) view, while the second is the actual hex data that was included in the input field.

Additionally, notice that the hex data in the images below is the same as the 2nd Foundry command above.

More Coming Soon...

Coming soon...

PreviousFoundry NextResources

Last updated 11 months ago

Was this helpful?

NOTE: This overview is a formalization of my notes from one of courses. I highly recommend taking any of his or .

Introduction

To master Ethereum development, there are three areas that one should be very competent in:

Design Patterns
Security
Gas Optimization
- This will be the primary focus of this overview.

Gas Basics

The gas cost of a transaction depends on exactly 5 factors:

The transaction data that was sent.
The amount of memory that was used.
The state changes that took place.
The opcodes that were executed.
The current gas price.

The gas units of a transaction are static and determined by 1-4 above. We can think of gas units as measured units of computation.

The gas price varies based on network congestion. If a lot of people are using the network, you'll have to pay more to get you're transaction processed by a validator. Real-time gas prices can be obtained from any online.

To calculate the gas cost of a transaction, we simply multiply its gas units by the gas price.

GasCost_{gwei}=GasUnits*GasPrice_{gwei}

GasCost_{usd}=\frac{GasUnits*GasPrice_{gwei}*EthPrice_{usd}}{10^9}

Block Limit

Throughput

The throughput of the network is defined as the number of transactions per second (TPS) that can be verified.

A new Ethereum block is generated every 15 seconds. This means that if all transactions were native transfers, the throughput would be 95 TPS.

\frac{1428 \; TXs}{1 \; Block}*\frac{1 \; Block}{15 \; seconds} = 95 \; TPS

Similarly, if all transactions were tornado cash transactions, the throughput would be 2 TPS.

At time of writing this, the actual Ethereum throughput ranges from 15-25 TPS.

Implications

There cannot be more than 1428 transactions in one block.
The max throughput for Ethereum is 95 TPS.
The highest bidders will get their transactions included in the block. This is why gas prices fluctuate.
One native transfer transaction (21,000 gas units) is 0.07% of Ethereum's computational capacity per 15 seconds.
If a transaction requires more than 30 million gas units to execute, it will never execute because it does not fit in a single block.
Your design choices as a smart contract engineer impact not only your users but the entire Ethereum network.

Storage

Ethereum smart contracts employ a storage model that allows for persistent storage of state variables in the . This model consists of two important concepts, storage layout and storage slots. Understanding these is key for gas optimization and data integrity in smart contracts, as mismanagement can result in vulnerabilities and high gas costs. Below are two brief explanations of storage layout and slots, but further is highly encouraged.

Storage Layout

Storage Slots

Accessing Slots in Solidity

The EVM uses storage slots to know which values to access. In other words, the EVM does not care what we name our variables because it only understands the location of our variables.

To better understand the nature of storage slots, let's use some code. The smart contract below contains one state variable, a, and a function that returns the storage slot of a.

Note that we use an block to fetch the storage slot of variable a.

contract Storage {
    uint256 private a;
    
    function storageLocation() external pure returns(uint256) {
        uint256 slotLocation;
        
        assembly {
            slotLocation := a.slot
        }
        
        return slotLocation:
    }
}

If we called storageLocation(), it would return 0. If we declared a variable before a, then storageLocation() would return 1.

Now, if we wanted to fetch the value of a variable given a storage slot, we can refactor our smart contract as shown below.

contract Storage {
    uint256 private a = 99;
    uint256 private b = 13;
    uint256 private c = 2;
    
    function getSlotValue(uint256 slot) external view returns(uint256) {
        uint256 value;
        
        assembly {
            value := sload(slot)
        }
        
        return value:
    }
}

Calling getSlotValue(0) would return 99, getSlotValue(1) would return 13, and getSlotValue(2) would return 2.

Lastly, it's crucial to remember that once a smart contract is deployed, the storage slots for its variables are fixed. This is especially important in contracts that use a , where meticulous management of storage slots is vital to avoid storage collisions.

For a more in-depth understanding of storage, checkout

Opcode Basics

NOTE: The EVM is a stack-based machine. This means that a is the primary data structure used for handling low-level instructions (opcodes) in a sequential, order.

contract TestContractOne {
    uint256 private a = 3;
    
    function doTheThing() external view returns(uint256) {
        return a + 1;
    }
}

Given the code above, the following is a simplified breakdown of the most relevant opcodes that will execute on the EVM when doTheThing() is called.

Opcode

Description

Stack State

PUSH1 00

Push 0 onto the stack.

SLOAD

Treat 0 as a storage slot and load the corresponding value.

PUSH1 01

Push 1 onto the stack.

3 1 <- top of stack

ADD

Add 1 and 3.

Here is a slightly more complex example:

contract TestContracTwo {
    uint256 private a = 3;
    uint256 private b = 6;
    
    function doTheThing() external view returns(uint256) {
        return 5 * a + 4 * b;
    }
}

The simplified opcode breakdown looks like this:

Opcode

Description

Stack State

PUSH1 05

Push 5 onto the stack.

PUSH1 00

Push 0 onto the stack.

5 0 <- top of stack

SLOAD

Treat 0 as a storage slot and load the corresponding value.

3 <- top of stack

PUSH1 04

Push 4 onto the stack.

4 <- top of stack

PUSH1 01

Push 1 onto the stack.

1 <- top of stack

SLOAD

Treat 1 as a storage slot and load the corresponding value.

6 <- top of stack

MUL

Multiply 6 and 4.

24 <- top of stack

SWAP2

Swap the topmost value in the stack with the value located two positions below it.

5 <- top of stack

MUL

Multiply 5 and 3.

15 <- top of stack

ADD

Add 15 and 24.

For a full, comprehensive list of opcodes and their associated cost, here are two helpful resources:

Function Selectors

To understand what a function selector is, we first need to understand what a function signature is in the context of Ethereum smart contracts.

Function Signature

A concise representation of a function that includes the function's name and the types of its input parameters
Ex: transfer(address,uint256)
Ex: someFunc(bool[],bytes)

Function Selector

The first 4 bytes of the keccak-256 hash of the function signature, used to identify functions in bytecode.
Ex: keccak256("transfer(address,uint256)") -> 0xa9059cbb
Ex: keccak256("someFunc(bool[],bytes)") -> 0x7a2cf21d

Now, let’s recall that there are two types of function calls that we can make to smart contracts:

read: functions that only read data without making any state changes.
write: functions that alter the contract’s state or involve ETH transfers, which requires a transaction to be published.

For reference, here is an example of a transaction object.

{
    "from": "0x1923f626bb8dc025849e00f99c25fe2b2f7fb0db",
    "gas": "0x55555",
    "maxFeePerGas": "0x1234",
    "maxPriorityFeePerGas": "0x1234",
    "input": "0xabcd",
    "nonce": "0x0",
    "to": "0x07a565b7ed7d7a678680a4c162885bedbb695fe0",
    "value": "0x1234"
 }

Now, let's consider the smart contract below.

contract TestContract {
    function doNothing(uint256 someNumber) external payable {
        // do nothing
    }
}

Say we want to call doNothing(uint256 someNumber) on this smart contract and send it 100 Wei. The following two Foundry commands are equivalent ways to do accomplish this.

cast send 0xc5Ae07D32067005CC098240A44828Cd7A087d4FC "doNothing(uint256)" 100 --value 0.0000000000000001ether --rpc-url <RPC_URL> --private-key <PRIVATE_KEY>

cast send 0xc5Ae07D32067005CC098240A44828Cd7A087d4FC 0xdce1d5ba0000000000000000000000000000000000000000000000000000000000000064 --value 0.0000000000000001ether --rpc-url <RPC_URL> --private-key <PRIVATE_KEY>

To verify this, the smart contract above was deployed to Gnosis, and two transactions were published with the commands above.

Deployed contract:

TX sent with 1st Command:

TX sent with 2nd Command:

Additionally, notice that the hex data in the images below is the same as the 2nd Foundry command above.

More Coming Soon...

Coming soon...