EIP-663: Unlimited SWAP and DUP instructions

ekpyron · November 29, 2022, 12:32pm

That would be fully contrary to the intention of this EIP of making more of the stack space addressable, though.

ekpyron · November 29, 2022, 12:35pm

What conceptual complexity, resp. what confusion and footguns, do you mean exactly? Generally, I’d not expect people to decipher opcodes by hand - and disassembled, you’d just print SWAP_N 0 as SWAP17. But in any case, either way is completely fine - the additional 16 items are indeed of no real concern. So if there is any strong preference to starting from zero, as far as I’m concerned we can just go with that as well.

charles-cooper · November 30, 2022, 2:23am

It’s a conceptual hurdle for anybody learning the EVM, or writing a low level tool/assembler/disassembler to understand, maintain and debug. I don’t think it’s necessarily a huge hurdle, but it is a nonzero one which IMO is not outweighed by the additional space addressability (I don’t think the extra 16 items will be useful in practice, although if I see a compelling example I could have my mind changed).

wjmelements · November 30, 2022, 3:57am

I like that the spec reads the stack index inline, as PUSH opcodes do.

If the code is legacy bytecode, both of these instructions result in an exceptional halt

I don’t want to prevent non-EOF code from using this, because I don’t plan to adopt EOF. Contracts shouldn’t be using invalid opcodes to revert, so it shouldn’t matter if we break such behavior for legacy contracts. We haven’t done the same for other opcodes introduced in the past. Why is it being done here?

axic · November 30, 2022, 3:59am

Exactly because of the property you like:

This cannot be achieved on legacy code, due to jumpdest-analysis and that existing code on chain can contain this instruction already (here’s one explainer on the current thread). The use of immediate (in-line) arguments is made possible by EOF.

ekpyron · November 30, 2022, 9:49am

Yeah, fair enough. And yeah, as I said, you’re right, reaching the additional 16 items is definitely not overly relevant in practice. The (weak) argument for starting at 17 was rather to avoid having duplicated, resp. “useless” opcodes (i.e. SWAP_N 0...SWAP_N 16 would never be used, as long as we still have SWAP1...SWAP16). But yeah, if we want to keep the option to deprecate or remove the old swaps eventually (even though I’m not sure that’ll ever actually happen), resp. since there’s concern about starting at 17 making it harder to maintain tools (even though I also don’t think that’s that significant), I see no problem with starting from 0 instead.

axic · November 30, 2022, 3:18pm

This clarification to the spec was merged, but the +17 idea was not included. That is tracked in this branch now, pending decision: GitHub - ipsilon/EIPs at eip-663-plus17

axic · December 5, 2022, 11:14pm

For what it’s worth, there was some feedback from Twitter people wanting SWAPMN:
https://twitter.com/alexberegszaszi/status/1598124647723433984

If it were to happen, XCHG may sound like an alternative name.

However some questioned how frequent the use case for SWAPMN may be:
https://twitter.com/recmo/status/1598215821125304321

charles-cooper · December 6, 2022, 5:36pm

A couple thoughts:

3-4 SWAP_N instructions will cover the entire addressable space of 1024 stack items.

We could have potentially 1-3 SWAP_N_M instructions which take different numbers of immediates to address codesize concerns.

green · January 31, 2023, 10:44am

What would the tradeoffs be for, instead of using immediate opcodes, using the first element(s) of the stack for the DUP / SWAP as suggested here? EIP-? : Introduce Opcodes B0 DUPN and B1 SWAPN - #2 by gumb0

The immediate argument idea seems simpler to implement, so I assume it’s the best choice.
SWAP_N_M as suggested by @charles-cooper would allow optimizing and cleaning a lot of EVM compiler code. Would it be in scope to add these three?

DUP_N
SWAP_N
SWAP_N_M

Philogy · February 3, 2023, 1:18pm

The big tradeoff with using stack values as parameters vs immediate values is that there’s more overhead, will probably cost more gas, and allows less upfront validation and analysis.

shemnon · February 3, 2023, 9:25pm

What we lose with stack reading is provably static dups and swaps. With some code flow analysis you can prove some stack based loads are static, but it opens the door to dynamic swaps and dups (which may be useful for on-stack arrays). Dynamic swaps and dups, however, nerf almost all useful register mapping schemes. It also complicates the stack proving requirements in EOF.

gcolvin · February 14, 2023, 2:48am

My understanding is that static stack machine code is essentially already in SSA form.

k06a · March 10, 2023, 9:01am

I would prefer to have this EIP accepted instead of having stack allocation in memory which is happeing in the Solidity compiler with “viaIR” option

charles-cooper · April 27, 2023, 3:23pm

I don’t think having dynamic stack access (i.e. using the first element of the stack to define which stack item(s) is being accessed) is a good idea. It complicates code analysis and does not provide much benefit, besides being compatible with non-EOF contracts.

I do agree though that this EIP should be expanded to 1 or even 2 DUP_N and SWAP_N instructions, and also SWAP_N_M instructions which will cover the entire addressable range of the stack.

charles-cooper · September 9, 2023, 3:00pm

The use case for SWAP_N_M is very clear, to the point that I think this EIP is only really useful if it includes SWAP_N_M or some of the variants I proposed above. It is used by compilers for stack scheduling. Right now, in order to swap the n’th and m’th items of the stack, (for instance the 2nd and 3rd items need to be swapped for an ADD instruction), you need to issue SWAPN SWAPM SWAPN, which also costs 9 gas. Having a single instruction for this would not be more costly in the VM implementation and would simplify a lot of bytecode.