Reduction semantics #109

dhil · 2025-01-22T10:19:51Z

This patch populates the "Execution" section of the Explainer document with the reduction rules for stack switching.

Resolves #91.

This patch populates the "Execution" section of the Explainer document with the reduction rules for stack switching.

rossberg

Would also make sense to have Maxime take a look, since he's mechanising this right now.

proposals/stack-switching/Explainer.md

mlegoupil

Most of these comments are either
(1) those already discussed on zoom on the 6th of February
(2) sidenotes about small insignificant differences between this and the Iris-WasmFX mechanisation. I do not think it is necessary to incorporate these comments into the explainer document but I figured I would share these details.

The main exception is the comment on line 856 which might well require our attention.

mlegoupil · 2025-02-06T14:18:16Z

proposals/stack-switching/Explainer.md

+
+* `(prompt{<hdl>*} <instr>* end)` represents an active handler
+  - `(prompt{hdl*}? instr* end) : [t1*] -> [t2*]`
+    - iff `instr* : [t1*] -> [t2*]`


This explanation does not mention what typing context is used. Here, the body instr* of the prompt instruction should be typechecked under the empty context. This enforces that its body is closed, which is necessary since continuations live in the store and store objects should be closed.

mlegoupil · 2025-02-06T14:23:50Z

proposals/stack-switching/Explainer.md

+The administrative structure `hdl` is defined as
+```
+hdl ::= (<tagaddr> $l) | (<tagaddr> switch)
+```


The resume instruction needs a list of tags, and prompt needs a list of (desugared) tag addresses. Hence we need to either define two separate notions hdl and hdlnew where hdl is as shown above and is used by prompt and hdlnew has tags instead of tag addresses and is used by resume; or we can keep one single hdl and allow it to either take tags or tag addresses as inputs. The former is the solution adopted by the Iris-WasmFX mechanisation

Even if we define this as a separate syntactic class, I'd suggest to mirror the syntax of the index-based notation, i.e., keep the on.

mlegoupil · 2025-02-06T14:26:39Z

proposals/stack-switching/Explainer.md

+
+* `S; F; v^n (ref.cont ca) (resume $ct hdl*)  -->  S'; F; prompt{hdl*} E[v^n] end`
+  - iff `S.conts[ca] = (E : n)`
+  - and `S' = S with conts[ca] = epsilon`


hdl must be desugared here: on the LHS it contains tags and on the RHS it should contain tag addresses. The field F.tags in the frame converts one to the other.

mlegoupil · 2025-02-06T14:28:55Z

proposals/stack-switching/Explainer.md

+* `S; F; v^m (ref.cont ca) (resume_throw $ct $e hdl*)  -->  S'; F; prompt{hdl*} E[v^m (throw $e)] end`
+  - iff `S.conts[ca] = (E : n)`
+  - and `S.tags[F.tags[$e]].type ~~ [t1^m] -> [t2*]`
+  - and `S' = S with conts[ca] = epsilon`


Same comment as for resume: the list hdl must be desugared using F.tags

mlegoupil · 2025-02-06T14:35:28Z

proposals/stack-switching/Explainer.md

+
+* `S; F; (prompt{hdl1* (ea $l) hdl2*} H^ea[v^n (suspend $e)] end)  --> S'; F; v^n (ref.cont |S.conts|) (br $l)`
+  - iff `ea notin tagaddr(hdl1*)`
+  - and `ea = F.tags[$e]`


This is wrong, F is not always the right frame to use; instead the innermost frame in H should be used (in case H contains nested frame instructions). I can suggest two solutions. The first is to write a function innermost_frame that explores H and returns the innermost frame F_i from the innermost frame instruction in H; if no frame instruction is found, the function should return F (the current top-level frame). I find this tedious. The second solution is have two instructions suspend and suspend_desugared, the first taking a tag $e as an immediate argument, the second taking a tag address ea as an argument. Then the rule above should mention suspend_desugared ea instead of suspend $e, and we would need to add a reduction rule that reduces S; F; suspend $e --> S; F; suspend_desugared ea when F.tags[$e] = ea, conveniently using the closest frame F without needing to define a function that explores the context. For simplicity, we can also consider using a single instruction suspend that can take both a tag or a tag address as an immediate argument instead of two separate instructions. The Iris-WasmFX mechanisation uses two instructions as it is convenient to consider suspend as a basic instruction and suspend_desugared as an administrative instruction.

mlegoupil · 2025-02-06T14:45:04Z

proposals/stack-switching/Explainer.md

+* `(ref.cont a)` represents a continuation value, where `a` is a *continuation address* indexing into the store's `conts` component
+  - `ref.cont a : [] -> [(ref $ct)]`
+    - iff `S.conts[a] = epsilon \/ S.conts[a] = (E : n)`
+      + iff `E[val^n] : t2*`


There are two ways of doing this. The first is the one displayed here, where type-checking the ref.cont instructions requires typechecking the body of the continuation here and now. The other one (which is the one used in the Iris-WasmFX mechanisation) is to merely read a type annotation here; and instead add a clause to the (unshown here) store_typing predicate that describes a well-formed state, mandating that all continuations in the store must have a body that type-checks. From a theoretical point of view, I prefer the second solution. Besides, the second approach is the one used when typechecking the invoke administrative instruction.

Hm, I'd generally prefer the first option, since that is a more faithful reflection of the intended runtime representation that erases these types. We really want to know that this is sound, so ideally, even a mechanised soundness proof would model the store without introducing additional type information that may affect the result in subtle ways.

Invoke is different in that functions are already type-annotated in the source program, and these types are in fact kept around in real implementations (e.g., to perform link-time type-checks).

Does typechecking the instructions inside the continuation require a type context C? If so, where does the context come from?

In the WasmFX-Cert mechanisation in the Rocq proof assistant, we use the empty context. This is ultimately irrelevant since all continuations start off as a function call and the body of a function call is typechecked using a different typing context as per the typing rules of (plain) WebAssembly

mlegoupil · 2025-02-06T14:49:26Z

proposals/stack-switching/Explainer.md

+  - `S ::= {..., conts <cont>?*}`
+
+* A continuation is a context annotated with its hole's arity
+  - `cont ::= (E : n)`


Sidenote: the Iris-WasmFX mechanisation stores more than just the arity n together with the context E, it stores the actual expected type t1* -> t2*. Transforming the presentation from the mechanisation to this one is simple (n = length(t1*)).

That is interesting. Is that merely for convenience (i.e., not having to guess the type non-deterministically in the proof), or would soundness actually break without fixing the types?

This is so that the logical relation can later have more to go on. The type soundness could be proved in a mechanisation that only decorates contexts with the arity n with minor changes to the proofs

mlegoupil · 2025-02-06T14:52:51Z

proposals/stack-switching/Explainer.md

+    - and `$ct ~~ cont $ft`
+    - and `$ft ~~ [t1^n] -> [t2*]`
+
+* `(prompt{<hdl>*} <instr>* end)` represents an active handler


Sidenote: the Iris-WasmFX mechanisation adds one more immediate argument to the prompt instruction: the type t* expected for the body <instr>*. This is necessary to define the behaviour of the suspend instruction since the mechanisation stores each continuation together with its expected type t1* -> t2*. If the type annotation was not present in the prompt instruction, it would be impossible to know the return type t2* of the captured continuation when reducing suspend. It is easy to transform the presentation from the mechanisation into this one (just forget the type annonation).

mlegoupil · 2025-02-06T14:54:24Z

proposals/stack-switching/Explainer.md

+    - and `$ft ~~ [t1^n] -> [t2*]`
+
+* `(prompt{<hdl>*} <instr>* end)` represents an active handler
+  - `(prompt{hdl*}? instr* end) : [t1*] -> [t2*]`


Can [t1*] be non-empty? In the Iris-WasmFX mechanisation, this list is always empty… There is a mistake either here or in the mechanisation.

You're correct, it's always empty, just like for other administrative block instructions.

mlegoupil · 2025-02-06T14:57:06Z

proposals/stack-switching/Explainer.md

+  - and `$ct2 ~~ cont $ft2`
+  - and `$ft2 ~~ [t1'^m] -> [t2'*]`
+  - and `S' = S with conts[ca] = epsilon`
+  - and `S'' = S' with conts += (H^ea : m)`


Sidenote: the Iris-WasmFX mechanisation does not yet have the switch instruction. I will add it shortly, but cannot at present comment on this reduction rule. However on a first glance, it appears that this reduction rule might suffer from the same issue as the suspend rule: the tag $e should be desugared not with frame F but the innermost frame of H.

Update: the Iris-WasmFX mechanisation now has the switch instruction and type soundness has been proven. The issue using the correct frame when desugaring $e is indeed present

mlegoupil

The Iris-WasmFX now includes the switch instruction. I have added extra comments pertaining to this.

mlegoupil · 2025-02-13T11:39:51Z

proposals/stack-switching/Explainer.md

+    - and `([te2*] -> [t2*] <: $ft')*`
+
+* `(a switch)` represents a tag-switch association
+ - `(a switch)` and `(S.tags[b].type ~~ [] -> [te2*])*`


These two lines make no sense, perhaps it was meant "(a switch) : [t2*] iff S.tags[a].type ~~ [] -> [t2*]"? I'm not sure what the typing rule is meant to be…

Yes. It is a typo/mistake. It should be the trivial typing rule as you noted.

proposals/stack-switching/Explainer.md

mlegoupil · 2025-02-14T11:42:12Z

proposals/stack-switching/Explainer.md

+  - and `$ct2 ~~ cont $ft2`
+  - and `$ft2 ~~ [t1'^m] -> [t2'*]`
+  - and `S' = S with conts[ca] = epsilon`
+  - and `S'' = S' with conts += (H^ea : m)`


Update: the Iris-WasmFX mechanisation now has the switch instruction and type soundness has been proven. The issue using the correct frame when desugaring $e is indeed present

Incorporated comments from before The biggest thing I am unsure of is lines 814 and 826. This pertains to what point the tag argument of suspend and switch is translated into a tag address. Doing it in the reduction rule for resume (as was the case in the previous version of Explainer.md) is wrong, for reasons explained in the comment I wrote at the time. I believe I was told in the actual implementation, the change happens during the validation phase, which is why I propose these lines 814 and 826; however my phrasing can perhaps be improved.

mlegoupil · 2025-08-25T14:09:43Z

I have gone into the files to make all the modifications discussed in the comments previously. I believe the operational semantics is now fixed.

The only uncertainty I have is lines 814 and 826, where I think a better phrasing might better reflect the way the implementation works. The crux is that the suspend instruction takes a tag index as an immediate argument, but when reducing a suspension, the tags in the prompt instruction are tag addresses (into the store), not indices. When does this translation from addresses to indices happen? In the previous version of the explainer, this happened within the reduction rule for suspend itself using the wasm frame F from S, F, prompt H[suspend] but this is wrong, since context H may (and in practice, always will), contain a function call and hence the frame to use is not F but rather the innermost wasm frame in H. I can see two solutions, and if I remember correctly an oral conversation I had months ago with Sam and Andres, the second solution is the one used by the actual implementation. The first solution (the one implemented by the Rocq formalisation WasmFXCert) is to have two instructions suspend (available to the programmer, uses tag indices) and suspend.desugared (only exists at runtime, uses tag addresses) and a desugaring reduction rule (which can now use directly wasm frame F since the rule is focused on the suspend instruction itself rather than suspend inside some context H). The second solution is for the change from tag indices to tag addresses to happen during the validation (type-checking) phase. This is what I wrote on lines 814 and 826 but my phrasing might be too vague, probably because I am not certain myself of what exactly happens in practice.

Apart from that point, I believe all other changes I have made should be uncontroversial and the updated explainer.md is now fixed.

tlively · 2025-08-26T21:20:51Z

Real implementations will not know the tag addresses at validation or compile time because they will avoid generating different code for different instantiations of the same compiled module. The compiled code will only have the tag index, and it will have to look up the corresponding tag address on the instance at runtime. The desugaring solution therefore makes more sense to me.

Incorporating Thomas Lively's comment, I have rectified the spot where suspend and switch translate the tag index into a tag address, by creating new instructions suspend.addr and switch.addr that take addresses (rather than indices) as arguments. This allows for a rule focused on translating suspend to suspend.addr (and likewise for switch), enforcing that the correct wasm frame F is used.

mlegoupil · 2025-08-29T12:23:50Z

With this latest commit, I now believe the explainer to be fully correct.

Thank you Thomas for your comment! What you said makes a lot of sense and I have now changed to the other solution I had suggested. Sam and I took the time to double-check that this behaviour is the one exhibited by the reference implementation.

tlively

Would things be slightly simpler if we didn't store the arity of the holes in the store, either? The arities should always be computable from instruction immediates.

tlively · 2025-08-29T16:47:30Z

proposals/stack-switching/Explainer.md

+#### Administrative instructions
+
+* `(ref.cont a)` represents a continuation value, where `a` is a *continuation address* indexing into the store's `conts` component
+  - `ref.cont a : [] -> [(ref $ct)]`


There may be many valid choices of $ct here, which violates our principal typing rules. Or are those not intended to apply to administrative instructions?

@rossberg, can you answer this?

proposals/stack-switching/Explainer.md

tlively · 2025-08-29T17:30:14Z

proposals/stack-switching/Explainer.md

+  - and `hdl'*` is obtained by translating the `<tagidx>` from `hdl*` into `<tagaddr>` using `F.tag`:
+       - if `on $a $l` is in `hdl*` and `F.tags[$e]=ea`, then `ea $l` is in `hdl'*`
+       - if `on $a switch` is in `hdl'*` and `F.tags[$e]=ea`, then `ea switch` is in `hdl'*`


This formulation doesn't seem to preserve the order of handlers, but the order can be important if there are multiple handlers for the same tag.

Also, I believe F.tags[$e] should be F.module.tags[$e]

For types, the spec defines a notion of inst_m(t) that substitutes all occurrences of type indices in t with respective defined types from the moduleinst m. We could generalise this notion to tag indices, then it would just be inst_F.module(hdl)* after generalising tagidx to taguse in the AST.

Should F.tags[$e] be F.tags[$a]? $e comes from nowhere. (Also applies to resume_throw.)

Agreed, the formulation can be improved to make it explicit that the order of the handlers should be preserved. F.tags should be F.module.tags, and $e should be $a

proposals/stack-switching/Explainer.md

tlively · 2025-08-29T18:02:17Z

proposals/stack-switching/Explainer.md

+* `S; F; (prompt{hdl1* (ea switch) hdl2*} H^ea[v^n (ref.cont ca) (switch.addr $ct ea)] end) --> S''; F; prompt{hdl1* (ea switch) hdl2*} E[v^n (ref.cont |S.conts|)] end`
+  - iff  `S.conts[ca] = (E : n')`
+  - and `n' = 1 + n`
+  - and `ea notin tagaddr(hdl1*)`


This should be (ea switch) notin hdl1*, I think. It's fine if there is an (ea $l) in hdl1*. suspend needs a similar fix.

Co-authored-by: Thomas Lively <[email protected]>

rossberg

Re the index vs address issue: for types, an analogous problem exists. The way I addressed this without introducing whole lotta duplicated syntax and clumsy rules is by generalising type indices to "type uses" in the AST, which can be either indices or concrete types. During reduction, the indices are then substituted.

For tags we should do the same, that is, introduce taguse ::= tagidx | tagaddr and use that in appropriate places of the AST. Then instantiation again can simply perform substitution. (As @tlively says, this substitution cannot happen at compile time, but only after the tags are actually allocated. It would happen in the same places during reduction where you currently introduce the new syntax forms like suspend.addr and hdl'.)

proposals/stack-switching/Explainer.md

rossberg · 2025-09-02T10:28:17Z

proposals/stack-switching/Explainer.md

+  - and `hdl'*` is obtained by translating the `<tagidx>` from `hdl*` into `<tagaddr>` using `F.tag`:
+       - if `on $a $l` is in `hdl*` and `F.tags[$e]=ea`, then `ea $l` is in `hdl'*`
+       - if `on $a switch` is in `hdl'*` and `F.tags[$e]=ea`, then `ea switch` is in `hdl'*`


For types, the spec defines a notion of inst_m(t) that substitutes all occurrences of type indices in t with respective defined types from the moduleinst m. We could generalise this notion to tag indices, then it would just be inst_F.module(hdl)* after generalising tagidx to taguse in the AST.

rossberg · 2025-09-02T10:29:54Z

proposals/stack-switching/Explainer.md

  label_n{instr*} H^ea end
  frame_n{F} H^ea end
  catch{...} H^ea end
-  prompt{hdl*} H^ea end   (iff ea notin tagaddr(hdl*))


Not sure I understand this change, you need to first compute the set of free tag addresses.

tlively · 2025-09-19T01:04:41Z

@mlegoupil, will you have time to push this over the finish line soon?

Alan-Liang · 2025-09-24T18:59:52Z

proposals/stack-switching/Explainer.md

+  - `S ::= {..., conts <cont>?*}`
+
+* A continuation is a context annotated with its hole's arity
+  - `cont ::= (E : n)`


Maybe bikeshedding: should this be called continst instead of cont? Other components in the store are all named like fooinst.

Yes, continst would be more consistent.

proposals/stack-switching/Explainer.md

Committing tlively's suggested change Co-authored-by: Thomas Lively <[email protected]>

Applied all suggested changes except taguse which is the last remaining issue

mlegoupil · 2025-10-02T17:38:49Z

I have now incorporated all the changes mentioned above. From where I stand the PR is ready to be merged.

proposals/stack-switching/Explainer.md

tlively · 2025-10-02T18:52:02Z

proposals/stack-switching/Explainer.md

+#### Administrative instructions
+
+* `(ref.cont a)` represents a continuation value, where `a` is a *continuation address* indexing into the store's `conts` component
+  - `ref.cont a : [] -> [(ref $ct)]`


@rossberg, can you answer this?

proposals/stack-switching/Explainer.md

tlively

Looks good to me, but I still wonder if we can simplify the rules (and make them better match implementations) by computing arities from immediates where they are needed rather than by looking them up and storing them in the store. Happy to discuss or consider that as a follow-up.

slindley · 2025-10-03T17:19:37Z

Looks good to me, but I still wonder if we can simplify the rules (and make them better match implementations) by computing arities from immediates where they are needed rather than by looking them up and storing them in the store. Happy to discuss or consider that as a follow-up.

We can still bikeshed details in further issues / PRs, but now that we've converged on something coherent I've merged it into main.

Reduction semantics

9ad8d55

This patch populates the "Execution" section of the Explainer document with the reduction rules for stack switching.

dhil requested review from rossberg and tlively January 22, 2025 10:19

dhil added 3 commits January 22, 2025 10:20

Some type expansions

db96405

Fix minor typo, hdl => hdl1

2330933

Fix typo: stray '

d8ad9ed

rossberg reviewed Jan 22, 2025

View reviewed changes

dhil added 2 commits February 6, 2025 09:48

Address Andreas' feedback

79e59f6

Remove stray ^

d7e176d

mlegoupil reviewed Feb 6, 2025

View reviewed changes

mlegoupil reviewed Feb 14, 2025

View reviewed changes

tlively reviewed Aug 29, 2025

View reviewed changes

Update proposals/stack-switching/Explainer.md

515e806

Co-authored-by: Thomas Lively <[email protected]>

rossberg reviewed Sep 2, 2025

View reviewed changes

Alan-Liang reviewed Sep 24, 2025

View reviewed changes

Alan-Liang reviewed Sep 27, 2025

View reviewed changes

proposals/stack-switching/Explainer.md Show resolved Hide resolved

Alan-Liang reviewed Sep 27, 2025

View reviewed changes

proposals/stack-switching/Explainer.md Outdated Show resolved Hide resolved

Fix resume_throw semantics

12d2624

rossberg reviewed Oct 2, 2025

View reviewed changes

proposals/stack-switching/Explainer.md Outdated Show resolved Hide resolved

mlegoupil and others added 5 commits October 2, 2025 14:11

Update proposals/stack-switching/Explainer.md

b09f2fa

Committing tlively's suggested change Co-authored-by: Thomas Lively <[email protected]>

Update proposals/stack-switching/Explainer.md

12ad434

Committing tlively's suggested change Co-authored-by: Thomas Lively <[email protected]>

Applied all suggested changes except taguse

b570d38

Applied all suggested changes except taguse which is the last remaining issue

Added tag uses

6082dc5

using taguse in hdl too

b304d86

mlegoupil added 2 commits October 2, 2025 18:18

factored typing rules and added keyword 'on' where now necessary

ff3c5e0

Typo in typing rule for tag addresses

f44c8d1

tlively reviewed Oct 2, 2025

View reviewed changes

corrected $a to $e and added switch failure rules

7335969

tlively approved these changes Oct 3, 2025

View reviewed changes

slindley merged commit 389f6cc into WebAssembly:main Oct 3, 2025

dhil deleted the reduction-semantics branch October 6, 2025 09:35

Reduction semantics #109

Reduction semantics #109

Uh oh!

Conversation

dhil commented Jan 22, 2025

Uh oh!

rossberg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mlegoupil left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mlegoupil left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mlegoupil commented Aug 25, 2025

Uh oh!

tlively commented Aug 26, 2025

Uh oh!

mlegoupil commented Aug 29, 2025

Uh oh!

tlively left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Alan-Liang Sep 27, 2025 •

edited

Loading