Experimental AutoBatch pass by kripken · Pull Request #8530 · WebAssembly/binaryen

kripken · 2026-03-26T21:01:25Z

Background:

WebAssembly/component-model#371 (comment)

We pay a cost to cross the wasm-JS boundary. If this happens a lot it can be significant. One way to avoid such boundary crossings is to batch calls, to build a buffer of serialized instructions and then call JS once to read it from linear memory and execute it. This approach is taken by Emscripten's GL proxying and webcc. If there are many short calls, this can speed things up in some cases.

This PR does something related but more general: it takes an input wasm and automatically applies batching to all calls where it can. All calls not returning a value are batched, while calls that do return a value flush the buffer and then run normally. The code also autogenerates JS deserialization code to match the serialization, and you paste that into the JS and that's it.

This is not safe in general, because of issues like reentrancy (wasm->js->wasm->js) and stale data (if a pointer is serialized to be used later, that data must not be modified). If we decide to productionize this, there would need to be user control over what is autobatched and what is not, etc. (and in emscripten specifically we could do things like enable this on all proxy: async methods, by default; other toolchains might have similar things). For now, however, this makes it easy to get benchmark numbers.

I measured three things:

A trivial microbenchmark. This becomes 2x faster.
The webcc benchmark. This uses embind, so it is actually going through an inefficient and unrecommended path for speed-intensive code, but still interesting I think. It becomes 1.5x faster.
A glgears benchmark which tests WebGL. This shows no speedup, and I confirmed in the profiler that there isn't really significant js/wasm boundary overhead here.

(These measurements are total time - I didn't measure the cost of individual wasm->js calls. But obviously this reduces that overhead to essentially 0, if you have enough calls being batched.)

So this does show a large speedup as expected, when doing large amounts of js/wasm boundary crossings for small amounts of work. However, I don't know how common that is in practice - the last benchmark I tested, with WebGL where I saw no speedup, is probably representative of most WebGL code out there (where proper shader and buffer usage avoids js/wasm overhead anyhow).

kripken added 30 commits March 24, 2026 13:57

start

1db4382

work

8083c2f

work

e8fe540

go

e2bc2cf

work

8f9f69f

fmrt

2e46e42

work

6ebafeb

work

e3d1fb2

work

43d95d2

work

1cca824

work

2e31718

work

f589f66

work

3471229

work

3481fdb

work

5ff7244

work

05bf540

work

efccae3

work

05efa27

work

3b35dfd

work

86677cf

work

d16bcfc

work

2238d74

work

4d78421

work

c9dc67c

work

74cc302

work

05e542a

work

cc650b2

work

fd3fcb2

work

5d77dc3

work

314ec2b

kripken added 23 commits March 25, 2026 11:45

work

303c60d

work

c62304d

work

ec7662d

work

1b8512b

work

421a195

work

6cc17e7

work

50a4cc6

work

20e08f9

work

d9e06cd

work

bda9939

work

85d5af6

work

a9e4957

work

dd45472

work

30cc5b3

fix

224afce

work

8cf5dec

work

004e020

work

3b23741

work

59e03e9

work

c3b6172

work

889779a

work

054e21d

work

a578a6d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experimental AutoBatch pass#8530

Experimental AutoBatch pass#8530
kripken wants to merge 53 commits intoWebAssembly:mainfrom
kripken:autobatch

kripken commented Mar 26, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

kripken commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

kripken commented Mar 26, 2026 •

edited

Loading