perf: use unordered_set for Name sets for better compile speed by Changqing-JING · Pull Request #8586 · WebAssembly/binaryen

Changqing-JING · 2026-04-09T05:32:29Z

wasm::Name has an O(1) pointer-based hash but operator< does O(n) memcmp, making std::set<Name> unnecessarily slow. On large workloads, ~35% of wasm-opt CPU time was spent in __memcmp_evex_movbe inside EffectAnalyzer::walk called from CodeFolding. Switching the four std::set<Name> fields in EffectAnalyzer, NameSet in branch-utils.h, and the local containers in CodeFolding to their unordered equivalents eliminates the bottleneck.

kripken · 2026-04-09T15:53:58Z

@Changqing-JING what workloads did you test on?

I ran a test with a large Java testcase. Instruction counts, branches, and walltime were within noise.

If you are seeing 35% on this code, perhaps there is something special in your testcase? In general, the number of globals and break targets is very small, so a normal set can do well (by saving the time it takes to do hashing).

Changqing-JING · 2026-04-10T02:07:03Z

@kripken Thank you for review

Yes, it's reproduced when a br_table has large amout of targets. Background story is, in assemblyscript GC, it uses br_table type_rtid to dispatching the gc visitor. In a large app, when there are large amount of types, then the CodeFolding become very slow.
It can be reproduced with this testcase
https://github.com/Changqing-JING/assemblyscript/blob/slow-compile/test.ts

time node ./bin/asc.js -O2 -o build/test.wasm ./test.ts 

real    6m27.324s
user    6m19.690s
sys     0m11.712s

For better understanding of this problem, I created an example to emulate the assemblyscript case
https://github.com/Changqing-JING/BinaryenLearn/blob/binaryen-slow-pass/flamegraph.sh

I can share the flamegraph
The flamegraph shows, even though the set and map saved time from hashing, but operator< of wasm::Name costed more. So that map saving time from hashing is majorly benifit for number key e.g. wasm::Index, because compare the key is cheap, but when the key is string, it's not that case, especially when the string is long and having same prefix.
Base on 3, the problem is hard to be reproduced by wasm-opt, because wasm-opt use auto indexing label name like $block1. But when binaryen is used as a lib of a frontend compiler, the name can be long like $__inlined_func$~lib/rt/itcms/Object#unlink$81, then the strcmp costs much longer time.

Changqing-JING marked this pull request as draft April 9, 2026 05:32

Changqing-JING requested a review from a team as a code owner April 9, 2026 05:32

Changqing-JING requested review from tlively and removed request for a team April 9, 2026 05:32

WIP

7b0e889

Changqing-JING force-pushed the opt/compile-speed branch from cd4d3c2 to 7b0e889 Compare April 9, 2026 08:04

WIP

80a2839

Changqing-JING marked this pull request as ready for review April 9, 2026 09:38

Changqing-JING marked this pull request as draft April 9, 2026 10:17

Changqing-JING marked this pull request as ready for review April 9, 2026 10:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: use unordered_set for Name sets for better compile speed#8586

perf: use unordered_set for Name sets for better compile speed#8586
Changqing-JING wants to merge 2 commits intoWebAssembly:mainfrom
Changqing-JING:opt/compile-speed

Changqing-JING commented Apr 9, 2026 •

edited

Loading

Uh oh!

kripken commented Apr 9, 2026

Uh oh!

Changqing-JING commented Apr 10, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Changqing-JING commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kripken commented Apr 9, 2026

Uh oh!

Changqing-JING commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Changqing-JING commented Apr 9, 2026 •

edited

Loading

Changqing-JING commented Apr 10, 2026 •

edited

Loading