Use IRBuilder in the binary parser #6963

tlively · 2024-09-21T04:22:15Z

IRBuilder is a utility for turning arbitrary valid streams of Wasm instructions into valid Binaryen IR. It is already used in the text parser, so now use it in the binary parser as well. Since the IRBuilder API for building each intruction requires only the information that the binary and text formats include as immediates to that instruction, the parser is now much simpler than before. In particular, it does not need to manage a stack of instructions to figure out what the children of each expression should be; IRBuilder handles this instead.

There are some differences between the IR constructed by IRBuilder and the IR the binary parser constructed before this change. Most importantly, IRBuilder generates better multivalue code because it avoids eagerly breaking up multivalue results into individual components that might need to be immediately reassembled into a tuple. It also parses try-delegate more correctly, allowing the delegate to target arbitrary labels, not just other trys. There are also a couple superficial differences in the generated label and scratch local names.

As part of this change, add support for recording binary source locations in IRBuilder.

In preparation for using IRBuilder in the binary parser, eagerly create Functions when parsing the function section so that they are already created once we parse the code section. IRBuilder will require the functions to exist when parsing calls so it can figure out what type each call should have, even when there is a call to a function whose body has not been parsed yet. NFC except that some error messages change to include the new empty functions.

The purpose of the datacount section is to pre-declare how many data segments there will be so that engines can allocate space for them and not have to back patch subsequent instructions in the code section that refer to them. Once we use IRBuilder in the binary parser, we will have to have the data segments available by the time we parse instructions that use them, so eagerly construct the data segments when parsing the datacount section.

The binary parser generally does not know the final names of module elements when it parses them, or even when it parses instructions that refer to them, since the name section comes at the end of a binary. The parser previously kept a list of pointers to locations where each module element's name would have to be used, then it patched those locations after parsing the names section to discover the final names. When the binary parser starts using IRBuilder, the parsed expressions will be constructed and managed by IRBuilder rather than by the parser itself. This means that the parser will no longer be able to collect pointers to places where module element names are used; it won't have access to the instructions at all. Since the strategy of collecting locations to patch will no longer work, switch to a strategy of traversing the module to find and update names instead. This is generally less efficient because the locations have to be found before they can be updated, but on the other hand it only happens when preserving debug info and it is parallelizable anyway.

…ame-fixup

IRBuilder is a utility for turning arbitrary valid streams of Wasm instructions into valid Binaryen IR. It is already used in the text parser, so now use it in the binary parser as well. Since the IRBuilder API for building each intruction requires only the information that the binary and text formats include as immediates to that instruction, the parser is now much simpler than before. In particular, it does not need to manage a stack of instructions to figure out what the children of each expression should be; IRBuilder handles this instead. There are some differences between the IR constructed by IRBuilder and the IR the binary parser constructed before this change. Most importantly, IRBuilder generates better multivalue code because it avoids eagerly breaking up multivalue results into individual components that might need to be immediately reassembled into a tuple. It also parses try-delegate more correctly, allowing the delegate to target arbitrary labels, not just other `try`s. There are also a couple superficial differences in the generated label and scratch local names. There are two remaining bugs: First, support for creating DWARF location spans is missing because IRBuilder does not have an API for that yet (but source map locations work fine). Second, IRBuilder generates pops inside nameless blocks in some circumstances involving stacky code. This is currently an IR validation error, so #6950 will have to be resolved before this can land. This change also makes the binary parser significantly slower (by about 50%). The lowest hanging performance fruit seems to be tracking branch targets in IRBuilder to avoid having to scan for branches when finalizing blocks.

There were previously two separate code paths for printing function signatures, one for imported functions and one for declared functions. The only intended difference was that parameter names were printed for declared functions but not for imported functions. Reduce duplication by consolidating the code paths, and add support for printing names for imported function parameters that have them. Also fix a bug where empty names were printed as `$` rather than the correct `$""`.

Rather than back-patching names when we get to the names section in the binary reader, skip ahead to read the names section before anything else so we can use the final names right away. This is a prerequisite for using IRBuilder in the binary reader. The only functional change is that we now allow empty local names. Empty names are perfectly valid.

Previously the interpreter only executed overflow and bounds checks for memory.grow on 32-bit memories. Run the checks on 64-bit memories as well.

CodeFolding previously did not consider br_on_* instructions at all, so it would happily merge tails even if there were br_on_* branches to the same label with non-matching tails. Fix the bug by making any label targeted by any instruction not explicitly handled by CodeFolding unoptimizable. This will gracefully handle other branching instructions like `resume` and `resume_throw` as well. Folding these branches properly is left as future work. Also rename the test file from code-folding_enable-threads.wast to just code-folding.wast and enable all features instead of just threads. The old name was left over from when the test was originally ported to lit, and the new feature is necessary because the new test uses GC instructions.

CodeFolding previously only worked on blocks that did not produce values. It worked on Ifs that produced values, but only by accident; the logic for folding matching tails was not written to support tails producing concrete values, but it happened to work for Ifs because subsequent ReFinalize runs fixed all the incorrect types it produced. Improve the power of the optimization by explicitly handling tails that produce concrete values for both blocks and ifs. Now that the core logic handles concrete values correctly, remove the unnecessary ReFinalize run.

tlively added 30 commits September 18, 2024 18:37

skip unparsed functions when printing

8f4ca9f

update and fix test of datacount error

8d8eb13

Merge branch 'main' into binary-parser-eager-funcs

945a807

Merge branch 'binary-parser-eager-funcs' into binary-parser-eager-data

4d0deb1

Merge branch 'binary-parser-eager-data' into binary-parser-refactor-n…

70585fc

…ame-fixup

Merge branch 'main' into binary-parser-refactor-name-fixup

098bd4f

Merge branch 'main' into binary-parser-refactor-name-fixup

b2b011c

Merge branch 'binary-parser-refactor-name-fixup' into binary-ir-builder

95758e0

Merge branch 'main' into binary-parser-refactor-name-fixup

2f8f294

Merge branch 'binary-parser-refactor-name-fixup' into binary-ir-builder

9c91006

Merge branch 'main' into binary-parser-refactor-name-fixup

e31a2b0

Merge branch 'binary-parser-refactor-name-fixup' into binary-ir-builder

a39fde5

Merge branch 'main' into binary-parser-refactor-name-fixup

b8e35d2

Merge branch 'binary-parser-refactor-name-fixup' into binary-ir-builder

25dd297

Merge branch 'main' into binary-parser-refactor-name-fixup

54ebeb3

Merge branch 'binary-parser-refactor-name-fixup' into binary-ir-builder

c06e531

Merge branch 'main' into binary-parser-refactor-name-fixup

a22dd5f

Merge branch 'binary-parser-refactor-name-fixup' into binary-ir-builder

f2be566

Merge branch 'binary-parser-names-first' into binary-ir-builder

ff3b653

address comments

2d1d914

Merge branch 'binary-parser-eager-local-names' into binary-ir-builder

24fb433

Merge branch 'main' into binary-parser-names-first

33efda5

Merge branch 'binary-parser-eager-local-names' into binary-ir-builder

a00a621

Merge branch 'main' into binary-ir-builder

c9436a5

tlively added 27 commits November 22, 2024 20:31

Merge branch 'relax-unreachable-if' into binary-ir-builder

e36d65a

Fix memory.grow bounds and overflow checks for mem64

640a101

Previously the interpreter only executed overflow and bounds checks for memory.grow on 32-bit memories. Run the checks on 64-bit memories as well.

Use BranchUtils

4ba1882

Merge branch 'code-folding-br-on' into binary-ir-builder

2a6c25b

Merge branch 'mem-grow-i64-checks' into binary-ir-builder

990d956

Merge branch 'main' into code-folding-no-concrete

4076665

Merge branch 'code-folding-no-concrete' into relax-unreachable-if

82533fb

Merge branch 'relax-unreachable-if' into binary-ir-builder

0c0dca3

Merge branch 'main' into code-folding-no-concrete

3bf4661

Merge branch 'relax-unreachable-if' into binary-ir-builder

83ad344

Merge branch 'code-folding-no-concrete' into relax-unreachable-if

2c3ddeb

remove redundant opt that would need refinalize

0800007

update tests

0d9d994

Merge branch 'code-folding-concrete' into binary-ir-builder

55f093f

cleanup empty ifs

c243013

Skip ifs with unreachable conditions in CodeFolding

3c0f5dd

Merge branch 'main' into code-folding-concrete

d400bcf

Merge branch 'code-folding-concrete' into relax-unreachable-if

30be648

Merge branch 'main' into code-folding-concrete

fa9fb83

Merge branch 'code-folding-concrete' into relax-unreachable-if

6f624aa

Merge branch 'relax-unreachable-if' into binary-ir-builder

75c7681

Merge branch 'main' into relax-unreachable-if

5d69456

Merge branch 'relax-unreachable-if' into binary-ir-builder

2dc8fc8

update tests

2ef8d9f

Merge branch 'relax-unreachable-if' into binary-ir-builder

1c22acf

tlively merged commit f8e1622 into main Nov 27, 2024
13 checks passed

tlively deleted the binary-ir-builder branch November 27, 2024 05:57

kripken mentioned this pull request Dec 2, 2024

[Old EH] "pop's location is not valid"-error since Emscripten 3.1.73 #7127

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use IRBuilder in the binary parser #6963

Use IRBuilder in the binary parser #6963

tlively commented Sep 21, 2024 •

edited

Loading

Use IRBuilder in the binary parser #6963

Use IRBuilder in the binary parser #6963

Conversation

tlively commented Sep 21, 2024 • edited Loading

tlively commented Sep 21, 2024 •

edited

Loading