Ladybird WebAssembly Update #2

Nearly a month has passed since my latest update on the WebAssembly implementation of the Ladybird browser. My intention is to get the Wasm virtual machine to be fully spec-compliant by the end of the summer. My original post explains a little bit more about what that means as well as my motivations behind it.

A lot has happened in the last few weeks, so I'll do my best to show my progress in this post. I also have an exciting announcement to make regarding the scope of this challenge!

Improvements

Right now, we're sitting at a mere 10 tests that do not pass on my machine.1 Here's a list of the PRs I've made, from most to least recent:

  1. Fix a tiny memory.grow bug (see PR)
  2. Fix loop arity for certain blocktypes (see PR)
  3. Implement SIMD bitwise operations (see PR)
  4. Fully validate section lengths (see PR)
  5. Give names to functions exposed to JS via ref.func (see PR)
  6. Clean up some unused value type variants (see PR)
  7. Make float operations fully spec-compliant (see PR)
  8. Validate edge-case if instruction (see PR)

In the following subsections, I'll go into a little bit more detail about a few of the notable PRs.

Floats

This was a very, very painful thing to work on. The main problem was this:

In the IEEE754 spec, there are many float bit patterns that mean "not-a-number".2 However, most JS implementations canonicalize them, meaning that two JS NaNs will always have the same bit pattern, even though they technically could have different internal representations and nobody would be mad.3 For example, 0x7fc00000 and 0x7fb71688 are both "not-a-number" by IEEE754's standard, but JS NaNs are always the former.4

"Wait we're talking about Wasm, so why is this a problem?" Well, the Ladybird Wasm testsuite uses JavaScript as a harness to run the tests! This means that we sometimes need to pass NaNs from JS to Wasm. But sometimes, we want to pass a NaN that is not 0x7fc00000. There is literally no way to do that in JS.

So, in short, because Ladybird's JS implementation canonicalizes NaNs to have the same bit pattern, we lose vital information when passing arguments to Wasm tests. Technically, this is not a problem with the Wasm VM itself. The VM is pretty much fine. It's a problem with how the tests are run.

The Fix

To fix this, instead of passing NaN from JavaScript (and losing information), we now pass a Uint8Array that contains the specific bit pattern of the NaN in question. Then, when JS values are being translated to Wasm values, we check if it's a Uint8Array, and if it is, we memcpy the data into a float (we're in C++ land at that point, and C++ doesn't canonicalize anything).

This wasn't an easy problem to solve, nor was it to find! You can read the code here.

SIMD Bitwise Operations

I saw that some SIMD instructions such as:

weren't implemented, and they seemed like low-hanging fruit, since they're just bitwise operations on a u128. I've been staying away from SIMD, but this was too hard to just ignore! I implemented them in a few minutes and a bunch of tests pass now.

This leads me into my next announcement...

Expanding the Goal

We've reached nearly full spec-compliance (minus SIMD) much faster than I thought we would.5 I initially decided to ignore SIMD tests because very few compilers take advantage of Wasm SIMD, even though it's part of the specification. But, since we have the extra time, why not go for 100% for real? The whole WebAssembly 2.0!

I did an experiment with SIMD a few weeks ago, and I feel comfortable enough to implement it. Another Ladybird contributor has been stuck on a big SIMD PR for a while now, so I think the first thing to do is to start there!

Wrap-Up

With a good amount of things just pushed to the to-do list, I once again invite you to help out! We can definitely get to 100% by the end of the summer; the existing progress has been a testament to that.

From a personal perspective, I've really enjoyed contributing to the project. I've learned a lot of modern C++ (even a little bit of template magic!), and I truly believe in the project's mission.

As always, if you have any issues, questions, or feedback about the post, feel free to post an issue on this website's GitHub repo!


  1. This does not include SIMD tests, but you'll read more about that later :) ↩︎

  2. I'd recommend checking out float.exposed to play around with the number of NaNs there are! ↩︎

  3. This isn't fully true. Most JS implementations (including Ladybird's) do something called NaN boxing, which essentially exploit the idea that NaN bit patterns are arbitrary to the effect of storing extra information inside NaN values. That's actually the reason why NaNs are canonicalized in the first place (so there's "one true NaN")! ↩︎

  4. This canonicalization behavior is not part of the JS standard, but is common among JS implementations. ↩︎

  5. We're only about halfway through the summer! ↩︎