JsPerf: ParseInt vs Plus conversion

W

2

13

I've try to probe that plus (+) conversion is faster than parseInt with the following jsperf, and the results surprised me:

Parse vs Plus

Preparation code

<script>
  Benchmark.prototype.setup = function() {
    var x = "5555";
  };
</script>

Parse Sample

var y = parseInt(x); //<---80 million loops

Plus Sample

var y = +x; //<--- 33 million loops

The reason is because I'm using "Benchmark.prototype.setup" in order to declare my variable, but I don't understand why

See the second example:

Parse vs Plus (local variable)

<script>
  Benchmark.prototype.setup = function() {
    x = "5555";
  };
</script>

Parse Sample

var y = parseInt(x); //<---89 million loops

Plus Sample

var y = +x; //<--- 633 million loops

Can someone explain the results?

Thanks

Wildermuth answered 11/2, 2015 at 15:4 Comment(4)

You will get a better response here if the core code is here, rather than referred by a link (I gave you a +1 BTW, since it is an interesting question). – Stylolite 11/2, 2015 at 15:9

An interesting question, but it's more about jsperf and test cases than about parseInt vs +. I think your title is a little misleading. – Damson 11/2, 2015 at 15:11

I edited the question. I hope now it better – Wildermuth 11/2, 2015 at 16:2

nobody knows why this results? – Wildermuth 11/2, 2015 at 21:2

H

32

In the second case + is faster because in that case V8 actually moves it out of the benchmarking loop - making benchmarking loop empty.

This happens due to certain peculiarities of the current optimization pipeline. But before we get to the gory details I would like to remind how Benchmark.js works.

To measure the test case you wrote it takes Benchmark.prototype.setup that you also provided and the test case itself and dynamically generates a function that looks approximately like this (I am skipping some irrelevant details):

function (n) {
  var start = Date.now();

  /* Benchmark.prototype.setup body here */
  while (n--) {
    /* test body here */
  }

  return Date.now() - start;
}

Once the function is created Benchmark.js calls it to measure your op for a certain number of iterations n. This process is repeated several times: generate a new function, call it to collect a measurement sample. Number of iterations is adjusted between samples to ensure that function runs long enough to give meaningful measurement.

Important things to notice here is that

both your case and Benchmark.prototype.setup are the textually inlined;
there is a loop around the operation you want to measure;

Essentially we discussing why the code below with a local variable x

function f(n) {
  var start = Date.now();

  var x = "5555"
  while (n--) {
    var y = +x
  }

  return Date.now() - start;
}

runs slower than the code with global variable x

function g(n) {
  var start = Date.now();

  x = "5555"
  while (n--) {
    var y = +x
  }

  return Date.now() - start;
}

(Note: this case is called local variable in the question itself, but this is not the case, x is global)

What happens when you execute these functions with a large enough values of n, for example f(1e6)?

Current optimization pipeline implements OSR in a peculiar fashion. Instead of generating an OSR specific version of the optimized code and discarding it later, it generates a version that can be used for both OSR and normal entry and can even be reused if we need to perform OSR at the same loop. This is done by injecting a special OSR entry block into the right spot in the control flow graph.

OSR version of the control flow graph

OSR entry block is injected while SSA IR for the function is built and it eagerly copies all local variables out of the incoming OSR state. As a result V8 fails to see that local x is actually a constant and even looses any information about its type. For subsequent optimization passes x2 looks like it can be anything.

As x2 can be anything expression +x2 can also have arbitrary side-effects (e.g. it can be an object with valueOf attached to it). This prevents loop-invariant code motion pass from moving +x2 out of the loop.

Why is g faster than? V8 pulls a trick here. It tracks global variables that contain constants: e.g. in this benchmark global x always contains "5555" so V8 just replaces x access with its value and marks this optimized code as dependent on the value of x. If somebody replaces x value with something different than all dependent code will be deoptimized. Global variables are also not part of the OSR state and do not participate in SSA renaming so V8 is not confused by "spurious" φ-functions merging OSR and normal entry states. That's why when V8 optimizes g it ends up generating the following IR in the loop body (red stripe on the left shows the loop):

IR before LICM

Note: +x is compiled to x * 1, but this is just an implementation detail.

Later LICM would just take this operation and move it out of the loop leaving nothing of interest in the loop itself. This becomes possible because now V8 knows that both operands of the * are primitives - so there can be no side-effects.

IR after LICM

And that's why g is faster, because empty loop is quite obviously faster than a non-empty one.

This also means that the second version of benchmark does not actually measure what you would like it to measure, and while the first version did actually grasp some of the differences between parseInt(x) and +x performance that was more by luck: you hit a limitation in V8's current optimization pipeline (Crankshaft) that prevented it from eating the whole microbenchmark away.

Helwig answered 13/2, 2015 at 13:39 Comment(1)

...wow. From graph paper flow charts to a complex OSR V8 SSAIR optimization graph thing. This seemed like such a simple question, yet this is one of the most complicated answers I have ever seen. +1 – Felsite 17/12, 2015 at 7:38

H

-1

I believe the reason is because parseInt looks for more than just a conversion to an integer. It also strips any remaining text off of the string like when parsing a pixel value:

var width = parseInt(element.style.width);//return width as integer

whereas the plus sign could not handle this case:

var width = +element.style.width;//returns NaN

The plus sign does an implicit conversion from string to number and only that conversion. parseInt tries to make sense out of the string first (like with integers tagged with a measurement).

Howey answered 12/2, 2015 at 23:40 Comment(1)

the question is why in the first case parseInt seams to be faster than plus – Wildermuth 13/2, 2015 at 6:39

Recommended topics

Hot tags