use global offset for buffered chunks by tyrasd · Pull Request #8 · max-mapper/binary-split

tyrasd · 2016-03-14T17:18:01Z

For split matchers that occur rarely in a stream with many chunks, resetting the search offset inside the handler makes for very bad performance (e.g. for a matcher that doesn't occur at all, the runtime is O(N²) where N is the total number of chunks in the stream). Moving the offset outside of the stream-chunk-handler restores linear runtime.

For split matchers that occur rarely in a stream with many chunks, resetting the search offset inside the handler makes for very bad performance (e.g. for a matcher that doesn't occur at all, the runtime is O(N²) where N is the total number of chunks in the stream). Moving the offset outside of the stream-chunk-handler restores linear runtime.

tyrasd · 2016-03-14T17:21:31Z

index.js

@@ -37,6 +37,7 @@ function BinarySplit (matcher) {
      } else {
        if (offset >= buf.length) {


Honestly, I don't see how this clause can ever be evaluated true (it doesn't in the tests and any scenario I can think of). So, the offset = 0 two lines below is kind of a guess…

Hmm, I'm also not sure whether this block is necessary there. @maxogden is it safe to remove or does it have a special meaning that's not obvious?

Can't remember :P Probably me just being overly defensive

mourner · 2016-03-14T17:38:01Z

This looks great to me. Unfortunately we don't have any real stress tests in the test suite to find potential regressions in changes like this, but if the change works fine with something like a big Tile-Reduce job, let's merge.

(this one fails before max-mapper#8)

tyrasd · 2016-03-14T20:18:25Z

OK, my initial approach turned out not to be good enough, so I've replaced it with something more robust including some test.

mourner · 2016-03-15T00:23:56Z

index.js

-        }
-      } else if (idx === 0) {
-        buf = bops.subarray(buf, offset + matcher.length)
+      if (typeof idx === 'number' && idx < buf.length) {


idx can be either false or number, so typeof is not necessary here — just idx !== false. Although for consistency with JS indexOf, I'd return -1 instead of false.

updated this in 8735002

(this one fails before max-mapper#8)

mourner · 2016-03-15T11:11:30Z

Great, looks good now! Maybe let's squash into one commit?

(this one fails before #8)

mourner · 2016-03-15T11:15:18Z

Rebased and merged.

mourner · 2016-03-15T11:18:50Z

Nope, had to revert — I ran a local benchmark on a big text file with short lines and it's 60% slower. We need to investigate this.

mourner · 2016-03-15T11:23:29Z

My benchmark script, using Node 5.8.0:

var fs = require('fs');
var split = require('./');

console.time('split');

fs.createReadStream('../mbtiles/latest.planet-z12.txt')
.pipe(split())
.on('data', function () {})
.on('end', function () {
    console.timeEnd('split');
});

The text file is 28M list of line-delimited tile numbers (e.g. 5, 122, 206). Master takes 1434ms and this branch 2307ms.

see max-mapper#8 (comment)

use global offset for buffered chunks

mourner · 2016-03-15T13:05:40Z

Performance is great now.

tyrasd · 2016-03-15T13:12:35Z

Yeah. I didn't think that adding one additional Buffer.slice (which should basically just be an O(1) operation, right?) would be such a performance overhead…

mourner · 2016-03-15T13:23:08Z

@tyrasd it's O(1) but happens a lot with short lines, in a hot loop.

tyrasd reviewed Mar 14, 2016
View reviewed changes

tyrasd added a commit to tyrasd/binary-split that referenced this pull request Mar 14, 2016

add (failing) test for max-mapper#8

ddd2de7

add (failing) test for chunked input data (max-mapper#8)

8a9bccd

tyrasd force-pushed the patch-1 branch from ddd2de7 to 8a9bccd Compare March 14, 2016 18:59

fix handling of buffered chunks

d0641be

tyrasd added a commit to tyrasd/binary-split that referenced this pull request Mar 14, 2016

add another test for chunked input

c4f6309

(this one fails before max-mapper#8)

mourner reviewed Mar 15, 2016
View reviewed changes

tyrasd added 2 commits March 15, 2016 10:58

compactify code, drop unnecessary stuff

8735002

add another test for chunked input

ee24ff0

(this one fails before max-mapper#8)

tyrasd force-pushed the patch-1 branch from c4f6309 to ee24ff0 Compare March 15, 2016 09:59

mourner pushed a commit that referenced this pull request Mar 15, 2016

add another test for chunked input

595ce1b

(this one fails before #8)

mourner closed this Mar 15, 2016

mourner reopened this Mar 15, 2016

make it fast again for input with many short lines

d3f8149

see max-mapper#8 (comment)

mourner added a commit that referenced this pull request Mar 15, 2016

Merge pull request #8 from tyrasd/patch-1

d1d8d0b

use global offset for buffered chunks

mourner merged commit d1d8d0b into max-mapper:master Mar 15, 2016

tyrasd deleted the patch-1 branch March 15, 2016 13:45

		@@ -37,6 +37,7 @@ function BinarySplit (matcher) {
		} else {
		if (offset >= buf.length) {

Conversation

tyrasd commented Mar 14, 2016

Uh oh!

tyrasd Mar 14, 2016

Choose a reason for hiding this comment

Uh oh!

mourner Mar 14, 2016

Choose a reason for hiding this comment

Uh oh!

max-mapper Mar 14, 2016

Choose a reason for hiding this comment

Uh oh!

mourner commented Mar 14, 2016

Uh oh!

tyrasd commented Mar 14, 2016

Uh oh!

mourner Mar 15, 2016

Choose a reason for hiding this comment

Uh oh!

tyrasd Mar 15, 2016

Choose a reason for hiding this comment

Uh oh!

mourner commented Mar 15, 2016

Uh oh!

mourner commented Mar 15, 2016

Uh oh!

mourner commented Mar 15, 2016

Uh oh!

mourner commented Mar 15, 2016

Uh oh!

mourner commented Mar 15, 2016

Uh oh!

tyrasd commented Mar 15, 2016

Uh oh!

mourner commented Mar 15, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants