[SPARK-18224] [CORE] Optimise PartitionedPairBuffer implementation #15736

a-roberts · 2016-11-02T11:37:25Z

What changes were proposed in this pull request?

This change is very similar to my pull request for improving PartitionedPairAppendOnlyMap: #15735

Summarising (more detail above), we avoid the slow iterator wrapping in favour of helping the inliner. We observed that this, when combined with the above change, leads to a 3% performance increase on the HiBench large PageRank benchmark with both IBM's SDK for Java and with OpenJDK 8

How was this patch tested?

Existing unit tests and HiBench large profile with both IBM's SDK for Java and OpenJDK 8, the PageRank benchmark specifically

This change is very similar to my pull request or improving PartitionedPairAppendOnlyMap: #15735 Summarising (more detail above), we avoid the slow iterator wrapping in favour of helping the inliner. We observed that this, when combined with the above change, leads to a 3% performance increase on the HiBench large PageRank benchmark with both IBM's SDK for Java and with OpenJDK 8

SparkQA · 2016-11-02T11:44:17Z

Test build #67984 has finished for PR 15736 at commit 0d1411c.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

srowen · 2016-11-02T12:19:04Z

core/src/main/scala/org/apache/spark/util/collection/PartitionedPairBuffer.scala

+    } else
+      new Comparator[(Int, K)] {
+        override def compare(a: (Int, K), b: (Int, K)): Int = {
+          val partitionDiff = a._1 - b._1


There are some indentation problems here and the else clause is missing a brace. I think you can omit the type of comparator; no space before the colon in any event.

This subtraction can overflow in theory and give the wrong answer, but the existing code does it, so, pass on that.

While optimizing, do you want to call keyComparator.get outside the class definition?

There's a similar construct in PartitionedAppendOnlyMap that should be changed too. Can this be refactored maybe?

Can the method partitionKeyComparator go away? I think the whole WritablePartitionedPairCollection object goes away after this if you care to 'inline' it too in the one refactored instance.

a-roberts · 2016-11-02T13:15:05Z

Will be adding the commit from #15735 here upon addressing the feedback

Inline benefit with this approach as we avoid the bad iterator wrapping

a-roberts · 2016-11-02T13:45:05Z

Addressed the scalastyle comments and added the PartitionedAppendOnlyMap change here as per the above suggestions, will look at the review comments next

Two unrelated asides

wary I'm hogging the build machines, would be useful to not autotest everytime
dev/scalastyle should accept a parameter so we can quickly check just the one file, takes a long time typically

SparkQA · 2016-11-02T15:14:58Z

Test build #67986 has finished for PR 15736 at commit a3d85b6.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

mridulm · 2016-11-02T15:46:22Z

core/src/main/scala/org/apache/spark/util/collection/PartitionedAppendOnlyMap.scala

+            partitionDiff
+          } else {
+            keyComparator.get.compare(a._2, b._2)
+          }


You can dereference the option to avoid get in inner loop

Yeah I think Github collapsed it but there's this and other suggestions at ... #15736 (review)

SparkQA · 2016-11-02T16:04:02Z

Test build #67987 has finished for PR 15736 at commit af4aea3.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

srowen · 2016-11-05T09:48:33Z

To recap some of my feedback here, I think this will be a fine change but it can be refactored further.

I think we can refactor this logic that appears twice in one place, perhaps in object WritablePartitionedPairCollection? that's where its support code currently lives, anyway.

Since the two methods there are only used by these call sites that are changing, they can be 'inlined' into the one common implementation, which might open up more optimization.

It'd be nice to fix the subtraction issue while we're here unless someone is convinced the difference can never overflow.

keyComparator.get can be lifted out of the compare() method.

srowen · 2016-11-09T17:59:13Z

@a-roberts this does look like a worthy change, what do you think of the further simplifications here?

a-roberts · 2016-11-09T19:31:33Z

Sean, they are great suggestions, thanks -- I'll find the time (like for the other outstanding pull requests) to get your feedback integrated, tested and profiled, currently caught up in packaging our own Apache Spark releases for both 1.6.3 and 2.0.2. I also have a JIRA to create proposing regular performance runs using the latest Spark snapshot builds to track regressions (I have this all set up with scripts already)

srowen · 2016-11-19T11:16:37Z

I know you're busy but this does look like a good change to finish off. That it's a win is self-evident, just a question of how much, and benchmarks you have already show it is an improvement. I can take it on (credit remains with you) or will just wait if you're getting back to it.

a-roberts · 2016-11-19T12:50:18Z

I'm resuming the work for all of these related PRs again this week after the London Spark meetup on Wednesday, if you are keen to take it on I'm more than happy to help out and will share some information here that yourself and others should find useful.

Useful tools
As well as simple microbenchmarking we use the Linux perf tools, tprof with Visual Performance Analyzer and also IBM's Healthcenter for Java for method profiling (this is bundled with the JDK and you provide -Xhealthcenter as a driver/executor option then open the files in said tool).

Benchmarks
We'd want to run our improvement ideas with and without the changes using HiBench 6 (large profile) and SparkSqlPerf against all 100 TPCDS queries.

a-roberts · 2016-11-25T10:01:37Z

Back to working on the performance related JIRAs now, so based on the above helpful comments here's what I'll do

Remove the .get.compare from the loop as suggested above - we'll do a .get upfront to get our comparator to use, eliminating the .get later

Move the duplicated code into the WritablePartitionedPairCollection object so the two methods optimised here will call the above new method (let's say it's called getComparator) before returning accordingly (both methods are the same apart from the final few lines).

PartitionedAppendOnlyMap returns

destructiveSortedIterator(comparator)

and PartitionedPairBuffer returns:

new Sorter(new KVArraySortDataFormat[(Int, K), AnyRef]).sort(data, 0, curSize, comparator)
iterator

I'll then build/test/profile this again

srowen · 2016-11-25T15:24:28Z

core/src/main/scala/org/apache/spark/util/collection/WritablePartitionedPairCollection.scala

    }
  }

+  /* Takes an optional parameter (keyComparator), use if provided


Javadoc/scaladoc starts with /** and usually you leave that alone on one line and start documentation on the next.

srowen · 2016-11-25T15:24:53Z

core/src/main/scala/org/apache/spark/util/collection/WritablePartitionedPairCollection.scala

+   * and returns a comparator for the partitions
+   */
+  def getComparator[K](keyComparator: Option[Comparator[K]]) : Comparator[(Int, K)] = {
+    val comparator : Comparator[(Int, K)] =


comparator is now entirely redundant. The whole body is just the if statement

srowen · 2016-11-25T15:25:39Z

core/src/main/scala/org/apache/spark/util/collection/PartitionedPairBuffer.scala

    : Iterator[((Int, K), V)] = {
-    val comparator = keyComparator.map(partitionKeyComparator).getOrElse(partitionComparator)
-    new Sorter(new KVArraySortDataFormat[(Int, K), AnyRef]).sort(data, 0, curSize, comparator)
+    new Sorter(new KVArraySortDataFormat[(Int, K),


I think breaking the line here is odd. If necesasry, pull out the result of getComparator to a statement above to shorten this line, like it was before.

srowen · 2016-11-25T15:26:24Z

core/src/main/scala/org/apache/spark/util/collection/WritablePartitionedPairCollection.scala

+      } else {
+        new Comparator[(Int, K)] {
+          // We know we have a non-empty comparator here
+          val ourKeyComp = keyComparator.get


I think this should be outside the body of the anonymous class. You don't need a reference to the Option here even in the anonymous class.

srowen · 2016-11-25T15:27:46Z

core/src/main/scala/org/apache/spark/util/collection/WritablePartitionedPairCollection.scala

+  /* Takes an optional parameter (keyComparator), use if provided
+   * and returns a comparator for the partitions
+   */
+  def getComparator[K](keyComparator: Option[Comparator[K]]) : Comparator[(Int, K)] = {


Nit: no space before colon

srowen · 2016-11-25T15:28:16Z

core/src/main/scala/org/apache/spark/util/collection/WritablePartitionedPairCollection.scala

+   */
+  def getComparator[K](keyComparator: Option[Comparator[K]]) : Comparator[(Int, K)] = {
+    val comparator : Comparator[(Int, K)] =
+      if (keyComparator.isEmpty) {


isDefined is probably a tiny bit more conventional (and then flip the logic here of course)

srowen · 2016-11-25T15:31:43Z

core/src/main/scala/org/apache/spark/util/collection/WritablePartitionedPairCollection.scala

+    comparator
+  }
+
+


You can delete partitionKeyComparator below now, right?

srowen · 2016-11-25T15:31:50Z

core/src/main/scala/org/apache/spark/util/collection/WritablePartitionedPairCollection.scala

+  def getComparator[K](keyComparator: Option[Comparator[K]]) : Comparator[(Int, K)] = {
+    val comparator : Comparator[(Int, K)] =
+      if (keyComparator.isEmpty) {
+        partitionComparator


This can be inlined now

srowen · 2016-11-25T15:32:08Z

core/src/main/scala/org/apache/spark/util/collection/WritablePartitionedPairCollection.scala

+          // We know we have a non-empty comparator here
+          val ourKeyComp = keyComparator.get
+          override def compare(a: (Int, K), b: (Int, K)): Int = {
+            val partitionDiff = a._1 - b._1


I'm still not thrilled about the subtraction here but maybe leave it for now

SparkQA · 2016-11-25T16:08:42Z

Test build #69164 has finished for PR 15736 at commit fc8f98e.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-11-25T16:14:41Z

Test build #69165 has finished for PR 15736 at commit d342394.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-11-25T20:19:55Z

Test build #69172 has finished for PR 15736 at commit 53ed170.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

a-roberts · 2016-11-25T23:51:19Z

I've conducted a lot of performance tests and gathered .hcd files so I can investigate this next week, but it looks like either the first commit is the best for performance or my current configuration with this benchmark results in us being unable to infer if our changes really make a difference.

Sharing some raw data, the format is as follows.

Benchmark name, date, time, data size in bytes (the same each run), the elapsed time and the throughput (bytes per second).

With the above suggestions for Partitioned*Buffer

ScalaSparkPagerank 2016-11-25 18:49:23 259928115            49.577               5242917              
ScalaSparkPagerank 2016-11-25 18:56:55 259928115            49.946               5204182              
ScalaSparkPagerank 2016-11-25 19:00:04 259928115            46.510               5588650              
ScalaSparkPagerank 2016-11-25 19:02:23 259928115            49.018               5302707              
ScalaSparkPagerank 2016-11-25 19:05:25 259928115            49.270               5275585

Vanilla, no changes at all

ScalaSparkPagerank 2016-11-25 19:08:45 259928115            48.068               5407508              
ScalaSparkPagerank 2016-11-25 19:11:20 259928115            47.712               5447856              
ScalaSparkPagerank 2016-11-25 19:13:50 259928115            44.517               5838850              
ScalaSparkPagerank 2016-11-25 19:16:07 259928115            49.942               5204599              
ScalaSparkPagerank 2016-11-25 19:19:08 259928115            48.521               5357023

Original commit

ScalaSparkPagerank 2016-11-25 19:47:59 259928115            45.486               5714464              
ScalaSparkPagerank 2016-11-25 19:50:48 259928115            48.507               5358569              
ScalaSparkPagerank 2016-11-25 19:53:09 259928115            47.063               5522982              
ScalaSparkPagerank 2016-11-25 19:56:58 259928115            46.154               5631757              
ScalaSparkPagerank 2016-11-25 20:00:01 259928115            48.935               5311701

In Healthcenter I do see that these methods are still great candidates for optimisation as they are all very commonly used.

Open to more suggestions, I have exclusive access to lots of hardware, can easily churn out more custom builds and have lots of profiling software we can use. I'll be committing code for the SizeEstimator soon as that's a good candidate for optimisation here as well.

srowen

Hm, I don't see why this would be slower than the original version. It should be nearly identical anyway or better, as it further inlines a few things. It could be some weird interactions with the JIT and benchmark or whatever, or maybe some difference in how it was tested.

Try one more round of changes here and benchmark again. In any event it would be worthwhile just for the code streamlining.

srowen · 2016-11-27T09:55:36Z

core/src/main/scala/org/apache/spark/util/collection/WritablePartitionedPairCollection.scala

-          keyComparator.compare(a._2, b._2)
+  def getComparator[K](keyComparator: Option[Comparator[K]]): Comparator[(Int, K)] = {
+    if (!keyComparator.isDefined) return partitionComparator
+    else {


Style is off here -- you need braces in both clauses, return is redundant, and there's no point in inverting the condition as opposed to just flipping the clauses.

srowen · 2016-11-27T09:56:13Z

core/src/main/scala/org/apache/spark/util/collection/WritablePartitionedPairCollection.scala

-        } else {
-          keyComparator.compare(a._2, b._2)
+  def getComparator[K](keyComparator: Option[Comparator[K]]): Comparator[(Int, K)] = {
+    if (!keyComparator.isDefined) return partitionComparator


inline and remove partitionComparator, as I think it's not used

srowen · 2016-11-27T09:57:48Z

core/src/main/scala/org/apache/spark/util/collection/WritablePartitionedPairCollection.scala

+        // We know we have a non-empty comparator here
+        override def compare(a: (Int, K), b: (Int, K)): Int = {
+          val partitionDiff = a._1 - b._1
+          if (partitionDiff != 0) {


Probably very very slightly better to say

if (a._1 == b._1) { theKeyComp.compare(a._2, b._2) } else { a._1 - b._1 }

a-roberts · 2016-11-28T22:29:18Z

Before progressing I've discussed what I'm seeing with our JIT compiler team, with the refactoring to reduce code duplication, the following occurs which solves some of the mystery -- although it's bad news as, like you, I wanted to remove the duplicate method.

Summarising:

By having both of these classes share the getComparator method one level up in the hierarchy, the JIT profiling won't function as expected.

Let's assume we have A (calling method) -> getComparator -> B (returns the comparator) and then you have C (another calling method) -> getComparator -> D (returns the comparator).

A and B are the actual methods calling getComparator. B and D are the comparators that are passed in.

As a JIT compiler if we profile getComparator on its I will see it calling B and calling D since profiling in most JITs is context insensitive and we think that's happening here.

When we inline from A into getComparator and C into getComparator, I don't know if I should inline B or D, and given that inlining is critical for performance we see the slight drop in performance. Inlining is critical for eliminating call overheads, improving code locality, and the scope for optimisation.

srowen · 2016-11-29T10:10:56Z

If that's true then again doesn't my suggestion to inline partitionComparator fix it?

a-roberts · 2016-11-30T10:36:41Z

@srowen how about this for profiling?

private[spark] object WritablePartitionedPairCollection {
  /**
   * Takes an optional parameter (keyComparator), use if provided
   * and returns a comparator for the partitions
   */
  def getComparator[K](keyComparator: Option[Comparator[K]]): Comparator[(Int, K)] = {
    if (keyComparator.isDefined) {
      val theKeyComp = keyComparator.get
      new Comparator[(Int, K)] {
        // We know we have a non-empty comparator here
        override def compare(a: (Int, K), b: (Int, K)): Int = {
          if (a._1 == b._1) {
            theKeyComp.compare(a._2, b._2)
          } else {
            a._1 - b._1
          }
        }
      }
    } else return new Comparator[(Int, K)] {
      override def compare(a: (Int, K), b: (Int, K)): Int = {
        a._1 - b._1
      }
    }
  }
}

srowen · 2016-11-30T10:49:01Z

Looks right except you just want to write

if (...) {
  ...
} else {
  new Comparator...
}

a-roberts · 2016-11-30T10:56:22Z

Good point, done, I can get profiling the below code then? Builds fine and no scalastyle problems

  def getComparator[K](keyComparator: Option[Comparator[K]]): Comparator[(Int, K)] = {
    if (keyComparator.isDefined) {
      val theKeyComp = keyComparator.get
      new Comparator[(Int, K)] {
        // We know we have a non-empty comparator here
        override def compare(a: (Int, K), b: (Int, K)): Int = {
          if (a._1 == b._1) {
            theKeyComp.compare(a._2, b._2)
          } else {
            a._1 - b._1
          }
        }
      }
    } else {
      new Comparator[(Int, K)] {
        override def compare(a: (Int, K), b: (Int, K)): Int = {
          a._1 - b._1
        }
      }
    }
  }

mridulm · 2016-11-30T12:37:33Z

(Particularly) as the number of partitions increase, "if (a._1 != b._1)" might be better for bpt reasons.

a-roberts · 2016-11-30T13:32:07Z

I see, so doing the != comparison first which is likely to be true more of the time so we're not consistently failing this check then entering the else

  def getComparator[K](keyComparator: Option[Comparator[K]]): Comparator[(Int, K)] = {
    if (keyComparator.isDefined) {
      val theKeyComp = keyComparator.get
      new Comparator[(Int, K)] {
        // We know we have a non-empty comparator here
        override def compare(a: (Int, K), b: (Int, K)): Int = {
          if (a._1 != b._1) {
           a._1 - b._1
          } else {
           theKeyComp.compare(a._2, b._2)
          }
        }
      }
    } else {
      new Comparator[(Int, K)] {
        override def compare(a: (Int, K), b: (Int, K)): Int = {
          a._1 - b._1
        }
      }
    }
  }

Again that builds fine

srowen · 2016-11-30T13:58:50Z

Hm why does the order matter - maybe helps branch prediction? I doubt we even know how the bytecode orders this let alone how it is JITted and whether it will gather branching info on this one branch. Either way. I usually prefer == for code clarity all else equal. No need to benchmark both just pick one.

a-roberts · 2016-11-30T15:59:08Z

Passed on your question to our JIT developers

The sense* of the test can have an impact of the VM interpreter performance, but that is not usually much of a component of actual throughput since important methods will be JIT'd very quickly regardless of which specific JVM you use. The J9 VM is capable of profiling the branch and flipping the sense when JITing the code

The sense refers to the way a code branches, so either down the equals branch or not equals branch

Numbers for us

Refactored further as above

ScalaSparkPagerank 2016-11-30 14:27:49 259928115            49.841               5215146
ScalaSparkPagerank 2016-11-30 14:29:52 259928115            51.310               5065837
ScalaSparkPagerank 2016-11-30 14:31:59 259928115            52.086               4990364
ScalaSparkPagerank 2016-11-30 14:34:05 259928115            50.667               5130126
ScalaSparkPagerank 2016-11-30 14:36:04 259928115            47.096               5519112
ScalaSparkPagerank 2016-11-30 14:38:04 259928115            48.244               5387781
ScalaSparkPagerank 2016-11-30 14:40:10 259928115            48.734               5333609
ScalaSparkPagerank 2016-11-30 14:42:12 259928115            49.295               5272910
397.273 / 8 = 49.659 sec average

initial commit

ScalaSparkPagerank 2016-11-30 14:48:01 259928115            46.442               5596832
ScalaSparkPagerank 2016-11-30 14:50:06 259928115            50.016               5196899
ScalaSparkPagerank 2016-11-30 14:52:12 259928115            51.113               5085362
ScalaSparkPagerank 2016-11-30 14:54:12 259928115            46.424               5599002
ScalaSparkPagerank 2016-11-30 14:56:15 259928115            47.604               5460215
ScalaSparkPagerank 2016-11-30 14:58:14 259928115            46.802               5553782
ScalaSparkPagerank 2016-11-30 15:00:16 259928115            47.021               5527915
ScalaSparkPagerank 2016-11-30 15:02:16 259928115            47.072               5521926
382.494 / 8 = 47.811s average

The first commit performs better on average, I'd like to next add the improved compare code as above and "push this down" into the subclasses to see how this performs

srowen · 2016-12-04T09:15:59Z

I'd certainly be curious to see a benchmark of the 'final' version with inlined comparator. I would honestly be surprised if that's not fastest of all.

a-roberts · 2016-12-06T13:58:02Z

New data for us, inlined comparator scores here (code provided below to check I've not profiled something useless!):

ScalaSparkPagerank 2016-12-05 13:44:41 259928115            48.149               5398411              5398411
ScalaSparkPagerank 2016-12-05 13:46:43 259928115            46.897               5542531              5542531
ScalaSparkPagerank 2016-12-05 13:48:46 259928115            49.130               5290619              5290619
ScalaSparkPagerank 2016-12-05 13:50:49 259928115            49.793               5220173              5220173
ScalaSparkPagerank 2016-12-05 13:52:50 259928115            48.061               5408296              5408296
ScalaSparkPagerank 2016-12-05 13:54:52 259928115            46.468               5593701              5593701
ScalaSparkPagerank 2016-12-05 13:56:56 259928115            51.385               5058443              5058443
ScalaSparkPagerank 2016-12-05 13:58:59 259928115            47.857               5431349              5431349
ScalaSparkPagerank 2016-12-05 14:00:59 259928115            46.515               5588049              5588049
ScalaSparkPagerank 2016-12-05 14:03:03 259928115            47.791               5438850              5438850
Avg 48.2046s

Remember our "vanilla" average time is 47.752s and our first commit averaged 47.229s (so not much of a difference really).

I think we're splitting hairs and I've got another PR I am seeing good results on that I plan to focus on instead: the SizeEstimator.

This is what I've benchmarked, PartitionedAppendOnlyMap first, so let me know if there any further suggestions, otherwise I propose leaving this one for later as actually against the Spark master codebase I'm not noticing anything exciting.

  def partitionedDestructiveSortedIterator(keyComparator: Option[Comparator[K]])
    : Iterator[((Int, K), V)] = {
    val comparator = {
      if (keyComparator.isDefined) {
        val theKeyComp = keyComparator.get
        new Comparator[(Int, K)] {
          // We know we have a non-empty comparator here
          override def compare(a: (Int, K), b: (Int, K)): Int = {
            if (a._1 != b._1) {
              a._1 - b._1
            } else {
             theKeyComp.compare(a._2, b._2)
            }
          }
        }
      } else {
        new Comparator[(Int, K)] {
          override def compare(a: (Int, K), b: (Int, K)): Int = {
            a._1 - b._1
          }
        }
      }
    }
    destructiveSortedIterator(comparator)
  }

In PartitionedPairBuffer


  /** Iterate through the data in a given order. For this class this is not really destructive. */
  override def partitionedDestructiveSortedIterator(keyComparator: Option[Comparator[K]])
    : Iterator[((Int, K), V)] = {
    val comparator = {
      if (keyComparator.isDefined) {
        val theKeyComp = keyComparator.get
        new Comparator[(Int, K)] {
          // We know we have a non-empty comparator here
          override def compare(a: (Int, K), b: (Int, K)): Int = {
            if (a._1 != b._1) {
              a._1 - b._1
            } else {
             theKeyComp.compare(a._2, b._2)
            }
          }
        }
      } else {
        new Comparator[(Int, K)] {
          override def compare(a: (Int, K), b: (Int, K)): Int = {
            a._1 - b._1
          }
        }
      }
    }
    new Sorter(new KVArraySortDataFormat[(Int, K), AnyRef]).sort(data, 0, curSize, comparator)
    iterator
  }

WritablePartitionedPairCollection remains unchanged.

srowen · 2016-12-06T21:09:05Z

It does seem like nice cleanup in any event. I am not sure why the first commit was faster as this seems like a 'superset' of optimization. We can't use that one in any event. If you want to update the PR with what you posted above, I think it'd be OK to commit just for the code simplification.

srowen · 2016-12-11T09:32:40Z

@a-roberts let's either finish the thought and merge this as mostly a code cleanup and maybe marginal win, or just close it.

srowen · 2016-12-19T09:15:40Z

Ping @a-roberts to resolve this

srowen · 2016-12-30T10:33:06Z

I'm going to manually close this

Closes apache#15736 Closes apache#16309 Closes apache#16485 Closes apache#16502 Closes apache#16196 Closes apache#16498 Closes apache#12380 Closes apache#16764

srowen reviewed Nov 2, 2016

View reviewed changes

srowen mentioned this pull request Nov 2, 2016

[SPARK-18223] [CORE] Optimise PartitionedAppendOnlyMap implementation #15735

Closed

a-roberts added 2 commits November 2, 2016 13:29

Scalastyle for PartitionedPairBuffer

a3d85b6

Improve PartitionedAppendOnlyMap

af4aea3

Inline benefit with this approach as we avoid the bad iterator wrapping

mridulm reviewed Nov 2, 2016

View reviewed changes

a-roberts added 2 commits November 25, 2016 13:41

Refactor to remove code duplication and do the .get earlier

fc8f98e

And remove extra newlines

d342394

srowen requested changes Nov 25, 2016

View reviewed changes

Refactoring

53ed170

srowen requested changes Nov 27, 2016

View reviewed changes

srowen added a commit to srowen/spark that referenced this pull request Feb 2, 2017

Close stale PRs

4a54bb4

Closes apache#15736 Closes apache#16309 Closes apache#16485 Closes apache#16502 Closes apache#16196 Closes apache#16498 Closes apache#12380 Closes apache#16764

srowen mentioned this pull request Feb 2, 2017

[BUILD] Close stale PRs #16778

Closed

asfgit closed this in 20b4ca1 Feb 3, 2017

[SPARK-18224] [CORE] Optimise PartitionedPairBuffer implementation #15736

[SPARK-18224] [CORE] Optimise PartitionedPairBuffer implementation #15736

Uh oh!

Conversation

a-roberts commented Nov 2, 2016

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

SparkQA commented Nov 2, 2016

Uh oh!

Choose a reason for hiding this comment

Uh oh!

a-roberts commented Nov 2, 2016

Uh oh!

a-roberts commented Nov 2, 2016

Uh oh!

SparkQA commented Nov 2, 2016

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Nov 2, 2016

Uh oh!

srowen commented Nov 5, 2016

Uh oh!

srowen commented Nov 9, 2016

Uh oh!

a-roberts commented Nov 9, 2016

Uh oh!

srowen commented Nov 19, 2016

Uh oh!

a-roberts commented Nov 19, 2016

Uh oh!

a-roberts commented Nov 25, 2016

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Nov 25, 2016

Uh oh!

SparkQA commented Nov 25, 2016

Uh oh!

SparkQA commented Nov 25, 2016

Uh oh!

a-roberts commented Nov 25, 2016

Uh oh!

srowen left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

a-roberts commented Nov 28, 2016

Uh oh!

srowen commented Nov 29, 2016

Uh oh!

a-roberts commented Nov 30, 2016

Uh oh!

srowen commented Nov 30, 2016

Uh oh!

a-roberts commented Nov 30, 2016